Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q12857 (NFIA_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified April 3, 2013. Version 124. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Nuclear factor 1 A-type

Short name=NF1-A
Short name=Nuclear factor 1/A
Alternative name(s):
CCAAT-box-binding transcription factor
Short name=CTF
Nuclear factor I/A
Short name=NF-I/A
Short name=NFI-A
TGGCA-binding protein
Gene names
Name:NFIA
Synonyms:KIAA1439
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length509 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Recognizes and binds the palindromic sequence 5'-TTGGCNNNNNGCCAA-3' present in viral and cellular promoters and in the origin of replication of adenovirus type 2. These proteins are individually capable of activating transcription and replication.

Subunit structure

Binds DNA as a homodimer.

Subcellular location

Nucleus.

Sequence similarities

Belongs to the CTF/NF-I family.

Contains 1 CTF/NF-I DNA-binding domain.

Sequence caution

The sequence BAA92677.1 differs from that shown. Reason: Erroneous initiation.

Ontologies

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q12857-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q12857-2)

The sequence of this isoform differs from the canonical sequence as follows:
     474-509: TYSTPSTSPANRFVSVGPRDPSFVNIPQQTQSWYLG → ILVPGIKVAASHHPPDRPPDPFSTL

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 509509Nuclear factor 1 A-type
PRO_0000100191

Regions

DNA binding1 – 194194CTF/NF-I

Amino acid modifications

Modified residue2581Phosphoserine Ref.8
Modified residue2651Phosphoserine Ref.8 Ref.10
Modified residue2801Phosphoserine Ref.8 Ref.9 Ref.10
Modified residue2871Phosphoserine Ref.7 Ref.8 Ref.9 Ref.10
Modified residue3001Phosphoserine Ref.8 Ref.10
Modified residue3191Phosphoserine Ref.8
Modified residue3601Phosphoserine Ref.10

Natural variations

Alternative sequence474 – 50936TYSTP…SWYLG → ILVPGIKVAASHHPPDRPPD PFSTL in isoform 2.
VSP_036620

Experimental info

Sequence conflict1861A → G in AAA93124. Ref.5
Sequence conflict240 – 2434TGPN → PAPT in AAA93124. Ref.5

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified June 1, 2001. Version 2.
Checksum: 42090C6B8B229F87

FASTA50955,944
        10         20         30         40         50         60 
MYSPLCLTQD EFHPFIEALL PHVRAFAYTW FNLQARKRKY FKKHEKRMSK EEERAVKDEL 

        70         80         90        100        110        120 
LSEKPEVKQK WASRLLAKLR KDIRPEYRED FVLTVTGKKP PCCVLSNPDQ KGKMRRIDCL 

       130        140        150        160        170        180 
RQADKVWRLD LVMVILFKGI PLESTDGERL VKSPQCSNPG LCVQPHHIGV SVKELDLYLA 

       190        200        210        220        230        240 
YFVHAADSSQ SESPSQPSDA DIKDQPENGH LGFQDSFVTS GVFSVTELVR VSQTPIAAGT 

       250        260        270        280        290        300 
GPNFSLSDLE SSSYYSMSPG AMRRSLPSTS STSSTKRLKS VEDEMDSPGE EPFYTGQGRS 

       310        320        330        340        350        360 
PGSGSQSSGW HEVEPGMPSP TTLKKSEKSG FSSPSPSQTS SLGTAFTQHH RPVITGPRAS 

       370        380        390        400        410        420 
PHATPSTLHF PTSPIIQQPG PYFSHPAIRY HPQETLKEFV QLVCPDAGQQ AGQVGFLNPN 

       430        440        450        460        470        480 
GSSQGKVHNP FLPTPMLPPP PPPPMARPVP LPVPDTKPPT TSTEGGAASP TSPTYSTPST 

       490        500 
SPANRFVSVG PRDPSFVNIP QQTQSWYLG 

« Hide

Isoform 2 [UniParc].

Checksum: 0C47EA46EC78030C
Show »

FASTA49854,620

References

« Hide 'large scale' references
[1]"Prediction of the coding sequences of unidentified human genes. XVI. The complete sequences of 150 new cDNA clones from brain which code for large proteins in vitro."
Nagase T., Kikuno R., Ishikawa K., Hirosawa M., Ohara O.
DNA Res. 7:65-73(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Tissue: Brain.
[2]"The DNA sequence and biological annotation of human chromosome 1."
Gregory S.G., Barlow K.F., McLay K.E., Kaul R., Swarbreck D., Dunham A., Scott C.E., Howe K.L., Woodfine K., Spencer C.C.A., Jones M.C., Gillson C., Searle S., Zhou Y., Kokocinski F., McDonald L., Evans R., Phillips K. expand/collapse author list , Atkinson A., Cooper R., Jones C., Hall R.E., Andrews T.D., Lloyd C., Ainscough R., Almeida J.P., Ambrose K.D., Anderson F., Andrew R.W., Ashwell R.I.S., Aubin K., Babbage A.K., Bagguley C.L., Bailey J., Beasley H., Bethel G., Bird C.P., Bray-Allen S., Brown J.Y., Brown A.J., Buckley D., Burton J., Bye J., Carder C., Chapman J.C., Clark S.Y., Clarke G., Clee C., Cobley V., Collier R.E., Corby N., Coville G.J., Davies J., Deadman R., Dunn M., Earthrowl M., Ellington A.G., Errington H., Frankish A., Frankland J., French L., Garner P., Garnett J., Gay L., Ghori M.R.J., Gibson R., Gilby L.M., Gillett W., Glithero R.J., Grafham D.V., Griffiths C., Griffiths-Jones S., Grocock R., Hammond S., Harrison E.S.I., Hart E., Haugen E., Heath P.D., Holmes S., Holt K., Howden P.J., Hunt A.R., Hunt S.E., Hunter G., Isherwood J., James R., Johnson C., Johnson D., Joy A., Kay M., Kershaw J.K., Kibukawa M., Kimberley A.M., King A., Knights A.J., Lad H., Laird G., Lawlor S., Leongamornlert D.A., Lloyd D.M., Loveland J., Lovell J., Lush M.J., Lyne R., Martin S., Mashreghi-Mohammadi M., Matthews L., Matthews N.S.W., McLaren S., Milne S., Mistry S., Moore M.J.F., Nickerson T., O'Dell C.N., Oliver K., Palmeiri A., Palmer S.A., Parker A., Patel D., Pearce A.V., Peck A.I., Pelan S., Phelps K., Phillimore B.J., Plumb R., Rajan J., Raymond C., Rouse G., Saenphimmachak C., Sehra H.K., Sheridan E., Shownkeen R., Sims S., Skuce C.D., Smith M., Steward C., Subramanian S., Sycamore N., Tracey A., Tromans A., Van Helmond Z., Wall M., Wallis J.M., White S., Whitehead S.L., Wilkinson J.E., Willey D.L., Williams H., Wilming L., Wray P.W., Wu Z., Coulson A., Vaudin M., Sulston J.E., Durbin R.M., Hubbard T., Wooster R., Dunham I., Carter N.P., McVean G., Ross M.T., Harrow J., Olson M.V., Beck S., Rogers J., Bentley D.R.
Nature 441:315-321(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[3]Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S., Turner R. expand/collapse author list , Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., Venter J.C.
Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[4]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Tissue: Skeletal muscle.
[5]"Chromosomal localization of the four genes (NFIA, B, C, and X) for the human transcription factor nuclear factor I by FISH."
Qian F., Kruse U., Lichter P., Sippel A.E.
Genomics 28:66-73(1995) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 19-243.
[6]"Global, in vivo, and site-specific phosphorylation dynamics in signaling networks."
Olsen J.V., Blagoev B., Gnad F., Macek B., Kumar C., Mortensen P., Mann M.
Cell 127:635-648(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
Tissue: Cervix carcinoma.
[7]"Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient phosphoproteomic analysis."
Cantin G.T., Yi W., Lu B., Park S.K., Xu T., Lee J.-D., Yates J.R. III
J. Proteome Res. 7:1346-1351(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-287, MASS SPECTROMETRY.
Tissue: Cervix carcinoma.
[8]"A quantitative atlas of mitotic phosphorylation."
Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-258; SER-265; SER-280; SER-287; SER-300 AND SER-319, MASS SPECTROMETRY.
Tissue: Cervix carcinoma.
[9]"Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions."
Mayya V., Lundgren D.H., Hwang S.-I., Rezaul K., Wu L., Eng J.K., Rodionov V., Han D.K.
Sci. Signal. 2:RA46-RA46(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-280 AND SER-287, MASS SPECTROMETRY.
Tissue: Leukemic T-cell.
[10]"Quantitative phosphoproteomics reveals widespread full phosphorylation site occupancy during mitosis."
Olsen J.V., Vermeulen M., Santamaria A., Kumar C., Miller M.L., Jensen L.J., Gnad F., Cox J., Jensen T.S., Nigg E.A., Brunak S., Mann M.
Sci. Signal. 3:RA3-RA3(2010) [PubMed] [Europe PMC] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-265; SER-280; SER-287; SER-300 AND SER-360, MASS SPECTROMETRY.
Tissue: Cervix carcinoma.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AB037860 mRNA. Translation: BAA92677.1. Different initiation.
AL445198, AC096534, AL096888 Genomic DNA. Translation: CAI13080.1.
AL445198, AC096534, AL096888 Genomic DNA. Translation: CAI13081.1.
AL096888, AC096534, AL445198 Genomic DNA. Translation: CAI23240.1.
AL096888, AC096534, AL445198 Genomic DNA. Translation: CAI23241.1.
CH471059 Genomic DNA. Translation: EAX06601.1.
BC022264 mRNA. Translation: AAH22264.1.
U07809 mRNA. Translation: AAA93124.1.
IPIIPI00029745.
IPI00923394.
RefSeqNP_001128145.1. NM_001134673.3.
NP_001138983.1. NM_001145511.1.
NP_001138984.1. NM_001145512.1.
NP_005586.1. NM_005595.4.
UniGeneHs.710546.
Hs.740757.

3D structure databases

ProteinModelPortalQ12857.
ModBaseSearch...

Protein-protein interaction databases

STRING9606.ENSP00000384523.

PTM databases

PhosphoSiteQ12857.

Polymorphism databases

DMDM14194959.

Proteomic databases

PaxDbQ12857.
PRIDEQ12857.

Protocols and materials databases

DNASU4774.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000403491; ENSP00000384523; ENSG00000162599.
ENST00000485903; ENSP00000419785; ENSG00000162599.
GeneID4774.
KEGGhsa:4774.
UCSCuc001czv.3. human.
uc001czw.3. human.

Organism-specific databases

CTD4774.
GeneCardsGC01P061260.
HGNCHGNC:7784. NFIA.
HPAHPA006111.
HPA008884.
MIM600727. gene.
neXtProtNX_Q12857.
PharmGKBPA31590.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG252172.
HOGENOMHOG000013028.
HOVERGENHBG006561.
KOK09168.
OrthoDBEOG4ZKJM9.

Enzyme and pathway databases

Pathway_Interaction_DBhnf3apathway. FOXA1 transcription factor network.

Gene expression databases

ArrayExpressQ12857.
BgeeQ12857.
CleanExHS_NFIA.
GenevestigatorQ12857.

Family and domain databases

InterProIPR000647. CTF/NFI.
IPR020604. CTF/NFI_DNA-bd-dom.
IPR019739. CTF/NFI_DNA-bd_CS.
IPR019548. CTF/NFI_DNA-bd_N.
IPR003619. MAD_homology1_Dwarfin-type.
[Graphical view]
PANTHERPTHR11492. PTHR11492. 1 hit.
PfamPF00859. CTF_NFI. 1 hit.
PF03165. MH1. 1 hit.
PF10524. NfI_DNAbd_pre-N. 1 hit.
[Graphical view]
SMARTSM00523. DWA. 1 hit.
[Graphical view]
PROSITEPS00349. CTF_NFI_1. 1 hit.
PS51080. CTF_NFI_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

ChiTaRSNFIA. human.
GenomeRNAi4774.
NextBio18406.
SOURCESearch...

Entry information

Entry nameNFIA_HUMAN
AccessionPrimary (citable) accession number: Q12857
Secondary accession number(s): Q8TA97, Q9H3X9, Q9P2A9
Entry history
Integrated into UniProtKB/Swiss-Prot: June 1, 2001
Last sequence update: June 1, 2001
Last modified: April 3, 2013
This is version 124 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 1

Human chromosome 1: entries, gene names and cross-references to MIM

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families