Reviewed,
UniProtKB/Swiss-Prot P15822 (ZEP1_HUMAN)
Last modified
October 13, 2009.
Version 103.
History...
Clusters with 100%,
90%,
50% identity |
Documents (6) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Zinc finger protein 40 Alternative name(s): Human immunodeficiency virus type I enhancer-binding protein 1 Short name=HIV-EP1 Major histocompatibility complex-binding protein 1 Short name=MBP-1 Positive regulatory domain II-binding factor 1 Short name=PRDII-BF1 Gate keeper of apoptosis-activating protein Short name=GAAP Cirhin interaction protein Short name=CIRIP | ||||
| Gene names |
| ||||
| Organism | Homo sapiens (Human) [Complete proteome] | ||||
| Taxonomic identifier | 9606 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo |
Protein attributes
| Sequence length | 2718 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is not processed. |
| Protein existence | Evidence at protein level. |
General annotation (Comments)
| Function | This protein specifically binds to the DNA sequence 5'-GGGACTTTCC-3' which is found in the enhancer elements of numerous viral promoters such as those of SV40, CMV, or HIV-1. In addition, related sequences are found in the enhancer elements of a number of cellular promoters, including those of the class I MHC, interleukin-2 receptor, and interferon-beta genes. It may act in T-cell activation. Involved in activating HIV-1 gene expression. Isoform 2 and isoform 3 also bind to the IPCS (IRF1 and p53 common sequence) DNA sequence in the promoter region of interferon regulatory factor 1 and p53 genes and are involved in transcription regulation of these genes. Isoform 2 does not activate HIV-1 gene expression. Isoform 2 and isoform 3 may be involved in apoptosis. Ref.2 Ref.7 Ref.8 Ref.9 |
| Subcellular location | Isoform 1: Nucleus. Ref.2 Ref.9 |
| Induction | By mitogens and phorbol ester. |
| Domain | Contains two sets of 2 zinc-fingers, which are widely separated and recognize the same DNA sequence. There is a fifth zinc-finger in-between. |
| Sequence similarities | Contains 5 C2H2-type zinc fingers. |
| Sequence caution | The sequence CAA35798.1 differs from that shown. Reason: Frameshift at several positions. The sequence CAH73909.1 differs from that shown. Reason: Erroneous gene model prediction. The sequence CAH73982.1 differs from that shown. Reason: Erroneous gene model prediction. The sequence CAI14768.1 differs from that shown. Reason: Erroneous gene model prediction. The sequence CAI21070.1 differs from that shown. Reason: Erroneous gene model prediction. |
Ontologies
| Keywords | |
|---|---|
| Biological process | Transcription Transcription regulation |
| Cellular component | Cytoplasm Nucleus |
| Coding sequence diversity | Alternative splicing Polymorphism |
| Domain | Repeat Zinc-finger |
| Ligand | DNA-binding Metal-binding Zinc |
| Molecular function | Activator |
| PTM | Phosphoprotein |
| Technical term | 3D-structure Complete proteome |
| Gene Ontology (GO) | |
| Biological process | regulation of transcription Inferred from electronic annotation. Source: UniProtKB-KW transcriptionInferred from electronic annotation. Source: UniProtKB-KW |
| Cellular component | cytoplasm Inferred from electronic annotation. Source: UniProtKB-SubCell nucleus Ref.1Traceable author statement. Source: ProtInc |
| Molecular function | DNA binding Ref.1 Traceable author statement. Source: ProtInc protein bindingInferred from physical interaction. Source: IntAct zinc ion bindingInferred from electronic annotation. Source: UniProtKB-KW |
| Complete GO annotation... | |
Alternative products
| This entry describes 3 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 (identifier: P15822-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: P15822-2) Also known as: Delta 2; GAAP-1; The sequence of this isoform differs from the canonical sequence as follows: 1-2017: Missing. 2018-2024: KWKSSLS → MGQKFQK | ||||||
| Isoform 3 (identifier: P15822-3) Also known as: GAAP-2; The sequence of this isoform differs from the canonical sequence as follows: 1-2002: Missing. 2003-2024: AITTHSKSDLLVYSSKWKSSLS → MGQKFQKKSYRLVLKELRNPLL |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||||||||||
Molecule processing | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 2718 | 2718 | Zinc finger protein 40 | PRO_0000047369 | |||||||||||||||
Regions | |||||||||||||||||||
| Zinc finger | 406 – 428 | 23 | C2H2-type 1 | ||||||||||||||||
| Zinc finger | 434 – 456 | 23 | C2H2-type 2 | ||||||||||||||||
| Zinc finger | 959 – 986 | 28 | C2H2-type 3 | ||||||||||||||||
| Zinc finger | 2088 – 2110 | 23 | C2H2-type 4 | ||||||||||||||||
| Zinc finger | 2116 – 2140 | 25 | C2H2-type 5 | ||||||||||||||||
| Compositional bias | 803 – 806 | 4 | Poly-Ser | ||||||||||||||||
Amino acid modifications | |||||||||||||||||||
| Modified residue | 429 | 1 | Phosphothreonine Ref.12 | ||||||||||||||||
| Modified residue | 484 | 1 | Phosphoserine Ref.12 | ||||||||||||||||
| Modified residue | 492 | 1 | Phosphoserine Ref.12 | ||||||||||||||||
| Modified residue | 495 | 1 | Phosphoserine Ref.12 | ||||||||||||||||
| Modified residue | 1036 | 1 | Phosphoserine Ref.12 | ||||||||||||||||
| Modified residue | 1051 | 1 | Phosphoserine Ref.10 | ||||||||||||||||
| Modified residue | 1735 | 1 | Phosphoserine Ref.12 | ||||||||||||||||
| Modified residue | 1749 | 1 | Phosphoserine Ref.12 | ||||||||||||||||
| Modified residue | 1753 | 1 | Phosphoserine Ref.12 | ||||||||||||||||
| Modified residue | 1873 | 1 | Phosphoserine Ref.11 | ||||||||||||||||
| Modified residue | 1874 | 1 | Phosphoserine Ref.11 | ||||||||||||||||
Natural variations | |||||||||||||||||||
| Alternative sequence | 1 – 2017 | 2017 | Missing in isoform 2. | VSP_037714 | |||||||||||||||
| Alternative sequence | 1 – 2002 | 2002 | Missing in isoform 3. | VSP_037715 | |||||||||||||||
| Alternative sequence | 2003 – 2024 | 22 | AITTH…KSSLS → MGQKFQKKSYRLVLKELRNP LL in isoform 3. | VSP_037716 | |||||||||||||||
| Alternative sequence | 2018 – 2024 | 7 | KWKSSLS → MGQKFQK in isoform 2. | VSP_037717 | |||||||||||||||
| Natural variant | 187 | 1 | T → M: dbSNP rs2228209. | VAR_057383 | |||||||||||||||
| Natural variant | 362 | 1 | P → L: dbSNP rs34221818. | VAR_057384 | |||||||||||||||
| Natural variant | 716 | 1 | T → A: dbSNP rs2228210. | VAR_057385 | |||||||||||||||
| Natural variant | 828 | 1 | V → I: dbSNP rs2228218. | VAR_057386 | |||||||||||||||
| Natural variant | 873 | 1 | T → A: dbSNP rs6900196. Ref.4 Ref.5 | VAR_057387 | |||||||||||||||
| Natural variant | 1074 | 1 | N → S: dbSNP rs2228220. Ref.1 | VAR_057388 | |||||||||||||||
| Natural variant | 1170 | 1 | K → N: dbSNP rs34258344. Ref.1 | VAR_057389 | |||||||||||||||
| Natural variant | 1520 | 1 | A → G: dbSNP rs2228212. Ref.4 | VAR_057390 | |||||||||||||||
| Natural variant | 1609 | 1 | M → I: dbSNP rs2228213. Ref.5 | VAR_057391 | |||||||||||||||
| Natural variant | 1915 | 1 | Q → R: dbSNP rs1126472. Ref.1 | VAR_057392 | |||||||||||||||
| Natural variant | 2444 | 1 | T → M: dbSNP rs2228214. | VAR_059892 | |||||||||||||||
| Natural variant | 2692 | 1 | A → G: dbSNP rs1042054. Ref.5 Ref.1 | VAR_059893 | |||||||||||||||
Experimental info | |||||||||||||||||||
| Sequence conflict | 515 | 1 | P → N in CAA35798. Ref.1 | ||||||||||||||||
| Sequence conflict | 1227 | 1 | V → I in CAA35798. Ref.1 | ||||||||||||||||
| Sequence conflict | 1436 | 1 | N → G in CAA35798. Ref.1 | ||||||||||||||||
| Sequence conflict | 1660 | 1 | V → E in AAA17534. Ref.5 | ||||||||||||||||
| Sequence conflict | 1883 | 1 | I → L in CAA35798. Ref.1 | ||||||||||||||||
| Sequence conflict | 2067 | 1 | F → C in AAV85766. Ref.6 | ||||||||||||||||
| Sequence conflict | 2080 | 1 | V → I in CAA35798. Ref.1 | ||||||||||||||||
| Sequence conflict | 2149 | 1 | V → I in CAA35798. Ref.1 | ||||||||||||||||
| Sequence conflict | 2388 | 1 | S → P in AAV85766. Ref.6 | ||||||||||||||||
Secondary structure | |||||||||||||||||||
Helix Strand Turn | |||||||||||||||||||
| Turn | 2091 – 2093 | 3 | |||||||||||||||||
| Helix | 2100 – 2109 | 10 | |||||||||||||||||
| Beta strand | 2119 – 2122 | 4 | |||||||||||||||||
| Beta strand | 2124 – 2127 | 4 | |||||||||||||||||
| Helix | 2128 – 2136 | 9 | |||||||||||||||||
| Beta strand | 2137 – 2140 | 4 | |||||||||||||||||
Sequences
| ||||||||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "A DNA-binding protein containing two widely separated zinc finger motifs that recognize the same DNA sequence." Fan C.M., Maniatis T. Genes Dev. 4:29-42(1990) [PubMed: 2106471] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), VARIANTS SER-1074; ASN-1170; ARG-1915 AND GLY-2692. |
| [2] | "Transcriptional regulator of genes involved in the control of cell growth or cell proliferation. Use of said regulator as a therapeutic or diagnostic agent." Tovey M. Patent number CA2448384, 12-DEC-2002 Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 2 AND 3), FUNCTION (ISOFORMS 2 AND 3), SUBCELLULAR LOCATION (ISOFORMS 2 AND 3). |
| [3] | "The DNA sequence and analysis of human chromosome 6." Mungall A.J., Palmer S.A., Sims S.K., Edwards C.A., Ashurst J.L., Wilming L., Jones M.C., Horton R., Hunt S.E., Scott C.E., Gilbert J.G.R., Clamp M.E., Bethel G., Milne S., Ainscough R., Almeida J.P., Ambrose K.D., Andrews T.D. Beck S.Nature 425:805-811(2003) [PubMed: 14574404] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. |
| [4] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), VARIANTS ALA-873 AND GLY-1520. |
| [5] | "A large protein containing zinc finger domains binds to related sequence elements in the enhancers of the class I major histocompatibility complex and kappa immunoglobulin genes." Baldwin A.S. Jr., LeClair K.P., Singh H., Sharp P.A. Mol. Cell. Biol. 10:1406-1414(1990) [PubMed: 2108316] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 817-2718 (ISOFORM 1), VARIANTS ALA-873; ILE-1609 AND GLY-2692. |
| [6] | Yu B., Mitchell G.A., Richter A. Submitted (JUL-2004) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 2035-2718 (ISOFORM 1). Tissue: Liver. |
| [7] | "Regulation of human immunodeficiency virus enhancer function by PRDII-BF1 and c-rel gene products." Muchardt C., Seeler J.S., Nirula A., Shurland D.L., Gaynor R.B. J. Virol. 66:244-250(1992) [PubMed: 1727488] [Abstract] Cited for: FUNCTION (ISOFORM 2), ALTERNATIVE SPLICING. |
| [8] | "Transcription factor PRDII-BF1 activates human immunodeficiency virus type 1 gene expression." Seeler J.S., Muchardt C., Suessle A., Gaynor R.B. J. Virol. 68:1002-1009(1994) [PubMed: 8289330] [Abstract] Cited for: FUNCTION (ISOFORM 1). |
| [9] | "GAAP-1: a transcriptional activator of p53 and IRF-1 possesses pro-apoptotic activity." Lallemand C., Palmieri M., Blanchard B., Meritet J.F., Tovey M.G. EMBO Rep. 3:153-158(2002) [PubMed: 11818340] [Abstract] Cited for: FUNCTION (ISOFORM 2), SUBCELLULAR LOCATION (ISOFORM 2), ALTERNATIVE SPLICING. |
| [10] | "Large-scale characterization of HeLa cell nuclear phosphoproteins." Beausoleil S.A., Jedrychowski M., Schwartz D., Elias J.E., Villen J., Li J., Cohn M.A., Cantley L.C., Gygi S.P. Proc. Natl. Acad. Sci. U.S.A. 101:12130-12135(2004) [PubMed: 15302935] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1051, MASS SPECTROMETRY. Tissue: Epithelium. |
| [11] | "Combining protein-based IMAC, peptide-based IMAC, and MudPIT for efficient phosphoproteomic analysis." Cantin G.T., Yi W., Lu B., Park S.K., Xu T., Lee J.-D., Yates J.R. III J. Proteome Res. 7:1346-1351(2008) [PubMed: 18220336] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1873 AND SER-1874, MASS SPECTROMETRY. |
| [12] | "A quantitative atlas of mitotic phosphorylation." Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P. Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed: 18669648] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-429; SER-484; SER-492; SER-495; SER-1036; SER-1735; SER-1749 AND SER-1753, MASS SPECTROMETRY. |
| [13] | "High-resolution three-dimensional structure of a single zinc finger from a human enhancer binding protein in solution." Omichinski J.G., Clore G.M., Appella E., Sakaguchi K., Gronenborn A.M. Biochemistry 29:9324-9334(1990) [PubMed: 2248949] [Abstract] Cited for: STRUCTURE BY NMR OF 2114-2143. |
| [14] | "High-resolution solution structure of the double Cys2His2 zinc finger from the human enhancer binding protein MBP-1." Omichinski J.G., Clore G.M., Robien M., Sakaguchi K., Appella E., Gronenborn A.M. Biochemistry 31:3907-3917(1992) [PubMed: 1567844] [Abstract] Cited for: STRUCTURE BY NMR OF 2088-2143. |
Cross-references
Sequence databases | |||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| X51435 mRNA. Translation: CAA35798.1. Frameshift. AL391828 Z98050 Genomic DNA. Translation: CAH73909.1. AL157373 Z98050 Genomic DNA. Translation: CAH73982.1. AL137221 Z98050 Genomic DNA. Translation: CAI14768.1. Z98050 AL391828 Genomic DNA. Translation: CAI21070.1. BC140816 mRNA. Translation: AAI40817.1. M32019 mRNA. Translation: AAA17534.1. AY673640 mRNA. Translation: AAV85766.1. | |||||||||||||||||||||||||
| IPI | IPI00783879. IPI00941780. IPI00942123. | ||||||||||||||||||||||||
| PIR | A34203. | ||||||||||||||||||||||||
| RefSeq | NP_002105.2. | ||||||||||||||||||||||||
| UniGene | Hs.567284 | ||||||||||||||||||||||||
3D structure databases | |||||||||||||||||||||||||
| |||||||||||||||||||||||||
| ModBase | Search... | ||||||||||||||||||||||||
Protein-protein interaction databases | |||||||||||||||||||||||||
| IntAct | P15822. 4 interactions. | ||||||||||||||||||||||||
| STRING | P15822. | ||||||||||||||||||||||||
PTM databases | |||||||||||||||||||||||||
| PhosphoSite | P15822. | ||||||||||||||||||||||||
Proteomic databases | |||||||||||||||||||||||||
| PRIDE | P15822. | ||||||||||||||||||||||||
Genome annotation databases | |||||||||||||||||||||||||
| Ensembl | ENST00000379382; ENSP00000368690; ENSG00000095951; Homo sapiens. [Genome view] ENST00000379388; ENSP00000368698; ENSG00000095951; Homo sapiens. [Genome view] ENST00000399469; ENSP00000382395; ENSG00000095951; Homo sapiens. [Genome view] ENST00000442081; ENSP00000409078; ENSG00000095951; Homo sapiens. [Genome view] | ||||||||||||||||||||||||
| GeneID | 3096. | ||||||||||||||||||||||||
| KEGG | hsa:3096. | ||||||||||||||||||||||||
| UCSC | uc003nac.1. human. | ||||||||||||||||||||||||
Organism-specific databases | |||||||||||||||||||||||||
| CTD | 3096. | ||||||||||||||||||||||||
| GeneCards | GC06P012120. | ||||||||||||||||||||||||
| HGNC | HGNC:4920. HIVEP1. | ||||||||||||||||||||||||
| MIM | 194540. gene. | ||||||||||||||||||||||||
| PharmGKB | PA29297. | ||||||||||||||||||||||||
| GenAtlas | Search... | ||||||||||||||||||||||||
Phylogenomic databases | |||||||||||||||||||||||||
| HOVERGEN | P15822. | ||||||||||||||||||||||||
Gene expression databases | |||||||||||||||||||||||||
| Bgee | P15822. | ||||||||||||||||||||||||
| CleanEx | HS_HIVEP1. | ||||||||||||||||||||||||
| Genevestigator | P15822. | ||||||||||||||||||||||||
| GermOnline | ENSG00000095951. Homo sapiens. | ||||||||||||||||||||||||
Family and domain databases | |||||||||||||||||||||||||
| InterPro | IPR007087. Znf_C2H2. IPR015880. Znf_C2H2-like. IPR013087. Znf_C2H2/integrase_DNA-bd. [Graphical view] | ||||||||||||||||||||||||
| Gene3D | G3DSA:3.30.160.60. Znf_C2H2/integrase_DNA-bd. 2 hits. | ||||||||||||||||||||||||
| Pfam | PF00096. zf-C2H2. 4 hits. [Graphical view] | ||||||||||||||||||||||||
| SMART | SM00355. ZnF_C2H2. 5 hits. [Graphical view] | ||||||||||||||||||||||||
| PROSITE | PS00028. ZINC_FINGER_C2H2_1. 4 hits. PS50157. ZINC_FINGER_C2H2_2. 4 hits. [Graphical view] | ||||||||||||||||||||||||
| ProtoNet | Search... | ||||||||||||||||||||||||
Other Resources | |||||||||||||||||||||||||
| NextBio | 12285. | ||||||||||||||||||||||||
| SOURCE | Search... | ||||||||||||||||||||||||
Entry information
| Entry name | ZEP1_HUMAN | ||||||||
| Accession | Primary (citable) accession number: P15822 Secondary accession number(s): B2RTU3 Q5VW60 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | HPI (Human Proteome Initiative) | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Human chromosome 6 Human chromosome 6: entries, gene names and cross-references to MIM |
| Human entries with polymorphisms or disease mutations List of human entries with polymorphisms or disease mutations |
| Human polymorphisms and disease mutations Index of human polymorphisms and disease mutations |
| MIM cross-references Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot |
| PDB cross-references Index of Protein Data Bank (PDB) cross-references |
| SIMILARITY comments Index of protein domains and families |

Clusters with


