Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P31629 (ZEP2_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified July 9, 2014. Version 130. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (5) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Transcription factor HIVEP2
Alternative name(s):
Human immunodeficiency virus type I enhancer-binding protein 2
Short name=HIV-EP2
MHC-binding protein 2
Short name=MBP-2
Gene names
Name:HIVEP2
OrganismHomo sapiens (Human) [Reference proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length2446 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

This protein specifically binds to the DNA sequence 5'-GGGACTTTCC-3' which is found in the enhancer elements of numerous viral promoters such as those of SV40, CMV, or HIV1. In addition, related sequences are found in the enhancer elements of a number of cellular promoters, including those of the class I MHC, interleukin-2 receptor, somatostatin receptor II, and interferon-beta genes. It may act in T-cell activation.

Subunit structure

Interacts with TCF4 By similarity.

Subcellular location

Nucleus.

Tissue specificity

Expressed in brain and skeletal muscle. Ref.6

Induction

By mitogens and phorbol ester.

Sequence similarities

Contains 4 C2H2-type zinc fingers.

Sequence caution

The sequence AAB88218.1 differs from that shown. Reason: Erroneous initiation.

The sequence CAA46596.1 differs from that shown. Reason: Erroneous initiation.

Ontologies

Keywords
   Biological processTranscription
Transcription regulation
   Cellular componentNucleus
   Coding sequence diversityPolymorphism
   DomainRepeat
Zinc-finger
   LigandDNA-binding
Metal-binding
Zinc
   PTMIsopeptide bond
Ubl conjugation
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processregulation of transcription, DNA-templated

Inferred from electronic annotation. Source: UniProtKB-KW

transcription, DNA-templated

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular_componentnucleus

Inferred from direct assay. Source: HPA

   Molecular_functionDNA binding

Traceable author statement Ref.1Ref.5. Source: UniProtKB

metal ion binding

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 24462446Transcription factor HIVEP2
PRO_0000047371

Regions

Repeat2053 – 205641
Repeat2059 – 206242
Repeat2071 – 207443
Repeat2083 – 208644
Repeat2089 – 209245
Repeat2106 – 210946
Repeat2112 – 211547
Repeat2118 – 212148
Repeat2130 – 213349
Repeat2145 – 2148410
Zinc finger189 – 21123C2H2-type 1
Zinc finger217 – 23923C2H2-type 2
Zinc finger1799 – 182123C2H2-type 3
Zinc finger1827 – 185125C2H2-type 4
Region2053 – 21489610 X 4 AA tandem repeats of S-P-[RGMKC]-[RK]
Motif937 – 9437Nuclear localization signal Potential
Compositional bias950 – 98233Ser-rich
Compositional bias1510 – 158677Ser-rich
Compositional bias1899 – 192325Asp/Glu-rich (acidic)
Compositional bias2073 – 214876Arg-rich

Amino acid modifications

Cross-link2092Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in ubiquitin) Ref.7

Natural variations

Natural variant461R → Q.
Corresponds to variant rs17072013 [ dbSNP | Ensembl ].
VAR_052754
Natural variant10411A → V.
Corresponds to variant rs34875559 [ dbSNP | Ensembl ].
VAR_052755
Natural variant12931L → I.
Corresponds to variant rs35675714 [ dbSNP | Ensembl ].
VAR_052756
Natural variant15381L → P. Ref.1 Ref.2 Ref.3
Corresponds to variant rs109836 [ dbSNP | Ensembl ].
VAR_052757

Experimental info

Sequence conflict20911R → T in CAA46596. Ref.2

Sequences

Sequence LengthMass (Da)Tools
P31629 [UniParc].

Last modified December 6, 2005. Version 2.
Checksum: 482E2C577EF9449A

FASTA2,446269,053
        10         20         30         40         50         60 
MDTGDTALGQ KATSRSGETD KASGRWRQEQ SAVIKMSTFG SHEGQRQPQI EPEQIGNTAS 

        70         80         90        100        110        120 
AQLFGSGKLA SPSEVVQQVA EKQYPPHRPS PYSCQHSLSF PQHSLPQGVM HSTKPHQSLE 

       130        140        150        160        170        180 
GPPWLFPGPL PSVASEDLFP FPIHGHSGGY PRKKISSLNP AYSQYSQKSI EQAEEAHKKE 

       190        200        210        220        230        240 
HKPKKPGKYI CPYCSRACAK PSVLKKHIRS HTGERPYPCI PCGFSFKTKS NLYKHRKSHA 

       250        260        270        280        290        300 
HAIKAGLVPF TESAVSKLDL EAGFIDVEAE IHSDGEQSTD TDEESSLFAE ASDKMSPGPP 

       310        320        330        340        350        360 
IPLDIASRGG YHGSLEESLG GPMKVPILII PKSGIPLPNE SSQYIGPDML PNPSLNTKAD 

       370        380        390        400        410        420 
DSHTVKQKLA LRLSEKKGQD SEPSLNLLSP HSKGSTDSGY FSRSESAEQQ ISPPNTNAKS 

       430        440        450        460        470        480 
YEEIIFGKYC RLSPRNALSV TTTSQERAAM GRKGIMEPLP HVNTRLDVKM FEDPVSQLIP 

       490        500        510        520        530        540 
SKGDVDPSQT SMLKSTKFNS ESRQPQIIPS SIRNEGKLYP ANFQGSNPVL LEAPVDSSPL 

       550        560        570        580        590        600 
IRSNSVPTSS ATNLTIPPSL RGSHSFDERM TGSDDVFYPG TVGIPPQRML RRQAAFELPS 

       610        620        630        640        650        660 
VQEGHVEVEH HGRMLKGISS SSLKEKKLSP GDRVGYDYDV CRKPYKKWED SETPKQNYRD 

       670        680        690        700        710        720 
ISCLSSLKHG GEYFMDPVVP LQGVPSMFGT TCENRKRRKE KSVGDEEDTP MICSSIVSTP 

       730        740        750        760        770        780 
VGIMASDYDP KLQMQEGVRS GFAMAGHENL SHGHTERFDP CRPQLQPGSP SLVSEESPSA 

       790        800        810        820        830        840 
IDSDKMSDLG GRKPPGNVIS VIQHTNSLSR PNSFERSESA ELVACTQDKA PSPSETCDSE 

       850        860        870        880        890        900 
ISEAPVSPEW APPGDGAESG GKPSPSQQVQ QQSYHTQPRL VRQHNIQVPE IRVTEEPDKP 

       910        920        930        940        950        960 
EKEKEAQSKE PEKPVEEFQW PQRSETLSQL PAEKLPPKKK RLRLADMEHS SGESSFESTG 

       970        980        990       1000       1010       1020 
TGLSRSPSQE SNLSHSSSFS MSFEREETSK LSALPKQDEF GKHSEFLTVP AGSYSLSVPG 

      1030       1040       1050       1060       1070       1080 
HHHQKEMRRC SSEQMPCPHP AEVPEVRSKS FDYGNLSHAP VSGAAASTVS PSRERKKCFL 

      1090       1100       1110       1120       1130       1140 
VRQASFSGSP EISQGEVGMD QSVKQEQLEH LHAGLRSGWH HGPPAVLPPL QQEDPGKQVA 

      1150       1160       1170       1180       1190       1200 
GPCPPLSSGP LHLAQPQIMH MDSQESLRNP LIQPTSYMTS KHLPEQPHLF PHQETIPFSP 

      1210       1220       1230       1240       1250       1260 
IQNALFQFQY PTVCMVHLPA QQPPWWQAHF PHPFAQHPQK SYGKPSFQTE IHSSYPLEHV 

      1270       1280       1290       1300       1310       1320 
AEHTGKKPAE YAHTKEQTYP CYSGASGLHP KNLLPKFPSD QSSKSTETPS EQVLQEDFAS 

      1330       1340       1350       1360       1370       1380 
ANAGSLQSLP GTVVPVRIQT HVPSYGSVMY TSISQILGQN SPAIVICKVD ENMTQRTLVT 

      1390       1400       1410       1420       1430       1440 
NAAMQGIGFN IAQVLGQHAG LEKYPIWKAP QTLPLGLESS IPLCLPSTSD SVATLGGSKR 

      1450       1460       1470       1480       1490       1500 
MLSPASSLEL FMETKQQKRV KEEKMYGQIV EELSAVELTN SDIKKDLSRP QKPQLVRQGC 

      1510       1520       1530       1540       1550       1560 
ASEPKDGLQS GSSSFSSLSP SSSQDYPSVS PSSREPFLPS KEMLSGSRAP LPGQKSSGPS 

      1570       1580       1590       1600       1610       1620 
ESKESSDELD IDETASDMSM SPQSSSLPAG DGQLEEEGKG HKRPVGMLVR MASAPSGNVA 

      1630       1640       1650       1660       1670       1680 
DSTLLLTDMA DFQQILQFPS LRTTTTVSWC FLNYTKPNYV QQATFKSSVY ASWCISSCNP 

      1690       1700       1710       1720       1730       1740 
NPSGLNTKTT LALLRSKQKI TAEIYTLAAM HRPGTGKLTS SSAWKQFTQM KPDASFLFGS 

      1750       1760       1770       1780       1790       1800 
KLERKLVGNI LKERGKGDIH GDKDIGSKQT EPIRIKIFEG GYKSNEDYVY VRGRGRGKYI 

      1810       1820       1830       1840       1850       1860 
CEECGIRCKK PSMLKKHIRT HTDVRPYVCK LCNFAFKTKG NLTKHMKSKA HMKKCLELGV 

      1870       1880       1890       1900       1910       1920 
SMTSVDDTET EEAENLEDLH KAAEKHSMSS ISTDHQFSDA EESDGEDGDD NDDDDEDEDD 

      1930       1940       1950       1960       1970       1980 
FDDQGDLTPK TRSRSTSPQP PRFSSLPVNV GAVPHGVPSD SSLGHSSLIS YLVTLPSIRV 

      1990       2000       2010       2020       2030       2040 
TQLMTPSDSC EDTQMTEYQR LFQSKSTDSE PDKDRLDIPS CMDEECMLPS EPSSSPRDFS 

      2050       2060       2070       2080       2090       2100 
PSSHHSSPGY DSSPCRDNSP KRYLIPKGDL SPRRHLSPRR DLSPMRHLSP RKEAALRREM 

      2110       2120       2130       2140       2150       2160 
SQRDVSPRRH LSPRRPVSPG KDITARRDLS PRRERRYMTT IRAPSPRRAL YHNPPLSMGQ 

      2170       2180       2190       2200       2210       2220 
YLQAEPIVLG PPNLRRGLPQ VPYFSLYGDQ EGAYEHPGSS LFPEGPNDYV FSHLPLHSQQ 

      2230       2240       2250       2260       2270       2280 
QVRAPIPMVP VGGIQMVHSM PPALSSLHPS PTLPLPMEGF EEKKGASGES FSKDPYVLSK 

      2290       2300       2310       2320       2330       2340 
QHEKRGPHAL QSSGPPSTPS SPRLLMKQST SEDSLNATER EQEENIQTCT KAIASLRIAT 

      2350       2360       2370       2380       2390       2400 
EEAALLGPDQ PARVQEPHQN PLGSAHVSIR HFSRPEPGQP CTSATHPDLH DGEKDNFGTS 

      2410       2420       2430       2440 
QTPLAHSTFY SKSCVDDKQL DFHSSKELSS STEESKDPSS EKSQLH 

« Hide

References

« Hide 'large scale' references
[1]"HIV-EP2, a new member of the gene family encoding the human immunodeficiency virus type 1 enhancer-binding protein. Comparison with HIV-EP1/PRDII-BF1/MBP-1."
Nomura N., Zhao M.-J., Nagase T., Maekawa T., Ishizaki R., Tabata S., Ishii S.
J. Biol. Chem. 266:8590-8594(1991) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANT PRO-1538.
[2]"Structure and expression of MBP-2: a 275 kDa zinc finger protein that binds to an enhancer of major histocompatibility complex class 1 genes."
Van't Veer L.J., Lutz P., Isselbacher K.J., Bernards R.
Proc. Natl. Acad. Sci. U.S.A. 89:8971-8975(1992) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA], VARIANT PRO-1538.
[3]"Characterization of the human MBP-2/HIV-EP2 gene: identification of multiple promoters and alternative splicing of 5' untranslated region."
Kukita Y., Komiya T., Tahira T., Asakawa S., Shimizu N., Suzuki Y., Sugano S., Hayashi K.
Submitted (MAY-1999) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], VARIANT PRO-1538.
[4]"The DNA sequence and analysis of human chromosome 6."
Mungall A.J., Palmer S.A., Sims S.K., Edwards C.A., Ashurst J.L., Wilming L., Jones M.C., Horton R., Hunt S.E., Scott C.E., Gilbert J.G.R., Clamp M.E., Bethel G., Milne S., Ainscough R., Almeida J.P., Ambrose K.D., Andrews T.D. expand/collapse author list , Ashwell R.I.S., Babbage A.K., Bagguley C.L., Bailey J., Banerjee R., Barker D.J., Barlow K.F., Bates K., Beare D.M., Beasley H., Beasley O., Bird C.P., Blakey S.E., Bray-Allen S., Brook J., Brown A.J., Brown J.Y., Burford D.C., Burrill W., Burton J., Carder C., Carter N.P., Chapman J.C., Clark S.Y., Clark G., Clee C.M., Clegg S., Cobley V., Collier R.E., Collins J.E., Colman L.K., Corby N.R., Coville G.J., Culley K.M., Dhami P., Davies J., Dunn M., Earthrowl M.E., Ellington A.E., Evans K.A., Faulkner L., Francis M.D., Frankish A., Frankland J., French L., Garner P., Garnett J., Ghori M.J., Gilby L.M., Gillson C.J., Glithero R.J., Grafham D.V., Grant M., Gribble S., Griffiths C., Griffiths M.N.D., Hall R., Halls K.S., Hammond S., Harley J.L., Hart E.A., Heath P.D., Heathcott R., Holmes S.J., Howden P.J., Howe K.L., Howell G.R., Huckle E., Humphray S.J., Humphries M.D., Hunt A.R., Johnson C.M., Joy A.A., Kay M., Keenan S.J., Kimberley A.M., King A., Laird G.K., Langford C., Lawlor S., Leongamornlert D.A., Leversha M., Lloyd C.R., Lloyd D.M., Loveland J.E., Lovell J., Martin S., Mashreghi-Mohammadi M., Maslen G.L., Matthews L., McCann O.T., McLaren S.J., McLay K., McMurray A., Moore M.J.F., Mullikin J.C., Niblett D., Nickerson T., Novik K.L., Oliver K., Overton-Larty E.K., Parker A., Patel R., Pearce A.V., Peck A.I., Phillimore B.J.C.T., Phillips S., Plumb R.W., Porter K.M., Ramsey Y., Ranby S.A., Rice C.M., Ross M.T., Searle S.M., Sehra H.K., Sheridan E., Skuce C.D., Smith S., Smith M., Spraggon L., Squares S.L., Steward C.A., Sycamore N., Tamlyn-Hall G., Tester J., Theaker A.J., Thomas D.W., Thorpe A., Tracey A., Tromans A., Tubby B., Wall M., Wallis J.M., West A.P., White S.S., Whitehead S.L., Whittaker H., Wild A., Willey D.J., Wilmer T.E., Wood J.M., Wray P.W., Wyatt J.C., Young L., Younger R.M., Bentley D.R., Coulson A., Durbin R.M., Hubbard T., Sulston J.E., Dunham I., Rogers J., Beck S.
Nature 425:805-811(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[5]"Two genes encode factors with NF-kappa B- and H2TF1-like DNA-binding properties."
Rustgi A.K., Van't Veer L.J., Bernards R.
Proc. Natl. Acad. Sci. U.S.A. 87:8707-8710(1990) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1797-1936.
[6]"Activation of somatostatin receptor II expression by transcription factors MIBP1 and SEF-2 in the murine brain."
Doerflinger U., Pscherer A., Moser M., Ruemmele P., Schuele R., Buettner R.
Mol. Cell. Biol. 19:3736-3747(1999) [PubMed] [Europe PMC] [Abstract]
Cited for: TISSUE SPECIFICITY.
Tissue: Brain.
[7]"Tryptic digestion of ubiquitin standards reveals an improved strategy for identifying ubiquitinated proteins by mass spectrometry."
Denis N.J., Vasilescu J., Lambert J.-P., Smith J.C., Figeys D.
Proteomics 7:868-874(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: UBIQUITINATION [LARGE SCALE ANALYSIS] AT LYS-2092.
Tissue: Mammary cancer.
[8]"Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions."
Mayya V., Lundgren D.H., Hwang S.-I., Rezaul K., Wu L., Eng J.K., Rodionov V., Han D.K.
Sci. Signal. 2:RA46-RA46(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
Tissue: Leukemic T-cell.
[9]"N-terminal acetylome analyses and functional insights of the N-terminal acetyltransferase NatB."
Van Damme P., Lasa M., Polevoda B., Gazquez C., Elosegui-Artola A., Kim D.S., De Juan-Pardo E., Demeyer K., Hole K., Larrea E., Timmerman E., Prieto J., Arnesen T., Sherman F., Gevaert K., Aldabe R.
Proc. Natl. Acad. Sci. U.S.A. 109:12449-12454(2012) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
M60119 Genomic DNA. Translation: AAB88218.1. Different initiation.
X65644 mRNA. Translation: CAA46596.1. Different initiation.
AF153836 Genomic DNA. Translation: AAF81365.1.
AL023584 Genomic DNA. Translation: CAA19042.1.
M61744 mRNA. Translation: AAA36202.1.
CCDSCCDS43510.1.
PIRWMHUE2. S26661.
RefSeqNP_006725.3. NM_006734.3.
XP_005267014.1. XM_005266957.1.
UniGeneHs.510172.

3D structure databases

ProteinModelPortalP31629.
SMRP31629. Positions 188-244, 1798-1854.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

BioGrid109344. 9 interactions.
IntActP31629. 12 interactions.
MINTMINT-7027293.
STRING9606.ENSP00000012134.

PTM databases

PhosphoSiteP31629.

Polymorphism databases

DMDM83305815.

Proteomic databases

MaxQBP31629.
PaxDbP31629.
PRIDEP31629.

Protocols and materials databases

DNASU3097.
StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblENST00000012134; ENSP00000012134; ENSG00000010818.
ENST00000367603; ENSP00000356575; ENSG00000010818.
ENST00000367604; ENSP00000356576; ENSG00000010818.
GeneID3097.
KEGGhsa:3097.
UCSCuc003qjd.3. human.

Organism-specific databases

CTD3097.
GeneCardsGC06M143114.
H-InvDBHIX0032889.
HGNCHGNC:4921. HIVEP2.
HPAHPA055954.
MIM143054. gene.
neXtProtNX_P31629.
PharmGKBPA29298.
GenAtlasSearch...

Phylogenomic databases

eggNOGNOG296349.
HOGENOMHOG000155774.
HOVERGENHBG007119.
InParanoidP31629.
KOK09239.
OMARKKCFLV.
OrthoDBEOG7V1FPQ.
PhylomeDBP31629.
TreeFamTF331837.

Gene expression databases

ArrayExpressP31629.
BgeeP31629.
CleanExHS_HIVEP2.
GenevestigatorP31629.

Family and domain databases

Gene3D3.30.160.60. 4 hits.
InterProIPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
SMARTSM00355. ZnF_C2H2. 4 hits.
[Graphical view]
PROSITEPS00028. ZINC_FINGER_C2H2_1. 4 hits.
PS50157. ZINC_FINGER_C2H2_2. 4 hits.
[Graphical view]
ProtoNetSearch...

Other

GeneWikiHIVEP2.
GenomeRNAi3097.
NextBio12289.
PROP31629.
SOURCESearch...

Entry information

Entry nameZEP2_HUMAN
AccessionPrimary (citable) accession number: P31629
Secondary accession number(s): Q02646, Q5THT5, Q9NS05
Entry history
Integrated into UniProtKB/Swiss-Prot: July 1, 1993
Last sequence update: December 6, 2005
Last modified: July 9, 2014
This is version 130 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

SIMILARITY comments

Index of protein domains and families

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

Human polymorphisms and disease mutations

Index of human polymorphisms and disease mutations

Human entries with polymorphisms or disease mutations

List of human entries with polymorphisms or disease mutations

Human chromosome 6

Human chromosome 6: entries, gene names and cross-references to MIM