Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Transcription factor HIVEP2

Gene

Hivep2

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Specifically binds to the DNA sequence 5'-GGGACTTTCC-3' which is found in the enhancer elements of numerous viral promoters such as those of SV40, CMV, or HIV1. In addition, related sequences are found in the enhancer elements of a number of cellular promoters, including those of the class I MHC, interleukin-2 receptor, somatostatin receptor II, and interferon-beta genes. It may act in T-cell activation (By similarity).By similarity1 Publication

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri189 – 211C2H2-type 1PROSITE-ProRule annotationAdd BLAST23
Zinc fingeri217 – 239C2H2-type 2PROSITE-ProRule annotationAdd BLAST23
Zinc fingeri1783 – 1805C2H2-type 3PROSITE-ProRule annotationAdd BLAST23
Zinc fingeri1811 – 1835C2H2-type 4PROSITE-ProRule annotationAdd BLAST25

GO - Molecular functioni

  • metal ion binding Source: MGI
  • sequence-specific DNA binding Source: GO_Central
  • transcription factor activity, sequence-specific DNA binding Source: MGI
  • transcription regulatory region DNA binding Source: GO_Central

GO - Biological processi

  • multicellular organism development Source: GO_Central
  • signal transduction Source: MGI
  • transcription, DNA-templated Source: MGI
  • transcription from RNA polymerase II promoter Source: GO_Central
Complete GO annotation...

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, Metal-binding, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Transcription factor HIVEP2
Alternative name(s):
Human immunodeficiency virus type I enhancer-binding protein 2 homolog
Myc intron-binding protein 1
Short name:
MIBP-1
Gene namesi
Name:Hivep2
Synonyms:Mibp1
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 10

Organism-specific databases

MGIiMGI:1338076. Hivep2.

Subcellular locationi

GO - Cellular componenti

  • nucleoplasm Source: MGI
  • nucleus Source: GO_Central
  • transcription factor complex Source: MGI
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000473721 – 2430Transcription factor HIVEP2Add BLAST2430

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei811PhosphoserineBy similarity1
Modified residuei942PhosphoserineCombined sources1
Modified residuei947PhosphoserineCombined sources1
Modified residuei1040PhosphoserineCombined sources1
Modified residuei1431PhosphoserineCombined sources1
Modified residuei1435PhosphoserineCombined sources1
Modified residuei2102PhosphoserineBy similarity1
Modified residuei2281PhosphoserineBy similarity1
Modified residuei2285PhosphoserineBy similarity1
Modified residuei2413PhosphoserineCombined sources1
Modified residuei2415PhosphoserineCombined sources1

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiQ3UHF7.
PRIDEiQ3UHF7.

PTM databases

iPTMnetiQ3UHF7.
PhosphoSitePlusiQ3UHF7.

Expressioni

Tissue specificityi

Expressed in heart, lung, skeletal muscle and liver. In the brain expressed in cerebral cortex, hippocampus, corpora amygdala and cerebellar cortex.1 Publication

Developmental stagei

At E13.5 and E15.5 expressed in anterior neural tube over primordial frontal cortex, spinal cord, dorsal root glanglia and developing skeletal muscle.1 Publication

Gene expression databases

BgeeiENSMUSG00000015501.
ExpressionAtlasiQ3UHF7. baseline and differential.
GenevisibleiQ3UHF7. MM.

Interactioni

Subunit structurei

Interacts with TCF4.1 Publication

Protein-protein interaction databases

BioGridi200314. 1 interactor.
STRINGi10090.ENSMUSP00000015645.

Structurei

3D structure databases

ProteinModelPortaliQ3UHF7.
SMRiQ3UHF7.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati2037 – 204014
Repeati2043 – 204624
Repeati2055 – 205834
Repeati2067 – 207044
Repeati2073 – 207654
Repeati2090 – 209364
Repeati2096 – 209974
Repeati2102 – 210584
Repeati2114 – 211794
Repeati2129 – 2132104

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni2037 – 213210 X 4 AA tandem repeats of S-P-[RGMKC]-[RK]Add BLAST96

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi929 – 935Nuclear localization signalSequence analysis7

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi942 – 974Ser-richAdd BLAST33
Compositional biasi1499 – 1569Ser-richAdd BLAST71
Compositional biasi1883 – 1907Asp/Glu-rich (acidic)Add BLAST25
Compositional biasi2057 – 2132Arg-richAdd BLAST76

Sequence similaritiesi

Contains 4 C2H2-type zinc fingers.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri189 – 211C2H2-type 1PROSITE-ProRule annotationAdd BLAST23
Zinc fingeri217 – 239C2H2-type 2PROSITE-ProRule annotationAdd BLAST23
Zinc fingeri1783 – 1805C2H2-type 3PROSITE-ProRule annotationAdd BLAST23
Zinc fingeri1811 – 1835C2H2-type 4PROSITE-ProRule annotationAdd BLAST25

Keywords - Domaini

Repeat, Zinc-finger

Phylogenomic databases

eggNOGiKOG1721. Eukaryota.
COG5048. LUCA.
GeneTreeiENSGT00530000063161.
HOGENOMiHOG000155774.
HOVERGENiHBG007119.
InParanoidiQ3UHF7.
KOiK09239.
OMAiRKKCFLV.
OrthoDBiEOG091G004W.
PhylomeDBiQ3UHF7.
TreeFamiTF331837.

Family and domain databases

Gene3Di3.30.160.60. 4 hits.
InterProiIPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
SMARTiSM00355. ZnF_C2H2. 4 hits.
[Graphical view]
PROSITEiPS00028. ZINC_FINGER_C2H2_1. 4 hits.
PS50157. ZINC_FINGER_C2H2_2. 4 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q3UHF7-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MDTGDTALGQ KATSRSGETD SVSGRWRQEQ SAVLKMSTFS SQEGPRQPQI
60 70 80 90 100
DPEQIGNAAS AQLFGSGKLA SPGEGLHQVT EKQYPPHRPS PYPCQHSLSF
110 120 130 140 150
PQHSLSQGMT HSHKPHQSLE GPPWLFPGPL PSVASEDLFP FPMHGHSGGY
160 170 180 190 200
PRKKISNLNP AYSQYSQKSI EQAEDAHKKE HKPKKPGKYI CPYCSRACAK
210 220 230 240 250
PSVLKKHIRS HTGERPYPCI PCGFSFKTKS NLYKHRKSHA HAIKAGLVPF
260 270 280 290 300
TESSVSKLDL EAGFIDVEAE IHSDGEQSTD TDEESSLFAE ASDKVSPGPP
310 320 330 340 350
VPLDIASRGG YHGSLEESLG GPMKVPILII PKSGIPLASE GSQYLSSEML
360 370 380 390 400
PNPSLNAKAD DSHTVKQKLA LRLSEKKGQD SEPSLNLLSP HSKGSTDSGY
410 420 430 440 450
FSRSESAEQQ ISPPNTNAKS YEEIIFGKYC RLSPRNTLSV TPTGQERTAM
460 470 480 490 500
GRRGIMEPLP HLNTRLEVKM FEDPISQLNP SKGEMDPGQI NMLKTTKFNS
510 520 530 540 550
ECRQPQAIPS SVRNEGKPYP GNFLGSNPML LEAPVDSSPL IRSNSMPTSS
560 570 580 590 600
ATNLSVPPSL RGSHSFDERM TGSDDVFYPG TVGIPPQRML RRQAAFELPS
610 620 630 640 650
VQEGHMESEH PARVSKGLAS PSLKEKKLLP GDRPGYDYDV CRKPYKKWED
660 670 680 690 700
SETLKQSYLG SFKQGGEYFM DPSVPVQGVP TMFGTTCENR KRRKEKSVGD
710 720 730 740 750
EEDVPMICGG MGNAPVGMMS SEYDPKLQDG GRSGFAMTAH ESLAHGHSDR
760 770 780 790 800
LDPARPQLPS RSPSLGSEDL PLAADPDKMT DLGKKPPGNV ISVIQHTNSL
810 820 830 840 850
SRPNSFERSE STEMVACPQD KTPSPAETCD SEVLEAPVSP EWAPPGDGGE
860 870 880 890 900
SGSKPTPSQQ VPQHSYHAQP RLVRQHNIQV PEIRVTEEPD KPEKEKEAPT
910 920 930 940 950
KEPEKPVEEF QWPQRSETLS QLPAEKLPPK KKRLRLADLE HSSGESSFES
960 970 980 990 1000
TGTGLSRSPS QESNLSHSSS FSMSFDREET VKLTAPPKQD ESGKHSEFLT
1010 1020 1030 1040 1050
VPAGSYSLSV PGHHHQKEMR RCSSEQMPCP HPTEVPEIRS KSFDYGNLSH
1060 1070 1080 1090 1100
APVAGTSPST LSPSRERKKC FLVRQASFSG SPEIAQGEAG VDPSVKQEHM
1110 1120 1130 1140 1150
EHLHAGLRAA WSSVLPPLPG DDPGKQVGTC GPLSSGPPLH LTQQQIMHMD
1160 1170 1180 1190 1200
SQESLRNPLI QPTSYMTSKH LPEQPHLFPH QDAVPFSPIQ NALFQFQYPT
1210 1220 1230 1240 1250
VCMVHLPAQQ PPWWQTHFPH PFAPHPQNSY SKPPFQADLH SSYPLEHVAE
1260 1270 1280 1290 1300
HTGKKSADYP HAKEQTYPCY SGTSGLHSKN LPLKFPSDPG SKSTETPTEQ
1310 1320 1330 1340 1350
LLREDFASEN AGPLQSLPGT VVPVRIQTHV PSYGSVMYTS ISQILGQNSP
1360 1370 1380 1390 1400
AIVICKVDEN MTQRTLVTNA AMQGIGLNIA QVLGQHTGLE KYPLWKVPQT
1410 1420 1430 1440 1450
LPLGLESSIP LCLPSTSDNA ASLGGSKRML SPASSLELFM ETKQQKRVKE
1460 1470 1480 1490 1500
EKMYGQIVEE LSAVELTNSD IKKGLSRPQK PQLVRQGCAS EPKDGCFQSR
1510 1520 1530 1540 1550
SSSFSSLSPS SSQDHPSASG PFPPNREILP GSRAPPRRKF SGPSESRESS
1560 1570 1580 1590 1600
DELDMDETSS DMSMSPQSSA LPTGGGQQEE EGKARKLPVS MLVHMASGPG
1610 1620 1630 1640 1650
GNVANSTLLF TDVADFQQIL QFPSLRTTTT VSWCFLNYTK PSFVQQATFK
1660 1670 1680 1690 1700
SSVYASWCIS SCNPNPSGLN TKTTLALLRS KQKITAEIYT LAAMHRPGAG
1710 1720 1730 1740 1750
KLTSSSVWKQ FAQMKPDAPF LFGNKLERKL AGNVLKERGK GEIHGDKDLG
1760 1770 1780 1790 1800
SKQTEPIRIK IFEGGYKSNE DYVYVRGRGR GKYICEECGI RCKKPSMLKK
1810 1820 1830 1840 1850
HIRTHTDVRP YVCKLCNFAF KTKGNLTKHM KSKAHMKKCL ELGVSMTSVD
1860 1870 1880 1890 1900
DTETEEAENM EELHKTSEKH SMSGISTDHQ FSDAEESDGE DGDDNDDDDE
1910 1920 1930 1940 1950
DDDDFDDQGD LTPKTRSRST SPQPPRFSSL PVNVGAVAHG VPSDSSLGHS
1960 1970 1980 1990 2000
SLISYLVTLP SIQVTQLMTP SDSCDDTQMT EYQRLFQSKS TDSEPDKDRL
2010 2020 2030 2040 2050
DIPSSMDEEA MLSSEPSSSP RDFSPSSYRS SPGYDSSPCR DNSPKRYLIP
2060 2070 2080 2090 2100
KGDLSPRRHL SPRRDLSPMR HLSPRKEAAL RREMSQGDAS PRRHLSPRRP
2110 2120 2130 2140 2150
LSPGKDITAR RDLSPRRERR YMTTIRAPSP RRALYPNPPL SMGQYLQTEP
2160 2170 2180 2190 2200
IVLGPPNLRR GIPQVPYFSL YGDQEGAYEH HGSSLFPEGP TDYVFSHLPL
2210 2220 2230 2240 2250
HSQQQVRAPI PMVPVGGIQM VHSLPPALSG LHPPPTLPLP TEGSEEKKGA
2260 2270 2280 2290 2300
PGEAFAKDPY ILSRRHEKQA PQVLQSSGLP SSPSSPRLLM KQSTSEDSLN
2310 2320 2330 2340 2350
STEREQEENI QTCTKAIASL RIATEEAALL GADPPTWVQE SPQKPLESAH
2360 2370 2380 2390 2400
VSIRHFGGPE PGQPCTSAAH PDLHDGEKDT FGTSQTAVAH PTFYSKSSVD
2410 2420 2430
EKRVDFQSSK ELSLSTEEGN EPSPEKNQLH
Length:2,430
Mass (Da):266,705
Last modified:October 11, 2005 - v1
Checksum:i265694210C8A7024
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti572G → E in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti661S → P in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti749D → N in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti762S → G in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti1112S → P in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti1211P → S in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti1223A → G in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti1496C → S in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti2001D → E in BAE23294 (PubMed:16141072).Curated1
Sequence conflicti2091P → A in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti2122M → L in CAA75868 (PubMed:10207097).Curated1
Sequence conflicti2263S → C in BAE23336 (PubMed:16141072).Curated1
Sequence conflicti2294T → A in BAE23294 (PubMed:16141072).Curated1
Sequence conflicti2314T → A in CAA75868 (PubMed:10207097).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Y15907 mRNA. Translation: CAA75868.1.
AK137291 mRNA. Translation: BAE23294.1.
AK137384 mRNA. Translation: BAE23336.1.
AK147244 mRNA. Translation: BAE27792.1.
AK147419 mRNA. Translation: BAE27900.1.
CCDSiCCDS23704.1.
RefSeqiNP_034567.2. NM_010437.2.
XP_006512619.1. XM_006512556.1.
XP_006512621.1. XM_006512558.2.
XP_006512622.1. XM_006512559.2.
XP_006512624.1. XM_006512561.3.
XP_011241431.1. XM_011243129.2.
XP_011241432.1. XM_011243130.2.
XP_011241433.1. XM_011243131.2.
XP_011241434.1. XM_011243132.2.
XP_011241435.1. XM_011243133.2.
XP_017169296.1. XM_017313807.1.
XP_017169297.1. XM_017313808.1.
XP_017169298.1. XM_017313809.1.
UniGeneiMm.42157.
Mm.484135.

Genome annotation databases

EnsembliENSMUST00000015645; ENSMUSP00000015645; ENSMUSG00000015501.
ENSMUST00000187083; ENSMUSP00000140290; ENSMUSG00000015501.
ENSMUST00000191138; ENSMUSP00000140150; ENSMUSG00000015501.
GeneIDi15273.
KEGGimmu:15273.
UCSCiuc007elg.1. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Y15907 mRNA. Translation: CAA75868.1.
AK137291 mRNA. Translation: BAE23294.1.
AK137384 mRNA. Translation: BAE23336.1.
AK147244 mRNA. Translation: BAE27792.1.
AK147419 mRNA. Translation: BAE27900.1.
CCDSiCCDS23704.1.
RefSeqiNP_034567.2. NM_010437.2.
XP_006512619.1. XM_006512556.1.
XP_006512621.1. XM_006512558.2.
XP_006512622.1. XM_006512559.2.
XP_006512624.1. XM_006512561.3.
XP_011241431.1. XM_011243129.2.
XP_011241432.1. XM_011243130.2.
XP_011241433.1. XM_011243131.2.
XP_011241434.1. XM_011243132.2.
XP_011241435.1. XM_011243133.2.
XP_017169296.1. XM_017313807.1.
XP_017169297.1. XM_017313808.1.
XP_017169298.1. XM_017313809.1.
UniGeneiMm.42157.
Mm.484135.

3D structure databases

ProteinModelPortaliQ3UHF7.
SMRiQ3UHF7.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi200314. 1 interactor.
STRINGi10090.ENSMUSP00000015645.

PTM databases

iPTMnetiQ3UHF7.
PhosphoSitePlusiQ3UHF7.

Proteomic databases

PaxDbiQ3UHF7.
PRIDEiQ3UHF7.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000015645; ENSMUSP00000015645; ENSMUSG00000015501.
ENSMUST00000187083; ENSMUSP00000140290; ENSMUSG00000015501.
ENSMUST00000191138; ENSMUSP00000140150; ENSMUSG00000015501.
GeneIDi15273.
KEGGimmu:15273.
UCSCiuc007elg.1. mouse.

Organism-specific databases

CTDi3097.
MGIiMGI:1338076. Hivep2.

Phylogenomic databases

eggNOGiKOG1721. Eukaryota.
COG5048. LUCA.
GeneTreeiENSGT00530000063161.
HOGENOMiHOG000155774.
HOVERGENiHBG007119.
InParanoidiQ3UHF7.
KOiK09239.
OMAiRKKCFLV.
OrthoDBiEOG091G004W.
PhylomeDBiQ3UHF7.
TreeFamiTF331837.

Miscellaneous databases

ChiTaRSiHivep2. mouse.
PROiQ3UHF7.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000015501.
ExpressionAtlasiQ3UHF7. baseline and differential.
GenevisibleiQ3UHF7. MM.

Family and domain databases

Gene3Di3.30.160.60. 4 hits.
InterProiIPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
SMARTiSM00355. ZnF_C2H2. 4 hits.
[Graphical view]
PROSITEiPS00028. ZINC_FINGER_C2H2_1. 4 hits.
PS50157. ZINC_FINGER_C2H2_2. 4 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiZEP2_MOUSE
AccessioniPrimary (citable) accession number: Q3UHF7
Secondary accession number(s): O55140, Q3UVD4, Q3UVH5
Entry historyi
Integrated into UniProtKB/Swiss-Prot: December 6, 2005
Last sequence update: October 11, 2005
Last modified: November 30, 2016
This is version 102 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.