Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

General transcription factor II-I repeat domain-containing protein 1

Gene

Gtf2ird1

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

May be a transcription regulator involved in cell-cycle progression and skeletal muscle differentiation. May repress GTF2I transcriptional functions, by preventing its nuclear residency, or by inhibiting its transcriptional activation. May contribute to slow-twitch fiber type specificity during myogenesis and in regenerating muscles. Binds troponin I slow-muscle fiber enhancer (USE B1). Binds specifically and with high affinity to the EFG sequences derived from the early enhancer of HOXC8.1 Publication

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionDevelopmental protein, DNA-binding
Biological processTranscription, Transcription regulation

Names & Taxonomyi

Protein namesi
Recommended name:
General transcription factor II-I repeat domain-containing protein 1
Short name:
GTF2I repeat domain-containing protein 1
Alternative name(s):
Binding factor for early enhancer
Gene namesi
Name:Gtf2ird1
Synonyms:Ben
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaMyomorphaMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 5

Organism-specific databases

MGIiMGI:1861942. Gtf2ird1.

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000838711 – 1104General transcription factor II-I repeat domain-containing protein 1Add BLAST1104

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Cross-linki27Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki184Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki212Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki225Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki238Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki271Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki337Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki436Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki439Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki443Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Modified residuei448PhosphoserineBy similarity1
Cross-linki567Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki579Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki588Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki622Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki638Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki669Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki709Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki717Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki757Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki759Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki772Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki841Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity
Cross-linki901Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)By similarity

Keywords - PTMi

Isopeptide bond, Phosphoprotein, Ubl conjugation

Proteomic databases

PaxDbiQ9JI57.
PRIDEiQ9JI57.

PTM databases

iPTMnetiQ9JI57.
PhosphoSitePlusiQ9JI57.

Expressioni

Tissue specificityi

Widely expressed.

Developmental stagei

Expressed in somites, neural tube and brain at E8-8.5. Expression remains constant from E9.5-E12.5 with highest expression levels in the limb buds, branchial arches, crainofacial area, brain and spinal cord.

Gene expression databases

BgeeiENSMUSG00000023079.
ExpressionAtlasiQ9JI57. baseline and differential.
GenevisibleiQ9JI57. MM.

Interactioni

Subunit structurei

Interacts with the retinoblastoma protein (RB1) via its C-terminus.By similarity

Protein-protein interaction databases

IntActiQ9JI57. 1 interactor.
STRINGi10090.ENSMUSP00000098217.

Structurei

3D structure databases

ProteinModelPortaliQ9JI57.
SMRiQ9JI57.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati119 – 213GTF2I-like 1Add BLAST95
Repeati342 – 436GTF2I-like 2Add BLAST95
Repeati556 – 650GTF2I-like 3Add BLAST95
Repeati681 – 775GTF2I-like 4Add BLAST95
Repeati805 – 899GTF2I-like 5Add BLAST95
Repeati908 – 1002GTF2I-like 6Add BLAST95

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi1012 – 1019Nuclear localization signalBy similarity8

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi1020 – 1043Ser-richAdd BLAST24

Sequence similaritiesi

Belongs to the TFII-I family.PROSITE-ProRule annotation

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiENOG410IEPZ. Eukaryota.
ENOG41100H8. LUCA.
GeneTreeiENSGT00530000063863.
HOVERGENiHBG051855.
InParanoidiQ9JI57.
KOiK03121.
OMAiMYMVDYA.
OrthoDBiEOG091G019P.
PhylomeDBiQ9JI57.
TreeFamiTF352524.

Family and domain databases

Gene3Di3.90.1460.10. 6 hits.
InterProiView protein in InterPro
IPR004212. GTF2I.
IPR036647. GTF2I-like_rpt_sf.
IPR016659. TF_II-I.
PfamiView protein in Pfam
PF02946. GTF2I. 6 hits.
PIRSFiPIRSF016441. TF_II-I. 1 hit.
SUPFAMiSSF117773. SSF117773. 6 hits.
PROSITEiView protein in PROSITE
PS51139. GTF2I. 6 hits.

Sequences (10)i

Sequence statusi: Complete.

This entry describes 10 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 2 (identifier: Q9JI57-1) [UniParc]FASTAAdd to basket
Also known as: Beta, 3b7

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MALLGKHCDI PTNGCGSERW NSTFARKDEL INSLVSALDS MCSALSKLNT
60 70 80 90 100
EVACVAVHNE SVFVMGTEKG RVFLNTRKEL QSDFLRFCRG PLWNDPEAGH
110 120 130 140 150
PKKVQRCEGG GRSLPRSSLE QCSDVYLLQK MVEEVFDVLY SEAMGRATVV
160 170 180 190 200
PLPYERLLRE PGLLAVQGLP EGLAFRRPAE YDPKALMAIL EHSHRIRFKL
210 220 230 240 250
RRPPDDGGQD TKALVEMNGI SLLPKGSRDC GLHGQASKVA PQDLTPTATP
260 270 280 290 300
SSMANFLYST SMPNHTIREL KQEVPTCPLT PSDLGMGWPV PEPHVPSTQD
310 320 330 340 350
FSDCCGQTPA GPAGPLIQNV HASKRILFSI VHDKSEKWDP FIKEMEDINT
360 370 380 390 400
LRECVQILFN SRYAEALGLD HMVPVPYRKI ACDPEAVEIV GIPDKIPFKR
410 420 430 440 450
PCTYGVPKLK RILEERHSIH FIIKRMFDER IFTGNKFTKD PMKLEPASPP
460 470 480 490 500
EDTSTEVCRD SMLDLAGTAW SDMSSVSEDC GPGTSGEIAM LRPIKIEPEE
510 520 530 540 550
LDIIQVTVSD PSPTSEEMTD SLPGHLPSED SGYGMEMPAD KGPSEEPWSE
560 570 580 590 600
ERPAEESPGD VIRPLRKQVE MLFNTKYAKA IGTSEPVKVP YSKFLMHPEE
610 620 630 640 650
LFVLGLPEGI SLRRPNCFGI AKLRKILEAS NSIQFVIKRP ELLTDGVKEP
660 670 680 690 700
VLDTQERDSW DRLVDETPKR QGLQENYNTR LSRIDIANTL REQVQDLFNK
710 720 730 740 750
KYGEALGIKY PVQVPYKRIK SNPGSVIIEG LPPGIPFRKP CTFGSQNLER
760 770 780 790 800
ILSVADKIKF TVTRPFQGLI PKPETKILTT GHEAGKTTRP RRLQQDTWQP
810 820 830 840 850
DEDDANRLGE KVILREQVKE LFNEKYGEAL GLNRPVLVPY KLIRDSPDAV
860 870 880 890 900
EVKGLPDDIP FRNPNTYDIH RLEKILKARE HVRMVIINQL QPFAEVCNDP
910 920 930 940 950
KVPEEDDSNK LGKKVILREQ VKELFNEKYG EALGLNRPVL VPYKLIRDSP
960 970 980 990 1000
DAVEVKGLPD DIPFRNPNTY DIHRLEKILK AREHVRMVII NQLQPFGDVC
1010 1020 1030 1040 1050
NNAKVPAKDN IPKRKRKRVS EGNSVSSSSS SSSSSSNPES VASTNQISLV
1060 1070 1080 1090 1100
VKSRGSELHP NSVWPLPLPR AGPSTAPGTG RHWALRGTQP TTEGQAHPLV

LPTR
Length:1,104
Mass (Da):123,483
Last modified:May 27, 2002 - v2
Checksum:iC76B2AF3CD1BCD73
GO
Isoform 1 (identifier: Q9JI57-2) [UniParc]FASTAAdd to basket
Also known as: Alpha, 3a7

The sequence of this isoform differs from the canonical sequence as follows:
     1051-1104: VKSRGSELHP...QAHPLVLPTR → QWPVYMVDYSGLNVQLPGPLDY

Show »
Length:1,072
Mass (Da):120,267
Checksum:i9E82AA6D7D6D02FC
GO
Isoform 3 (identifier: Q9JI57-3) [UniParc]FASTAAdd to basket
Also known as: Gamma, 1a1

The sequence of this isoform differs from the canonical sequence as follows:
     774-800: Missing.
     864-966: Missing.
     1051-1104: VKSRGSELHP...QAHPLVLPTR → QWPVYMVDYSGLNVQLPGPLDY

Show »
Length:942
Mass (Da):105,235
Checksum:i699A0ECFF01933EF
GO
Isoform 4 (identifier: Q9JI57-4) [UniParc]FASTAAdd to basket
Also known as: 1b1

The sequence of this isoform differs from the canonical sequence as follows:
     774-800: Missing.
     864-966: Missing.

Show »
Length:974
Mass (Da):108,452
Checksum:i243952898632E546
GO
Isoform 5 (identifier: Q9JI57-5) [UniParc]FASTAAdd to basket
Also known as: 3b5

The sequence of this isoform differs from the canonical sequence as follows:
     774-800: Missing.

Show »
Length:1,077
Mass (Da):120,352
Checksum:iA577262306C14333
GO
Isoform 6 (identifier: Q9JI57-6) [UniParc]FASTAAdd to basket
Also known as: 3b3

The sequence of this isoform differs from the canonical sequence as follows:
     657-675: Missing.
     774-800: Missing.

Show »
Length:1,058
Mass (Da):118,041
Checksum:i99AA5EFF06E36681
GO
Isoform 7 (identifier: Q9JI57-7) [UniParc]FASTAAdd to basket
Also known as: 3a3

The sequence of this isoform differs from the canonical sequence as follows:
     657-675: Missing.
     774-800: Missing.
     1051-1104: VKSRGSELHP...QAHPLVLPTR → QWPVYMVDYSGLNVQLPGPLDY

Show »
Length:1,026
Mass (Da):114,825
Checksum:i035A62B8AE290EC9
GO
Isoform 8 (identifier: Q9JI57-8) [UniParc]FASTAAdd to basket
Also known as: 1b4

The sequence of this isoform differs from the canonical sequence as follows:
     864-966: Missing.

Show »
Length:1,001
Mass (Da):111,584
Checksum:iFCEC832BCAD242BE
GO
Isoform 9 (identifier: Q9JI57-9) [UniParc]FASTAAdd to basket
Also known as: 2a5

The sequence of this isoform differs from the canonical sequence as follows:
     703-800: Missing.
     1051-1104: VKSRGSELHP...QAHPLVLPTR → QWPVYMVDYSGLNVQLPGPLDY

Show »
Length:974
Mass (Da):109,349
Checksum:i4A312EAE7E473D2A
GO
Isoform 10 (identifier: Q9JI57-10) [UniParc]FASTAAdd to basket
Also known as: 1a4

The sequence of this isoform differs from the canonical sequence as follows:
     864-966: Missing.
     1051-1104: VKSRGSELHP...QAHPLVLPTR → QWPVYMVDYSGLNVQLPGPLDY

Show »
Length:969
Mass (Da):108,367
Checksum:i9BF42994545C256B
GO

Sequence cautioni

Q9JI57: The sequence AAF78367 differs from that shown. Reason: Frameshift at positions 217 and 243.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti79E → V in AAM02923 (PubMed:12780350).Curated1
Sequence conflicti94N → D in AAF78367 (PubMed:10861001).Curated1
Sequence conflicti134E → K in AAM02920 (PubMed:12780350).Curated1
Sequence conflicti252S → F in AAF78367 (PubMed:10861001).Curated1
Sequence conflicti258Y → H in AAF78367 (PubMed:10861001).Curated1
Sequence conflicti261S → L in AAF78367 (PubMed:10861001).Curated1
Sequence conflicti283D → G in AAF78367 (PubMed:10861001).Curated1
Sequence conflicti399K → R in AAL68980 (PubMed:12780350).Curated1
Sequence conflicti821L → F in AAM02920 (PubMed:12780350).Curated1
Sequence conflicti821L → F in AAP30731 (PubMed:12780350).Curated1
Sequence conflicti881H → Y in AAM02923 (PubMed:12780350).Curated1
Sequence conflicti1058L → P in AAP30733 (PubMed:12780350).Curated1
Sequence conflicti1058L → P in AAM02922 (PubMed:12780350).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_021376657 – 675Missing in isoform 6 and isoform 7. 1 PublicationAdd BLAST19
Alternative sequenceiVSP_021377703 – 800Missing in isoform 9. 1 PublicationAdd BLAST98
Alternative sequenceiVSP_003874774 – 800Missing in isoform 3, isoform 4, isoform 5, isoform 6 and isoform 7. 2 PublicationsAdd BLAST27
Alternative sequenceiVSP_003875864 – 966Missing in isoform 3, isoform 4, isoform 8 and isoform 10. 2 PublicationsAdd BLAST103
Alternative sequenceiVSP_0038761051 – 1104VKSRG…VLPTR → QWPVYMVDYSGLNVQLPGPL DY in isoform 1, isoform 3, isoform 7, isoform 9 and isoform 10. 3 PublicationsAdd BLAST54

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF260133 mRNA. Translation: AAF78367.1. Frameshift.
AF257475 mRNA. Translation: AAG44655.1.
AY030287 mRNA. Translation: AAK49782.1.
AY030288 mRNA. Translation: AAK49783.1.
AY030289 mRNA. Translation: AAK49784.1.
AF247161 mRNA. Translation: AAL68980.1.
AF343348 mRNA. Translation: AAM02920.1.
AF343349 mRNA. Translation: AAM02921.1.
AF343350 mRNA. Translation: AAM02922.1.
AF343351 mRNA. Translation: AAM02923.1.
AF497637 mRNA. Translation: AAP30728.1.
AF497638 mRNA. Translation: AAP30729.1.
AF497639 mRNA. Translation: AAP30730.1.
AF497640 mRNA. Translation: AAP30731.1.
AF497641 mRNA. Translation: AAP30732.1.
AF497642 mRNA. Translation: AAP30733.1.
AF289666 Genomic DNA. Translation: AAF99337.1.
AF289667 Genomic DNA. Translation: AAF99339.1.
CCDSiCCDS39302.1. [Q9JI57-7]
CCDS39303.1. [Q9JI57-9]
CCDS39305.1. [Q9JI57-2]
CCDS39306.1. [Q9JI57-5]
CCDS39307.1. [Q9JI57-1]
CCDS51657.1. [Q9JI57-8]
CCDS80420.1. [Q9JI57-10]
CCDS84967.1. [Q9JI57-6]
RefSeqiNP_001074931.1. NM_001081462.2. [Q9JI57-1]
NP_001074932.1. NM_001081463.2. [Q9JI57-5]
NP_001074935.1. NM_001081466.2. [Q9JI57-7]
NP_001074936.1. NM_001081467.2. [Q9JI57-8]
NP_001074938.1. NM_001081469.2. [Q9JI57-10]
NP_001074939.1. NM_001081470.2. [Q9JI57-3]
NP_001231865.1. NM_001244936.1. [Q9JI57-4]
NP_001334417.1. NM_001347488.1. [Q9JI57-6]
NP_065064.2. NM_020331.3. [Q9JI57-2]
XP_017176527.1. XM_017321038.1. [Q9JI57-1]
XP_017176528.1. XM_017321039.1. [Q9JI57-5]
UniGeneiMm.332735.

Genome annotation databases

EnsembliENSMUST00000073161; ENSMUSP00000072904; ENSMUSG00000023079. [Q9JI57-2]
ENSMUST00000074114; ENSMUSP00000073752; ENSMUSG00000023079. [Q9JI57-8]
ENSMUST00000100650; ENSMUSP00000098215; ENSMUSG00000023079. [Q9JI57-5]
ENSMUST00000100652; ENSMUSP00000098217; ENSMUSG00000023079. [Q9JI57-1]
ENSMUST00000100654; ENSMUSP00000098219; ENSMUSG00000023079. [Q9JI57-9]
ENSMUST00000111245; ENSMUSP00000106876; ENSMUSG00000023079. [Q9JI57-7]
ENSMUST00000167084; ENSMUSP00000132882; ENSMUSG00000023079. [Q9JI57-10]
ENSMUST00000200944; ENSMUSP00000143848; ENSMUSG00000023079. [Q9JI57-10]
ENSMUST00000202554; ENSMUSP00000143809; ENSMUSG00000023079. [Q9JI57-6]
GeneIDi57080.
KEGGimmu:57080.
UCSCiuc008zvv.2. mouse. [Q9JI57-2]
uc008zvw.2. mouse. [Q9JI57-7]
uc008zvx.2. mouse. [Q9JI57-3]
uc008zvy.2. mouse. [Q9JI57-10]
uc008zvz.2. mouse. [Q9JI57-9]
uc008zwa.2. mouse. [Q9JI57-8]
uc008zwb.2. mouse. [Q9JI57-6]
uc008zwc.2. mouse. [Q9JI57-4]
uc008zwd.2. mouse. [Q9JI57-5]
uc008zwe.2. mouse. [Q9JI57-1]

Keywords - Coding sequence diversityi

Alternative splicing

Similar proteinsi

Entry informationi

Entry nameiGT2D1_MOUSE
AccessioniPrimary (citable) accession number: Q9JI57
Secondary accession number(s): Q547E0
, Q80WJ8, Q80WJ9, Q80WK0, Q80WK1, Q8R4X5, Q8R4X6, Q8R4X7, Q8R4X8, Q8VHD5, Q8VI58, Q9EQE7, Q9ESZ6, Q9ESZ7
Entry historyiIntegrated into UniProtKB/Swiss-Prot: May 27, 2002
Last sequence update: May 27, 2002
Last modified: October 25, 2017
This is version 135 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families