Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

General transcription factor 3C polypeptide 1

Gene

GTF3C1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Required for RNA polymerase III-mediated transcription. Component of TFIIIC that initiates transcription complex assembly on tRNA and is required for transcription of 5S rRNA and other stable nuclear and cytoplasmic RNAs. Binds to the box B promoter element.

GO - Molecular functioni

GO - Biological processi

  • 5S class rRNA transcription from RNA polymerase III type 1 promoter Source: HGNC
  • rRNA transcription Source: ProtInc
  • transcription, DNA-templated Source: HGNC
  • transcription from RNA polymerase III promoter Source: HGNC
  • tRNA transcription Source: ProtInc
  • tRNA transcription from RNA polymerase III promoter Source: HGNC
Complete GO annotation...

Keywords - Biological processi

Transcription

Keywords - Ligandi

DNA-binding

Enzyme and pathway databases

BioCyciZFISH:ENSG00000077235-MONOMER.
ReactomeiR-HSA-749476. RNA Polymerase III Abortive And Retractive Initiation.
R-HSA-76061. RNA Polymerase III Transcription Initiation From Type 1 Promoter.
R-HSA-76066. RNA Polymerase III Transcription Initiation From Type 2 Promoter.

Names & Taxonomyi

Protein namesi
Recommended name:
General transcription factor 3C polypeptide 1
Alternative name(s):
TF3C-alpha
TFIIIC box B-binding subunit
Transcription factor IIIC 220 kDa subunit
Short name:
TFIIIC 220 kDa subunit
Short name:
TFIIIC220
Transcription factor IIIC subunit alpha
Gene namesi
Name:GTF3C1
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 16

Organism-specific databases

HGNCiHGNC:4664. GTF3C1.

Subcellular locationi

GO - Cellular componenti

  • intracellular ribonucleoprotein complex Source: Ensembl
  • membrane Source: UniProtKB
  • nucleolus Source: HPA
  • nucleoplasm Source: Reactome
  • nucleus Source: HPA
  • transcription factor TFIIIC complex Source: HGNC
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

DisGeNETi2975.
OpenTargetsiENSG00000077235.
PharmGKBiPA29052.

Polymorphism and mutation databases

BioMutaiGTF3C1.
DMDMi215274233.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00002097101 – 2109General transcription factor 3C polypeptide 1Add BLAST2109

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei667PhosphoserineCombined sources1
Modified residuei739PhosphoserineCombined sources1
Cross-linki833Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)Combined sources
Modified residuei1062PhosphoserineCombined sources1
Modified residuei1068PhosphoserineCombined sources1
Cross-linki1142Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)Combined sources
Modified residuei1253PhosphoserineCombined sources1
Modified residuei1611PhosphoserineCombined sources1
Modified residuei1632PhosphoserineCombined sources1
Modified residuei1653PhosphoserineCombined sources1
Modified residuei1856PhosphoserineCombined sources1
Modified residuei1865PhosphoserineCombined sources1
Modified residuei1868PhosphoserineCombined sources1
Modified residuei1896PhosphoserineBy similarity1
Modified residuei1911PhosphoserineCombined sources1
Modified residuei1969PhosphoserineCombined sources1

Keywords - PTMi

Isopeptide bond, Phosphoprotein, Ubl conjugation

Proteomic databases

EPDiQ12789.
PaxDbiQ12789.
PeptideAtlasiQ12789.
PRIDEiQ12789.

PTM databases

iPTMnetiQ12789.
PhosphoSitePlusiQ12789.

Expressioni

Gene expression databases

BgeeiENSG00000077235.
CleanExiHS_GTF3C1.
ExpressionAtlasiQ12789. baseline and differential.
GenevisibleiQ12789. HS.

Organism-specific databases

HPAiHPA051617.

Interactioni

Subunit structurei

Part of the TFIIIC subcomplex TFIIIC2, consisting of six subunits, GTF3C1, GTF3C2, GTF3C3, GTF3C4, GTF3C5 and GTF3C6. Interacts with IGHMBP2.1 Publication

Binary interactionsi

WithEntry#Exp.IntActNotes
AGTRAPQ6RW133EBI-357956,EBI-741181
CMTM5Q96DZ93EBI-357956,EBI-2548702
MAGEA4Q1RN333EBI-357956,EBI-10194128

Protein-protein interaction databases

BioGridi109230. 89 interactors.
DIPiDIP-38212N.
IntActiQ12789. 42 interactors.
MINTiMINT-1154812.
STRINGi9606.ENSP00000348510.

Structurei

3D structure databases

ProteinModelPortaliQ12789.
SMRiQ12789.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi343 – 351Asp/Glu-rich (acidic)9
Compositional biasi1205 – 1233Arg/Lys-rich (basic)Add BLAST29
Compositional biasi1613 – 1624Asp/Glu-rich (acidic)Add BLAST12

Sequence similaritiesi

Belongs to the TFIIIC subunit 1 family.Curated

Phylogenomic databases

eggNOGiKOG4560. Eukaryota.
ENOG410XSJ5. LUCA.
GeneTreeiENSGT00390000008664.
HOGENOMiHOG000154556.
HOVERGENiHBG057283.
InParanoidiQ12789.
KOiK15199.
OMAiVYPIHMI.
OrthoDBiEOG091G00PD.
PhylomeDBiQ12789.
TreeFamiTF351624.

Family and domain databases

InterProiIPR007309. TFIIIC_Bblock-bd.
[Graphical view]
PfamiPF04182. B-block_TFIIIC. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q12789-2) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MDALESLLDE VALEGLDGLC LPALWSRLET RVPPFPLPLE PCTQEFLWRA
60 70 80 90 100
LATHPGISFY EEPRERPDLQ LQDRYEEIDL ETGILESRRD PVALEDVYPI
110 120 130 140 150
HMILENKDGI QGSCRYFKER KNITNDIRTK SLQPRCTMVE AFDRWGKKLI
160 170 180 190 200
IVASQAMRYR ALIGQEGDPD LKLPDFSYCI LERLGRSRWQ GELQRDLHTT
210 220 230 240 250
AFKVDAGKLH YHRKILNKNG LITMQSHVIR LPTGAQQHSI LLLLNRFHVD
260 270 280 290 300
RRSKYDILME KLSVMLSTRT NHIETLGKLR EELGLCERTF KRLYQYMLNA
310 320 330 340 350
GLAKVVSLRL QEIHPECGPC KTKKGTDVMV RCLKLLKEFK RNDHDDDEDE
360 370 380 390 400
EVISKTVPPV DIVFERDMLT QTYDLIERRG TKGISQAEIR VAMNVGKLEA
410 420 430 440 450
RMLCRLLQRF KVVKGFMEDE GRQRTTKYIS CVFAEESDLS RQYQREKARS
460 470 480 490 500
ELLTTVSLAS MQEESLLPEG EDTFLSESDS EEERSSSKRR GRGSQKDTRA
510 520 530 540 550
SANLRPKTQP HHSTPTKGGW KVVNLHPLKK QPPSFPGAAE ERACQSLASR
560 570 580 590 600
DSLLDTSSVS EPNVSFVSHC ADSNSGDIAV IEEVRMENPK ESSSSLKTGR
610 620 630 640 650
HSSGQDKPHE TYRLLKRRNL IIEAVTNLRL IESLFTIQKM IMDQEKQEGV
660 670 680 690 700
STKCCKKSIV RLVRNLSEEG LLRLYRTTVI QDGIKKKVDL VVHPSMDQND
710 720 730 740 750
PLVRSAIEQV RFRISNSSTA NRVKTSQPPV PQGEAEEDSQ GKEGPSGSGD
760 770 780 790 800
SQLSASSRSE SGRMKKSDNK MGITPLRNYH PIVVPGLGRS LGFLPKMPRL
810 820 830 840 850
RVVHMFLWYL IYGHPASNTV EKPSFISERR TIKQESGRAG VRPSSSGSAW
860 870 880 890 900
EACSEAPSKG SQDGVTWEAE VELATETVYV DDASWMRYIP PIPVHRDFGF
910 920 930 940 950
GWALVSDILL CLPLSIFIQI VQVSYKVDNL EEFLNDPLKK HTLIRFLPRP
960 970 980 990 1000
IRQQLLYKRR YIFSVVENLQ RLCYMGLLQF GPTEKFQDKD QVFIFLKKNA
1010 1020 1030 1040 1050
VIVDTTICDP HYNLARSSRP FERRLYVLNS MQDVENYWFD LQCVCLNTPL
1060 1070 1080 1090 1100
GVVRCPRVRK NSSTDQGSDE EGSLQKEQES AMDKHNLERK CAMLEYTTGS
1110 1120 1130 1140 1150
REVVDEGLIP GDGLGAAGLD SSFYGHLKRN WIWTSYIINQ AKKENTAAEN
1160 1170 1180 1190 1200
GLTVRLQTFL SKRPMPLSAR GNSRLNIWGE ARVGSELCAG WEEQFEVDRE
1210 1220 1230 1240 1250
PSLDRNRRVR GGKSQKRKRL KKDPGKKIKR KKKGEFPGEK SKRLRYHDEA
1260 1270 1280 1290 1300
DQSALQRMTR LRVTWSMQED GLLVLCRIAS NVLNTKVKGP FVTWQVVRDI
1310 1320 1330 1340 1350
LHATFEESLD KTSHSVGRRA RYIVKNPQAY LNYKVCLAEV YQDKALVGDF
1360 1370 1380 1390 1400
MNRRGDYDDP KVCANEFKEF VEKLKEKFSS ALRNSNLEIP DTLQELFARY
1410 1420 1430 1440 1450
RVLAIGDEKD QTRKEDELNS VDDIHFLVLQ NLIQSTLALS DSQMKSYQSF
1460 1470 1480 1490 1500
QTFRLYREYK DHVLVKAFME CQKRSLVNRR RVNHTLGPKK NRALPFVPMS
1510 1520 1530 1540 1550
YQLSQTYYRI FTWRFPSTIC TESFQFLDRM RAAGKLDQPD RFSFKDQDNN
1560 1570 1580 1590 1600
EPTNDMVAFS LDGPGGNCVA VLTLFSLGLI SVDVRIPEQI IVVDSSMVEN
1610 1620 1630 1640 1650
EVIKSLGKDG SLEDDEDEED DLDEGVGGKR RSMEVKPAQA SHTNYLLMRG
1660 1670 1680 1690 1700
YYSPGIVSTR NLNPNDSIVV NSCQMKFQLR CTPVPARLRP AAAPLEELTM
1710 1720 1730 1740 1750
GTSCLPDTFT KLINPQENTC SLEEFVLQLE LSGYSPEDLT AALEILEAII
1760 1770 1780 1790 1800
ATGCFGIDKE ELRRRFSALE KAGGGRTRTF ADCIQALLEQ HQVLEVGGNT
1810 1820 1830 1840 1850
ARLVAMGSAW PWLLHSVRLK DREDADIQRE DPQARPLEGS SSEDSPPEGQ
1860 1870 1880 1890 1900
APPSHSPRGT KRRASWASEN GETDAEGTQM TPAKRPALQD SNLAPSLGPG
1910 1920 1930 1940 1950
AEDGAEAQAP SPPPALEDTA AAGAAQEDQE GVGEFSSPGQ EQLSGQAQPP
1960 1970 1980 1990 2000
EGSEDPRGFT ESFGAANISQ AARERDCESV CFIGRPWRVV DGHLNLPVCK
2010 2020 2030 2040 2050
GMMEAMLYHI MTRPGIPESS LLRHYQGVLQ PVAVLELLQG LESLGCIRKR
2060 2070 2080 2090 2100
WLRKPRPVSL FSTPVVEEVE VPSSLDESPM AFYEPTLDCT LRLGRVFPHE

VNWNKWIHL
Length:2,109
Mass (Da):238,875
Last modified:November 25, 2008 - v4
Checksum:i37A03135EFE695FC
GO
Isoform 2 (identifier: Q12789-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1933-1957: Missing.

Note: No experimental confirmation available.
Show »
Length:2,084
Mass (Da):236,278
Checksum:iD1D74F05BF1C6051
GO

Sequence cautioni

The sequence AAA17985 differs from that shown. Reason: Frameshift at positions 152 and 189.Curated
The sequence AAA85638 differs from that shown. Contaminating sequence. Potential vector sequence at the N-terminus.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti141A → P in AAA17985 (PubMed:8127861).Curated1
Sequence conflicti153A → P in AAA17985 (PubMed:8127861).Curated1
Sequence conflicti156A → P in AAA17985 (PubMed:8127861).Curated1
Sequence conflicti161A → P in AAA17985 (PubMed:8127861).Curated1
Sequence conflicti164G → A in AAA17985 (PubMed:8127861).Curated1
Sequence conflicti977L → V in AAA17985 (PubMed:8127861).Curated1
Sequence conflicti1015 – 1018ARSS → GRRR in AAA17985 (PubMed:8127861).Curated4
Sequence conflicti1256Q → H in AAA17985 (PubMed:8127861).Curated1
Sequence conflicti1316V → L in AAA17985 (PubMed:8127861).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_0475341889Q → E.Corresponds to variant rs35233306dbSNPEnsembl.1
Natural variantiVAR_0475351959F → S.Corresponds to variant rs12919017dbSNPEnsembl.1
Natural variantiVAR_0475362077E → K.Corresponds to variant rs2228248dbSNPEnsembl.1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0218191933 – 1957Missing in isoform 2. 1 PublicationAdd BLAST25

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U02619 mRNA. Translation: AAA17985.1. Frameshift.
AC002303 Genomic DNA. Translation: AAB67637.1.
AC025275 Genomic DNA. No translation available.
AC002551 Genomic DNA. Translation: AAC05811.1.
BC044857 mRNA. Translation: AAH44857.1.
BC137229 mRNA. Translation: AAI37230.1.
U06485 mRNA. Translation: AAA85638.1. Sequence problems.
CCDSiCCDS32414.1. [Q12789-2]
CCDS66988.1. [Q12789-3]
PIRiB56011.
I38414.
RefSeqiNP_001273171.1. NM_001286242.1. [Q12789-3]
NP_001511.2. NM_001520.3. [Q12789-2]
UniGeneiHs.371718.

Genome annotation databases

EnsembliENST00000356183; ENSP00000348510; ENSG00000077235. [Q12789-2]
ENST00000561623; ENSP00000455417; ENSG00000077235. [Q12789-3]
GeneIDi2975.
KEGGihsa:2975.
UCSCiuc002dou.4. human. [Q12789-2]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U02619 mRNA. Translation: AAA17985.1. Frameshift.
AC002303 Genomic DNA. Translation: AAB67637.1.
AC025275 Genomic DNA. No translation available.
AC002551 Genomic DNA. Translation: AAC05811.1.
BC044857 mRNA. Translation: AAH44857.1.
BC137229 mRNA. Translation: AAI37230.1.
U06485 mRNA. Translation: AAA85638.1. Sequence problems.
CCDSiCCDS32414.1. [Q12789-2]
CCDS66988.1. [Q12789-3]
PIRiB56011.
I38414.
RefSeqiNP_001273171.1. NM_001286242.1. [Q12789-3]
NP_001511.2. NM_001520.3. [Q12789-2]
UniGeneiHs.371718.

3D structure databases

ProteinModelPortaliQ12789.
SMRiQ12789.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi109230. 89 interactors.
DIPiDIP-38212N.
IntActiQ12789. 42 interactors.
MINTiMINT-1154812.
STRINGi9606.ENSP00000348510.

PTM databases

iPTMnetiQ12789.
PhosphoSitePlusiQ12789.

Polymorphism and mutation databases

BioMutaiGTF3C1.
DMDMi215274233.

Proteomic databases

EPDiQ12789.
PaxDbiQ12789.
PeptideAtlasiQ12789.
PRIDEiQ12789.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000356183; ENSP00000348510; ENSG00000077235. [Q12789-2]
ENST00000561623; ENSP00000455417; ENSG00000077235. [Q12789-3]
GeneIDi2975.
KEGGihsa:2975.
UCSCiuc002dou.4. human. [Q12789-2]

Organism-specific databases

CTDi2975.
DisGeNETi2975.
GeneCardsiGTF3C1.
H-InvDBHIX0012915.
HGNCiHGNC:4664. GTF3C1.
HPAiHPA051617.
MIMi603246. gene.
neXtProtiNX_Q12789.
OpenTargetsiENSG00000077235.
PharmGKBiPA29052.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG4560. Eukaryota.
ENOG410XSJ5. LUCA.
GeneTreeiENSGT00390000008664.
HOGENOMiHOG000154556.
HOVERGENiHBG057283.
InParanoidiQ12789.
KOiK15199.
OMAiVYPIHMI.
OrthoDBiEOG091G00PD.
PhylomeDBiQ12789.
TreeFamiTF351624.

Enzyme and pathway databases

BioCyciZFISH:ENSG00000077235-MONOMER.
ReactomeiR-HSA-749476. RNA Polymerase III Abortive And Retractive Initiation.
R-HSA-76061. RNA Polymerase III Transcription Initiation From Type 1 Promoter.
R-HSA-76066. RNA Polymerase III Transcription Initiation From Type 2 Promoter.

Miscellaneous databases

ChiTaRSiGTF3C1. human.
GeneWikiiGTF3C1.
GenomeRNAii2975.
PROiQ12789.
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000077235.
CleanExiHS_GTF3C1.
ExpressionAtlasiQ12789. baseline and differential.
GenevisibleiQ12789. HS.

Family and domain databases

InterProiIPR007309. TFIIIC_Bblock-bd.
[Graphical view]
PfamiPF04182. B-block_TFIIIC. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiTF3C1_HUMAN
AccessioniPrimary (citable) accession number: Q12789
Secondary accession number(s): B2RP21
, Q12838, Q6DKN9, Q9Y4W9
Entry historyi
Integrated into UniProtKB/Swiss-Prot: June 7, 2004
Last sequence update: November 25, 2008
Last modified: November 2, 2016
This is version 143 of the entry and version 4 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Human chromosome 16
    Human chromosome 16: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.