Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Cleavage stimulation factor subunit 3

Gene

CSTF3

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

One of the multiple factors required for polyadenylation and 3'-end cleavage of mammalian pre-mRNAs.

GO - Molecular functioni

  • mRNA binding Source: GO_Central
  • poly(A) RNA binding Source: UniProtKB
  • RNA binding Source: ProtInc

GO - Biological processi

  • mRNA 3'-end processing Source: Reactome
  • mRNA cleavage Source: ProtInc
  • mRNA polyadenylation Source: ProtInc
  • mRNA splicing, via spliceosome Source: Reactome
  • termination of RNA polymerase II transcription Source: Reactome
Complete GO annotation...

Keywords - Biological processi

mRNA processing

Enzyme and pathway databases

BioCyciZFISH:ENSG00000176102-MONOMER.
ReactomeiR-HSA-109688. Cleavage of Growing Transcript in the Termination Region.
R-HSA-72163. mRNA Splicing - Major Pathway.
R-HSA-72187. mRNA 3'-end processing.
R-HSA-77595. Processing of Intronless Pre-mRNAs.

Names & Taxonomyi

Protein namesi
Recommended name:
Cleavage stimulation factor subunit 3
Alternative name(s):
CF-1 77 kDa subunit
Cleavage stimulation factor 77 kDa subunit
Short name:
CSTF 77 kDa subunit
Short name:
CstF-77
Gene namesi
Name:CSTF3
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 11

Organism-specific databases

HGNCiHGNC:2485. CSTF3.

Subcellular locationi

GO - Cellular componenti

  • nucleoplasm Source: HPA
  • nucleus Source: ProtInc
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

OpenTargetsiENSG00000176102.
PharmGKBiPA26987.

Polymorphism and mutation databases

BioMutaiCSTF3.
DMDMi71153231.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Initiator methionineiRemovedCombined sources
ChainiPRO_00002057222 – 717Cleavage stimulation factor subunit 3Add BLAST716

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei2N-acetylserineCombined sources1
Modified residuei691PhosphoserineCombined sources1

Keywords - PTMi

Acetylation, Phosphoprotein

Proteomic databases

EPDiQ12996.
MaxQBiQ12996.
PaxDbiQ12996.
PeptideAtlasiQ12996.
PRIDEiQ12996.

PTM databases

iPTMnetiQ12996.
PhosphoSitePlusiQ12996.
SwissPalmiQ12996.

Miscellaneous databases

PMAP-CutDBQ12996.

Expressioni

Gene expression databases

BgeeiENSG00000176102.
CleanExiHS_CSTF3.
ExpressionAtlasiQ12996. baseline and differential.
GenevisibleiQ12996. HS.

Organism-specific databases

HPAiHPA039743.
HPA040168.

Interactioni

Subunit structurei

Homodimer. The CSTF complex is composed of CSTF1 (50 kDa subunit), CSTF2 (64 kDa subunit) and CSTF3 (77 kDa subunit). CSTF3 directly interacts with CSTF1 and CSTF2. Interacts with FIP1L1.2 Publications

Protein-protein interaction databases

BioGridi107861. 38 interactors.
DIPiDIP-48674N.
IntActiQ12996. 10 interactors.
MINTiMINT-4654440.
STRINGi9606.ENSP00000315791.

Structurei

3D structure databases

ProteinModelPortaliQ12996.
SMRiQ12996.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati45 – 77HAT 1Add BLAST33
Repeati79 – 110HAT 2Add BLAST32
Repeati117 – 152HAT 3Add BLAST36
Repeati163 – 196HAT 4Add BLAST34
Repeati221 – 261HAT 5Add BLAST41
Repeati271 – 303HAT 6Add BLAST33
Repeati319 – 352HAT 7Add BLAST34
Repeati354 – 387HAT 8Add BLAST34
Repeati458 – 494HAT 9Add BLAST37

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi559 – 626Pro-richAdd BLAST68

Sequence similaritiesi

Contains 9 HAT repeats.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiKOG1914. Eukaryota.
COG5107. LUCA.
GeneTreeiENSGT00390000006758.
HOGENOMiHOG000231786.
HOVERGENiHBG053813.
InParanoidiQ12996.
KOiK14408.
OMAiDTNQMIP.
OrthoDBiEOG091G02LY.
PhylomeDBiQ12996.
TreeFamiTF105867.

Family and domain databases

Gene3Di1.25.40.10. 1 hit.
InterProiIPR003107. HAT.
IPR008847. Suf.
IPR011990. TPR-like_helical_dom.
[Graphical view]
PfamiPF05843. Suf. 1 hit.
[Graphical view]
SMARTiSM00386. HAT. 10 hits.
[Graphical view]
SUPFAMiSSF48452. SSF48452. 1 hit.

Sequences (3)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q12996-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MSGDGATEQA AEYVPEKVKK AEKKLEENPY DLDAWSILIR EAQNQPIDKA
60 70 80 90 100
RKTYERLVAQ FPSSGRFWKL YIEAEIKAKN YDKVEKLFQR CLMKVLHIDL
110 120 130 140 150
WKCYLSYVRE TKGKLPSYKE KMAQAYDFAL DKIGMEIMSY QIWVDYINFL
160 170 180 190 200
KGVEAVGSYA ENQRITAVRR VYQRGCVNPM INIEQLWRDY NKYEEGINIH
210 220 230 240 250
LAKKMIEDRS RDYMNARRVA KEYETVMKGL DRNAPSVPPQ NTPQEAQQVD
260 270 280 290 300
MWKKYIQWEK SNPLRTEDQT LITKRVMFAY EQCLLVLGHH PDIWYEAAQY
310 320 330 340 350
LEQSSKLLAE KGDMNNAKLF SDEAANIYER AISTLLKKNM LLYFAYADYE
360 370 380 390 400
ESRMKYEKVH SIYNRLLAIE DIDPTLVYIQ YMKFARRAEG IKSGRMIFKK
410 420 430 440 450
AREDTRTRHH VYVTAALMEY YCSKDKSVAF KIFELGLKKY GDIPEYVLAY
460 470 480 490 500
IDYLSHLNED NNTRVLFERV LTSGSLPPEK SGEIWARFLA FESNIGDLAS
510 520 530 540 550
ILKVEKRRFT AFKEEYEGKE TALLVDRYKF MDLYPCSASE LKALGYKDVS
560 570 580 590 600
RAKLAAIIPD PVVAPSIVPV LKDEVDRKPE YPKPDTQQMI PFQPRHLAPP
610 620 630 640 650
GLHPVPGGVF PVPPAAVVLM KLLPPPICFQ GPFVQVDELM EIFRRCKIPN
660 670 680 690 700
TVEEAVRIIT GGAPELAVEG NGPVESNAVL TKAVKRPNED SDEDEEKGAV
710
VPPVHDIYRA RQQKRIR
Length:717
Mass (Da):82,922
Last modified:November 1, 1996 - v1
Checksum:i9ADE7AC86FBB3E93
GO
Isoform 2 (identifier: Q12996-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     76-103: IKAKNYDKVEKLFQRCLMKVLHIDLWKC → VTILFYFFLYQYCSIHCSDRKQVRNIAN
     104-717: Missing.

Note: No experimental confirmation available.
Show »
Length:103
Mass (Da):12,102
Checksum:i990E37CFFE2B2732
GO
Isoform 3 (identifier: Q12996-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     44-44: N → V
     45-717: Missing.

Note: No experimental confirmation available.
Show »
Length:44
Mass (Da):4,952
Checksum:i9303D542E58B7162
GO

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_04567544N → V in isoform 3. 1 Publication1
Alternative sequenceiVSP_04567645 – 717Missing in isoform 3. 1 PublicationAdd BLAST673
Alternative sequenceiVSP_04273176 – 103IKAKN…DLWKC → VTILFYFFLYQYCSIHCSDR KQVRNIAN in isoform 2. 1 PublicationAdd BLAST28
Alternative sequenceiVSP_042732104 – 717Missing in isoform 2. 1 PublicationAdd BLAST614

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U15782 mRNA. Translation: AAA61417.1.
AK290836 mRNA. Translation: BAF83525.1.
AC131263 Genomic DNA. No translation available.
AL121926 Genomic DNA. Translation: CAC48252.1.
CH471064 Genomic DNA. Translation: EAW68200.1.
CH471064 Genomic DNA. Translation: EAW68201.1.
CH471064 Genomic DNA. Translation: EAW68202.1.
BC009792 mRNA. Translation: AAH09792.1.
BC010533 mRNA. Translation: AAH10533.1.
BC059948 mRNA. Translation: AAH59948.1.
BC108319 mRNA. Translation: AAI08320.1.
BM014288 mRNA. No translation available.
CCDSiCCDS44563.1. [Q12996-2]
CCDS44564.1. [Q12996-3]
CCDS7883.1. [Q12996-1]
PIRiS50852.
RefSeqiNP_001028677.1. NM_001033505.1. [Q12996-2]
NP_001028678.1. NM_001033506.1. [Q12996-3]
NP_001317.1. NM_001326.2. [Q12996-1]
UniGeneiHs.44402.

Genome annotation databases

EnsembliENST00000323959; ENSP00000315791; ENSG00000176102. [Q12996-1]
ENST00000431742; ENSP00000393064; ENSG00000176102. [Q12996-3]
ENST00000438862; ENSP00000388711; ENSG00000176102. [Q12996-2]
GeneIDi1479.
KEGGihsa:1479.
UCSCiuc001muh.4. human. [Q12996-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U15782 mRNA. Translation: AAA61417.1.
AK290836 mRNA. Translation: BAF83525.1.
AC131263 Genomic DNA. No translation available.
AL121926 Genomic DNA. Translation: CAC48252.1.
CH471064 Genomic DNA. Translation: EAW68200.1.
CH471064 Genomic DNA. Translation: EAW68201.1.
CH471064 Genomic DNA. Translation: EAW68202.1.
BC009792 mRNA. Translation: AAH09792.1.
BC010533 mRNA. Translation: AAH10533.1.
BC059948 mRNA. Translation: AAH59948.1.
BC108319 mRNA. Translation: AAI08320.1.
BM014288 mRNA. No translation available.
CCDSiCCDS44563.1. [Q12996-2]
CCDS44564.1. [Q12996-3]
CCDS7883.1. [Q12996-1]
PIRiS50852.
RefSeqiNP_001028677.1. NM_001033505.1. [Q12996-2]
NP_001028678.1. NM_001033506.1. [Q12996-3]
NP_001317.1. NM_001326.2. [Q12996-1]
UniGeneiHs.44402.

3D structure databases

ProteinModelPortaliQ12996.
SMRiQ12996.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi107861. 38 interactors.
DIPiDIP-48674N.
IntActiQ12996. 10 interactors.
MINTiMINT-4654440.
STRINGi9606.ENSP00000315791.

PTM databases

iPTMnetiQ12996.
PhosphoSitePlusiQ12996.
SwissPalmiQ12996.

Polymorphism and mutation databases

BioMutaiCSTF3.
DMDMi71153231.

Proteomic databases

EPDiQ12996.
MaxQBiQ12996.
PaxDbiQ12996.
PeptideAtlasiQ12996.
PRIDEiQ12996.

Protocols and materials databases

DNASUi1479.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000323959; ENSP00000315791; ENSG00000176102. [Q12996-1]
ENST00000431742; ENSP00000393064; ENSG00000176102. [Q12996-3]
ENST00000438862; ENSP00000388711; ENSG00000176102. [Q12996-2]
GeneIDi1479.
KEGGihsa:1479.
UCSCiuc001muh.4. human. [Q12996-1]

Organism-specific databases

CTDi1479.
GeneCardsiCSTF3.
H-InvDBHIX0021822.
HGNCiHGNC:2485. CSTF3.
HPAiHPA039743.
HPA040168.
MIMi600367. gene.
neXtProtiNX_Q12996.
OpenTargetsiENSG00000176102.
PharmGKBiPA26987.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG1914. Eukaryota.
COG5107. LUCA.
GeneTreeiENSGT00390000006758.
HOGENOMiHOG000231786.
HOVERGENiHBG053813.
InParanoidiQ12996.
KOiK14408.
OMAiDTNQMIP.
OrthoDBiEOG091G02LY.
PhylomeDBiQ12996.
TreeFamiTF105867.

Enzyme and pathway databases

BioCyciZFISH:ENSG00000176102-MONOMER.
ReactomeiR-HSA-109688. Cleavage of Growing Transcript in the Termination Region.
R-HSA-72163. mRNA Splicing - Major Pathway.
R-HSA-72187. mRNA 3'-end processing.
R-HSA-77595. Processing of Intronless Pre-mRNAs.

Miscellaneous databases

GeneWikiiCSTF3.
GenomeRNAii1479.
PMAP-CutDBQ12996.
PROiQ12996.
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000176102.
CleanExiHS_CSTF3.
ExpressionAtlasiQ12996. baseline and differential.
GenevisibleiQ12996. HS.

Family and domain databases

Gene3Di1.25.40.10. 1 hit.
InterProiIPR003107. HAT.
IPR008847. Suf.
IPR011990. TPR-like_helical_dom.
[Graphical view]
PfamiPF05843. Suf. 1 hit.
[Graphical view]
SMARTiSM00386. HAT. 10 hits.
[Graphical view]
SUPFAMiSSF48452. SSF48452. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiCSTF3_HUMAN
AccessioniPrimary (citable) accession number: Q12996
Secondary accession number(s): A8K471
, D3DR04, E9PB40, Q32P22, Q96FQ8, Q96QD6, Q96QK4
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 19, 2005
Last sequence update: November 1, 1996
Last modified: November 30, 2016
This is version 145 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 11
    Human chromosome 11: entries, gene names and cross-references to MIM
  2. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.