Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Cleavage stimulation factor subunit 1

Gene

Cstf1

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 2 out of 5-Experimental evidence at protein leveli

Functioni

One of the multiple factors required for polyadenylation and 3'-end cleavage of mammalian pre-mRNAs. May be responsible for the interaction of CSTF with other factors to form a stable complex on the pre-mRNA (By similarity).By similarity

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

mRNA processing

Enzyme and pathway databases

ReactomeiR-MMU-109688. Cleavage of Growing Transcript in the Termination Region.
R-MMU-72163. mRNA Splicing - Major Pathway.
R-MMU-72187. mRNA 3'-end processing.
R-MMU-77595. Processing of Intronless Pre-mRNAs.

Names & Taxonomyi

Protein namesi
Recommended name:
Cleavage stimulation factor subunit 1
Alternative name(s):
CF-1 50 kDa subunit
Cleavage stimulation factor 50 kDa subunit
Short name:
CSTF 50 kDa subunit
Short name:
CstF-50
Gene namesi
Name:Cstf1
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 2

Organism-specific databases

MGIiMGI:1914587. Cstf1.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000509451 – 431Cleavage stimulation factor subunit 1Add BLAST431

Proteomic databases

EPDiQ99LC2.
MaxQBiQ99LC2.
PaxDbiQ99LC2.
PeptideAtlasiQ99LC2.
PRIDEiQ99LC2.

PTM databases

iPTMnetiQ99LC2.
PhosphoSitePlusiQ99LC2.

Expressioni

Gene expression databases

BgeeiENSMUSG00000027498.
ExpressionAtlasiQ99LC2. baseline and differential.
GenevisibleiQ99LC2. MM.

Interactioni

Subunit structurei

Homodimer. The CSTF complex is composed of CSTF1 (50 kDa subunit), CSTF2 (64 kDa subunit) and CSTF3 (77 kDa subunit). Interacts directly with CSTF3. Interacts with BARD1 (By similarity).By similarity

Protein-protein interaction databases

BioGridi212117. 1 interactor.
IntActiQ99LC2. 1 interactor.
MINTiMINT-4124130.
STRINGi10090.ENSMUSP00000112076.

Structurei

3D structure databases

ProteinModelPortaliQ99LC2.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati106 – 145WD 1Add BLAST40
Repeati171 – 210WD 2Add BLAST40
Repeati215 – 254WD 3Add BLAST40
Repeati260 – 301WD 4Add BLAST42
Repeati303 – 343WD 5Add BLAST41
Repeati395 – 431WD 6Add BLAST37

Domaini

The WD6 domain is required for interaction with BARD1. WD domains are responsible for interaction with CSTF3 (By similarity).By similarity
N-terminus mediates homodimerization.By similarity

Sequence similaritiesi

Contains 6 WD repeats.PROSITE-ProRule annotation

Keywords - Domaini

Repeat, WD repeat

Phylogenomic databases

eggNOGiKOG0640. Eukaryota.
ENOG410XPWJ. LUCA.
GeneTreeiENSGT00860000133707.
HOGENOMiHOG000234077.
HOVERGENiHBG051144.
InParanoidiQ99LC2.
KOiK14406.
OMAiWELSTNR.
OrthoDBiEOG091G07AW.
PhylomeDBiQ99LC2.
TreeFamiTF314234.

Family and domain databases

Gene3Di2.130.10.10. 1 hit.
InterProiIPR032028. CSTF1_dimer.
IPR015943. WD40/YVTN_repeat-like_dom.
IPR001680. WD40_repeat.
IPR019775. WD40_repeat_CS.
IPR017986. WD40_repeat_dom.
[Graphical view]
PfamiPF16699. CSTF1_dimer. 1 hit.
PF00400. WD40. 5 hits.
[Graphical view]
SMARTiSM00320. WD40. 6 hits.
[Graphical view]
SUPFAMiSSF50978. SSF50978. 1 hit.
PROSITEiPS00678. WD_REPEATS_1. 1 hit.
PS50082. WD_REPEATS_2. 4 hits.
PS50294. WD_REPEATS_REGION. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q99LC2-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MYRTKVGLKD RQQLYKLIIS QLLYDGYISI ANGLINEIKP QSVCAPSEQL
60 70 80 90 100
LHLIKLGMEN DDTAVQYAIG RSDTVAPGTG IDLEFDADVQ TMSPEASEYE
110 120 130 140 150
TCYVTSHKGP CRVATYSRDG QLIATGSADA SIKILDTERM LAKSAMPIEV
160 170 180 190 200
MMNETAQQNM ENHPVIRTLY DHVDEVTCLA FHPTEQILAS GSRDYTLKLF
210 220 230 240 250
DYSKPSAKRA FKYIQEAEML RSISFHPSGD FILVGTQHPT LRLYDINTFQ
260 270 280 290 300
CFVSCNPQDQ HTDAICSVNY NPSANMYVTG SKDGCIKLWD GVSNRCITTF
310 320 330 340 350
EKAHDGAEVC SAIFSKNSKY ILSSGKDSVA KLWEISTGRT LVRYTGAGLS
360 370 380 390 400
GRQVHRTQAV FNHTEDYILL PDERTISLCC WDSRTAERRN LLSLGHNNIV
410 420 430
RCIVHSPTNP GFMTCSDDFR ARFWYRRSTT D
Length:431
Mass (Da):48,382
Last modified:June 1, 2001 - v1
Checksum:i8BD7BEFE422824D2
GO

Sequence cautioni

The sequence BAB27929 differs from that shown. Reason: Frameshift at position 314.Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK011943 mRNA. Translation: BAB27929.1. Frameshift.
AK050686 mRNA. Translation: BAC34379.1.
AK078667 mRNA. Translation: BAC37353.1.
AK145001 mRNA. Translation: BAE26180.1.
BC003440 mRNA. Translation: AAH03440.1.
CCDSiCCDS17130.1.
RefSeqiNP_077161.1. NM_024199.2.
XP_006500135.2. XM_006500072.3.
UniGeneiMm.26944.

Genome annotation databases

EnsembliENSMUST00000116375; ENSMUSP00000112076; ENSMUSG00000027498.
GeneIDi67337.
KEGGimmu:67337.
UCSCiuc008ocp.1. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK011943 mRNA. Translation: BAB27929.1. Frameshift.
AK050686 mRNA. Translation: BAC34379.1.
AK078667 mRNA. Translation: BAC37353.1.
AK145001 mRNA. Translation: BAE26180.1.
BC003440 mRNA. Translation: AAH03440.1.
CCDSiCCDS17130.1.
RefSeqiNP_077161.1. NM_024199.2.
XP_006500135.2. XM_006500072.3.
UniGeneiMm.26944.

3D structure databases

ProteinModelPortaliQ99LC2.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi212117. 1 interactor.
IntActiQ99LC2. 1 interactor.
MINTiMINT-4124130.
STRINGi10090.ENSMUSP00000112076.

PTM databases

iPTMnetiQ99LC2.
PhosphoSitePlusiQ99LC2.

Proteomic databases

EPDiQ99LC2.
MaxQBiQ99LC2.
PaxDbiQ99LC2.
PeptideAtlasiQ99LC2.
PRIDEiQ99LC2.

Protocols and materials databases

DNASUi67337.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000116375; ENSMUSP00000112076; ENSMUSG00000027498.
GeneIDi67337.
KEGGimmu:67337.
UCSCiuc008ocp.1. mouse.

Organism-specific databases

CTDi1477.
MGIiMGI:1914587. Cstf1.

Phylogenomic databases

eggNOGiKOG0640. Eukaryota.
ENOG410XPWJ. LUCA.
GeneTreeiENSGT00860000133707.
HOGENOMiHOG000234077.
HOVERGENiHBG051144.
InParanoidiQ99LC2.
KOiK14406.
OMAiWELSTNR.
OrthoDBiEOG091G07AW.
PhylomeDBiQ99LC2.
TreeFamiTF314234.

Enzyme and pathway databases

ReactomeiR-MMU-109688. Cleavage of Growing Transcript in the Termination Region.
R-MMU-72163. mRNA Splicing - Major Pathway.
R-MMU-72187. mRNA 3'-end processing.
R-MMU-77595. Processing of Intronless Pre-mRNAs.

Miscellaneous databases

PROiQ99LC2.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000027498.
ExpressionAtlasiQ99LC2. baseline and differential.
GenevisibleiQ99LC2. MM.

Family and domain databases

Gene3Di2.130.10.10. 1 hit.
InterProiIPR032028. CSTF1_dimer.
IPR015943. WD40/YVTN_repeat-like_dom.
IPR001680. WD40_repeat.
IPR019775. WD40_repeat_CS.
IPR017986. WD40_repeat_dom.
[Graphical view]
PfamiPF16699. CSTF1_dimer. 1 hit.
PF00400. WD40. 5 hits.
[Graphical view]
SMARTiSM00320. WD40. 6 hits.
[Graphical view]
SUPFAMiSSF50978. SSF50978. 1 hit.
PROSITEiPS00678. WD_REPEATS_1. 1 hit.
PS50082. WD_REPEATS_2. 4 hits.
PS50294. WD_REPEATS_REGION. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiCSTF1_MOUSE
AccessioniPrimary (citable) accession number: Q99LC2
Secondary accession number(s): Q3UMB8, Q8BJW1, Q9CSU8
Entry historyi
Integrated into UniProtKB/Swiss-Prot: June 7, 2005
Last sequence update: June 1, 2001
Last modified: November 30, 2016
This is version 130 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.