Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Cleavage and polyadenylation specificity factor subunit 4

Gene

Cpsf4

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: -Experimental evidence at transcript leveli

Functioni

Component of the cleavage and polyadenylation specificity factor (CPSF) complex that play a key role in pre-mRNA 3'-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. CPSF4 binds RNA polymers with a preference for poly(U) (By similarity).By similarity1 Publication

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri35 – 61C3H1-type 1PROSITE-ProRule annotationAdd BLAST27
Zinc fingeri62 – 89C3H1-type 2PROSITE-ProRule annotationAdd BLAST28
Zinc fingeri111 – 137C3H1-type 3PROSITE-ProRule annotationAdd BLAST27
Zinc fingeri185 – 202CCHC-typePROSITE-ProRule annotationAdd BLAST18

GO - Molecular functioni

GO - Biological processi

  • pre-mRNA cleavage required for polyadenylation Source: GO_Central

Keywordsi

Molecular functionRNA-binding
Biological processmRNA processing
LigandMetal-binding, Zinc

Enzyme and pathway databases

ReactomeiR-MMU-109688 Cleavage of Growing Transcript in the Termination Region
R-MMU-72163 mRNA Splicing - Major Pathway
R-MMU-72187 mRNA 3'-end processing
R-MMU-77595 Processing of Intronless Pre-mRNAs

Names & Taxonomyi

Protein namesi
Recommended name:
Cleavage and polyadenylation specificity factor subunit 4
Alternative name(s):
Cleavage and polyadenylation specificity factor 30 kDa subunit
Short name:
CPSF 30 kDa subunit
Clipper homolog
Clipper/CPSF 30K
Gene namesi
Name:Cpsf4
Synonyms:Cpsf30
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaMyomorphaMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 5

Organism-specific databases

MGIiMGI:1861602 Cpsf4

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000744031 – 211Cleavage and polyadenylation specificity factor subunit 4Add BLAST211

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei209PhosphoserineBy similarity1

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiQ8BQZ5
PaxDbiQ8BQZ5
PeptideAtlasiQ8BQZ5
PRIDEiQ8BQZ5

PTM databases

iPTMnetiQ8BQZ5
PhosphoSitePlusiQ8BQZ5

Expressioni

Gene expression databases

BgeeiENSMUSG00000029625 Expressed in 291 organ(s), highest expression level in floor plate of midbrain
CleanExiMM_CPSF4
ExpressionAtlasiQ8BQZ5 baseline and differential
GenevisibleiQ8BQZ5 MM

Interactioni

Subunit structurei

Component of the cleavage and polyadenylation specificity factor (CPSF) complex, composed of CPSF1, CPSF2, CPSF3, CPSF4 and FIP1L1. Interacts with FIP1L1 (By similarity).By similarity

Protein-protein interaction databases

BioGridi207591, 1 interactor
STRINGi10090.ENSMUSP00000069243

Structurei

3D structure databases

ProteinModelPortaliQ8BQZ5
SMRiQ8BQZ5
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the CPSF4/YTH1 family.Curated

Zinc finger

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri35 – 61C3H1-type 1PROSITE-ProRule annotationAdd BLAST27
Zinc fingeri62 – 89C3H1-type 2PROSITE-ProRule annotationAdd BLAST28
Zinc fingeri111 – 137C3H1-type 3PROSITE-ProRule annotationAdd BLAST27
Zinc fingeri185 – 202CCHC-typePROSITE-ProRule annotationAdd BLAST18

Keywords - Domaini

Repeat, Zinc-finger

Phylogenomic databases

eggNOGiKOG1040 Eukaryota
COG5084 LUCA
GeneTreeiENSGT00390000009627
HOGENOMiHOG000212457
HOVERGENiHBG051108
InParanoidiQ8BQZ5
KOiK14404
PhylomeDBiQ8BQZ5
TreeFamiTF314871

Family and domain databases

InterProiView protein in InterPro
IPR000571 Znf_CCCH
IPR036855 Znf_CCCH_sf
IPR001878 Znf_CCHC
IPR036875 Znf_CCHC_sf
PfamiView protein in Pfam
PF00642 zf-CCCH, 2 hits
PF00098 zf-CCHC, 1 hit
SMARTiView protein in SMART
SM00343 ZnF_C2HC, 1 hit
SM00356 ZnF_C3H1, 4 hits
SUPFAMiSSF57756 SSF57756, 1 hit
SSF90229 SSF90229, 2 hits
PROSITEiView protein in PROSITE
PS50103 ZF_C3H1, 3 hits
PS50158 ZF_CCHC, 1 hit

Sequences (3+)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

This entry has 3 described isoforms and 5 potential isoforms that are computationally mapped.Show allAlign All

Isoform 1 (identifier: Q8BQZ5-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide
        10         20         30         40         50
MQEIIASVDH IKFDLEIAVE QQLGAQPLPF PGMDKSGAAV CEFFLKAACG
60 70 80 90 100
KGGMCPFRHI SGEKTVVCKH WLRGLCKKGD QCEFLHEYDM TKMPECYFYS
110 120 130 140 150
KFGPLCRHRH TRRVICVNYL VGFCPEGPSC KFMHPRFELP MGTTEQPPLP
160 170 180 190 200
QQTQPPTKRA PQVIGVMQSQ NSSAGNRGPR PLEQVTCYKC GEKGHYANRC
210
TKGHLAFLSG Q
Note: No experimental confirmation available.
Length:211
Mass (Da):23,653
Last modified:March 1, 2003 - v1
Checksum:iF5656741519E0E26
GO
Isoform 2 (identifier: Q8BQZ5-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     103-103: G → GECSNKECPFLHIDPESKIKDCPWYDRGFCKHG
     158-158: K → KQ
     174-188: AGNRGPRPLEQVTCY → DSSSSSSSWNHCGAA
     189-211: Missing.

Show »
Length:221
Mass (Da):24,881
Checksum:i5DE80D92089DDC85
GO
Isoform 3 (identifier: Q8BQZ5-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     103-103: G → GECSNKECPFLHIDPESKIKDCPWYDRGFCKHG
     159-180: RAPQVIGVMQSQNSSAGNRGPR → VLYPAASLATLACRDGLITHSV
     181-211: Missing.

Note: No experimental confirmation available.
Show »
Length:212
Mass (Da):23,958
Checksum:i75487F4FF13C64FB
GO

Computationally mapped potential isoform sequencesi

There are 5 potential isoforms mapped to this entry.BLASTAlignShow allAdd to basket
EntryEntry nameProtein names
Gene namesLengthAnnotation
B2LVG6B2LVG6_MOUSE
Cleavage and polyadenylation specif...
Cpsf4
244Annotation score:
E0CXT7E0CXT7_MOUSE
Cleavage and polyadenylation-specif...
Cpsf4
269Annotation score:
B2LVG5B2LVG5_MOUSE
Cleavage and polyadenylation specif...
Cpsf4
243Annotation score:
F7B6Z5F7B6Z5_MOUSE
Cleavage and polyadenylation-specif...
Cpsf4
208Annotation score:
M0QWM8M0QWM8_MOUSE
Cleavage and polyadenylation-specif...
Cpsf4
62Annotation score:

Sequence cautioni

The sequence AAC53567 differs from that shown. Reason: Erroneous initiation.Curated
The sequence AAH57067 differs from that shown. Reason: Erroneous initiation.Curated

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_008603103G → GECSNKECPFLHIDPESKIK DCPWYDRGFCKHG in isoform 2 and isoform 3. 2 Publications1
Alternative sequenceiVSP_008604158K → KQ in isoform 2. 1 Publication1
Alternative sequenceiVSP_008605159 – 180RAPQV…NRGPR → VLYPAASLATLACRDGLITH SV in isoform 3. 1 PublicationAdd BLAST22
Alternative sequenceiVSP_008606174 – 188AGNRG…QVTCY → DSSSSSSSWNHCGAA in isoform 2. 1 PublicationAdd BLAST15
Alternative sequenceiVSP_008607181 – 211Missing in isoform 3. 1 PublicationAdd BLAST31
Alternative sequenceiVSP_008608189 – 211Missing in isoform 2. 1 PublicationAdd BLAST23

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK046064 mRNA Translation: BAC32587.1
AF033201 mRNA Translation: AAC53567.1 Different initiation.
BC057067 mRNA Translation: AAH57067.1 Different initiation.
CCDSiCCDS19859.1 [Q8BQZ5-1]
RefSeqiNP_001278177.1, NM_001291248.1
NP_001278178.1, NM_001291249.1
NP_848671.1, NM_178576.3 [Q8BQZ5-1]
UniGeneiMm.196884

Genome annotation databases

EnsembliENSMUST00000070487; ENSMUSP00000069243; ENSMUSG00000029625 [Q8BQZ5-1]
GeneIDi54188
KEGGimmu:54188
UCSCiuc009amj.2 mouse [Q8BQZ5-1]

Keywords - Coding sequence diversityi

Alternative splicing

Similar proteinsi

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK046064 mRNA Translation: BAC32587.1
AF033201 mRNA Translation: AAC53567.1 Different initiation.
BC057067 mRNA Translation: AAH57067.1 Different initiation.
CCDSiCCDS19859.1 [Q8BQZ5-1]
RefSeqiNP_001278177.1, NM_001291248.1
NP_001278178.1, NM_001291249.1
NP_848671.1, NM_178576.3 [Q8BQZ5-1]
UniGeneiMm.196884

3D structure databases

ProteinModelPortaliQ8BQZ5
SMRiQ8BQZ5
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi207591, 1 interactor
STRINGi10090.ENSMUSP00000069243

PTM databases

iPTMnetiQ8BQZ5
PhosphoSitePlusiQ8BQZ5

Proteomic databases

EPDiQ8BQZ5
PaxDbiQ8BQZ5
PeptideAtlasiQ8BQZ5
PRIDEiQ8BQZ5

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000070487; ENSMUSP00000069243; ENSMUSG00000029625 [Q8BQZ5-1]
GeneIDi54188
KEGGimmu:54188
UCSCiuc009amj.2 mouse [Q8BQZ5-1]

Organism-specific databases

CTDi10898
MGIiMGI:1861602 Cpsf4

Phylogenomic databases

eggNOGiKOG1040 Eukaryota
COG5084 LUCA
GeneTreeiENSGT00390000009627
HOGENOMiHOG000212457
HOVERGENiHBG051108
InParanoidiQ8BQZ5
KOiK14404
PhylomeDBiQ8BQZ5
TreeFamiTF314871

Enzyme and pathway databases

ReactomeiR-MMU-109688 Cleavage of Growing Transcript in the Termination Region
R-MMU-72163 mRNA Splicing - Major Pathway
R-MMU-72187 mRNA 3'-end processing
R-MMU-77595 Processing of Intronless Pre-mRNAs

Miscellaneous databases

ChiTaRSiCpsf4 mouse
PROiPR:Q8BQZ5
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000029625 Expressed in 291 organ(s), highest expression level in floor plate of midbrain
CleanExiMM_CPSF4
ExpressionAtlasiQ8BQZ5 baseline and differential
GenevisibleiQ8BQZ5 MM

Family and domain databases

InterProiView protein in InterPro
IPR000571 Znf_CCCH
IPR036855 Znf_CCCH_sf
IPR001878 Znf_CCHC
IPR036875 Znf_CCHC_sf
PfamiView protein in Pfam
PF00642 zf-CCCH, 2 hits
PF00098 zf-CCHC, 1 hit
SMARTiView protein in SMART
SM00343 ZnF_C2HC, 1 hit
SM00356 ZnF_C3H1, 4 hits
SUPFAMiSSF57756 SSF57756, 1 hit
SSF90229 SSF90229, 2 hits
PROSITEiView protein in PROSITE
PS50103 ZF_C3H1, 3 hits
PS50158 ZF_CCHC, 1 hit
ProtoNetiSearch...

Entry informationi

Entry nameiCPSF4_MOUSE
AccessioniPrimary (citable) accession number: Q8BQZ5
Secondary accession number(s): O54930
Entry historyiIntegrated into UniProtKB/Swiss-Prot: October 24, 2003
Last sequence update: March 1, 2003
Last modified: November 7, 2018
This is version 117 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families
  2. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again