Q8BQZ5 (CPSF4_MOUSE) Reviewed, UniProtKB/Swiss-Prot
Last modified
May 29, 2013.
Version 78.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Cleavage and polyadenylation specificity factor subunit 4 Alternative name(s): Cleavage and polyadenylation specificity factor 30 kDa subunit Short name=CPSF 30 kDa subunit Clipper homolog Clipper/CPSF 30K | ||||
| Gene names |
| ||||
| Organism | Mus musculus (Mouse) [Reference proteome] | ||||
| Taxonomic identifier | 10090 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Glires › Rodentia › Sciurognathi › Muroidea › Muridae › Murinae › Mus › Mus![]() |
Protein attributes
| Sequence length | 211 AA. |
| Sequence status | Complete. |
| Protein existence | Evidence at transcript level |
General annotation (Comments)
| Function | Component of the cleavage and polyadenylation specificity factor (CPSF) complex that play a key role in pre-mRNA 3'-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. CPSF4 binds RNA polymers with a preference for poly(U) By similarity. Ref.2 |
| Subunit structure | Component of the cleavage and polyadenylation specificity factor (CPSF) complex, composed of CPSF1, CPSF2, CPSF3, CPSF4 and FIP1L1. Interacts with FIP1L1 By similarity. |
| Subcellular location | Nucleus By similarity. |
| Sequence similarities | Belongs to the CPSF4/YTH1 family. Contains 3 C3H1-type zinc fingers. Contains 1 CCHC-type zinc finger. |
| Sequence caution | The sequence AAC53567.1 differs from that shown. Reason: Erroneous initiation. The sequence AAH57067.1 differs from that shown. Reason: Erroneous initiation. |
Ontologies
| Keywords | |
|---|---|
| Biological process | mRNA processing |
| Cellular component | Nucleus |
| Coding sequence diversity | Alternative splicing |
| Domain | Repeat Zinc-finger |
| Ligand | Metal-binding RNA-binding Zinc |
| Technical term | Complete proteome Reference proteome |
| Gene Ontology (GO) | |
| Biological_process | mRNA processing Inferred from electronic annotation. Source: UniProtKB-KW |
| Cellular_component | mRNA cleavage and polyadenylation specificity factor complex Inferred from sequence or structural similarity. Source: UniProtKB mitochondrionInferred from electronic annotation. Source: Compara |
| Molecular_function | RNA binding Inferred from electronic annotation. Source: UniProtKB-KW zinc ion bindingInferred from electronic annotation. Source: InterPro |
| Complete GO annotation... | |
Alternative products
| This entry describes 3 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 (identifier: Q8BQZ5-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Note: No experimental confirmation available. | ||||||
| Isoform 2 (identifier: Q8BQZ5-2) The sequence of this isoform differs from the canonical sequence as follows: 103-103: G → GECSNKECPFLHIDPESKIKDCPWYDRGFCKHG 158-158: K → KQ 174-188: AGNRGPRPLEQVTCY → DSSSSSSSWNHCGAA 189-211: Missing. | ||||||
| Isoform 3 (identifier: Q8BQZ5-3) The sequence of this isoform differs from the canonical sequence as follows: 103-103: G → GECSNKECPFLHIDPESKIKDCPWYDRGFCKHG 159-180: RAPQVIGVMQSQNSSAGNRGPR → VLYPAASLATLACRDGLITHSV 181-211: Missing. | ||||||
| Note: No experimental confirmation available. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 211 | 211 | Cleavage and polyadenylation specificity factor subunit 4 | PRO_0000074403 | |||||
Regions | |||||||||
| Zinc finger | 35 – 61 | 27 | C3H1-type 1 | ||||||
| Zinc finger | 62 – 89 | 28 | C3H1-type 2 | ||||||
| Zinc finger | 111 – 137 | 27 | C3H1-type 3 | ||||||
| Zinc finger | 185 – 202 | 18 | CCHC-type | ||||||
Natural variations | |||||||||
| Alternative sequence | 103 | 1 | G → GECSNKECPFLHIDPESKIK DCPWYDRGFCKHG in isoform 2 and isoform 3. | VSP_008603 | |||||
| Alternative sequence | 158 | 1 | K → KQ in isoform 2. | VSP_008604 | |||||
| Alternative sequence | 159 – 180 | 22 | RAPQV…NRGPR → VLYPAASLATLACRDGLITH SV in isoform 3. | VSP_008605 | |||||
| Alternative sequence | 174 – 188 | 15 | AGNRG…QVTCY → DSSSSSSSWNHCGAA in isoform 2. | VSP_008606 | |||||
| Alternative sequence | 181 – 211 | 31 | Missing in isoform 3. | VSP_008607 | |||||
| Alternative sequence | 189 – 211 | 23 | Missing in isoform 2. | VSP_008608 | |||||
Sequences
| ||||||||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "The transcriptional landscape of the mammalian genome." Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J. Hayashizaki Y.Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). Strain: C57BL/6J. Tissue: Corpora quadrigemina. |
| [2] | "Drosophila clipper/CPSF 30K is a post-transcriptionally regulated nuclear protein that binds RNA containing GC clusters." Bai C., Tolias P.P. Nucleic Acids Res. 26:1597-1604(1998) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 17-211 (ISOFORM 2), FUNCTION. Strain: C57BL/6J. Tissue: Embryo. |
| [3] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 45-211 (ISOFORM 3). Strain: C57BL/6. Tissue: Brain. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| EMBL GenBank DDBJ | AK046064 mRNA. Translation: BAC32587.1. AF033201 mRNA. Translation: AAC53567.1. Different initiation. BC057067 mRNA. Translation: AAH57067.1. Different initiation. |
| IPI | IPI00309761. IPI00380450. IPI01027761. |
| RefSeq | NP_848671.1. NM_178576.2. |
| UniGene | Mm.196884. |
3D structure databases | |
| ProteinModelPortal | Q8BQZ5. |
| SMR | Q8BQZ5. Positions 61-103. |
| ModBase | Search... |
Protein-protein interaction databases | |
| MINT | MINT-89829. |
PTM databases | |
| PhosphoSite | Q8BQZ5. |
Proteomic databases | |
| PaxDb | Q8BQZ5. |
| PRIDE | Q8BQZ5. |
Protocols and materials databases | |
| StructuralBiologyKnowledgebase | Search... |
Genome annotation databases | |
| Ensembl | ENSMUST00000070487; ENSMUSP00000069243; ENSMUSG00000029625. |
| GeneID | 54188. |
| KEGG | mmu:54188. |
| UCSC | uc009amj.1. mouse. |
Organism-specific databases | |
| CTD | 10898. |
| MGI | MGI:1861602. Cpsf4. |
Phylogenomic databases | |
| eggNOG | COG5084. |
| GeneTree | ENSGT00390000009627. |
| HOGENOM | HOG000212457. |
| HOVERGEN | HBG051108. |
| KO | K14404. |
| OrthoDB | EOG4KH2VQ. |
Gene expression databases | |
| ArrayExpress | Q8BQZ5. |
| Bgee | Q8BQZ5. |
| CleanEx | MM_CPSF4. |
| Genevestigator | Q8BQZ5. |
| GermOnline | ENSMUSG00000029625. Mus musculus. |
Family and domain databases | |
| Gene3D | 4.10.1000.10. 2 hits. 4.10.60.10. 1 hit. |
| InterPro | IPR000571. Znf_CCCH. IPR001878. Znf_CCHC. [Graphical view] |
| Pfam | PF00642. zf-CCCH. 2 hits. PF00098. zf-CCHC. 1 hit. [Graphical view] |
| SMART | SM00343. ZnF_C2HC. 1 hit. SM00356. ZnF_C3H1. 4 hits. [Graphical view] |
| SUPFAM | SSF57756. SSF57756. 1 hit. |
| PROSITE | PS50103. ZF_C3H1. 3 hits. PS50158. ZF_CCHC. 1 hit. [Graphical view] |
| ProtoNet | Search... |
Other | |
| ChiTaRS | CPSF4. mouse. |
| NextBio | 311022. |
| SOURCE | Search... |
Entry information
| Entry name | CPSF4_MOUSE | ||||||||
| Accession | Primary (citable) accession number: Q8BQZ5 Secondary accession number(s): O54930 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Chordata Protein Annotation Program | ||||||||
Relevant documents
| MGD cross-references Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with
