UniProtKB - A0A0R4IC37 (CPSF1_DANRE)
Cleavage and polyadenylation specificity factor subunit 1
cpsf1
Functioni
Component of the cleavage and polyadenylation specificity factor (CPSF) complex that plays a key role in pre-mRNA 3'-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. This subunit is involved in the RNA recognition step of the polyadenylation reaction (By similarity).
Plays a role in eye morphogenesis and the development of retinal ganglion cell projections to the tectum (PubMed:30689892).
By similarity1 PublicationGO - Molecular functioni
- nucleic acid binding Source: InterPro
GO - Biological processi
- definitive hemopoiesis Source: ZFIN
- mRNA polyadenylation Source: ZFIN
- retinal ganglion cell axon guidance Source: ZFIN
Enzyme and pathway databases
Reactomei | R-DRE-72163, mRNA Splicing - Major Pathway R-DRE-72187, mRNA 3'-end processing R-DRE-77595, Processing of Intronless Pre-mRNAs |
Names & Taxonomyi
Protein namesi | Recommended name: Cleavage and polyadenylation specificity factor subunit 1By similarityAlternative name(s): Cleavage and polyadenylation specific factor 1Imported |
Gene namesi | Name:cpsf1Imported |
Organismi | Danio rerio (Zebrafish) (Brachydanio rerio) |
Taxonomic identifieri | 7955 [NCBI] |
Taxonomic lineagei | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Actinopterygii › Neopterygii › Teleostei › Ostariophysi › Cypriniformes › Danionidae › Danioninae › Danio |
Proteomesi |
|
Organism-specific databases
ZFINi | ZDB-GENE-040709-2, cpsf1 |
Subcellular locationi
Nucleus
- nucleoplasm By similarity
Nucleus
- mRNA cleavage and polyadenylation specificity factor complex Source: GO_Central
- nucleoplasm Source: UniProtKB-SubCell
- nucleus Source: GO_Central
Keywords - Cellular componenti
NucleusPathology & Biotechi
Disruption phenotypei
PTM / Processingi
Molecule processing
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
ChainiPRO_0000451718 | 1 – 1451 | Cleavage and polyadenylation specificity factor subunit 1Add BLAST | 1451 |
Family & Domainsi
Region
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Regioni | 401 – 432 | DisorderedSequence analysisAdd BLAST | 32 | |
Regioni | 548 – 572 | DisorderedSequence analysisAdd BLAST | 25 | |
Regioni | 753 – 789 | DisorderedSequence analysisAdd BLAST | 37 |
Motif
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Motifi | 901 – 916 | Nuclear localization signalSequence analysisAdd BLAST | 16 |
Compositional bias
Feature key | Position(s) | DescriptionActions | Graphical view | Length |
---|---|---|---|---|
Compositional biasi | 401 – 426 | Basic and acidic residuesSequence analysisAdd BLAST | 26 | |
Compositional biasi | 755 – 785 | Polar residuesSequence analysisAdd BLAST | 31 |
Sequence similaritiesi
Phylogenomic databases
GeneTreei | ENSGT00950000183151 |
Family and domain databases
Gene3Di | 2.130.10.10, 2 hits |
InterProi | View protein in InterPro IPR004871, Cleavage/polyA-sp_fac_asu_C IPR018846, Cleavage/polyA-sp_fac_asu_N IPR015943, WD40/YVTN_repeat-like_dom_sf |
Pfami | View protein in Pfam PF03178, CPSF_A, 1 hit PF10433, MMS1_N, 1 hit |
(1+)i Sequence
Sequence statusi: Complete.
This entry has 1 described isoform and 5 potential isoforms that are computationally mapped.Show allAlign All
10 20 30 40 50
MYAVYRQAHP PTAVEFAVYC NFISSQEKNL VVAGTSQLYV YRIIYDVEST
60 70 80 90 100
SKSEKSSDGK SRKEKLEQVA SFSLFGNVMS MASVQLVGTN RDALLLSFKD
110 120 130 140 150
AKLSVVEYDP GTHDLKTLSL HYFEEPELRD GFVQNVHIPM VRVDPENRCA
160 170 180 190 200
VMLVYGTCLV VLPFRKDTLA DEQEGIVGEG QKSSFLPSYI IDVRELDEKL
210 220 230 240 250
LNIIDMKFLH GYYEPTLLIL FEPNQTWPGR VAVRQDTCSI VAISLNIMQK
260 270 280 290 300
VHPVIWSLSN LPFDCNQVMA VPKPIGGVVV FAVNSLLYLN QSVPPFGVSL
310 320 330 340 350
NSLTNGTTAF PLRPQEEVKI TLDCSQASFI TSDKMVISLK GGEIYVLTLI
360 370 380 390 400
TDGMRSVRAF HFDKAAASVL TTCMMTMEPG YLFLGSRLGN SLLLRYTEKL
410 420 430 440 450
QETPMEEGKE NEEKEKQEEP PNKKKRVDSN WAGCPGKGNL PDELDEIEVY
460 470 480 490 500
GSEAQSGTQL ATYSFEVCDS ILNIGPCASA SMGEPAFLSE EFQTNPEPDL
510 520 530 540 550
EVVVCSGYGK NGALSVLQKS IRPQVVTTFE LPGCHDMWTV IYCEEKPEKP
560 570 580 590 600
SAEGDGESPE EEKREPTIED DKKKHGFLIL SREDSTMILQ TGQEIMELDT
610 620 630 640 650
SGFATQGPTV YAGNIGDNKY IIQVSPMGIR LLEGVNQLHF IPVDLGSPIV
660 670 680 690 700
HCSVADPYVV IMTAEGVVTM FVLKNDSYMG KSHRLALQKP QIHTQSRVIT
710 720 730 740 750
LCAYRDVSGM FTTENKVSFL AKEEIAIRTN SETETIIQDI SNTVDDEEEM
760 770 780 790 800
LYGESNPLTS PNKEESSRGS AAASSAHTGK ESGSGRQEPS HWCLLVRENG
810 820 830 840 850
VMEIYQLPDW RLVFLVKNFP VGQRVLVDSS ASQSATQGEL KKEEVTRQGD
860 870 880 890 900
IPLVKEVALV SLGYNHSRPY LLAHVEQELL IYEAFPYDQQ QAQSNLKVRF
910 920 930 940 950
KKMPHNINYR EKKVKVRKDK KPEGQGEDTL GVKGRVARFR YFQDISGYSG
960 970 980 990 1000
VFICGPSPHW MLVTSRGAMR LHPMTIDGAI ESFSPFHNIN CPKGFLYFNK
1010 1020 1030 1040 1050
QGELRISVLP TYLSYDAPWP VRKIPLRCTV HYVSYHVESK VYAVCTSVKE
1060 1070 1080 1090 1100
PCTRIPRMTG EEKEFETIER DERYIHPQQD KFSIQLISPV SWEAIPNTRV
1110 1120 1130 1140 1150
DLEEWEHVTC MKTVALKSQE TVSGLKGYVA LGTCLMQGEE VTCRGRILIL
1160 1170 1180 1190 1200
DVIEVVPEPG QPLTKNKFKV LYEKEQKGPV TALCHCSGFL VSAIGQKIFL
1210 1220 1230 1240 1250
WSLKDNDLTG MAFIDTQLYI HQMYSIKNFI LAADVMKSIS LLRYQPESKT
1260 1270 1280 1290 1300
LSLVSRDAKP LEVYSIEFMV DNNQLGFLVS DRDKNLMVYM YLPEAKESFG
1310 1320 1330 1340 1350
GMRLLRRADF NVGSHVNAFW RMPCRGTLDT ANKKALTWDN KHITWFATLD
1360 1370 1380 1390 1400
GGVGLLLPMQ EKTYRRLLML QNALTTMLPH HAGLNPKAFR MLHCDRRTLQ
1410 1420 1430 1440 1450
NAVKNILDGE LLNKYLYLST MERSELAKKI GTTPDIILDD LLEIERVTAH
F
Computationally mapped potential isoform sequencesi
There are 5 potential isoforms mapped to this entry.BLASTAlignShow allAdd to basketA0A0R4IJ92 | A0A0R4IJ92_DANRE | Poly [ADP-ribose] polymerase | parp10 cpsf1, si:ch1073-296d18.1 | 1,125 | Annotation score: | ||
A0A2R8RJW1 | A0A2R8RJW1_DANRE | Poly [ADP-ribose] polymerase | parp10 | 1,178 | Annotation score: | ||
E7F8U3 | E7F8U3_DANRE | Poly [ADP-ribose] polymerase | parp10 | 581 | Annotation score: | ||
F6NXE7 | F6NXE7_DANRE | Cleavage and polyadenylation-specif... | cpsf1 | 1,448 | Annotation score: | ||
A0A0R4IH63 | A0A0R4IH63_DANRE | Cleavage and polyadenylation-specif... | cpsf1 | 400 | Annotation score: |
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | CU467825 Genomic DNA No translation available. FP236813 Genomic DNA No translation available. FP325126 Genomic DNA No translation available. LO018649 Genomic DNA No translation available. |
Genome annotation databases
Ensembli | ENSDART00000163478; ENSDARP00000130523; ENSDARG00000034178 |
Similar proteinsi
Cross-referencesi
Sequence databases
Select the link destinations: EMBLi GenBanki DDBJi Links Updated | CU467825 Genomic DNA No translation available. FP236813 Genomic DNA No translation available. FP325126 Genomic DNA No translation available. LO018649 Genomic DNA No translation available. |
3D structure databases
SMRi | A0A0R4IC37 |
ModBasei | Search... |
Protein-protein interaction databases
STRINGi | 7955.ENSDARP00000098742 |
Genome annotation databases
Ensembli | ENSDART00000163478; ENSDARP00000130523; ENSDARG00000034178 |
Organism-specific databases
ZFINi | ZDB-GENE-040709-2, cpsf1 |
Phylogenomic databases
GeneTreei | ENSGT00950000183151 |
Enzyme and pathway databases
Reactomei | R-DRE-72163, mRNA Splicing - Major Pathway R-DRE-72187, mRNA 3'-end processing R-DRE-77595, Processing of Intronless Pre-mRNAs |
Gene expression databases
ExpressionAtlasi | A0A0R4IC37, baseline and differential |
Family and domain databases
Gene3Di | 2.130.10.10, 2 hits |
InterProi | View protein in InterPro IPR004871, Cleavage/polyA-sp_fac_asu_C IPR018846, Cleavage/polyA-sp_fac_asu_N IPR015943, WD40/YVTN_repeat-like_dom_sf |
Pfami | View protein in Pfam PF03178, CPSF_A, 1 hit PF10433, MMS1_N, 1 hit |
MobiDBi | Search... |
Entry informationi
Entry namei | CPSF1_DANRE | |
Accessioni | A0A0R4IC37Primary (citable) accession number: A0A0R4IC37 | |
Entry historyi | Integrated into UniProtKB/Swiss-Prot: | December 2, 2020 |
Last sequence update: | June 20, 2018 | |
Last modified: | February 23, 2022 | |
This is version 30 of the entry and version 2 of the sequence. See complete history. | ||
Entry statusi | Reviewed (UniProtKB/Swiss-Prot) | |
Annotation program | Chordata Protein Annotation Program |
Miscellaneousi
Keywords - Technical termi
Reference proteomeDocuments
- SIMILARITY comments
Index of protein domains and families