Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Cleavage and polyadenylation specificity factor subunit 1

Gene

CPSF1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

Component of the cleavage and polyadenylation specificity factor (CPSF) complex that plays a key role in pre-mRNA 3'-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. This subunit is involved in the RNA recognition step of the polyadenylation reaction.1 Publication

GO - Molecular functioni

  • enzyme binding Source: UniProtKB
  • mRNA 3'-UTR AU-rich region binding Source: UniProtKB

GO - Biological processi

  • mRNA 3'-end processing Source: Reactome
  • mRNA export from nucleus Source: Reactome
  • mRNA polyadenylation Source: UniProtKB
  • mRNA splicing, via spliceosome Source: Reactome
  • pre-mRNA cleavage required for polyadenylation Source: UniProtKB
  • termination of RNA polymerase II transcription Source: Reactome

Keywordsi

Molecular functionRNA-binding
Biological processmRNA processing

Enzyme and pathway databases

ReactomeiR-HSA-109688 Cleavage of Growing Transcript in the Termination Region
R-HSA-159231 Transport of Mature mRNA Derived from an Intronless Transcript
R-HSA-6784531 tRNA processing in the nucleus
R-HSA-72163 mRNA Splicing - Major Pathway
R-HSA-72187 mRNA 3'-end processing
R-HSA-77595 Processing of Intronless Pre-mRNAs
SIGNORiQ10570

Names & Taxonomyi

Protein namesi
Recommended name:
Cleavage and polyadenylation specificity factor subunit 1
Alternative name(s):
Cleavage and polyadenylation specificity factor 160 kDa subunit
Short name:
CPSF 160 kDa subunit
Gene namesi
Name:CPSF1
Synonyms:CPSF160
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 8

Organism-specific databases

EuPathDBiHostDB:ENSG00000071894.14
HGNCiHGNC:2324 CPSF1
MIMi606027 gene
neXtProtiNX_Q10570

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

DisGeNETi29894
PharmGKBiPA26841

Polymorphism and mutation databases

BioMutaiCPSF1
DMDMi23503048

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000743871 – 1443Cleavage and polyadenylation specificity factor subunit 1Add BLAST1443

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei756PhosphoserineBy similarity1
Modified residuei766PhosphoserineCombined sources1

Post-translational modificationi

The N-terminus is blocked.

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiQ10570
MaxQBiQ10570
PaxDbiQ10570
PeptideAtlasiQ10570
PRIDEiQ10570
ProteomicsDBi58860

PTM databases

iPTMnetiQ10570
PhosphoSitePlusiQ10570
SwissPalmiQ10570

Expressioni

Gene expression databases

BgeeiENSG00000071894 Expressed in 101 organ(s), highest expression level in right testis
CleanExiHS_CPSF1
ExpressionAtlasiQ10570 baseline and differential
GenevisibleiQ10570 HS

Organism-specific databases

HPAiHPA065167
HPA068906

Interactioni

Subunit structurei

Component of the cleavage and polyadenylation specificity factor (CPSF) complex, composed of CPSF1, CPSF2, CPSF3, CPSF4 and FIP1L1. Found in a complex with CPSF1, FIP1L1 and PAPOLA. Interacts with FIP1L1, TENT2/GLD2 and SRRM1. Interacts with TUT1; the interaction is direct and mediates the recruitment of the CPSF complex on the 3'UTR of selected pre-mRNAs.3 Publications

Binary interactionsi

GO - Molecular functioni

Protein-protein interaction databases

BioGridi118946, 101 interactors
CORUMiQ10570
DIPiDIP-32694N
IntActiQ10570, 31 interactors
MINTiQ10570
STRINGi9606.ENSP00000339353

Structurei

Secondary structure

11443
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details

3D structure databases

ProteinModelPortaliQ10570
SMRiQ10570
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi893 – 908Nuclear localization signalSequence analysisAdd BLAST16

Sequence similaritiesi

Belongs to the CPSF1 family.Curated

Phylogenomic databases

eggNOGiKOG1896 Eukaryota
COG5161 LUCA
GeneTreeiENSGT00550000075040
HOGENOMiHOG000007904
HOVERGENiHBG051105
InParanoidiQ10570
KOiK14401
OMAiPVAFMDM
OrthoDBiEOG091G00LQ
PhylomeDBiQ10570
TreeFamiTF314322

Family and domain databases

Gene3Di2.130.10.10, 2 hits
InterProiView protein in InterPro
IPR004871 Cleavage/polyA-sp_fac_asu_C
IPR015943 WD40/YVTN_repeat-like_dom_sf
PfamiView protein in Pfam
PF03178 CPSF_A, 1 hit

Sequence (1+)i

Sequence statusi: Complete.

This entry has 1 described isoform and 3 potential isoforms that are computationally mapped.Show allAlign All

Q10570-1 [UniParc]FASTAAdd to basket
« Hide
        10         20         30         40         50
MYAVYKQAHP PTGLEFSMYC NFFNNSERNL VVAGTSQLYV YRLNRDAEAL
60 70 80 90 100
TKNDRSTEGK AHREKLELAA SFSFFGNVMS MASVQLAGAK RDALLLSFKD
110 120 130 140 150
AKLSVVEYDP GTHDLKTLSL HYFEEPELRD GFVQNVHTPR VRVDPDGRCA
160 170 180 190 200
AMLVYGTRLV VLPFRRESLA EEHEGLVGEG QRSSFLPSYI IDVRALDEKL
210 220 230 240 250
LNIIDLQFLH GYYEPTLLIL FEPNQTWPGR VAVRQDTCSI VAISLNITQK
260 270 280 290 300
VHPVIWSLTS LPFDCTQALA VPKPIGGVVV FAVNSLLYLN QSVPPYGVAL
310 320 330 340 350
NSLTTGTTAF PLRTQEGVRI TLDCAQATFI SYDKMVISLK GGEIYVLTLI
360 370 380 390 400
TDGMRSVRAF HFDKAAASVL TTSMVTMEPG YLFLGSRLGN SLLLKYTEKL
410 420 430 440 450
QEPPASAVRE AADKEEPPSK KKRVDATAGW SAAGKSVPQD EVDEIEVYGS
460 470 480 490 500
EAQSGTQLAT YSFEVCDSIL NIGPCANAAV GEPAFLSEEF QNSPEPDLEI
510 520 530 540 550
VVCSGHGKNG ALSVLQKSIR PQVVTTFELP GCYDMWTVIA PVRKEEEDNP
560 570 580 590 600
KGEGTEQEPS TTPEADDDGR RHGFLILSRE DSTMILQTGQ EIMELDTSGF
610 620 630 640 650
ATQGPTVFAG NIGDNRYIVQ VSPLGIRLLE GVNQLHFIPV DLGAPIVQCA
660 670 680 690 700
VADPYVVIMS AEGHVTMFLL KSDSYGGRHH RLALHKPPLH HQSKVITLCL
710 720 730 740 750
YRDLSGMFTT ESRLGGARDE LGGRSGPEAE GLGSETSPTV DDEEEMLYGD
760 770 780 790 800
SGSLFSPSKE EARRSSQPPA DRDPAPFRAE PTHWCLLVRE NGTMEIYQLP
810 820 830 840 850
DWRLVFLVKN FPVGQRVLVD SSFGQPTTQG EARREEATRQ GELPLVKEVL
860 870 880 890 900
LVALGSRQSR PYLLVHVDQE LLIYEAFPHD SQLGQGNLKV RFKKVPHNIN
910 920 930 940 950
FREKKPKPSK KKAEGGGAEE GAGARGRVAR FRYFEDIYGY SGVFICGPSP
960 970 980 990 1000
HWLLVTGRGA LRLHPMAIDG PVDSFAPFHN VNCPRGFLYF NRQGELRISV
1010 1020 1030 1040 1050
LPAYLSYDAP WPVRKIPLRC TAHYVAYHVE SKVYAVATST NTPCARIPRM
1060 1070 1080 1090 1100
TGEEKEFETI ERDERYIHPQ QEAFSIQLIS PVSWEAIPNA RIELQEWEHV
1110 1120 1130 1140 1150
TCMKTVSLRS EETVSGLKGY VAAGTCLMQG EEVTCRGRIL IMDVIEVVPE
1160 1170 1180 1190 1200
PGQPLTKNKF KVLYEKEQKG PVTALCHCNG HLVSAIGQKI FLWSLRASEL
1210 1220 1230 1240 1250
TGMAFIDTQL YIHQMISVKN FILAADVMKS ISLLRYQEES KTLSLVSRDA
1260 1270 1280 1290 1300
KPLEVYSVDF MVDNAQLGFL VSDRDRNLMV YMYLPEAKES FGGMRLLRRA
1310 1320 1330 1340 1350
DFHVGAHVNT FWRTPCRGAT EGLSKKSVVW ENKHITWFAT LDGGIGLLLP
1360 1370 1380 1390 1400
MQEKTYRRLL MLQNALTTML PHHAGLNPRA FRMLHVDRRT LQNAVRNVLD
1410 1420 1430 1440
GELLNRYLYL STMERSELAK KIGTTPDIIL DDLLETDRVT AHF
Length:1,443
Mass (Da):160,884
Last modified:September 19, 2002 - v2
Checksum:i7E1DF4D8A93487A4
GO

Computationally mapped potential isoform sequencesi

There are 3 potential isoforms mapped to this entry.BLASTAlignShow allAdd to basket
EntryEntry nameProtein names
Gene namesLengthAnnotation
A0A087WTV4A0A087WTV4_HUMAN
Cleavage and polyadenylation-specif...
CPSF1
169Annotation score:
A0A087X101A0A087X101_HUMAN
Cleavage and polyadenylation-specif...
CPSF1
204Annotation score:
E9PIM1E9PIM1_HUMAN
Cleavage and polyadenylation-specif...
CPSF1
148Annotation score:

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti12T → P in AAC50293 (PubMed:7590244).Curated1
Sequence conflicti1318Missing in AAC50293 (PubMed:7590244).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U37012 mRNA Translation: AAC50293.1
BC017232 mRNA Translation: AAH17232.1
CCDSiCCDS34966.1
RefSeqiNP_037423.2, NM_013291.2
UniGeneiHs.493202

Genome annotation databases

EnsembliENST00000616140; ENSP00000484669; ENSG00000071894
ENST00000620219; ENSP00000478145; ENSG00000071894
ENST00000643746; ENSP00000495102; ENSG00000285049
ENST00000644539; ENSP00000495020; ENSG00000285049
GeneIDi29894
KEGGihsa:29894
UCSCiuc003zcj.3 human

Similar proteinsi

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U37012 mRNA Translation: AAC50293.1
BC017232 mRNA Translation: AAH17232.1
CCDSiCCDS34966.1
RefSeqiNP_037423.2, NM_013291.2
UniGeneiHs.493202

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
6BLYelectron microscopy3.36A1-1443[»]
6BM0electron microscopy3.80A1-1443[»]
6DNHelectron microscopy3.40A1-1443[»]
6F9NX-ray2.50A1-1443[»]
6FBSelectron microscopy3.07A1-1443[»]
6FUWelectron microscopy3.07A1-1443[»]
ProteinModelPortaliQ10570
SMRiQ10570
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi118946, 101 interactors
CORUMiQ10570
DIPiDIP-32694N
IntActiQ10570, 31 interactors
MINTiQ10570
STRINGi9606.ENSP00000339353

PTM databases

iPTMnetiQ10570
PhosphoSitePlusiQ10570
SwissPalmiQ10570

Polymorphism and mutation databases

BioMutaiCPSF1
DMDMi23503048

Proteomic databases

EPDiQ10570
MaxQBiQ10570
PaxDbiQ10570
PeptideAtlasiQ10570
PRIDEiQ10570
ProteomicsDBi58860

Protocols and materials databases

DNASUi29894
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000616140; ENSP00000484669; ENSG00000071894
ENST00000620219; ENSP00000478145; ENSG00000071894
ENST00000643746; ENSP00000495102; ENSG00000285049
ENST00000644539; ENSP00000495020; ENSG00000285049
GeneIDi29894
KEGGihsa:29894
UCSCiuc003zcj.3 human

Organism-specific databases

CTDi29894
DisGeNETi29894
EuPathDBiHostDB:ENSG00000071894.14
GeneCardsiCPSF1
HGNCiHGNC:2324 CPSF1
HPAiHPA065167
HPA068906
MIMi606027 gene
neXtProtiNX_Q10570
PharmGKBiPA26841
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG1896 Eukaryota
COG5161 LUCA
GeneTreeiENSGT00550000075040
HOGENOMiHOG000007904
HOVERGENiHBG051105
InParanoidiQ10570
KOiK14401
OMAiPVAFMDM
OrthoDBiEOG091G00LQ
PhylomeDBiQ10570
TreeFamiTF314322

Enzyme and pathway databases

ReactomeiR-HSA-109688 Cleavage of Growing Transcript in the Termination Region
R-HSA-159231 Transport of Mature mRNA Derived from an Intronless Transcript
R-HSA-6784531 tRNA processing in the nucleus
R-HSA-72163 mRNA Splicing - Major Pathway
R-HSA-72187 mRNA 3'-end processing
R-HSA-77595 Processing of Intronless Pre-mRNAs
SIGNORiQ10570

Miscellaneous databases

ChiTaRSiCPSF1 human
GeneWikiiCPSF1
GenomeRNAii29894
PROiPR:Q10570
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000071894 Expressed in 101 organ(s), highest expression level in right testis
CleanExiHS_CPSF1
ExpressionAtlasiQ10570 baseline and differential
GenevisibleiQ10570 HS

Family and domain databases

Gene3Di2.130.10.10, 2 hits
InterProiView protein in InterPro
IPR004871 Cleavage/polyA-sp_fac_asu_C
IPR015943 WD40/YVTN_repeat-like_dom_sf
PfamiView protein in Pfam
PF03178 CPSF_A, 1 hit
ProtoNetiSearch...

Entry informationi

Entry nameiCPSF1_HUMAN
AccessioniPrimary (citable) accession number: Q10570
Secondary accession number(s): Q96AF0
Entry historyiIntegrated into UniProtKB/Swiss-Prot: October 1, 1996
Last sequence update: September 19, 2002
Last modified: November 7, 2018
This is version 167 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families
  2. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  3. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  4. Human chromosome 8
    Human chromosome 8: entries, gene names and cross-references to MIM
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again