Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot Q9P2I0 (CPSF2_HUMAN)

Last modified July 7, 2009. Version 59. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Binary interactions · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Cleavage and polyadenylation specificity factor subunit 2
Alternative name(s):
    Cleavage and polyadenylation specificity factor 100 kDa subunit
      Short name=CPSF 100 kDa subunit
Gene names
Name: CPSF2
Synonyms: CPSF100, KIAA1367
OrganismHomo sapiens (Human) [Complete proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length782 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

Component of the cleavage and polyadenylation specificity factor (CPSF) complex that play a key role in pre-mRNA 3'-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. Ref.4

Subunit structure

Component of the cleavage and polyadenylation specificity factor (CPSF) complex, composed of CPSF1, CPSF2, CPSF3, CPSF4 and FIP1L1.

Subcellular location

Nucleus Potential.

Sequence similarities

Belongs to the metallo-beta-lactamase superfamily. RNA-metabolizing metallo-beta-lactamase-like family. CPSF2/YSH1 subfamily.

Ontologies

Keywords
   Biological processmRNA processing
   Cellular componentNucleus
   LigandRNA-binding
   PTMPhosphoprotein
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processnuclear mRNA splicing, via spliceosome

Inferred from Experiment. Source: Reactome

   Cellular componentcytoplasm

Inferred from direct assay. Source: HPA

nucleus

Inferred from direct assay. Source: HPA

   Molecular functionRNA binding

Inferred from electronic annotation. Source: UniProtKB-KW

hydrolase activity

Inferred from electronic annotation. Source: InterPro

protein binding

Inferred from physical interaction. Source: IntAct

Complete GO annotation...

Binary interactions

With

Entry

#Exp.

IntAct

Notes

RNPS1Q152871EBI-1043224,EBI-395959

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 782782Cleavage and polyadenylation specificity factor subunit 2
PRO_0000074393

Amino acid modifications

Modified residue3951Phosphotyrosine Ref.6
Modified residue4121Phosphoserine By similarity
Modified residue4191Phosphoserine Ref.6 Ref.7
Modified residue4201Phosphoserine Ref.6 Ref.7
Modified residue4231Phosphoserine Ref.6 Ref.7
Modified residue6601Phosphoserine Ref.7 Ref.5

Experimental info

Sequence conflict2891D → G in AAH70095. Ref.1
Sequence conflict6541K → R in AAH70095. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Q9P2I0-1 [UniParc].

Last modified August 16, 2004. Version 2.
Checksum: F67B4813B9883CE8

FASTA78288,487
        10         20         30         40         50         60 
MTSIIKLTTL SGVQEESALC YLLQVDEFRF LLDCGWDEHF SMDIIDSLRK HVHQIDAVLL 

        70         80         90        100        110        120 
SHPDPLHLGA LPYAVGKLGL NCAIYATIPV YKMGQMFMYD LYQSRHNTED FTLFTLDDVD 

       130        140        150        160        170        180 
AAFDKIQQLK FSQIVNLKGK GHGLSITPLP AGHMIGGTIW KIVKDGEEEI VYAVDFNHKR 

       190        200        210        220        230        240 
EIHLNGCSLE MLSRPSLLIT DSFNATYVQP RRKQRDEQLL TNVLETLRGD GNVLIAVDTA 

       250        260        270        280        290        300 
GRVLELAQLL DQIWRTKDAG LGVYSLALLN NVSYNVVEFS KSQVEWMSDK LMRCFEDKRN 

       310        320        330        340        350        360 
NPFQFRHLSL CHGLSDLARV PSPKVVLASQ PDLECGFSRD LFIQWCQDPK NSIILTYRTT 

       370        380        390        400        410        420 
PGTLARFLID NPSEKITEIE LRKRVKLEGK ELEEYLEKEK LKKEAAKKLE QSKEADIDSS 

       430        440        450        460        470        480 
DESDIEEDID QPSAHKTKHD LMMKGEGSRK GSFFKQAKKS YPMFPAPEER IKWDEYGEII 

       490        500        510        520        530        540 
KPEDFLVPEL QATEEEKSKL ESGLTNGDEP MDQDLSDVPT KCISTTESIE IKARVTYIDY 

       550        560        570        580        590        600 
EGRSDGDSIK KIINQMKPRQ LIIVHGPPEA SQDLAECCRA FGGKDIKVYM PKLHETVDAT 

       610        620        630        640        650        660 
SETHIYQVRL KDSLVSSLQF CKAKDAELAW IDGVLDMRVS KVDTGVILEE GELKDDGEDS 

       670        680        690        700        710        720 
EMQVEAPSDS SVIAQQKAMK SLFGDDEKET GEESEIIPTL EPLPPHEVPG HQSVFMNEPR 

       730        740        750        760        770        780 
LSDFKQVLLR EGIQAEFVGG VLVCNNQVAV RRTETGRIGL EGCLCQDFYR IRDLLYEQYA 


IV 

« Hide

References

« Hide 'large scale' references
[1]"The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
The MGC Project Team
Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Tissue: Testis.
[2]"Prediction of the coding sequences of unidentified human genes. XVI. The complete sequences of 150 new cDNA clones from brain which code for large proteins in vitro."
Nagase T., Kikuno R., Ishikawa K., Hirosawa M., Ohara O.
DNA Res. 7:65-73(2000) [PubMed: 10718198] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 204-782.
Tissue: Brain.
[3]"The full-ORF clone resource of the German cDNA consortium."
Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., Wellenreuther R., Mehrle A., Schuster C., Bahr A., Blocker H., Heubner D., Hoerlein A., Michel G., Wedler H., Kohrer K., Ottenwalder B., Poustka A., Wiemann S., Schupp I.
BMC Genomics 8:399-399(2007) [PubMed: 17974005] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 279-782.
Tissue: Lymph node.
[4]"Human Fip1 is a subunit of CPSF that binds to U-rich RNA elements and stimulates poly(A) polymerase."
Kaufmann I., Martin G., Friedlein A., Langen H., Keller W.
EMBO J. 23:616-626(2004) [PubMed: 14749727] [Abstract]
Cited for: FUNCTION IN PRE-MRNA 3'-END PROCESSING, IDENTIFICATION IN THE CPSF COMPLEX.
[5]"Large-scale characterization of HeLa cell nuclear phosphoproteins."
Beausoleil S.A., Jedrychowski M., Schwartz D., Elias J.E., Villen J., Li J., Cohn M.A., Cantley L.C., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 101:12130-12135(2004) [PubMed: 15302935] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-660, MASS SPECTROMETRY.
Tissue: Epithelium.
[6]"Global, in vivo, and site-specific phosphorylation dynamics in signaling networks."
Olsen J.V., Blagoev B., Gnad F., Macek B., Kumar C., Mortensen P., Mann M.
Cell 127:635-648(2006) [PubMed: 17081983] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT TYR-395; SER-419; SER-420 AND SER-423, MASS SPECTROMETRY.
Tissue: Epithelium.
[7]"A quantitative atlas of mitotic phosphorylation."
Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P.
Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed: 18669648] [Abstract]
Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-419; SER-420; SER-423 AND SER-660, MASS SPECTROMETRY.
[8]Colinge J., Superti-Furga G., Bennett K.L.
Submitted (OCT-2008) to UniProtKB
Cited for: IDENTIFICATION [LARGE SCALE ANALYSIS], MASS SPECTROMETRY.
+Additional computationally mapped references.

Cross-references

Sequence databases

BC070095 mRNA. Translation: AAH70095.1.
AB037788 mRNA. Translation: BAA92605.1.
AL442079 mRNA. Translation: CAC09445.1.
IPIIPI00419531.
RefSeqNP_059133.1.
UniGeneHs.657632

3D structure databases

ModBaseSearch...

Protein-protein interaction databases

IntActQ9P2I0. 1 interaction.

PTM databases

PhosphoSiteQ9P2I0.

Proteomic databases

PeptideAtlasQ9P2I0.
PRIDEQ9P2I0.

Genome annotation databases

EnsemblENSG00000165934. Homo sapiens. [Contig view]
GeneID53981.
KEGGhsa:53981.
UCSCuc001yah.1. human.

Organism-specific databases

GeneCardsGC14P091658.
H-InvDBHIX0020194.
HGNCHGNC:2325. CPSF2.
HPAHPA000870.
MIM606028. gene.
PharmGKBPA26842.
HUGESearch...
GenAtlasSearch...

Phylogenomic databases

HOGENOMQ9P2I0.
HOVERGENQ9P2I0.
OMAQ9P2I0. DYYKIRE.

Enzyme and pathway databases

ReactomeREACT_125. Processing of Capped Intron-Containing Pre-mRNA.
REACT_1788. Transcription.
REACT_71. Gene Expression.

Gene expression databases

ArrayExpressQ9P2I0.
BgeeQ9P2I0.
CleanExHS_CPSF2.
GermOnlineENSG00000165934. Homo sapiens.

Family and domain databases

InterProIPR001279. Blactmase-like.
IPR011108. RMMBL.
[Graphical view]
PfamPF00753. Lactamase_B. 1 hit.
PF07521. RMMBL. 1 hit.
[Graphical view]
ProtoNetSearch...

Other Resources

NextBio56268.
SOURCESearch...

Entry information

Entry nameCPSF2_HUMAN
AccessionPrimary (citable) accession number: Q9P2I0
Secondary accession number(s): Q6NSJ1, Q9H3W7
Entry history
Integrated into UniProtKB/Swiss-Prot: October 18, 2001
Last sequence update: August 16, 2004
Last modified: July 7, 2009
This is version 59 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)

Relevant documents

Human chromosome 14

Human chromosome 14: entries, gene names and cross-references to MIM

MIM cross-references

Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Binary interactions · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents