Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

RNA end formation protein 2

Gene

REF2

Organism
Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

RNA-binding component of the cleavage and polyadenylation factor (CPF) complex, which plays a key role in polyadenylation-dependent pre-mRNA 3'-end formation and cooperates with cleavage factors including the CFIA complex and NAB4/CFIB. Negative regulator of poly(A) synthesis. Component of the APT complex, which may be involved in polyadenylation-independent transcript 3'-end formation. REF2 is required for 3'-end formation of snoRNAs.4 Publications

GO - Molecular functioni

  • chromatin binding Source: SGD
  • RNA binding Source: SGD

GO - Biological processi

  • mRNA 3'-end processing Source: SGD
  • negative regulation of mRNA polyadenylation Source: SGD
  • snoRNA 3'-end processing Source: SGD
  • termination of RNA polymerase II transcription, poly(A)-coupled Source: SGD
Complete GO annotation...

Keywords - Biological processi

mRNA processing

Keywords - Ligandi

RNA-binding

Enzyme and pathway databases

BioCyciYEAST:G3O-29782-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
RNA end formation protein 2
Gene namesi
Name:REF2
Ordered Locus Names:YDR195W
ORF Names:YD9346.06
OrganismiSaccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast)
Taxonomic identifieri559292 [NCBI]
Taxonomic lineageiEukaryotaFungiDikaryaAscomycotaSaccharomycotinaSaccharomycetesSaccharomycetalesSaccharomycetaceaeSaccharomyces
Proteomesi
  • UP000002311 Componenti: Chromosome IV

Organism-specific databases

EuPathDBiFungiDB:YDR195W.
SGDiS000002603. REF2.

Subcellular locationi

GO - Cellular componenti

  • cytosol Source: SGD
  • mRNA cleavage and polyadenylation specificity factor complex Source: SGD
  • nucleus Source: SGD
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 533533RNA end formation protein 2PRO_0000097239Add
BLAST

Proteomic databases

MaxQBiP42073.

PTM databases

iPTMnetiP42073.

Interactioni

Subunit structurei

Interacts with FIR1. Component of the cleavage and polyadenylation factor (CPF) complex, which is composed of PTI1, SYC1, SSU72, GLC7, MPE1, REF2, PFS2, PTA1, YSH1/BRR5, SWD2, CFT2/YDH1, YTH1, CFT1/YHH1, FIP1 and PAP1. Component of the APT complex, which is a subcomplex of CPF, and is composed of PTI1, SYC1, SSU72, GLC7, REF2, PTA1 and SWD2.2 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
GLC7P325987EBI-14915,EBI-13715

Protein-protein interaction databases

BioGridi32247. 60 interactions.
DIPiDIP-871N.
IntActiP42073. 31 interactions.
MINTiMINT-394246.

Structurei

3D structure databases

ProteinModelPortaliP42073.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi321 – 33111Ser/Thr-richAdd
BLAST

Phylogenomic databases

InParanoidiP42073.
KOiK15543.
OMAiINLCVLD.
OrthoDBiEOG092C29T4.

Sequencei

Sequence statusi: Complete.

P42073-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSAPVPQLVN ISHALQASTI QQIRLDMVDF NKDCKLSSIQ LARIDKYIDS
60 70 80 90 100
LQAALNQFTK DNLHIERKEK NVTEADIQLY SGLKSMYLDY LNQLIKLKHE
110 120 130 140 150
KQHHSTPPIA NDVSLDFFVN QLPKFSPEER KNYIDNLILN KNSHNRLSKM
160 170 180 190 200
DGLVDAVINL CVLDTSVAEN VRSYMKLLDT LGFQKGSNST GTKANLKKKL
210 220 230 240 250
ASSKAKIKDS EKEKEKEKDK SKVKMKTKLK PSPLLNNDDK NSSPSPTAST
260 270 280 290 300
SSMKKLKSGL FNKNEAKSTE SLPTSSKKKL SFSKYLNKDD ADMTKLGTKR
310 320 330 340 350
SIDVDFKVNP EASTVASNII SSSTSGSSTT TVATPASSEE PLKKKTKISV
360 370 380 390 400
QDSNVQSILR NGKPKKARIS SIKFLDDSQL IKVYGDDLPN QGLQVSPTQL
410 420 430 440 450
KKILKPFKEG EPKEIILFED MSIKLKPLDL MFLKNTNSDD YMDISETKGG
460 470 480 490 500
PIHCETRTPL IYRKNFNHFN PDLNKRPPRE PIEFDLNGNT NSTPTIAKAF
510 520 530
GKNSLLLRKD RGGLPYKHVP IVKRNKYPPR PVH
Length:533
Mass (Da):59,819
Last modified:February 1, 1996 - v2
Checksum:i36B0982B005697C4
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U20261 Genomic DNA. Translation: AAA85866.1.
Z48784 Genomic DNA. Translation: CAA88708.1.
BK006938 Genomic DNA. Translation: DAA12038.1.
PIRiS52702.
RefSeqiNP_010481.3. NM_001180503.3.

Genome annotation databases

EnsemblFungiiYDR195W; YDR195W; YDR195W.
GeneIDi851776.
KEGGisce:YDR195W.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U20261 Genomic DNA. Translation: AAA85866.1.
Z48784 Genomic DNA. Translation: CAA88708.1.
BK006938 Genomic DNA. Translation: DAA12038.1.
PIRiS52702.
RefSeqiNP_010481.3. NM_001180503.3.

3D structure databases

ProteinModelPortaliP42073.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi32247. 60 interactions.
DIPiDIP-871N.
IntActiP42073. 31 interactions.
MINTiMINT-394246.

PTM databases

iPTMnetiP42073.

Proteomic databases

MaxQBiP42073.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblFungiiYDR195W; YDR195W; YDR195W.
GeneIDi851776.
KEGGisce:YDR195W.

Organism-specific databases

EuPathDBiFungiDB:YDR195W.
SGDiS000002603. REF2.

Phylogenomic databases

InParanoidiP42073.
KOiK15543.
OMAiINLCVLD.
OrthoDBiEOG092C29T4.

Enzyme and pathway databases

BioCyciYEAST:G3O-29782-MONOMER.

Miscellaneous databases

PROiP42073.

Family and domain databases

ProtoNetiSearch...

Entry informationi

Entry nameiREF2_YEAST
AccessioniPrimary (citable) accession number: P42073
Secondary accession number(s): D6VSH8
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 1, 1995
Last sequence update: February 1, 1996
Last modified: September 7, 2016
This is version 129 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programFungal Protein Annotation Program

Miscellaneousi

Miscellaneous

Present with 7450 molecules/cell in log phase SD medium.1 Publication

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Yeast
    Yeast (Saccharomyces cerevisiae): entries, gene names and cross-references to SGD
  2. Yeast chromosome IV
    Yeast (Saccharomyces cerevisiae) chromosome IV: entries and gene names

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.