Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot A5DCV3 (RPB1_PICGU)

Last modified November 24, 2009. Version 15. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    DNA-directed RNA polymerase II subunit RPB1
      Short name=RNA polymerase II subunit B1
      Short name=RNA polymerase II subunit 1
    EC=2.7.7.6
Alternative name(s):
    DNA-directed RNA polymerase III largest subunit
Gene names
Name: RPB1
ORF Names: PGUG_01108/01105
OrganismPichia guilliermondii (Yeast) (Candida guilliermondii) [Complete proteome]
Taxonomic identifier4929 [NCBI]
Taxonomic lineageEukaryotaFungiDikaryaAscomycotaSaccharomycotinaSaccharomycetesSaccharomycetalesSaccharomycetaceaePichia

Protein attributes

Sequence length1579 AA.
Sequence statusFragments.
Sequence processingThe displayed sequence is not processed.
Protein existenceInferred from homology.

General annotation (Comments)

Function

DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Largest and catalytic component of RNA polymerase II which synthesizes mRNA precursors and many functional non-coding RNAs. Forms the polymerase active center together with the second largest subunit. Pol II is the central component of the basal RNA polymerase II transcription machinery. It is composed of mobile elements that move relative to each other. RPB1 is part of the core element with the central large cleft, the clamp element that moves to open and close the cleft and the jaws that are thought to grab the incoming DNA template. At the start of transcription, a single stranded DNA template strand of the promoter is positioned within the central active site cleft of Pol II. A bridging helix emanates from RPB1 and crosses the cleft near the catalytic site and is thought to promote translocation of Pol II by acting as a ratchet that moves the RNA-DNA hybrid through the active site by switching from straight to bent conformations at each step of nucleotide addition. During transcription elongation, Pol II moves on the template as the transcript elongates. Elongation is influenced by the phosphorylation status of the C-terminal domain (CTD) of Pol II largest subunit (RPB1), which serves as a platform for assembly of factors that regulate transcription initiation, elongation, termination and mRNA processing By similarity.

Catalytic activity

Nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1).

Subunit structure

Component of the RNA polymerase II (Pol II) complex consisting of 12 subunits By similarity.

Subcellular location

Nucleus By similarity.

Post-translational modification

The tandem 7 residues repeats in the C-terminal domain (CTD) can be highly phosphorylated. The phosphorylation activates Pol II. Phosphorylation occurs mainly at residues 'Ser-2' and 'Ser-5' of the heptapepdtide repeat. The phosphorylation state is believed to result from the balanced action of site-specific CTD kinases and phosphataes, and a "CTD code" that specifies the position of Pol II within the transcription cycle has been proposed By similarity.

Miscellaneous

The binding of ribonucleoside triphosphate to the RNA polymerase II transcribing complex probably involves a two-step mechanism. The initial binding seems to occur at the entry (E) site and involves a magnesium ion temporarily coordinated by three conserved aspartate residues of the two largest RNA Pol II subunits. The ribonucleoside triphosphate is transferred by a rotation to the nucelotide addition (A) site for pairing with the template DNA. The catalytic A site involves three conserved aspartate residues of the RNA Pol II largest subunit which permanently coordinate a second magnesium ion.

Sequence similarities

Belongs to the RNA polymerase beta' chain family.

Contains 1 C2H2-type zinc finger.

Sequence caution

The sequence EDK37008.1 differs from that shown. Reason: Frameshift at positions 1480 and 1528.

The sequence EDK37010.1 differs from that shown. Reason: Frameshift at position 317.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 15791579DNA-directed RNA polymerase II subunit RPB1
PRO_0000295034

Regions

Repeat1385 – 139171
Repeat1392 – 139872
Repeat1399 – 140573
Repeat1406 – 141274
Repeat1413 – 141975
Repeat1420 – 142676
Repeat1427 – 143377
Repeat1434 – 144078
Repeat1441 – 144779
Repeat1448 – 1454710
Repeat1455 – 1461711
Repeat1462 – 1468712
Repeat1469 – 1475713
Repeat1476 – 1482714
Repeat1483 – 1489715
Repeat1490 – 1496716
Repeat1497 – 1503717; approximate
Repeat1504 – 1510718
Repeat1511 – 1517719
Repeat1518 – 1524720
Repeat1525 – 1531721
Repeat1532 – 1538722
Repeat1539 – 1545723
Repeat1546 – 1552724
Repeat1553 – 1559725; approximate
Repeat1560 – 1566726; approximate
Zinc finger68 – 8417C2H2-type By similarity
Region642 – 65413Bridging helix By similarity
Region1385 – 156618226 X 7 AA approximate tandem repeats of Y-S-P-T-S-P-[S-A-Q]
Compositional bias244 – 2496Poly-Pro

Sites

Metal binding681Zinc 1 By similarity
Metal binding711Zinc 1 By similarity
Metal binding781Zinc 1 By similarity
Metal binding811Zinc 1 By similarity
Metal binding1081Zinc 2 By similarity
Metal binding1111Zinc 2 By similarity
Metal binding1491Zinc 2 By similarity
Metal binding1711Zinc 2 By similarity
Metal binding4851Magnesium 1; catalytic By similarity
Metal binding4851Magnesium 2; shared with RPB2 By similarity
Metal binding4871Magnesium 1; catalytic By similarity
Metal binding4871Magnesium 2; shared with RPB2 By similarity
Metal binding4891Magnesium 1; catalytic By similarity

Natural variations

Natural variant1981T → S in strain: MmRL 1635. Ref.2

Experimental info

Non-adjacent residues566 – 5672

Sequences

Sequence LengthMass (Da)Tools
A5DCV3-1 [UniParc].

Last modified July 10, 2007. Version 2.
Checksum: 7BE0B14CC70852F8

FASTA1,579175,204
        10         20         30         40         50         60 
MSRQFPHSSA PLRSVKEVQF GLLSPEEVRA ISVAKIEYPE TMDQATKRPR EGGLNDPRLG 

        70         80         90        100        110        120 
SIDRNFKCQT CGEDMAECPG HFGHIELAKP VFHIGFIAKI KKVCECVCMH CGKLLLDESN 

       130        140        150        160        170        180 
PAMAQAIRIR DPKKRFNAVW NLCKGKMICE ADVLQDDGEG NEPKRTSRGG CGHTQPVVRK 

       190        200        210        220        230        240 
DGLKLWGTWK QNKNYDETEQ PERRLLTPSE ILNVFKHINS EDCVRLGFNE DYARPEWMLI 

       250        260        270        280        290        300 
TVLPVPPPPV RPSIAFNDTA RGEDDLTFKL ADVIKANINV QRLELDGSPQ HVISEFEALL 

       310        320        330        340        350        360 
QFHVATYMDN DIAGQPQALQ KTGRPIKSIR ARLKGKEGRL RGNLMGKRVD FSARTVISGD 

       370        380        390        400        410        420 
PNLDLDQVGV PISIARTLTY PEIVTPYNIH RLTEYVRNGP NEHPGAKYVI RDTGDRIDLR 

       430        440        450        460        470        480 
YNKRAGDIAL QYGWKVERHL MDDDPVLFNR QPSLHKMSMM CHRVKVMPYS TFRLNLSVTS 

       490        500        510        520        530        540 
PYNADFDGDE MNLHVPQSPE TRAELSEICA VPLQIVSPQS NKPVMGIVQD TLCGVRKMTL 

       550        560        570        580        590        600 
RDIFIEYDQV MNMLYWIPDW DGVIPPMNLK DLNNVKQMVV SGSKGSFINI SQMSACVGQQ 

       610        620        630        640        650        660 
IVEGKRIPFG FADRSLPHFT KDDYSPESKG FVENSYLRGL TPQEFFFHAM AGREGLIDTA 

       670        680        690        700        710        720 
VKTAETGYIQ RRLVKALEDI MVHYDGTTRN SLGDIIQFVY GEDGVDGTQV ESQSVDVIPG 

       730        740        750        760        770        780 
TDESFERRYR VDLLDPEKCI RDSLLESGKE VRGDVNLQRV LDEEYEQLCN DRRYLREVCF 

       790        800        810        820        830        840 
TNGDYTWPLP VNLRRIIQNA QQLFHGGRHK ASDLRLEEIV EGVKELCKKL LVLRGDSPLV 

       850        860        870        880        890        900 
KESQENATLL FQCLVRSRLA TRRVIEEFRL NRMSFEWAMG EIETQFQKSL VHPGEMVGVI 

       910        920        930        940        950        960 
AAQSIGEPAT QMTLNTFHYA GVSSKNVTLG VPRLKEILNV AKNIKTPALT VYLEKDIAAD 

       970        980        990       1000       1010       1020 
IEKAKVVQSA IEHTTLKNVT ASTEIFYDPD PRSTVLEEDY DTVEAYFAIP DEKVEESIEK 

      1030       1040       1050       1060       1070       1080 
QSPWLLRLEL DRAKMLDKQL TMAQVAEKIS QNFGEDLFVI WSDDTADKLI IRCRVVRDPK 

      1090       1100       1110       1120       1130       1140 
SLDEDVDAEE DQILKRIEAH MLESISLRGI QGITRVFMMQ HKLSLPDETG EFKQGQEWVL 

      1150       1160       1170       1180       1190       1200 
ETDGVNLADV MAVPGVDAKR TYSNNFIEIL SVLGIEATRS ALFKEILNVI AFDGSYVNYR 

      1210       1220       1230       1240       1250       1260 
HMALLVDVMT ARGHLMAITR HGINRADTGA LMRCSFEETV EILLEAGAAA ELDDCHGISE 

      1270       1280       1290       1300       1310       1320 
NVVLGQLAPL GTGAFDVMVD DKILQSSPSN MSVPVGAAEG AGAYADDGGA TPYREYDMED 

      1330       1340       1350       1360       1370       1380 
DKIQFEEGAG FSPIHTAPVS DGAGALTAYG GQPGSPSPTS PFAYGATSPA YGGASPGYGV 

      1390       1400       1410       1420       1430       1440 
TSPTYSPTSP SYSPTSPSYS PTSPSYSPTS PSYSPTSPSY SPTSPSYSPT SPSYSPTSPS 

      1450       1460       1470       1480       1490       1500 
YSPTSPSYSP TSPSYSPTSP SYSPTSPSYS PTSPSYSPTS PSYSPTSPSY SPTSPSYSPT 

      1510       1520       1530       1540       1550       1560 
SPLYSPTSPS YSPTSPSYSP TSPSYSPTSP SYSPTSPQYS PTSPAYSPTS PQYSPNSPQY 

      1570 
SPRSPLYSPD QNDDKDKKQ 

« Hide

Cross-references

Sequence databases

CH408155 Genomic DNA. Translation: EDK37008.1. Frameshift.
CH408155 Genomic DNA. Translation: EDK37010.1. Frameshift.
AY497711 Genomic DNA. Translation: AAT12588.1.
AY497732 Genomic DNA. Translation: AAT12608.1.
AY497733 Genomic DNA. Translation: AAT12609.1.
AY497734 Genomic DNA. Translation: AAT12610.1.

3D structure databases

ModBaseSearch...

Phylogenomic databases

OrthoDBEOG9SXPVX

Enzyme and pathway databases

BRENDA2.7.7.6. 3704.

Family and domain databases

InterProIPR000722. RNA_pol_asu.
IPR000684. RNA_pol_II_repeat_euk.
IPR006592. RNA_pol_N.
IPR007080. RNA_pol_Rpb1_1.
IPR007066. RNA_pol_Rpb1_3.
IPR007083. RNA_pol_Rpb1_4.
IPR007081. RNA_pol_Rpb1_5.
IPR007075. RNA_pol_Rpb1_6.
IPR007073. RNA_pol_Rpb1_7.
[Graphical view]
Gene3DG3DSA:2.40.40.30. RNA_pol_A. 1 hit.
PfamPF04997. RNA_pol_Rpb1_1. 1 hit.
PF00623. RNA_pol_Rpb1_2. 1 hit.
PF04983. RNA_pol_Rpb1_3. 1 hit.
PF05000. RNA_pol_Rpb1_4. 1 hit.
PF04998. RNA_pol_Rpb1_5. 1 hit.
PF04992. RNA_pol_Rpb1_6. 1 hit.
PF04990. RNA_pol_Rpb1_7. 1 hit.
PF05001. RNA_pol_Rpb1_R. 15 hits.
[Graphical view]
SMARTSM00663. RPOLA_N. 1 hit.
[Graphical view]
PROSITEPS00115. RNA_POL_II_REPEAT. 21 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameRPB1_PICGU
AccessionPrimary (citable) accession number: A5DCV3
Secondary accession number(s): A5DCV0 expand/collapse secondary AC list , Q6JE96, Q6JE97, Q6JE98
Entry history
Integrated into UniProtKB/Swiss-Prot: July 10, 2007
Last sequence update: July 10, 2007
Last modified: November 24, 2009
This is version 15 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectFPAP (Fungal Proteome Annotation Project)

Relevant documents

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents