P03354 (POL_RSVP) Reviewed, UniProtKB/Swiss-Prot
Last modified
December 14, 2011.
Version 94.
History...
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order
Names·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize orderNames and origin
| Protein names | Recommended name: Gag-Pro-Pol polyprotein Cleaved into the following 12 chains:
| ||
| Gene names |
| ||
| Organism | Rous sarcoma virus (strain Prague C) (RSV-PrC) [Complete proteome] | ||
| Taxonomic identifier | 11888 [NCBI] | ||
| Taxonomic lineage | Viruses › Retro-transcribing viruses › Retroviridae › Orthoretrovirinae › Alpharetrovirus | ||
| Virus host | Gallus gallus (Chicken) [TaxID: 9031] |
Protein attributes
| Sequence length | 1603 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is further processed into a mature form. |
| Protein existence | Evidence at protein level |
General annotation (Comments)
| Function | Capsid protein p27 forms the spherical core of the virus that encapsulates the genomic RNA-nucleocapsid complex By similarity. The aspartyl protease mediates proteolytic cleavages of Gag and Gag-Pol polyproteins during or shortly after the release of the virion from the plasma membrane. Cleavages take place as an ordered, step-wise cascade to yield mature proteins. This process is called maturation. Displays maximal activity during the budding process just prior to particle release from the cell By similarity. |
| Catalytic activity | Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1). Endonucleolytic cleavage to 5'-phosphomonoester. |
| Cofactor | Binds 2 magnesium ions for reverse transcriptase polymerase activity By similarity. Binds 2 magnesium ions for ribonuclease H (RNase H) activity. Substrate-binding is a precondition for magnesium binding By similarity. Binds 8 magnesium ions per integrase homotetramer By similarity. |
| Subunit structure | The protease is active as a homodimer By similarity. The integrase forms a homotetramer. Reverse transcriptase is a heterodimer of alpha and beta subunits. Three forms of RT exist: alpha-alpha (alpha-Pol), beta-beta (beta-Pol), and alpha-beta, with the major form being the heterodimer. Both the polymerase and RNase H active sites are located in the alpha subunit of heterodimeric RT alpha-beta. Ref.6 |
| Subcellular location | Matrix protein p19: Virion Potential. Capsid protein p27: Virion Potential. Nucleocapsid protein p12: Virion Potential. |
| Domain | Late-budding domains (L domains) are short sequence motifs essential for viral particle release. They can occur individually or in close proximity within structural proteins. They interacts with sorting cellular proteins of the multivesicular body (MVB) pathway. Most of these proteins are class E vacuolar protein sorting factors belonging to ESCRT-I, ESCRT-II or ESCRT-III complexes. P2B contains one L domain: a PPXY motif which probably binds to the WW domains of HECT (homologous to E6-AP C-terminus) E3 ubiquitin ligases By similarity. Integrase core domain contains the D-x(n)-D-x(35)-E motif, named for the phylogenetically conserved glutamic acid and aspartic acid residues and the invariant 35 amino acid spacing between the second and third acidic residues. Each acidic residue of the D,D(35)E motif is independently essential for the 3'-processing and strand transfer activities of purified integrase protein By similarity. |
| Post-translational modification | Specific enzymatic cleavages in vivo yield mature proteins. |
| Miscellaneous | The reverse transcriptase is an error-prone enzyme that lacks a proof-reading function. High mutations rate is a direct consequence of this characteristic. RT also displays frequent template switching leading to high recombination rate. Recombination mostly occurs between homologous regions of the two copackaged RNA genomes. If these two RNA molecules derive from different viral strains, reverse transcription will give rise to highly recombinated proviral DNAs. |
| Sequence similarities | Contains 2 CCHC-type zinc fingers. Contains 1 integrase catalytic domain. Contains 1 integrase-type DNA-binding domain. Contains 1 integrase-type zinc finger. Contains 1 peptidase A2 domain. Contains 1 reverse transcriptase domain. Contains 1 RNase H domain. |
| Sequence caution | The sequence AAB59933.1 differs from that shown. Reason: Erroneous initiation. |
Ontologies
Alternative products
| This entry describes 2 isoforms produced by ribosomal frameshifting. [Align] [Select] Note: Translation results in the formation of the Gag-Pro. Ribosomal frameshifting at the gag-pro/pol genes boundary produces the Gag-Pro-Pol polyprotein. | ||||||
| Isoform Gag-Pro-Pol polyprotein (identifier: P03354-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Note: Produced by -1 ribosomal frameshifting. | ||||||
| Isoform Gag-Pro polyprotein (identifier: P03322-1) The sequence of this isoform can be found in the external entry P03322. Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly. | ||||||
| Note: Produced by conventional translation. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||||||||||||||||||||||||||||||||||||||||||
Molecule processing | |||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 155 | 155 | Matrix protein p19 | PRO_5000053588 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 156 – 166 | 11 | p2A | PRO_0000397068 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 167 – 177 | 11 | p2B | PRO_0000397069 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 178 – 239 | 62 | p10 | PRO_5000053589 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 240 – 479 | 240 | Capsid protein p27 | PRO_5000053590 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 480 – 488 | 9 | p3 | PRO_0000397070 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 489 – 577 | 89 | Nucleocapsid protein p12 | PRO_5000053591 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 578 – 708 | 131 | Protease p15 | PRO_5000053592 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 709 – 1567 | 859 | Reverse transcriptase beta-subunit | PRO_0000397071 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 709 – 1280 | 572 | Reverse transcriptase alpha-subunit | PRO_0000040986 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 1281 – 1567 | 287 | Integrase | PRO_0000040987 | |||||||||||||||||||||||||||||||||||||||||||||
| Chain | 1568 – 1603 | 36 | p4 | PRO_0000397072 | |||||||||||||||||||||||||||||||||||||||||||||
Regions | |||||||||||||||||||||||||||||||||||||||||||||||||
| Domain | 609 – 690 | 82 | Peptidase A2 | ||||||||||||||||||||||||||||||||||||||||||||||
| Domain | 750 – 938 | 189 | Reverse transcriptase | ||||||||||||||||||||||||||||||||||||||||||||||
| Domain | 1149 – 1280 | 132 | RNase H | ||||||||||||||||||||||||||||||||||||||||||||||
| Domain | 1333 – 1496 | 164 | Integrase catalytic | ||||||||||||||||||||||||||||||||||||||||||||||
| Zinc finger | 507 – 524 | 18 | CCHC-type 1 | ||||||||||||||||||||||||||||||||||||||||||||||
| Zinc finger | 533 – 550 | 18 | CCHC-type 2 | ||||||||||||||||||||||||||||||||||||||||||||||
| Zinc finger | 1280 – 1321 | 42 | Integrase-type | ||||||||||||||||||||||||||||||||||||||||||||||
| DNA binding | 1502 – 1550 | 49 | Integrase-type | ||||||||||||||||||||||||||||||||||||||||||||||
| Motif | 172 – 175 | 4 | PPXY motif By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Compositional bias | 171 – 174 | 4 | Poly-Pro | ||||||||||||||||||||||||||||||||||||||||||||||
Sites | |||||||||||||||||||||||||||||||||||||||||||||||||
| Active site | 614 | 1 | For protease activity; shared with dimeric partner By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 815 | 1 | Magnesium; catalytic; for reverse transcriptase activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 890 | 1 | Magnesium; catalytic; for reverse transcriptase activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 891 | 1 | Magnesium; catalytic; for reverse transcriptase activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 1158 | 1 | Magnesium; catalytic; for RNase H activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 1192 | 1 | Magnesium; catalytic; for RNase H activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 1213 | 1 | Magnesium; catalytic; for RNase H activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 1272 | 1 | Magnesium; catalytic; for RNase H activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 1344 | 1 | Magnesium; catalytic; for integrase activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 1401 | 1 | Magnesium; catalytic; for integrase activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Metal binding | 1437 | 1 | Magnesium; catalytic; for integrase activity By similarity | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 155 – 156 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 166 – 167 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 177 – 178 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 239 – 240 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 479 – 480 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 488 – 489 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 577 – 578 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 708 – 709 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 1280 – 1281 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
| Site | 1567 – 1568 | 2 | Cleavage; by viral protease p15 | ||||||||||||||||||||||||||||||||||||||||||||||
Natural variations | |||||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 722 | 1 | P → S. | ||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 724 | 1 | H → R. | ||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 884 | 1 | C → R. | ||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 907 | 1 | E → K. | ||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 955 | 1 | A → T. | ||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 1012 | 1 | R → Q. | ||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 1182 | 1 | A → V. | ||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 1243 | 1 | S → G. | ||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 1575 | 1 | E → G. | ||||||||||||||||||||||||||||||||||||||||||||||
| Natural variant | 1577 | 1 | E → K. | ||||||||||||||||||||||||||||||||||||||||||||||
Experimental info | |||||||||||||||||||||||||||||||||||||||||||||||||
| Sequence conflict | 756 | 1 | E → V in CAA48535. Ref.3 | ||||||||||||||||||||||||||||||||||||||||||||||
| Sequence conflict | 1206 | 1 | T → A in CAA48535. Ref.3 | ||||||||||||||||||||||||||||||||||||||||||||||
| Sequence conflict | 1274 | 1 | Q → K in CAA48535. Ref.3 | ||||||||||||||||||||||||||||||||||||||||||||||
| Sequence conflict | 1381 | 1 | V → A in CAA48535. Ref.3 | ||||||||||||||||||||||||||||||||||||||||||||||
Secondary structure | |||||||||||||||||||||||||||||||||||||||||||||||||
Helix Strand Turn | |||||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1339 – 1347 | 9 | |||||||||||||||||||||||||||||||||||||||||||||||
| Helix | 1349 – 1351 | 3 | |||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1356 – 1362 | 7 | |||||||||||||||||||||||||||||||||||||||||||||||
| Turn | 1363 – 1365 | 3 | |||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1368 – 1373 | 6 | |||||||||||||||||||||||||||||||||||||||||||||||
| Helix | 1378 – 1391 | 14 | |||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1396 – 1399 | 4 | |||||||||||||||||||||||||||||||||||||||||||||||
| Turn | 1405 – 1407 | 3 | |||||||||||||||||||||||||||||||||||||||||||||||
| Helix | 1409 – 1417 | 9 | |||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1421 – 1423 | 3 | |||||||||||||||||||||||||||||||||||||||||||||||
| Helix | 1434 – 1451 | 18 | |||||||||||||||||||||||||||||||||||||||||||||||
| Helix | 1462 – 1464 | 3 | |||||||||||||||||||||||||||||||||||||||||||||||
| Helix | 1468 – 1477 | 10 | |||||||||||||||||||||||||||||||||||||||||||||||
| Helix | 1488 – 1493 | 6 | |||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1503 – 1507 | 5 | |||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1518 – 1521 | 4 | |||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1527 – 1530 | 4 | |||||||||||||||||||||||||||||||||||||||||||||||
| Turn | 1532 – 1534 | 3 | |||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1537 – 1540 | 4 | |||||||||||||||||||||||||||||||||||||||||||||||
| Helix | 1542 – 1544 | 3 | |||||||||||||||||||||||||||||||||||||||||||||||
| Beta strand | 1545 – 1547 | 3 | |||||||||||||||||||||||||||||||||||||||||||||||
Sequences
| ||||||||||||||||||||||||
References
| [1] | "Nucleotide sequence of Rous sarcoma virus." Schwartz D., Tizard R., Gilbert W. Cell 32:853-869(1983) [PubMed: 6299578] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA]. |
| [2] | "Rous sarcoma virus encodes a transcriptional activator." Broome S., Gilbert W. Cell 40:537-546(1985) [PubMed: 2982497] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA], POST-TRANSLATIONAL MODIFICATIONS. |
| [3] | "Complete nucleotide sequence of Rous sarcoma virus variants adapted to duck cells." Kashuba V.I., Kavsan V.M., Ryndich A.V., Lazurkevich Z.V., Zubak S.V., Popov S.V., Dostalova V., Glozhanek I. Mol. Biol. (Mosk.) 27:436-450(1993) [PubMed: 8387633] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA]. |
| [4] | Chappey C. Submitted (NOV-1997) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [GENOMIC RNA]. |
| [5] | "Comparative studies on the substrate specificity of avian myeloblastosis virus proteinase and lentiviral proteinases." Tozser J., Bagossi P., Weber I.T., Copeland T.D., Oroszlan S. J. Biol. Chem. 271:6781-6788(1996) [PubMed: 8636100] [Abstract] Cited for: PROTEOLYTIC PROCESSING OF POLYPROTEIN. |
| [6] | "Asymmetric subunit organization of heterodimeric Rous sarcoma virus reverse transcriptase alphabeta: localization of the polymerase and RNase H active sites in the alpha subunit." Werner S., Woehrl B.M. J. Virol. 74:3245-3252(2000) [PubMed: 10708441] [Abstract] Cited for: SUBUNIT. |
| [7] | "Structural basis for specificity of retroviral proteases." Wu J., Adomat J.M., Ridky T.W., Louis J.M., Leis J., Harrison R.W., Weber I.T. Biochemistry 37:4518-4526(1998) [PubMed: 9521772] [Abstract] Cited for: X-RAY CRYSTALLOGRAPHY (2.4 ANGSTROMS) OF 578-701 OF MUTANT SER-615; ILE-619; ILE-621; MET-650; ALA-677; VAL-681; ARG-682; GLY-683; SER-684 WITH INHIBITOR. |
| [8] | "Crystal structure of an active two-domain derivative of Rous sarcoma virus integrase." Yang Z.-N., Mueser T.C., Bushman F.D., Hyde C.C. J. Mol. Biol. 296:535-548(2000) [PubMed: 10669607] [Abstract] Cited for: X-RAY CRYSTALLOGRAPHY (2.53 ANGSTROMS) OF 1329-1566. Strain: Clone pATV8. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| EMBL GenBank DDBJ | V01197 Genomic DNA. No translation available. J02342 Genomic RNA. Translation: AAB59933.1. Different initiation. X68524 Genomic DNA. Translation: CAA48535.1. AF033808 Genomic RNA. Translation: AAC82561.1. | ||||||||||||||||||||||||
| PIR | GNFV1R. A03955. | ||||||||||||||||||||||||
| RefSeq | NP_056886.1. NC_001407.1. | ||||||||||||||||||||||||
3D structure databases | |||||||||||||||||||||||||
| PDBe RCSB PDB PDBj |
| ||||||||||||||||||||||||
| ProteinModelPortal | P03354. | ||||||||||||||||||||||||
| SMR | P03354. Positions 2-87, 214-469, 503-552, 578-701, 1329-1550. | ||||||||||||||||||||||||
| ModBase | Search... | ||||||||||||||||||||||||
Protocols and materials databases | |||||||||||||||||||||||||
| StructuralBiologyKnowledgebase | Search... | ||||||||||||||||||||||||
Family and domain databases | |||||||||||||||||||||||||
| InterPro | IPR004028. Gag_M. IPR000721. Gag_p24. IPR001037. Integrase_C_retrovir. IPR001584. Integrase_cat-core. IPR017856. Integrase_Zn-bd_dom-like_N. IPR003308. Integrase_Zn-bd_dom_N. IPR012344. Matrix_N_HIV/RSV. IPR018061. Pept_A2A_retrovirus_sg. IPR001995. Peptidase_A2_cat. IPR021109. Peptidase_aspartic. IPR001969. Peptidase_aspartic_AS. IPR009007. Peptidase_aspartic_catalytic. IPR008916. Retrov_capsid_C. IPR008919. Retrov_capsid_N. IPR010999. Retrovr_matrix_N. IPR012337. RNaseH-like_dom. IPR002156. RNaseH_domain. IPR000477. RVT. IPR013084. Znf_CCH_retrovir. IPR001878. Znf_CCHC. [Graphical view] | ||||||||||||||||||||||||
| Gene3D | G3DSA:2.30.30.10. Integrase_C. 1 hit. G3DSA:1.10.10.200. Intgrase_N_Zn_bd. 1 hit. G3DSA:1.10.150.90. Matrix_HIV/RSV_N. 1 hit. G3DSA:2.40.70.10. Pept_Aspartc_cat. 1 hit. G3DSA:1.10.1200.30. Retrov_capsid_C. 1 hit. G3DSA:1.10.375.10. Retrov_capsid_N. 1 hit. G3DSA:4.10.60.10. Znf_CCH_retrovir. 1 hit. | ||||||||||||||||||||||||
| Pfam | PF00607. Gag_p24. 1 hit. PF00552. IN_DBD_C. 1 hit. PF02022. Integrase_Zn. 1 hit. PF02813. Retro_M. 1 hit. PF00075. RNase_H. 1 hit. PF00665. rve. 1 hit. PF00077. RVP. 1 hit. PF00078. RVT_1. 1 hit. PF00098. zf-CCHC. 2 hits. [Graphical view] | ||||||||||||||||||||||||
| ProDom | PD002871. Gag_M. 1 hit. [Graphical view] [Entries sharing at least one domain] | ||||||||||||||||||||||||
| SMART | SM00343. ZnF_C2HC. 2 hits. [Graphical view] | ||||||||||||||||||||||||
| SUPFAM | SSF50122. Integrase_C. 1 hit. SSF46919. Integrase_Zn_N. 1 hit. SSF50630. Pept_Aspartic. 1 hit. SSF47353. Retrov_capsid_C. 1 hit. SSF47943. Retrov_capsid_N. 1 hit. SSF47836. Retrovir_matrix. 1 hit. SSF53098. RNaseH_fold. 2 hits. | ||||||||||||||||||||||||
| PROSITE | PS50175. ASP_PROT_RETROV. 1 hit. PS00141. ASP_PROTEASE. 1 hit. PS50994. INTEGRASE. 1 hit. PS51027. INTEGRASE_DBD. 1 hit. PS50879. RNASE_H. 1 hit. PS50878. RT_POL. 1 hit. PS50158. ZF_CCHC. 1 hit. PS50876. ZF_INTEGRASE. 1 hit. [Graphical view] | ||||||||||||||||||||||||
| ProtoNet | Search... | ||||||||||||||||||||||||
Entry information
| Entry name | POL_RSVP | ||||||||
| Accession | Primary (citable) accession number: P03354 Secondary accession number(s): O92805, Q07462, Q64983 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation program | Viral Protein Annotation Program | ||||||||
Relevant documents
| Peptidase families Classification of peptidase families and list of entries |
| PDB cross-references Index of Protein Data Bank (PDB) cross-references |
| SIMILARITY comments Index of protein domains and families |

Clusters with