Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P03354 (POL_RSVP)

Last modified June 16, 2009. Version 81. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Pol polyprotein
Cleaved into the following 2 chains:
    1- Recommended name:
            Reverse transcriptase/ribonuclease H alpha-subunit
                Short name=RT
              EC=2.7.7.49
              EC=2.7.7.7
              EC=3.1.26.4
    2- Recommended name:
            Integrase
                Short name=IN
        Alternative name(s):
            pp32
Gene names
Name: pol
OrganismRous sarcoma virus (strain Prague C) (RSV-PrC)
Taxonomic identifier11888 [NCBI]
Taxonomic lineageVirusesRetro-transcribing virusesRetroviridaeOrthoretrovirinaeAlpharetrovirus
Virus hostGallus gallus (Chicken) [TaxID: 9031]

Protein attributes

Sequence length895 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Function

During replicative cycle of retroviruses, the reverse-transcribed viral DNA is integrated into the host chromosome by the viral integrase enzyme. RNase H activity is associated with the reverse transcriptase.

Catalytic activity

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).

Endonucleolytic cleavage to 5'-phosphomonoester.

Cofactor

Binds 8 manganese or magnesium ions per integrase homotetramer By similarity.

Subunit structure

The integrase forms a homotetramer. Reverse transcriptase is a heterodimer of alpha and beta subunits.

Domain

The integrase is composed of three domains. The N-terminal domain is a zinc binding domain. The central domain is the catalytic domain. The C-terminal domain is a non-specific DNA binding domain.

Post-translational modification

Specific enzymatic cleavages in vivo yield mature proteins. The pol-derived portion of gag-pol precursor is processed to yield the reverse transcriptase beta subunit, which in turn is processed to yield the reverse transcriptase alpha subunit and the integrase.

Miscellaneous

This protein is synthesized as a Gag-Pol polyprotein.

Sequence similarities

Contains 1 integrase catalytic domain.

Contains 1 integrase-type DNA-binding domain.

Contains 1 integrase-type zinc finger.

Contains 1 reverse transcriptase domain.

Contains 1 RNase H domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 572572Reverse transcriptase/ribonuclease H alpha-subunit Potential
PRO_0000040986
Chain573 – 895323Integrase Potential
PRO_0000040987

Regions

Domain42 – 230189Reverse transcriptase
Domain441 – 571131RNase H
Domain625 – 788164Integrase catalytic
Zinc finger573 – 61341Integrase-type
DNA binding794 – 84249Integrase-type

Sites

Metal binding6361Magnesium; catalytic; for integrase activity By similarity
Metal binding6931Magnesium; catalytic; for integrase activity By similarity
Metal binding7291Magnesium; catalytic; for integrase activity By similarity

Natural variations

Natural variant141P → S
Natural variant161H → R
Natural variant1761C → R
Natural variant1991E → K
Natural variant2471A → T
Natural variant3041R → Q
Natural variant4741A → V
Natural variant5351S → G
Natural variant8671E → G
Natural variant8691E → K

Experimental info

Sequence conflict481E → V in CAA48535. Ref.3
Sequence conflict4981T → A in CAA48535. Ref.3
Sequence conflict5661Q → K in CAA48535. Ref.3
Sequence conflict6731V → A in CAA48535. Ref.3

Secondary structure

.......................................... 895
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P03354-1 [UniParc].

Last modified July 21, 1986. Version 1.
Checksum: 7C28319ED8985465

FASTA89598,662
        10         20         30         40         50         60 
TVALHLAIPL KWKPDHTPVW IDQWPLPEGK LVALTQLVEK ELQLGHIEPS LSCWNTPVFV 

        70         80         90        100        110        120 
IRKASGSYRL LHDLRAVNAK LVPFGAVQQG APVLSALPRG WPLMVLDLKD CFFSIPLAEQ 

       130        140        150        160        170        180 
DREAFAFTLP SVNNQAPARR FQWKVLPQGM TCSPTICQLV VGQVLEPLRL KHPSLCMLHY 

       190        200        210        220        230        240 
MDDLLLAASS HDGLEAAGEE VISTLERAGF TISPDKVQRE PGVQYLGYKL GSTYVAPVGL 

       250        260        270        280        290        300 
VAEPRIATLW DVQKLVGSLQ WLRPALGIPP RLMGPFYEQL RGSDPNEARE WNLDMKMAWR 

       310        320        330        340        350        360 
EIVRLSTTAA LERWDPALPL EGAVARCEQG AIGVLGQGLS THPRPCLWLF STQPTKAFTA 

       370        380        390        400        410        420 
WLEVLTLLIT KLRASAVRTF GKEVDILLLP ACFREDLPLP EGILLALKGF AGKIRSSDTP 

       430        440        450        460        470        480 
SIFDIARPLH VSLKVRVTDH PVPGPTVFTD ASSSTHKGVV VWREGPRWEI KEIADLGASV 

       490        500        510        520        530        540 
QQLEARAVAM ALLLWPTTPT NVVTDSAFVA KMLLKMGQEG VPSTAAAFIL EDALSQRSAM 

       550        560        570        580        590        600 
AAVLHVRSHS EVPGFFTEGN DVADSQATFQ AYPLREAKDL HTALHIGPRA LSKACNISMQ 

       610        620        630        640        650        660 
QAREVVQTCP HCNSAPALEA GVNPRGLGPL QIWQTDFTLE PRMAPRSWLA VTVDTASSAI 

       670        680        690        700        710        720 
VVTQHGRVTS VAVQHHWATA IAVLGRPKAI KTDNGSCFTS KSTREWLARW GIAHTTGIPG 

       730        740        750        760        770        780 
NSQGQAMVER ANRLLKDRIR VLAEGDGFMK RIPTSKQGEL LAKAMYALNH FERGENTKTP 

       790        800        810        820        830        840 
IQKHWRPTVL TEGPPVKIRI ETGEWEKGWN VLVWGRGYAA VKNRDTDKVI WVPSRKVKPD 

       850        860        870        880        890 
ITQKDEVTKK DEASPLFAGI SDWIPWEDEQ EGLQGETASN KQERPGEDTL AANES 

« Hide

References

[1]"Nucleotide sequence of Rous sarcoma virus."
Schwartz D., Tizard R., Gilbert W.
Cell 32:853-869(1983) [PubMed: 6299578] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
[2]"Rous sarcoma virus encodes a transcriptional activator."
Broome S., Gilbert W.
Cell 40:537-546(1985) [PubMed: 2982497] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE, POST-TRANSLATIONAL MODIFICATIONS.
[3]"Complete nucleotide sequence of Rous sarcoma virus variants adapted to duck cells."
Kashuba V.I., Kavsan V.M., Ryndich A.V., Lazurkevich Z.V., Zubak S.V., Popov S.V., Dostalova V., Glozhanek I.
Mol. Biol. (Mosk.) 27:436-450(1993) [PubMed: 8387633] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[4]"Crystal structure of an active two-domain derivative of Rous sarcoma virus integrase."
Yang Z.-N., Mueser T.C., Bushman F.D., Hyde C.C.
J. Mol. Biol. 296:535-548(2000) [PubMed: 10669607] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.53 ANGSTROMS) OF 621-858.
Strain: Clone pATV8.

Cross-references

Sequence databases

V01197 Genomic DNA. No translation available.
J02342 Genomic RNA. Translation: AAB59933.1. Different initiation.
X68524 Genomic DNA. Translation: CAA48535.1.
PIRGNFV1R. A03955.

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
1C0MX-ray2.53A/B/C/D621-858[»]
1C1AX-ray3.10A/B621-858[»]
ModBaseSearch...

Family and domain databases

InterProIPR000477. DNA_pol_RVTase.
IPR001037. Integrase_C_retrovir.
IPR001584. Integrase_cat-core.
IPR003308. Integrase_Zn-bd_dom_N.
IPR002156. RNase_H.
IPR010661. RVT_thumb.
[Graphical view]
Gene3DG3DSA:2.30.30.10. Integrase_C. 1 hit.
PfamPF00552. Integrase. 1 hit.
PF02022. Integrase_Zn. 1 hit.
PF00075. RnaseH. 1 hit.
PF00665. rve. 1 hit.
PF00078. RVT_1. 1 hit.
PF06817. RVT_thumb. 1 hit.
[Graphical view]
PROSITEPS50994. INTEGRASE. 1 hit.
PS51027. INTEGRASE_DBD. 1 hit.
PS50879. RNASE_H. 1 hit.
PS50878. RT_POL. 1 hit.
PS50876. ZF_INTEGRASE. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry namePOL_RSVP
AccessionPrimary (citable) accession number: P03354
Secondary accession number(s): Q07462, Q64983
Entry history
Integrated into UniProtKB/Swiss-Prot: July 21, 1986
Last sequence update: July 21, 1986
Last modified: June 16, 2009
This is version 81 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectVirus (Virus annotation project)

Relevant documents

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents