Skip Header

Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot O71304 (DPOL_WMHBV)

Last modified January 19, 2010. Version 44. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Protein P
Including the following 3 domains:
    1- Recommended name:
            DNA-directed DNA polymerase
              EC=2.7.7.7
    2- Recommended name:
            RNA-directed DNA polymerase
              EC=2.7.7.49
    3- Recommended name:
            Ribonuclease H
              EC=3.1.26.4
Gene names
Name: P
OrganismWoolly monkey hepatitis B virus (isolate Louisville) (WMHBV) [Complete proteome]
Taxonomic identifier490134 [NCBI]
Taxonomic lineageVirusesRetro-transcribing virusesHepadnaviridaeOrthohepadnavirus
Virus hostLagothrix lagotricha (Common woolly monkey) [TaxID: 9519]

Protein attributes

Sequence length835 AA.
Sequence statusComplete.
Protein existenceInferred from homology.

General annotation (Comments)

Function

Multifunctional enzyme that converts the viral RNA genome into dsDNA in viral cytoplasmic capsids. This enzyme displays a DNA polymerase activity that can copy either DNA or RNA templates, and a ribonuclease H (RNase H) activity that cleaves the RNA strand of RNA-DNA heteroduplexes in a partially processive 3'- to 5'-endonucleasic mode. Neo-synthesized pregenomic RNA (pgRNA) are encapsidated together with the P protein, and reverse-transcribed inside the nucleocapsid. Initiation of reverse-transcription occurs first by binding the epsilon loop on the pgRNA genome, and is initiated by protein priming, thereby the 5'-end of (-)DNA is covalently linked to P protein. Partial (+)DNA is synthesized from the (-)DNA template and generates the relaxed circular DNA (RC-DNA) genome. After budding and infection, the RC-DNA migrates in the nucleus, and is converted into a plasmid-like covalently closed circular DNA (cccDNA). The activity of P protein does not seem to be necessary for cccDNA generation, and is presumably released from (+)DNA by host nuclear DNA repair machinery By similarity.

Catalytic activity

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).

Endonucleolytic cleavage to 5'-phosphomonoester.

Enzyme regulation

Activated by host HSP70 and HSP40 in vitro to be able to bind the epsilon loop of the pgRNA. Because deletion of the RNase H region renders the protein partly chaperone-independent, the chaperones may be needed indirectly to releive occlusion of the RNA-binding site by this domain. Inhibited by several reverse-transcriptase inhibitors: Lamivudine, Adefovir and Entecavir By similarity.

Domain

Terminal protein domain (TP) is hepadnavirus-specific. Spacer domain is highly variable and separates the TP and RT domains. Polymerase/reverse-transcriptase domain (RT) and ribonuclease H domain (RH) are similar to retrovirus reverse transcriptase/RNase H By similarity.

The polymerase/reverse transcriptase (RT) and ribonuclease H (RH) domains are structured in five subdomains: finger, palm, thumb, connection and RNase H. Within the palm subdomain, the 'primer grip' region is thought to be involved in the positioning of the primer terminus for accommodating the incoming nucleotide. The RH domain stabilizes the association of RT with primer-template By similarity.

Sequence similarities

Belongs to the hepadnaviridae P protein family.

Contains 1 reverse transcriptase domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 835835Protein P
PRO_0000323286

Regions

Domain345 – 590246Reverse transcriptase
Region1 – 176176Terminal protein domain (TP) By similarity
Region177 – 334158Spacer By similarity
Region335 – 680346Polymerase/reverse transcriptase domain (RT) By similarity
Region681 – 835155RnaseH domain (RH) By similarity

Sites

Metal binding4171Magnesium; catalytic By similarity
Metal binding5411Magnesium; catalytic By similarity
Metal binding5421Magnesium; catalytic By similarity
Site621Priming of reverse-transcription by covalently linking the first nucleotide of the (-)DNA By similarity

Sequences

Sequence LengthMass (Da)Tools
O71304-1 [UniParc].

Last modified August 1, 1998. Version 1.
Checksum: 88623B654D96107F

FASTA83593,843
        10         20         30         40         50         60 
MPLSYQHFRK LLLLDEGDPL EDALPRLADE DLNRRVAEGL NLQHLPVSIP WTHKVGPFSG 

        70         80         90        100        110        120 
LYSVSTLTFN PQWKTPQFPL IHLKEDLIPF IESYFGPLTS NEKRRLKLVL PARFYPKATK 

       130        140        150        160        170        180 
YFPLEKGIKP HYPNDVVNHY YQVQHYLHTL WEAGVLYKRE TTHSASFFGT PYTWEHKLQH 

       190        200        210        220        230        240 
GTQPVNVQPA GILSQSSAGP PVQGQCRLSR LGQKSKQGPL ATSPRHGSGG LWSRTSATPW 

       250        260        270        280        290        300 
RPSGVEFTSS GFVCHSARHP SSSINQSRQR KETNTSYSSS ERHSPTSHDL EHVLLPELSS 

       310        320        330        340        350        360 
ESKGQRPLLS CWWLNFKHCQ PCSDHCLHHI VKLLDDWGPC QHHGHHFIRI PRTPSRITGG 

       370        380        390        400        410        420 
VFLVDKNPHN ATESRLVVDF SQFSRGNTSV SWPKFAVPNL QSLTNLLSTD LSWVSLDVFA 

       430        440        450        460        470        480 
AFYHLPLHPA SMPHLLVGSS GLPRYVARVS SSTNSYRNNN NNGTLQDLHA NCSRHLFVSL 

       490        500        510        520        530        540 
MLLYQTYGRK LHLYSHPLIM GFRKVPMGLG LSPFLLAQFT SAICSVVRRA FPHCMAFSYM 

       550        560        570        580        590        600 
DDVVLGAKSV QHLESLLASV TTFLLALGIH LNPEKTKRWG KALNFMGYVI GGYGSLPQQH 

       610        620        630        640        650        660 
IRDKIALCFQ KLPCNRPIDW KVCQRIVGLL GFVAPFTQCG YAALMPIYTC IQKHQAFTFS 

       670        680        690        700        710        720 
LVYKTFLKDQ YMHLYPVARQ RAGHCQVFAD ATPTGWGLVM GNQRMRGTFL SPLPIHTAEL 

       730        740        750        760        770        780 
LAACFARCWS GAKLIGTDNA VVLSRKYTHF PWLLGCAATW ILRGTCFVYV PSKLNPADDP 

       790        800        810        820        830 
SRGCLGLLKP LPRLLFQPST GRTSLYAVSP PVPFHRPGRV LFASPLQPGD AWRPP 

« Hide

References

[1]"Isolation of a hepadnavirus from the woolly monkey, a New World primate."
Lanford R.E., Chavez D., Brasky K.M., Burns R.B. III, Rico-Hesse R.
Proc. Natl. Acad. Sci. U.S.A. 95:5757-5761(1998) [PubMed: 9576957] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
[2]"Hepatitis B virus replication."
Beck J., Nassal M.
World J. Gastroenterol. 13:48-64(2007) [PubMed: 17206754] [Abstract]
Cited for: REVIEW.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF046996 Genomic DNA. Translation: AAC16908.1.

3D structure databases

SMRO71304. Positions 346-655.
ModBaseSearch...

Family and domain databases

InterProIPR001462. DNApol_viral_C.
IPR000201. DNApol_viral_N.
IPR000477. Reverse_transcriptase.
[Graphical view]
PfamPF00336. DNA_pol_viral_C. 1 hit.
PF00242. DNA_pol_viral_N. 1 hit.
PF00078. RVT_1. 1 hit.
[Graphical view]
PROSITEPS50878. RT_POL. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameDPOL_WMHBV
AccessionPrimary (citable) accession number: O71304
Entry history
Integrated into UniProtKB/Swiss-Prot: March 18, 2008
Last sequence update: August 1, 1998
Last modified: January 19, 2010
This is version 44 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectVirus (Virus annotation project)

Relevant documents

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents