Skip Header

Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P63133 (POK6_HUMAN)

Last modified January 19, 2010. Version 40. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    HERV-K_8p23.1 provirus ancestral Pol protein
Alternative name(s):
    HERV-K115 Pol protein
Including the following 3 domains:
    1- Recommended name:
            Reverse transcriptase
                Short name=RT
              EC=2.7.7.49
    2- Recommended name:
            Ribonuclease H
                Short name=RNase H
              EC=3.1.26.4
    3- Recommended name:
            Integrase
                Short name=IN
OrganismHomo sapiens (Human) [Complete proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length956 AA.
Sequence statusComplete.
Protein existenceInferred from homology.

General annotation (Comments)

Function

Early post-infection, the reverse transcriptase converts the viral RNA genome into double-stranded viral DNA. The RNase H domain of the reverse transcriptase performs two functions. It degrades the RNA template and specifically removes the RNA primer from the RNA/DNA hybrid. Following nuclear import, the integrase catalyzes the insertion of the linear, double-stranded viral DNA into the host cell chromosome. Endogenous Pol proteins may have kept, lost or modified their original function during evolution.

Catalytic activity

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).

Endonucleolytic cleavage to 5'-phosphomonoester.

Domain

The LPQG and YXDD motifs are catalytically important and conserved among many retroviruses.

Miscellaneous

This protein is synthesized as Gag-Pro and Gag-Pro-Pol polyprotein precursors. These polyproteins are thought, by similarity with type-B retroviruses, to be generated by -1 frameshifts occurring at the Gag-Pro and Pro-Pol genes boundaries.

Exact N-terminus of this protein has not been formally described.

Insertional polymorphism. Provirus present in 16% of tested individuals.

Intragenic, in first intron of DEFB107 gene.

Sequence similarities

Belongs to the beta type-B retroviral polymerase family. HERV class-II K(HML-2) subfamily.

Contains 1 integrase catalytic domain.

Contains 1 integrase-type DNA-binding domain.

Contains 1 integrase-type zinc finger.

Contains 1 reverse transcriptase domain.

Contains 1 RNase H domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 956956HERV-K_8p23.1 provirus ancestral Pol protein
PRO_0000186766

Regions

Domain57 – 245189Reverse transcriptase
Domain461 – 590130RNase H
Domain642 – 803162Integrase catalytic
Zinc finger587 – 62842Integrase-type
DNA binding811 – 85949Integrase-type
Motif161 – 1644LPQG
Motif195 – 1984YXDD

Sequences

Sequence LengthMass (Da)Tools
P63133-1 [UniParc].

Last modified September 13, 2004. Version 1.
Checksum: 2A267FE1EF31F06B

FASTA956107,703
        10         20         30         40         50         60 
NKSKKRRNRV SFLGVATIEP PKPIPLTWKT EKLVWVNQWP LPKQKLEALH LLANEQLEKG 

        70         80         90        100        110        120 
HIEPSFSPWN SPVFVIQKKS GKWRMLTDLR AVNAVIQPMG PLQPGLPSPA MIPKDWPLII 

       130        140        150        160        170        180 
IDLKDCFFTI PLAEQDCEKF AFTIPAINNK EPATRFQWKV LPQGMLNSPT ICQTFVGRAL 

       190        200        210        220        230        240 
QPVRKKFSDC YIIHYIDDIL CAAETKDKLI DCYTFLQAEV ASAGLAIASD KIQTSTPFHY 

       250        260        270        280        290        300 
LGMQIENRKI KPQKIEIRKD TLKTLNDFQK LLGDINWIQP TLGIPTYAMS NLFSILRGDS 

       310        320        330        340        350        360 
DLNSKRILTP EATKEIKLVE EKIQSAQINR IDPLAPLQLL IFATAHSPTG IIIQNTDLVE 

       370        380        390        400        410        420 
WSFLPHSTVK TFTLYLDQIA TLIGQTRLRI IKLCGNDPDK IVVPLTKEQV RQAFINSGAW 

       430        440        450        460        470        480 
QIGLANFVGI IDNHYPKTKI FQFLKLTTWI LPKITRREPL ENALTVFTDG SSNGKAAYTG 

       490        500        510        520        530        540 
PKERVIKTPY QSAQRAELVA VITVLQDFDQ PINIISDSAY VVQATRVVET ALIKYSMDDQ 

       550        560        570        580        590        600 
LNQLFNLLQQ TVRKRNFPFY ITHIRAHTNL PGPLTKANEQ ADLLVSSALI KAQELHALTH 

       610        620        630        640        650        660 
VNAAGLKNKF DVTWKQAKDI VQHCTQCQVL HLPTQEAGVN PRGLCPNALW QMDVTHVPSF 

       670        680        690        700        710        720 
GRLSYVHVTV DTYSHFIWAT CQTGESTSHV KKHLLSCFAV MGVPEKIKTD NGPGYCSKAF 

       730        740        750        760        770        780 
QKFLSQWKIS HTTGIPYNSQ GQAIVERTNR TLKTQLVKQK EGGDSKECTT PQMQLNLALY 

       790        800        810        820        830        840 
TLNFLNIYRN QTTTSAEQHL TGKKNSPHEG KLIWWKDNKN KTWEIGKVIT WGRGFACVSP 

       850        860        870        880        890        900 
GENQLPVWIP TRHLKFYNEP IRDAKKSTSA ETETPQSSTV DSQDEQNGDV RRTDEVAIHQ 

       910        920        930        940        950 
EGRAADLGTT KEADAVSYKI SREHKGDTNP REYAACSLDD CINGGKSPYA CRSSCS 

« Hide

References

[1]"Insertional polymorphisms of full-length endogenous retroviruses in humans."
Turner G., Barbulescu M., Su M., Jensen-Seaman M.I., Kidd K.K., Lenz J.
Curr. Biol. 11:1531-1535(2001) [PubMed: 11591322] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AY037929 Genomic DNA. No translation available.
IPIIPI00454617.

3D structure databases

SMRP63133. Positions 23-589, 589-629, 643-785.
ModBaseSearch...

Proteomic databases

PRIDEP63133.

Phylogenomic databases

HOVERGENP63133.

Enzyme and pathway databases

BRENDA2.7.7.49. 247.
3.1.26.4. 247.

Gene expression databases

GenevestigatorP63133.

Family and domain databases

InterProIPR001037. Integrase_C_retrovir.
IPR001584. Integrase_cat-core.
IPR003308. Integrase_Zn-bd_dom_N.
IPR012337. PolynucTfrase_RNaseH_fold.
IPR000477. Reverse_transcriptase.
IPR002156. RNase_H.
IPR010661. RVT_thumb.
[Graphical view]
PfamPF00552. Integrase. 1 hit.
PF02022. Integrase_Zn. 1 hit.
PF00075. RnaseH. 1 hit.
PF00665. rve. 1 hit.
PF00078. RVT_1. 1 hit.
PF06817. RVT_thumb. 1 hit.
[Graphical view]
PROSITEPS50994. INTEGRASE. 1 hit.
PS51027. INTEGRASE_DBD. 1 hit.
PS50879. RNASE_H. 1 hit.
PS50878. RT_POL. 1 hit.
PS50876. ZF_INTEGRASE. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry namePOK6_HUMAN
AccessionPrimary (citable) accession number: P63133
Entry history
Integrated into UniProtKB/Swiss-Prot: September 13, 2004
Last sequence update: September 13, 2004
Last modified: January 19, 2010
This is version 40 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 8

Human chromosome 8: entries, gene names and cross-references to MIM

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents