Skip Header

Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P10266 (POK10_HUMAN)

Last modified January 19, 2010. Version 81. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    HERV-K_5q33.3 provirus ancestral Pol protein
Alternative name(s):
    HERV-K10 Pol protein
    HERV-K107 Pol protein
Including the following 3 domains:
    1- Recommended name:
            Reverse transcriptase
                Short name=RT
              EC=2.7.7.49
    2- Recommended name:
            Ribonuclease H
                Short name=RNase H
              EC=3.1.26.4
    3- Recommended name:
            Integrase
                Short name=IN
OrganismHomo sapiens (Human) [Complete proteome]
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1014 AA.
Sequence statusComplete.
Protein existenceInferred from homology.

General annotation (Comments)

Function

Early post-infection, the reverse transcriptase converts the viral RNA genome into double-stranded viral DNA. The RNase H domain of the reverse transcriptase performs two functions. It degrades the RNA template and specifically removes the RNA primer from the RNA/DNA hybrid. Following nuclear import, the integrase catalyzes the insertion of the linear, double-stranded viral DNA into the host cell chromosome. Endogenous Pol proteins may have kept, lost or modified their original function during evolution.

Catalytic activity

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).

Endonucleolytic cleavage to 5'-phosphomonoester.

Domain

The LPQG and YXDD motifs are catalytically important and conserved among many retroviruses.

Miscellaneous

This protein is synthesized as Gag-Pro and Gag-Pro-Pol polyprotein precursors. These polyproteins are thought, by similarity with type-B retroviruses, to be generated by -1 frameshifts occurring at the Gag-Pro and Pro-Pol genes boundaries.

HERV-K_5q33.3 has a type 1 genome. The HERV-K(HML-2) family contains type 1 and type 2 genomes depending on the absence or presence of 292 nucleotides at the 5'-end of the env gene. Type 1 genomes lack a pol stop codon, leading to expression of a fusion protein containing a portion of the Env sequence.

Exact N-terminus of this protein has not been formally described.

Sequence similarities

Belongs to the beta type-B retroviral polymerase family. HERV class-II K(HML-2) subfamily.

Contains 1 integrase catalytic domain.

Contains 1 integrase-type DNA-binding domain.

Contains 1 integrase-type zinc finger.

Contains 1 reverse transcriptase domain.

Contains 1 RNase H domain.

Sequence caution

The sequence AAD51796.1 differs from that shown. Reason: Frameshift at several positions. A -1 frameshift presumed to occur at the N-terminus at the Pro-Pol gene boundary.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 10141014HERV-K_5q33.3 provirus ancestral Pol protein
PRO_0000186769

Regions

Domain57 – 245189Reverse transcriptase
Domain460 – 589130RNase H
Domain642 – 803162Integrase catalytic
Zinc finger587 – 62842Integrase-type
DNA binding811 – 85949Integrase-type
Motif161 – 1644LPQG
Motif195 – 1984YXDD

Experimental info

Sequence conflict11N → M Ref.2
Sequence conflict4961D → A Ref.3
Sequence conflict4961D → A Ref.4
Sequence conflict5631Y → H in AAD51796. Ref.4
Sequence conflict6741S → SYS Ref.4

Sequences

Sequence LengthMass (Da)Tools
P10266-1 [UniParc].

Last modified September 13, 2004. Version 2.
Checksum: 7E320F8F5DEE19FE

FASTA1,014114,827
        10         20         30         40         50         60 
NKSRKRRNRV SFLGAVTVEP PKPIPLTWKT EKPVWVNQWP LPKQKLEALH LLANEQLEKG 

        70         80         90        100        110        120 
HIEPSFSPWN SPVFVIQKKS GKWHTLTDLR AVNAVIQPMG PLQPGLPSPA MIPKDWPLII 

       130        140        150        160        170        180 
IDLKDCFFTI PLAEQDCEKF AFTIPAINNK EPATRFQWKV LPQGMLNSPT ICQTFVGRAL 

       190        200        210        220        230        240 
QPVREKFSDC YIIHYIDDIL CAAETKDKLI DCYTFLQAEV ANAGLAIASD KIQTSTPFHY 

       250        260        270        280        290        300 
LGMQIENRKI KPQKIEIRKD TLKTLNDFQK LLGDINWIRP TLGIPTYAMS NLFSILRGDS 

       310        320        330        340        350        360 
DLNSQRILTP EATKEIKLVE EKIQSAQINR IDPLAPLQLL IFATAHSPTG IIIQNTDLVE 

       370        380        390        400        410        420 
WSFLPHSTVK TFTLYLDQIA TLIGQTRLRI TKLCGNDPDK IVVPLTKEQV RQAFINSGAW 

       430        440        450        460        470        480 
QIGLANFVGL IDNHYPKTKI FQFLKLTTWI LPKITRREPL ENALTVFTDG SSNGKAAYTG 

       490        500        510        520        530        540 
PKERVIKTPY QSAQRDELVA VITVLQDFDQ PINIISDSAY VVQATRDVET ALIKYSMDDQ 

       550        560        570        580        590        600 
LNQLFNLLQQ TVRKRNFPFY ITYIRAHTNL PGPLTKANEQ ADLLVSSALI KAQELHALTH 

       610        620        630        640        650        660 
VNAAGLKNKF DVTWKQAKDI VQHCTQCQVL HLPTQEAGVN PRGLCPNALW QMDVTHVPSF 

       670        680        690        700        710        720 
GRLSYVHVTV DTYSHFIWAT CQTGESTSHV KKHLLSCFAV MGVPEKIKTD NGPGYCSKAF 

       730        740        750        760        770        780 
QKFLSQWKIS HTTGIPYNSQ GQAIVERTNR TLKTQLVKQK EGGDSKECTT PQMQLNLALY 

       790        800        810        820        830        840 
TLNFLNIYRN QTTTSAEQHL TGKKNSPHEG KLIWWKDNKN KTWEIGKVIT WGRGFACVSP 

       850        860        870        880        890        900 
GENQLPVWLP TRHLKFYNEP IGDAKKRAST EMVTPVTWMD NPIEVYVNDS IWVPGPIDDR 

       910        920        930        940        950        960 
CPAKPEEEGM MINISIGYRY PPICLGRAPG CLMPAVQNWL VEVPTVSPIS RFTYHMVSGM 

       970        980        990       1000       1010 
SLRPRVNYLQ DFSYQRSLKF RPKGKPCPKE IPKESKNTEV LVWEECVANS AVIL 

« Hide

References

« Hide 'large scale' references
[1]"The DNA sequence and comparative analysis of human chromosome 5."
Schmutz J., Martin J., Terry A., Couronne O., Grimwood J., Lowry S., Gordon L.A., Scott D., Xie G., Huang W., Hellsten U., Tran-Gyamfi M., She X., Prabhakar S., Aerts A., Altherr M., Bajorek E., Black S. expand/collapse author list , Branscomb E., Caoile C., Challacombe J.F., Chan Y.M., Denys M., Detter J.C., Escobar J., Flowers D., Fotopulos D., Glavina T., Gomez M., Gonzales E., Goodstein D., Grigoriev I., Groza M., Hammon N., Hawkins T., Haydu L., Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Lopez F., Lou Y., Martinez D., Medina C., Morgan J., Nandkeshwar R., Noonan J.P., Pitluck S., Pollard M., Predki P., Priest J., Ramirez L., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A., Thayer N., Tice H., Tsai M., Ustaszewska A., Vo N., Wheeler J., Wu K., Yang J., Dickson M., Cheng J.-F., Eichler E.E., Olsen A., Pennacchio L.A., Rokhsar D.S., Richardson P., Lucas S.M., Myers R.M., Rubin E.M.
Nature 431:268-274(2004) [PubMed: 15372022] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[2]"Nucleotide sequence of human endogenous retrovirus genome related to the mouse mammary tumor virus genome."
Ono M., Yasunaga T., Miyata T., Ushikubo H.
J. Virol. 60:589-598(1986) [PubMed: 3021993] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-740.
[3]"Characterization of human endogenous retrovirus type K (HERV-K) virus-like particles generated from recombinant baculoviruses."
Toenjes R.R., Boller K., Limbach R., Lugert R., Kurth R.
Virology 233:280-291(1997) [PubMed: 9217052] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-740.
[4]"Many human endogenous retrovirus K (HERV-K) proviruses are unique to humans."
Barbulescu M., Turner G., Seaman M.I., Deinard A.S., Kidd K.K., Lenz J.
Curr. Biol. 9:861-868(1999) [PubMed: 10469592] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-830.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AC016577 Genomic DNA. No translation available.
M14123 Genomic DNA. Translation: AAA88033.1. Sequence problems.
Y10391 Genomic DNA. Translation: CAA71417.1.
AF164613 Genomic DNA. Translation: AAD51796.1. Sequence problems.
IPIIPI00382531.
PIRGNHUER. D24483.

3D structure databases

SMRP10266. Positions 24-423, 589-629, 643-785.
ModBaseSearch...

Protein-protein interaction databases

STRINGP10266.

Proteomic databases

PRIDEP10266.

Organism-specific databases

H-InvDBHIX0056908.
HIX0057602.

Phylogenomic databases

HOVERGENP10266.

Enzyme and pathway databases

BRENDA2.7.7.49. 247.
3.1.26.4. 247.

Gene expression databases

GenevestigatorP10266.

Family and domain databases

InterProIPR001037. Integrase_C_retrovir.
IPR001584. Integrase_cat-core.
IPR003308. Integrase_Zn-bd_dom_N.
IPR012337. PolynucTfrase_RNaseH_fold.
IPR000477. Reverse_transcriptase.
IPR002156. RNase_H.
IPR010661. RVT_thumb.
[Graphical view]
PfamPF00552. Integrase. 1 hit.
PF02022. Integrase_Zn. 1 hit.
PF00075. RnaseH. 1 hit.
PF00665. rve. 1 hit.
PF00078. RVT_1. 1 hit.
PF06817. RVT_thumb. 1 hit.
[Graphical view]
PROSITEPS50994. INTEGRASE. 1 hit.
PS51027. INTEGRASE_DBD. 1 hit.
PS50879. RNASE_H. 1 hit.
PS50878. RT_POL. 1 hit.
PS50876. ZF_INTEGRASE. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry namePOK10_HUMAN
AccessionPrimary (citable) accession number: P10266
Secondary accession number(s): P87890, Q14273
Entry history
Integrated into UniProtKB/Swiss-Prot: July 1, 1989
Last sequence update: September 13, 2004
Last modified: January 19, 2010
This is version 81 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHPI (Human Proteome Initiative)
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 5

Human chromosome 5: entries, gene names and cross-references to MIM

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents