Skip Header

Contribute Send feedback
Read comments (?) or add your own

P10266 (POK10_HUMAN) Reviewed, UniProtKB/Swiss-Prot

Last modified December 14, 2011. Version 93. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Web links·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
HERV-K_5q33.3 provirus ancestral Pol protein
Alternative name(s):
HERV-K10 Pol protein
HERV-K107 Pol protein

Including the following 3 domains:

  1. Reverse transcriptase
    Short name=RT
    EC=2.7.7.49
  2. Ribonuclease H
    Short name=RNase H
    EC=3.1.26.4
  3. Integrase
    Short name=IN
OrganismHomo sapiens (Human)
Taxonomic identifier9606 [NCBI]
Taxonomic lineageEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo

Protein attributes

Sequence length1014 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Function

Early post-infection, the reverse transcriptase converts the viral RNA genome into double-stranded viral DNA. The RNase H domain of the reverse transcriptase performs two functions. It degrades the RNA template and specifically removes the RNA primer from the RNA/DNA hybrid. Following nuclear import, the integrase catalyzes the insertion of the linear, double-stranded viral DNA into the host cell chromosome. Endogenous Pol proteins may have kept, lost or modified their original function during evolution.

Catalytic activity

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).

Endonucleolytic cleavage to 5'-phosphomonoester.

Domain

The LPQG and YXDD motifs are catalytically important and conserved among many retroviruses.

Miscellaneous

This protein is synthesized as Gag-Pro and Gag-Pro-Pol polyprotein precursors. These polyproteins are thought, by similarity with type-B retroviruses, to be generated by -1 frameshifts occurring at the Gag-Pro and Pro-Pol genes boundaries.

HERV-K_5q33.3 has a type 1 genome. The HERV-K(HML-2) family contains type 1 and type 2 genomes depending on the absence or presence of 292 nucleotides at the 5'-end of the env gene. Type 1 genomes lack a pol stop codon, leading to expression of a fusion protein containing a portion of the Env sequence.

Exact N-terminus of this protein has not been formally described.

Sequence similarities

Belongs to the beta type-B retroviral polymerase family. HERV class-II K(HML-2) pol subfamily.

Contains 1 integrase catalytic domain.

Contains 1 integrase-type DNA-binding domain.

Contains 1 integrase-type zinc finger.

Contains 1 reverse transcriptase domain.

Contains 1 RNase H domain.

Sequence caution

The sequence AAD51796.1 differs from that shown. Reason: Frameshift at several positions. A -1 frameshift presumed to occur at the N-terminus at the Pro-Pol gene boundary.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 10141014HERV-K_5q33.3 provirus ancestral Pol protein
PRO_0000186769

Regions

Domain57 – 245189Reverse transcriptase
Domain460 – 589130RNase H
Domain642 – 803162Integrase catalytic
Zinc finger587 – 62842Integrase-type
DNA binding811 – 85949Integrase-type
Motif161 – 1644LPQG
Motif195 – 1984YXDD

Experimental info

Sequence conflict11N → M Ref.2
Sequence conflict4961D → A Ref.3
Sequence conflict4961D → A Ref.4
Sequence conflict5631Y → H in AAD51796. Ref.4
Sequence conflict6741S → SYS Ref.4

Sequences

Sequence LengthMass (Da)Tools
P10266 [UniParc].

Last modified September 13, 2004. Version 2.
Checksum: 7E320F8F5DEE19FE

FASTA1,014114,827
        10         20         30         40         50         60 
NKSRKRRNRV SFLGAVTVEP PKPIPLTWKT EKPVWVNQWP LPKQKLEALH LLANEQLEKG 

        70         80         90        100        110        120 
HIEPSFSPWN SPVFVIQKKS GKWHTLTDLR AVNAVIQPMG PLQPGLPSPA MIPKDWPLII 

       130        140        150        160        170        180 
IDLKDCFFTI PLAEQDCEKF AFTIPAINNK EPATRFQWKV LPQGMLNSPT ICQTFVGRAL 

       190        200        210        220        230        240 
QPVREKFSDC YIIHYIDDIL CAAETKDKLI DCYTFLQAEV ANAGLAIASD KIQTSTPFHY 

       250        260        270        280        290        300 
LGMQIENRKI KPQKIEIRKD TLKTLNDFQK LLGDINWIRP TLGIPTYAMS NLFSILRGDS 

       310        320        330        340        350        360 
DLNSQRILTP EATKEIKLVE EKIQSAQINR IDPLAPLQLL IFATAHSPTG IIIQNTDLVE 

       370        380        390        400        410        420 
WSFLPHSTVK TFTLYLDQIA TLIGQTRLRI TKLCGNDPDK IVVPLTKEQV RQAFINSGAW 

       430        440        450        460        470        480 
QIGLANFVGL IDNHYPKTKI FQFLKLTTWI LPKITRREPL ENALTVFTDG SSNGKAAYTG 

       490        500        510        520        530        540 
PKERVIKTPY QSAQRDELVA VITVLQDFDQ PINIISDSAY VVQATRDVET ALIKYSMDDQ 

       550        560        570        580        590        600 
LNQLFNLLQQ TVRKRNFPFY ITYIRAHTNL PGPLTKANEQ ADLLVSSALI KAQELHALTH 

       610        620        630        640        650        660 
VNAAGLKNKF DVTWKQAKDI VQHCTQCQVL HLPTQEAGVN PRGLCPNALW QMDVTHVPSF 

       670        680        690        700        710        720 
GRLSYVHVTV DTYSHFIWAT CQTGESTSHV KKHLLSCFAV MGVPEKIKTD NGPGYCSKAF 

       730        740        750        760        770        780 
QKFLSQWKIS HTTGIPYNSQ GQAIVERTNR TLKTQLVKQK EGGDSKECTT PQMQLNLALY 

       790        800        810        820        830        840 
TLNFLNIYRN QTTTSAEQHL TGKKNSPHEG KLIWWKDNKN KTWEIGKVIT WGRGFACVSP 

       850        860        870        880        890        900 
GENQLPVWLP TRHLKFYNEP IGDAKKRAST EMVTPVTWMD NPIEVYVNDS IWVPGPIDDR 

       910        920        930        940        950        960 
CPAKPEEEGM MINISIGYRY PPICLGRAPG CLMPAVQNWL VEVPTVSPIS RFTYHMVSGM 

       970        980        990       1000       1010 
SLRPRVNYLQ DFSYQRSLKF RPKGKPCPKE IPKESKNTEV LVWEECVANS AVIL 

« Hide

References

« Hide 'large scale' references
[1]"The DNA sequence and comparative analysis of human chromosome 5."
Schmutz J., Martin J., Terry A., Couronne O., Grimwood J., Lowry S., Gordon L.A., Scott D., Xie G., Huang W., Hellsten U., Tran-Gyamfi M., She X., Prabhakar S., Aerts A., Altherr M., Bajorek E., Black S. expand/collapse author list , Branscomb E., Caoile C., Challacombe J.F., Chan Y.M., Denys M., Detter J.C., Escobar J., Flowers D., Fotopulos D., Glavina T., Gomez M., Gonzales E., Goodstein D., Grigoriev I., Groza M., Hammon N., Hawkins T., Haydu L., Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Lopez F., Lou Y., Martinez D., Medina C., Morgan J., Nandkeshwar R., Noonan J.P., Pitluck S., Pollard M., Predki P., Priest J., Ramirez L., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A., Thayer N., Tice H., Tsai M., Ustaszewska A., Vo N., Wheeler J., Wu K., Yang J., Dickson M., Cheng J.-F., Eichler E.E., Olsen A., Pennacchio L.A., Rokhsar D.S., Richardson P., Lucas S.M., Myers R.M., Rubin E.M.
Nature 431:268-274(2004) [PubMed: 15372022] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
[2]"Nucleotide sequence of human endogenous retrovirus genome related to the mouse mammary tumor virus genome."
Ono M., Yasunaga T., Miyata T., Ushikubo H.
J. Virol. 60:589-598(1986) [PubMed: 3021993] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-740.
[3]"Characterization of human endogenous retrovirus type K (HERV-K) virus-like particles generated from recombinant baculoviruses."
Toenjes R.R., Boller K., Limbach R., Lugert R., Kurth R.
Virology 233:280-291(1997) [PubMed: 9217052] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-740.
[4]"Many human endogenous retrovirus K (HERV-K) proviruses are unique to humans."
Barbulescu M., Turner G., Seaman M.I., Deinard A.S., Kidd K.K., Lenz J.
Curr. Biol. 9:861-868(1999) [PubMed: 10469592] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-830.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AC016577 Genomic DNA. No translation available.
M14123 Genomic DNA. Translation: AAA88033.1. Sequence problems.
Y10391 Genomic DNA. Translation: CAA71417.1.
AF164613 Genomic DNA. Translation: AAD51796.1. Sequence problems.
IPIIPI00382531.
PIRGNHUER. D24483.

3D structure databases

ProteinModelPortalP10266.
SMRP10266. Positions 18-856.
ModBaseSearch...

Protein-protein interaction databases

IntActP10266. 1 interaction.

Polymorphism databases

DMDM52001473.

Proteomic databases

PRIDEP10266.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Organism-specific databases

neXtProtNX_P10266.

Phylogenomic databases

HOVERGENHBG053630.

Gene expression databases

GenevestigatorP10266.

Family and domain databases

InterProIPR001037. Integrase_C_retrovir.
IPR001584. Integrase_cat-core.
IPR017856. Integrase_Zn-bd_dom-like_N.
IPR003308. Integrase_Zn-bd_dom_N.
IPR012337. RNaseH-like_dom.
IPR002156. RNaseH_domain.
IPR000477. RVT.
IPR010661. RVT_thumb.
[Graphical view]
Gene3DG3DSA:2.30.30.10. Integrase_C. 1 hit.
G3DSA:1.10.10.200. Intgrase_N_Zn_bd. 1 hit.
PfamPF00552. IN_DBD_C. 1 hit.
PF02022. Integrase_Zn. 1 hit.
PF00075. RNase_H. 1 hit.
PF00665. rve. 1 hit.
PF00078. RVT_1. 1 hit.
PF06817. RVT_thumb. 1 hit.
[Graphical view]
SUPFAMSSF50122. Integrase_C. 1 hit.
SSF46919. Integrase_Zn_N. 1 hit.
SSF53098. RNaseH_fold. 2 hits.
PROSITEPS50994. INTEGRASE. 1 hit.
PS51027. INTEGRASE_DBD. 1 hit.
PS50879. RNASE_H. 1 hit.
PS50878. RT_POL. 1 hit.
PS50876. ZF_INTEGRASE. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry namePOK10_HUMAN
AccessionPrimary (citable) accession number: P10266
Secondary accession number(s): P87890, Q14273
Entry history
Integrated into UniProtKB/Swiss-Prot: July 1, 1989
Last sequence update: September 13, 2004
Last modified: December 14, 2011
This is version 93 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Relevant documents

Human chromosome 5

Human chromosome 5: entries, gene names and cross-references to MIM

SIMILARITY comments

Index of protein domains and families