Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Endogenous retrovirus group K member 19 Pol protein

Gene

ERVK-19

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: -Experimental evidence at transcript leveli

Functioni

Early post-infection, the reverse transcriptase converts the viral RNA genome into double-stranded viral DNA. The RNase H domain of the reverse transcriptase performs two functions. It degrades the RNA template and specifically removes the RNA primer from the RNA/DNA hybrid. Following nuclear import, the integrase catalyzes the insertion of the linear, double-stranded viral DNA into the host cell chromosome. Endogenous Pol proteins may have kept, lost or modified their original function during evolution.

Miscellaneous

Exact N-terminus of this protein has not been formally described.

Catalytic activityi

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).PROSITE-ProRule annotation
Endonucleolytic cleavage to 5'-phosphomonoester.PROSITE-ProRule annotation

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri590 – 631Integrase-typePROSITE-ProRule annotationAdd BLAST42
DNA bindingi814 – 862Integrase-typePROSITE-ProRule annotationAdd BLAST49

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionDNA-binding, Endonuclease, Hydrolase, Multifunctional enzyme, Nuclease, Nucleotidyltransferase, RNA-directed DNA polymerase, Transferase
Biological processDNA integration, DNA recombination
LigandMetal-binding, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Endogenous retrovirus group K member 19 Pol protein
Alternative name(s):
HERV-K(C19) Pol protein
HERV-K_19q11 provirus ancestral Pol protein
Including the following 3 domains:
Reverse transcriptase (EC:2.7.7.49)
Short name:
RT
Ribonuclease H (EC:3.1.26.4)
Short name:
RNase H
Integrase
Short name:
IN
Gene namesi
Name:ERVK-19
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Unplaced

Organism-specific databases

HGNCiHGNC:39026 ERVK-19
neXtProtiNX_Q9WJR5

Pathology & Biotechi

Organism-specific databases

DisGeNETi100862685

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00001867631 – 959Endogenous retrovirus group K member 19 Pol proteinAdd BLAST959

Proteomic databases

PeptideAtlasiQ9WJR5
PRIDEiQ9WJR5
ProteomicsDBi85576

Interactioni

Protein-protein interaction databases

IntActiQ9WJR5, 3 interactors

Structurei

3D structure databases

ProteinModelPortaliQ9WJR5
SMRiQ9WJR5
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini57 – 248Reverse transcriptasePROSITE-ProRule annotationAdd BLAST192
Domaini464 – 593RNase HPROSITE-ProRule annotationAdd BLAST130
Domaini645 – 806Integrase catalyticPROSITE-ProRule annotationAdd BLAST162

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi164 – 167LPQG4
Motifi198 – 201YXDD4

Domaini

The LPQG and YXDD motifs are catalytically important and conserved among many retroviruses.

Sequence similaritiesi

Zinc finger

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Zinc fingeri590 – 631Integrase-typePROSITE-ProRule annotationAdd BLAST42

Keywords - Domaini

Zinc-finger

Phylogenomic databases

HOVERGENiHBG053630
InParanoidiQ9WJR5
PhylomeDBiQ9WJR5

Family and domain databases

Gene3Di1.10.10.200, 1 hit
2.30.30.10, 1 hit
3.30.420.10, 2 hits
InterProiView protein in InterPro
IPR017856 Integrase-like_N
IPR036862 Integrase_C_dom_sf_retrovir
IPR001037 Integrase_C_retrovir
IPR001584 Integrase_cat-core
IPR003308 Integrase_Zn-bd_dom_N
IPR012337 RNaseH-like_sf
IPR002156 RNaseH_domain
IPR036397 RNaseH_sf
IPR000477 RT_dom
IPR010661 RVT_thumb
PfamiView protein in Pfam
PF00552 IN_DBD_C, 1 hit
PF02022 Integrase_Zn, 1 hit
PF00075 RNase_H, 1 hit
PF00665 rve, 1 hit
PF00078 RVT_1, 1 hit
PF06817 RVT_thumb, 1 hit
SUPFAMiSSF46919 SSF46919, 1 hit
SSF50122 SSF50122, 1 hit
SSF53098 SSF53098, 2 hits
PROSITEiView protein in PROSITE
PS50994 INTEGRASE, 1 hit
PS51027 INTEGRASE_DBD, 1 hit
PS50879 RNASE_H, 1 hit
PS50878 RT_POL, 1 hit
PS50876 ZF_INTEGRASE, 1 hit

Sequence (1+)i

Sequence statusi: Complete.

This entry describes 1 isoform i produced by ribosomal frameshifting. AlignAdd to basket
Note: This protein is synthesized as Gag-Pro and Gag-Pro-Pol polyprotein precursors. These polyproteins are thought, by similarity with type-B retroviruses, to be generated by -1 frameshifts occurring at the Gag-Pro and Pro-Pol genes boundaries.

This entry has 1 described isoform and 4 potential isoforms that are computationally mapped.Show allAlign All

Isoform 1 (identifier: Q9WJR5-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide
        10         20         30         40         50
NKSKKRRNRV SFLGAATVEP PKPIPLTWKT EKPVWVNQWP LPKQKLEALH
60 70 80 90 100
LLANEQLEKG HIEPSFSPWN SPVFVIQKKS GKWRMLTDLR AVNAVNAVIQ
110 120 130 140 150
PMGPLQPGLP SLAMIPKDWP LIIIDLKDCF FTIPLAEQDC EKFAFTIPAI
160 170 180 190 200
NNKEPATRFQ WKVLPQGMLN SPTICQTFVG RALQPVREKF SDCYIIHYID
210 220 230 240 250
DILCAAEMKD KLIDCYTFLQ AEVANAGLAI ASDKIQTSTP FHYLEMQIEN
260 270 280 290 300
RKIKPPKIEI RKDTLKTLND FQKLLGDINW IRPTLGIPTY AMSNLFSILR
310 320 330 340 350
GDSDLNSKRM LTPEATKEIK LVEEKIQSAQ INRIDPLAPL QLLIFATAHS
360 370 380 390 400
PTGIIIQNTD LVEWSFLPHS TVKTFTLYLD QMATLIGQTR LRIIKLCGND
410 420 430 440 450
PDKIVVPLTK EQVRQAFINS GAWQIGLANF VGIIDNHYPK TKIFQFLKMT
460 470 480 490 500
TWILPKITRR EPLENALTVF TDGSSNGKAA YTGPKERVIK TQYQSAQRAE
510 520 530 540 550
LVAVITVLQD FDQPINIISD SAYVVQATRD VETALIKYSM DDQLNQLFNL
560 570 580 590 600
LQQTVRKRNF PFYITHIRAH TNLPGPLTKA NEQADLLVSS ALIKAQELHA
610 620 630 640 650
LTHVNVAGLK NKFDVTWKQA KDIVQHCTQC QVLHLPTQEA GVNPRGLCPN
660 670 680 690 700
ALWQMDVTHV SSFGRLSYIH VTVDTYSHFI WATCQTGEST SHVKKHLLSC
710 720 730 740 750
FAVMGVPEKI KTDNGPGYCS KAFQKFLSQW KISHTTGIPY NSQGQAIVER
760 770 780 790 800
TNRTLKTQLV KQKEGGDSKE CTTPQMQLNL ALYTLNFLNI YRNQTTTSAE
810 820 830 840 850
QHLTGKKNSP HEGKLIWWKD NKNKTWEIGK VITWGRGFAC VSPGENQLPV
860 870 880 890 900
WIPTRHLKFY NEPIGDAKKS TSAETETPQS STVDSQDEQN GDVRRTDEVA
910 920 930 940 950
IHQESRAADL GTTKEADAVS YKISREHKGD TNPREYAACG LDDCINGGKS

PYACRSSCS
Length:959
Mass (Da):108,106
Last modified:September 13, 2004 - v2
Checksum:i60276896681286A9
GO

Computationally mapped potential isoform sequencesi

There are 4 potential isoforms mapped to this entry.BLASTAlignShow allAdd to basket
EntryEntry nameProtein names
Gene namesLengthAnnotation
O71037ENK19_HUMAN
Endogenous retrovirus group K membe...
ERVK-19
699Annotation score:
Q9YNA8GAK19_HUMAN
Endogenous retrovirus group K membe...
ERVK-19
666Annotation score:
P63120VPK19_HUMAN
Endogenous retrovirus group K membe...
ERVK-19
156Annotation score:
P61572REC19_HUMAN
Endogenous retrovirus group K membe...
ERVK-19
105Annotation score:

Sequence cautioni

The sequence AC112702 differs from that shown. Reason: Frameshift at position 562. The frameshift results from the integration of a SINE element AluYa5.Curated
The sequence CAA76882 differs from that shown. Reason: Erroneous initiation.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti15A → V in AC112702 (PubMed:15057824).Curated1
Sequence conflicti208M → T in AC112702 (PubMed:15057824).Curated1
Sequence conflicti245E → G in AC112702 (PubMed:15057824).Curated1
Sequence conflicti339P → T in AC112702 (PubMed:15057824).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Y17833 Genomic DNA Translation: CAA76882.1 Different initiation.
AC112702 Genomic DNA No translation available.
U87592 mRNA Translation: AAB63115.1

Keywords - Coding sequence diversityi

Ribosomal frameshifting

Similar proteinsi

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Y17833 Genomic DNA Translation: CAA76882.1 Different initiation.
AC112702 Genomic DNA No translation available.
U87592 mRNA Translation: AAB63115.1

3D structure databases

ProteinModelPortaliQ9WJR5
SMRiQ9WJR5
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiQ9WJR5, 3 interactors

Proteomic databases

PeptideAtlasiQ9WJR5
PRIDEiQ9WJR5
ProteomicsDBi85576

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Organism-specific databases

DisGeNETi100862685
GeneCardsiERVK-19
HGNCiHGNC:39026 ERVK-19
neXtProtiNX_Q9WJR5
GenAtlasiSearch...

Phylogenomic databases

HOVERGENiHBG053630
InParanoidiQ9WJR5
PhylomeDBiQ9WJR5

Family and domain databases

Gene3Di1.10.10.200, 1 hit
2.30.30.10, 1 hit
3.30.420.10, 2 hits
InterProiView protein in InterPro
IPR017856 Integrase-like_N
IPR036862 Integrase_C_dom_sf_retrovir
IPR001037 Integrase_C_retrovir
IPR001584 Integrase_cat-core
IPR003308 Integrase_Zn-bd_dom_N
IPR012337 RNaseH-like_sf
IPR002156 RNaseH_domain
IPR036397 RNaseH_sf
IPR000477 RT_dom
IPR010661 RVT_thumb
PfamiView protein in Pfam
PF00552 IN_DBD_C, 1 hit
PF02022 Integrase_Zn, 1 hit
PF00075 RNase_H, 1 hit
PF00665 rve, 1 hit
PF00078 RVT_1, 1 hit
PF06817 RVT_thumb, 1 hit
SUPFAMiSSF46919 SSF46919, 1 hit
SSF50122 SSF50122, 1 hit
SSF53098 SSF53098, 2 hits
PROSITEiView protein in PROSITE
PS50994 INTEGRASE, 1 hit
PS51027 INTEGRASE_DBD, 1 hit
PS50879 RNASE_H, 1 hit
PS50878 RT_POL, 1 hit
PS50876 ZF_INTEGRASE, 1 hit
ProtoNetiSearch...

Entry informationi

Entry nameiPOK19_HUMAN
AccessioniPrimary (citable) accession number: Q9WJR5
Secondary accession number(s): O15312
Entry historyiIntegrated into UniProtKB/Swiss-Prot: September 13, 2004
Last sequence update: September 13, 2004
Last modified: June 20, 2018
This is version 117 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, ERV, Reference proteome, Transposable element

Documents

  1. SIMILARITY comments
    Index of protein domains and families
  2. Human chromosome 19
    Human chromosome 19: entries, gene names and cross-references to MIM
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again