Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

DNA polymerase

Gene

pol

Organism
Thermococcus aggregans
Status
Reviewed-Annotation score: -Protein inferred from homologyi

Functioni

Catalytic activityi

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionDNA-binding, DNA-directed DNA polymerase, Endonuclease, Hydrolase, Nuclease, Nucleotidyltransferase, Transferase
Biological processDNA replication

Names & Taxonomyi

Protein namesi
Recommended name:
DNA polymerase (EC:2.7.7.7)
Alternative name(s):
Pol Tfu
Cleaved into the following 3 chains:
Alternative name(s):
Intein I
Tsp-TY pol-1
Alternative name(s):
Intein II
Tsp-TY pol-2
Alternative name(s):
Intein III
Tsp-TY pol-3
Gene namesi
Name:pol
OrganismiThermococcus aggregans
Taxonomic identifieri110163 [NCBI]
Taxonomic lineageiArchaeaEuryarchaeotaThermococciThermococcalesThermococcaceaeThermococcus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000073531 – 409DNA polymerase, 1st partSequence analysisAdd BLAST409
ChainiPRO_0000007354410 – 769Tag pol-1 inteinSequence analysisAdd BLAST360
ChainiPRO_0000007355770 – 855DNA polymerase, 2nd partSequence analysisAdd BLAST86
ChainiPRO_0000007356856 – 1392Tag pol-2 inteinSequence analysisAdd BLAST537
ChainiPRO_00000073571393 – 1441DNA polymerase, 3rd partSequence analysisAdd BLAST49
ChainiPRO_00000073581442 – 1598Tag pol-3 inteinSequence analysisAdd BLAST157
ChainiPRO_00000073591599 – 1829DNA polymerase, 4th partSequence analysisAdd BLAST231

Post-translational modificationi

This protein undergoes a protein self splicing that involves a post-translational excision of the three intervening regions (inteins) followed by peptide ligation.

Keywords - PTMi

Autocatalytic cleavage, Protein splicing

Proteomic databases

PRIDEiO33845

Structurei

3D structure databases

ProteinModelPortaliO33845
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini527 – 668DOD-type homing endonuclease 1PROSITE-ProRule annotationAdd BLAST142
Domaini1136 – 1269DOD-type homing endonuclease 2PROSITE-ProRule annotationAdd BLAST134

Sequence similaritiesi

Belongs to the DNA polymerase type-B family.Curated

Keywords - Domaini

Repeat

Family and domain databases

Gene3Di3.10.28.10, 2 hits
3.30.420.10, 1 hit
3.90.1600.10, 4 hits
InterProiView protein in InterPro
IPR006172 DNA-dir_DNA_pol_B
IPR006133 DNA-dir_DNA_pol_B_exonuc
IPR006134 DNA-dir_DNA_pol_B_multi_dom
IPR023211 DNA_pol_palm_dom_sf
IPR003586 Hint_dom_C
IPR003587 Hint_dom_N
IPR036844 Hint_dom_sf
IPR027434 Homing_endonucl
IPR006142 INTEIN
IPR030934 Intein_C
IPR004042 Intein_endonuc
IPR006141 Intein_N
IPR004860 LAGLIDADG_2
IPR012337 RNaseH-like_sf
IPR036397 RNaseH_sf
PfamiView protein in Pfam
PF00136 DNA_pol_B, 3 hits
PF03104 DNA_pol_B_exo1, 2 hits
PF14528 LAGLIDADG_3, 1 hit
PRINTSiPR00379 INTEIN
SMARTiView protein in SMART
SM00305 HintC, 3 hits
SM00306 HintN, 3 hits
SM00486 POLBc, 1 hit
SUPFAMiSSF51294 SSF51294, 5 hits
SSF53098 SSF53098, 1 hit
SSF55608 SSF55608, 2 hits
TIGRFAMsiTIGR01443 intein_Cterm, 3 hits
TIGR01445 intein_Nterm, 2 hits
PROSITEiView protein in PROSITE
PS50818 INTEIN_C_TER, 3 hits
PS50819 INTEIN_ENDONUCLEASE, 2 hits
PS50817 INTEIN_N_TER, 3 hits

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

O33845-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MILDTDYITK DGKPIIRIFK KENGEFKIEL DPHFQPYIYA LLKDDSAIDE
60 70 80 90 100
IKAIKGERHG KIVRVVDAVK VKKKFLGRDV EVWKLIFEHP QDVPALRGKI
110 120 130 140 150
REHPAVIDIY EYDIPFAKRY LIDKGLIPME GDEELKLMAF DIETFYHEGD
160 170 180 190 200
EFGKGEIIMI SYADEEEARV ITWKNIDLPY VDVVSNEREM IKRFVQIVRE
210 220 230 240 250
KDPDVLITYN GDNFDLPYLI KRAEKLGVTL LLGRDKEHPE PKIHRMGDSF
260 270 280 290 300
AVEIKGRIHF DLFPVVRRTI NLPTYTLEAV YEAVLGKTKS KLGAEEIAAI
310 320 330 340 350
WETEESMKKL AQYSMEDARA TYELGKEFFP MEAELAKLIG QSVWDVSRSS
360 370 380 390 400
TGNLVEWYLL RVAYERNELA PNKPDEEEYR RRLRTTYLGG YVKEPERGLW
410 420 430 440 450
ENIAYLDFRC HPADTKVIVK GKGIVNISDV KEGDYILGID GWQRVKKVWK
460 470 480 490 500
YHYEGKLINI NGLKCTPNHK VPVVTENDRQ TRIRDSLAKS FLSGKVKGKI
510 520 530 540 550
ITTKLFEKIA EFEKNKPSEE EILKGELSGI ILAEGTLLRK DIEYFDSSRG
560 570 580 590 600
KKRISHQYRV EITIGENEKE LLERILYIFD KLFGIRPSVK KKGDTNALKI
610 620 630 640 650
TTAKKAVYLQ IEELLKNIES LYAPAVLRGF FERDATVNKI RSTIVVTQGT
660 670 680 690 700
NNKWKIDIVA KLLDSLGIPY SRYEYKYIEN GKELTKHILE ITGRDGLILF
710 720 730 740 750
QTLVGFISSE KNEALEKAIE VREMNRLKNN SFYNLSTFEV SSEYYKGEVY
760 770 780 790 800
DLTLEGNPYY FANGILTHNS LYPSIIVTHN VSPDTLEREG CKNYDVAPIV
810 820 830 840 850
GYKFCKDFPG FIPSILGELI TMRQEIKKKM KATIDPIEKK MLDYRQRAVK
860 870 880 890 900
LLANSILPNE WLPIIENGEV KFVKIGEFID RYMEEQKDKV RTVDNTEVLE
910 920 930 940 950
VDNIFAFSLN KESKKSEIKK VKALIRHKYK GEAYEVELNS GRKIHITRGH
960 970 980 990 1000
SLFTIRNGKI KEIWGEEVKV GDLIIVPKKV KLNEKEAVIN IPELISKLPD
1010 1020 1030 1040 1050
EDTADVVMTT PVKGRKNFFK GMLRTLKWIF GEESKRIRTF NRYLFHLEEL
1060 1070 1080 1090 1100
GFVKLLPRGY EVTDWEGLKR YRQLYEKLVK NLRYNGNKRE YLVRFNDIKD
1110 1120 1130 1140 1150
SVSCFPRKEL EEWKIGTXKG FRXKCILKVD EDFGKFLGYY VSEGYAGAQK
1160 1170 1180 1190 1200
NKTGGMSYSV KLYNENPNVL KDMKNIAEKF FGKVRVGKNC VDIPKKMAYL
1210 1220 1230 1240 1250
LAKSLCGVTA ENKRIPSIIF DSSEPVRWAF LRAYFVGDGD IHPSKRLRLS
1260 1270 1280 1290 1300
TKSELLANQL VFLLNSLGVS SIKIGFDSGV YRVYINEDLP FLQTSRQKNT
1310 1320 1330 1340 1350
YYPNLIPKEV LEEIFGRKFQ KNITFEKFKE LADSGKLDKR KVKLLDFLLN
1360 1370 1380 1390 1400
GDIVLDRVKN VEKREYEGYV YDLSVEDNEN FLVGFGLLYA HNSYYGYMGY
1410 1420 1430 1440 1450
PKARWYSKEC AESVTAWGRH YIEMTIKEIE EKFGFKVLYA DSVTGDTEII
1460 1470 1480 1490 1500
VKRNGRIEFV PIEKLFERVD YRIGEKEYCI LEDVEALTLD NRGKLIWKKV
1510 1520 1530 1540 1550
PYVMRHRAKK KVYRIWITNS WYIDVTEDHS LIVAEDGLKE ARPMEIEGKS
1560 1570 1580 1590 1600
LIATKDDLSG VEYIKPHAIE EISYNGYVYD IEVEGTHRFF ANGILVHNTD
1610 1620 1630 1640 1650
GFYATIPGEK PETIKKKAKE FLKYINSKLP GLLELEYEGF YLRGFFVAKK
1660 1670 1680 1690 1700
RYAVIDEEGR ITTRGLEVVR RDWSEIAKET QAKVLEAILK EDSVEKAVEI
1710 1720 1730 1740 1750
VKDVVEEIAK YQVPLEKLVI HEQITKDLSE YKAIGPHVAI AKRLAAKGIK
1760 1770 1780 1790 1800
VRPGTIISYI VLRGSGKISD RVILLSEYDP KKHKYDPDYY IENQVLPAVL
1810 1820
RILEAFGYRK EDLKYQSSKQ VGLDAWLKK
Length:1,829
Mass (Da):211,879
Last modified:January 1, 1998 - v1
Checksum:iA113A8BC57EB9CB3
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Y13030 Genomic DNA Translation: CAA73475.1

Similar proteinsi

Entry informationi

Entry nameiDPOL_THEAG
AccessioniPrimary (citable) accession number: O33845
Entry historyiIntegrated into UniProtKB/Swiss-Prot: December 15, 1998
Last sequence update: January 1, 1998
Last modified: April 25, 2018
This is version 112 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Documents

  1. Intein-containing proteins
    List of intein-containing protein entries
  2. SIMILARITY comments
    Index of protein domains and families
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health