Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

DNA polymerase theta

Gene

mus308

Organism
Drosophila melanogaster (Fruit fly)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

DNA polymerase that promotes microhomology-mediated end-joining (MMEJ), an alternative non-homologous end-joining (NHEJ) machinery triggered in response to double-strand breaks in DNA (PubMed:20617203). MMEJ is an error-prone repair pathway that produces deletions of sequences from the strand being repaired and promotes genomic rearrangements, such as telomere fusions.4 Publications1 Publication

Catalytic activityi

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1).By similarity

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Nucleotide bindingi256 – 2638ATPPROSITE-ProRule annotation

GO - Molecular functioni

  • ATP binding Source: UniProtKB-KW
  • DNA binding Source: InterPro
  • DNA-directed DNA polymerase activity Source: FlyBase
  • helicase activity Source: FlyBase

GO - Biological processi

  • DNA-dependent DNA replication Source: InterPro
  • DNA repair Source: FlyBase
  • double-strand break repair via alternative nonhomologous end joining Source: FlyBase
  • nucleotide-excision repair Source: FlyBase
Complete GO annotation...

Keywords - Molecular functioni

DNA-directed DNA polymerase, Hydrolase, Nucleotidyltransferase, Transferase

Keywords - Biological processi

DNA damage, DNA repair

Keywords - Ligandi

ATP-binding, Nucleotide-binding

Names & Taxonomyi

Protein namesi
Recommended name:
DNA polymerase thetaBy similarity (EC:2.7.7.7By similarity)
Alternative name(s):
Mutagen-sensitive protein 3081 Publication
Gene namesi
Name:mus3081 PublicationImported
ORF Names:CG6019Imported
OrganismiDrosophila melanogaster (Fruit fly)
Taxonomic identifieri7227 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora
Proteomesi
  • UP000000803 Componenti: Chromosome 3R

Organism-specific databases

FlyBaseiFBgn0002905. mus308.

Subcellular locationi

  • Nucleus 1 Publication

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Disruption phenotypei

Hypersensitivity to DNA-cross-linking agents.1 Publication

Mutagenesis

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Mutagenesisi621 – 6211G → S in mus308(3294); flies are unable to repair interstrand cross-links. 1 Publication
Mutagenesisi781 – 7811P → L in mus308(D5); flies are unable to repair interstrand cross-links. 1 Publication

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 20592059DNA polymerase thetaPRO_0000432704Add
BLAST

Proteomic databases

PaxDbiO18475.
PRIDEiO18475.

Expressioni

Gene expression databases

BgeeiFBgn0002905.
ExpressionAtlasiO18475. differential.
GenevisibleiO18475. DM.

Interactioni

Protein-protein interaction databases

IntActiO18475. 4 interactions.
STRINGi7227.FBpp0082131.

Structurei

3D structure databases

ProteinModelPortaliO18475.
SMRiO18475. Positions 251-626, 1647-2053.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini243 – 416174Helicase ATP-bindingPROSITE-ProRule annotationAdd
BLAST
Domaini464 – 666203Helicase C-terminalPROSITE-ProRule annotationAdd
BLAST

Motif

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Motifi357 – 3604DEAH boxPROSITE-ProRule annotation

Sequence similaritiesi

Belongs to the DNA polymerase type-A family.Curated
Contains 1 helicase ATP-binding domain.PROSITE-ProRule annotation
Contains 1 helicase C-terminal domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG0950. Eukaryota.
COG0749. LUCA.
COG1204. LUCA.
GeneTreeiENSGT00640000091272.
KOiK02349.
OMAiQCFVLES.
OrthoDBiEOG091G005B.
PhylomeDBiO18475.

Family and domain databases

Gene3Di3.30.420.10. 1 hit.
3.40.50.300. 3 hits.
InterProiIPR011545. DEAD/DEAH_box_helicase_dom.
IPR001098. DNA-dir_DNA_pol_A_palm_dom.
IPR002298. DNA_polymerase_A.
IPR014001. Helicase_ATP-bd.
IPR001650. Helicase_C.
IPR027417. P-loop_NTPase.
IPR012337. RNaseH-like_dom.
IPR011991. WHTH_DNA-bd_dom.
[Graphical view]
PfamiPF00270. DEAD. 1 hit.
PF00476. DNA_pol_A. 1 hit.
PF00271. Helicase_C. 1 hit.
[Graphical view]
PRINTSiPR00868. DNAPOLI.
SMARTiSM00487. DEXDc. 1 hit.
SM00490. HELICc. 1 hit.
SM00482. POLAc. 1 hit.
[Graphical view]
SUPFAMiSSF46785. SSF46785. 1 hit.
SSF52540. SSF52540. 3 hits.
PROSITEiPS51192. HELICASE_ATP_BIND_1. 1 hit.
PS51194. HELICASE_CTER. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

O18475-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MAFSQSFNFG NSTLMALEKG MQADDKENAQ PGNGNIQVQS AGNEVNSEIQ
60 70 80 90 100
EINSEFFRDE FSYEVNQAHK PAEQSVVNVS QVQQHMAVVS NQDSEDQSRS
110 120 130 140 150
SALNDQICTQ SSFEGEDAGA DAVLDQPNLD ENSFLCPAQD EEASEQLKED
160 170 180 190 200
ILHSHSVLAK QEFYQEISQV TQNLSSMSPN QLRVSPNSSR IREAMPERPA
210 220 230 240 250
MPLDLNTLRS ISAWNLPMSI QAEYKKKGVV DMFDWQVECL SKPRLLFEHC
260 270 280 290 300
NLVYSAPTSA GKTLVSEILM LKTVLERGKK VLLILPFISV VREKMFYMQD
310 320 330 340 350
LLTPAGYRVE GFYGGYTPPG GFESLHVAIC TIEKANSIVN KLMEQGKLET
360 370 380 390 400
IGMVVVDEVH LISDKGRGYI LELLLAKILY MSRRNGLQIQ VITMSATLEN
410 420 430 440 450
VQLLQSWLDA ELYITNYRPV ALKEMIKVGT VIYDHRLKLV RDVAKQKVLL
460 470 480 490 500
KGLENDSDDV ALLCIETLLE GCSVIVFCPS KDWCENLAVQ LATAIHVQIK
510 520 530 540 550
SETVLGQRLR TNLNPRAIAE VKQQLRDIPT GLDGVMSKAI TYACAFHHAG
560 570 580 590 600
LTTEERDIIE ASFKAGALKV LVATSTLSSG VNLPARRVLI RSPLFGGKQM
610 620 630 640 650
SSLTYRQMIG RAGRMGKDTL GESILICNEI NARMGRDLVV SELQPITSCL
660 670 680 690 700
DMDGSTHLKR ALLEVISSGV ANTKEDIDFF VNCTLLSAQK AFHAKEKPPD
710 720 730 740 750
EESDANYIND ALDFLVEYEF VRLQRNEERE TAVYVATRLG AACLASSMPP
760 770 780 790 800
TDGLILFAEL QKSRRSFVLE SELHAVYLVT PYSVCYQLQD IDWLLYVHMW
810 820 830 840 850
EKLSSPMKKV GELVGVRDAF LYKALRGQTK LDYKQMQIHK RFYIALALEE
860 870 880 890 900
LVNETPINVV VHKYKCHRGM LQSLQQMAST FAGIVTAFCN SLQWSTLALI
910 920 930 940 950
VSQFKDRLFF GIHRDLIDLM RIPDLSQKRA RALFDAGITS LVELAGADPV
960 970 980 990 1000
ELEKVLYNSI SFDSAKQHDH ENADEAAKRN VVRNFYITGK AGMTVSEAAK
1010 1020 1030 1040 1050
LLIGEARQFV QHEIGLGTIK WTQTQAGVEI ASRAIHDGGE VDLHMSLEEE
1060 1070 1080 1090 1100
QPPVKRKLSI EENGTANSQK NPRLETVVDT QRGYKVDKNI ANQSKMNPNL
1110 1120 1130 1140 1150
KEIDAQNKAR RNSTAHMDNL NPISNDPCQN NVNVKTAQPI ISNLNDIQKQ
1160 1170 1180 1190 1200
GSQIEKMKIN PATVVCSPQL ANEEKPSTSQ SARRKLVNEG MAERRRVALM
1210 1220 1230 1240 1250
KIQQRTQKEN QSKDQPIQAS RSNQLSSPVN RTPANRWTQS ENPNNEMNNS
1260 1270 1280 1290 1300
QLPRRNPRNQ SPVPNANRTA SRKVSNAEED LFMADDSFML NTGLAAALTA
1310 1320 1330 1340 1350
AESKIASCTE ADVIPSSQPK EPEVIGALTP HASRLKRSDQ LRSQRIQSPS
1360 1370 1380 1390 1400
PTPQREIEID LESKNESNGV SSMEISDMSM ENPLMKNPLH LNASHIMSCS
1410 1420 1430 1440 1450
KVDETASSFS SIDIIDVCGH RNAFQAAIIE INNATRLGFS VGLQAQAGKQ
1460 1470 1480 1490 1500
KPLIGSNLLI NQVAAAENRE AAARERVLFQ VDDTNFISGV SFCLADNVAY
1510 1520 1530 1540 1550
YWNMQIDERA AYQGVPTPLK VQELCNLMAR KDLTLVMHDG KEQLKMLRKA
1560 1570 1580 1590 1600
IPQLKRISAK LEDAKVANWL LQPDKTVNFL NMCQTFAPEC TGLANLCGSG
1610 1620 1630 1640 1650
RGYSSYGLDT SSAILPRIRT AIESCVTLHI LQGQTENLSR IGNGDLLKFF
1660 1670 1680 1690 1700
HDIEMPIQLT LCQMELVGFP AQKQRLQQLY QRMVAVMKKV ETKIYEQHGS
1710 1720 1730 1740 1750
RFNLGSSQAV AKVLGLHRKA KGRVTTSRQV LEKLNSPISH LILGYRKLSG
1760 1770 1780 1790 1800
LLAKSIQPLM ECCQADRIHG QSITYTATGR ISMTEPNLQN VAKEFSIQVG
1810 1820 1830 1840 1850
SDVVHISCRS PFMPTDESRC LLSADFCQLE MRILAHMSQD KALLEVMKSS
1860 1870 1880 1890 1900
QDLFIAIAAH WNKIEESEVT QDLRNSTKQV CYGIVYGMGM RSLAESLNCS
1910 1920 1930 1940 1950
EQEARMISDQ FHQAYKGIRD YTTRVVNFAR SKGFVETITG RRRYLENINS
1960 1970 1980 1990 2000
DVEHLKNQAE RQAVNSTIQG SAADIAKNAI LKMEKNIERY REKLALGDNS
2010 2020 2030 2040 2050
VDLVMHLHDE LIFEVPTGKA KKIAKVLSLT MENCVKLSVP LKVKLRIGRS

WGEFKEVSV
Length:2,059
Mass (Da):229,873
Last modified:January 1, 1998 - v1
Checksum:iE93B3CD9A5F75F16
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti1930 – 19301R → H in AAX33507 (Ref. 4) Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L76559 Genomic DNA. Translation: AAB67306.1.
AE014297 Genomic DNA. Translation: AAF54858.1.
BT021359 mRNA. Translation: AAX33507.1.
BT044169 mRNA. Translation: ACH92234.1.
PIRiT13858.
RefSeqiNP_524333.1. NM_079609.3.
UniGeneiDm.23677.

Genome annotation databases

EnsemblMetazoaiFBtr0082662; FBpp0082131; FBgn0002905.
GeneIDi41571.
KEGGidme:Dmel_CG6019.
UCSCiCG6019-RA. d. melanogaster.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L76559 Genomic DNA. Translation: AAB67306.1.
AE014297 Genomic DNA. Translation: AAF54858.1.
BT021359 mRNA. Translation: AAX33507.1.
BT044169 mRNA. Translation: ACH92234.1.
PIRiT13858.
RefSeqiNP_524333.1. NM_079609.3.
UniGeneiDm.23677.

3D structure databases

ProteinModelPortaliO18475.
SMRiO18475. Positions 251-626, 1647-2053.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiO18475. 4 interactions.
STRINGi7227.FBpp0082131.

Proteomic databases

PaxDbiO18475.
PRIDEiO18475.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiFBtr0082662; FBpp0082131; FBgn0002905.
GeneIDi41571.
KEGGidme:Dmel_CG6019.
UCSCiCG6019-RA. d. melanogaster.

Organism-specific databases

CTDi41571.
FlyBaseiFBgn0002905. mus308.

Phylogenomic databases

eggNOGiKOG0950. Eukaryota.
COG0749. LUCA.
COG1204. LUCA.
GeneTreeiENSGT00640000091272.
KOiK02349.
OMAiQCFVLES.
OrthoDBiEOG091G005B.
PhylomeDBiO18475.

Miscellaneous databases

GenomeRNAii41571.
PROiO18475.

Gene expression databases

BgeeiFBgn0002905.
ExpressionAtlasiO18475. differential.
GenevisibleiO18475. DM.

Family and domain databases

Gene3Di3.30.420.10. 1 hit.
3.40.50.300. 3 hits.
InterProiIPR011545. DEAD/DEAH_box_helicase_dom.
IPR001098. DNA-dir_DNA_pol_A_palm_dom.
IPR002298. DNA_polymerase_A.
IPR014001. Helicase_ATP-bd.
IPR001650. Helicase_C.
IPR027417. P-loop_NTPase.
IPR012337. RNaseH-like_dom.
IPR011991. WHTH_DNA-bd_dom.
[Graphical view]
PfamiPF00270. DEAD. 1 hit.
PF00476. DNA_pol_A. 1 hit.
PF00271. Helicase_C. 1 hit.
[Graphical view]
PRINTSiPR00868. DNAPOLI.
SMARTiSM00487. DEXDc. 1 hit.
SM00490. HELICc. 1 hit.
SM00482. POLAc. 1 hit.
[Graphical view]
SUPFAMiSSF46785. SSF46785. 1 hit.
SSF52540. SSF52540. 3 hits.
PROSITEiPS51192. HELICASE_ATP_BIND_1. 1 hit.
PS51194. HELICASE_CTER. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiDPOLQ_DROME
AccessioniPrimary (citable) accession number: O18475
Secondary accession number(s): Q5BI65
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 1, 2015
Last sequence update: January 1, 1998
Last modified: September 7, 2016
This is version 141 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programDrosophila annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Drosophila
    Drosophila: entries, gene names and cross-references to FlyBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.