Skip Header

Contribute Send feedback
Read comments (?) or add your own

O62218 (O62218_CAEEL) Unreviewed, UniProtKB/TrEMBL

Last modified May 1, 2013. Version 92. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
DNA polymerase RuleBase RU000442

EC=2.7.7.7 RuleBase RU000442
Gene names
ORF Names:CELE_F33H2.5 EMBL CAB04263.1, F33H2.5 EMBL CAB04263.1 WormBase F33H2.5
OrganismCaenorhabditis elegans [Reference proteome] EMBL CAB04263.1
Taxonomic identifier6239 [NCBI]
Taxonomic lineageEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis

Protein attributes

Sequence length2144 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

Deoxynucleoside triphosphate + DNA(n) = diphosphate + DNA(n+1). RuleBase RU000442

Sequence similarities

Belongs to the DNA polymerase type-B family. RuleBase RU000442

Sequences

Sequence LengthMass (Da)Tools
O62218 [UniParc].

Last modified August 1, 1998. Version 1.
Checksum: D73E820585EB1B75

FASTA2,144244,709
        10         20         30         40         50         60 
MSSKDDILAQ AVENDSNYKE RLALIRSNDE IDAKLGFSRY TGLQEKKGFL INIQPSELVD 

        70         80         90        100        110        120 
EQTKVIISVV DYFFISDMDE RFKISYPFRP YFYIATLDGF EFQVSSYLSK KYGAQTAVEH 

       130        140        150        160        170        180 
MDREDLDLKD HLSGLKKTYI KLSFTSTVEL IKIRKELMPL VRKNTDRIKK ESAYADYLAR 

       190        200        210        220        230        240 
NLSGKGGDSK DQQLNGDILN QIVDIREYDV PFHMRVSIDE KIFVGLWYDV KGIGPNRVPT 

       250        260        270        280        290        300 
IKRKDLPLFH AKPKVLAFDI ETTKLPLKFP DRESDEIMMI SYMVDGRGFL IINREIVSAD 

       310        320        330        340        350        360 
INAFEYTPKA EYIGEFTVWN EKDEAALIRK FFDHFLQVRP NIVVTYNGDF FDWPFVEARA 

       370        380        390        400        410        420 
KIRGFNMERE IGFSKDSADE YKSRNCIHMD AFRWVKRDSY LPVGSQNLKA VTKAKLRYDP 

       430        440        450        460        470        480 
VEVEPELMCK MAREQPQQLA NYSVSDAVST YYLYMKYVHQ FIFALCTIIP LGADDVLRKG 

       490        500        510        520        530        540 
SGTLCEALLM VEAFHNNIVF PNKYTGPEET RFSKDGHRVE SETYVGGHVE ALEAGVFRAD 

       550        560        570        580        590        600 
IPAKFRLSVP ALEQLKSEIQ ETLRKELARE FEVTLDQVVD FDEQCAEVQD AFDGMINVPT 

       610        620        630        640        650        660 
RLENPRIYHL DVGAMYPNII LTNRLQPCAM VTEEICMGCS YNKPDAECKR TMAWEWRGEL 

       670        680        690        700        710        720 
TPATRGEYQQ IMQQLEAESF GKPPKHFHML ERSEREAIEM KRIKDYSRRV YGKTHLTRLE 

       730        740        750        760        770        780 
MRETTICQRE NHFYVETVKA FRDRRYEYKD MLKKAKGRFD QAQATNDLAT MTTSKLEMVL 

       790        800        810        820        830        840 
YESLQLAHKC ILNSFYGYVM RKGSRWFSME MAGIVCHTGA NIIREARKLV EQIGTPLELD 

       850        860        870        880        890        900 
TDGIWCLIPA SFPENVTFKL KNHKRSSVTV SYPGAMLNAL VYEGFTNHQY HTLEKDGSYS 

       910        920        930        940        950        960 
KSSENSIYFE VDGPYQCMIL PASKEEGKKL KKRYAVFNLD GSLAEMKGFE LKRRGELNII 

       970        980        990       1000       1010       1020 
KHFQGCVFKT FLNGKTLEET YKAVAADADH WLDILHSHGA DLTDEELFDL ISENRSMSRK 

      1030       1040       1050       1060       1070       1080 
LEDYGAQKST SISTAKRLAE FLGDDMVKDA GLACMFIISK HPIGAPVTER AIPVAIFKSD 

      1090       1100       1110       1120       1130       1140 
AKVRSHYIRK WTKQVDFNED TDIRDMLDWD YYLERFGSCI QKIITIPAAL QGISNPVPRV 

      1150       1160       1170       1180       1190       1200 
PHPDWLQNKI RNKFDAHRQP RINQIFAACQ KPSTSQMDNG KRRRTPDDDV ASEDAMDSQD 

      1210       1220       1230       1240       1250       1260 
DIIIDDDKEN GAKRQKNTKK VHTTEVVLEK KTLVEHGFDE WMGFLKKKWR VQRKERKTQL 

      1270       1280       1290       1300       1310       1320 
SSKDSDVVEA IVRGAREAEH DKEWHILSVE PTADASFFNV WLAVQGQMHK VIMKIGRRII 

      1330       1340       1350       1360       1370       1380 
VDSRAPRGDR DTIRRILPHH KTPGFLYEFR TDENQLTALM DKLYSETCSS TIDGIYESEV 

      1390       1400       1410       1420       1430       1440 
PTSFRAVLQL GSIVRPDHGI SLGGHQLTLE NLKPMEKAPY LPLDQKIRTI FLYKFSQDSR 

      1450       1460       1470       1480       1490       1500 
HVYSLIDSSG SAAYFYIVNT GDVQMPNMDS LYTSAYTKMM STERGQLCHT SESMPFTVKR 

      1510       1520       1530       1540       1550       1560 
FSSNTECERQ LGRALRVYRE VSSKTAIVLL LSDTDPFRLA RKLPNLGLFP NVQLHITEPS 

      1570       1580       1590       1600       1610       1620 
SLLNQIDWQK VVARRVLQHY FNSFFFLADY LEWARYLRVP IGNLPADHAL FGLDLLFARN 

      1630       1640       1650       1660       1670       1680 
LQKSGHALWA TRASRPDLGG KEMDDVRLSV DWNPLSIDDT VLLNRETFCE TACVELQLSA 

      1690       1700       1710       1720       1730       1740 
VAVTALVQRS RVLEAEGADD VVTFDSMNTI AQQSVTGGAV NSIACYDEGA AVDATIKVLK 

      1750       1760       1770       1780       1790       1800 
QMLTECVRHI AHQGNARADE VVMTVSRWLN TRSALLFDAA LTRSVSVLES KLVLLLCAEC 

      1810       1820       1830       1840       1850       1860 
ERIGAKVIHA TAQKLVLNTG KSTSEEAKGF AEMLIQSLST NVVFAALHIT PVKFFDAMLW 

      1870       1880       1890       1900       1910       1920 
MDAHNHTGIR ISEKTESSPD VIADEEESSC TEFETTAIWK IAEEMPTEAN IQEEFLQMIG 

      1930       1940       1950       1960       1970       1980 
AYILEFLETN RKMHFDSESG ATFRSDCISQ KISHRLYRIV NKMVHNNADI AHCSVYLANA 

      1990       2000       2010       2020       2030       2040 
LCRALSCDQT SQLAVEGIRD NAKRLLHNSV VEADMTPLRS TTLFVSNVFC NSCSQASNVF 

      2050       2060       2070       2080       2090       2100 
LSSTDEILTC ATCQSKLNSD VIDMMICDRL NQLLTAYQIQ DHQCTKCKSV RHDTLSMYCE 

      2110       2120       2130       2140 
CCSQFIPQIT PAQLKHEAST VETVSIVRNF ALSSELATWV LKML 

« Hide

References

[1]"Genome sequence of the nematode C. elegans: a platform for investigating biology."
Caenorhabditis elegans Sequencing Consortium
Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Bristol N2 EMBL CAB04263.1.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
Z81526 Genomic DNA. Translation: CAB04263.1.
PIRT21712.
RefSeqNP_493616.1. NM_061215.3.

3D structure databases

ProteinModelPortalO62218.
SMRO62218. Positions 82-573, 759-1039.
ModBaseSearch...

Protein-protein interaction databases

STRING6239.F33H2.5.
O62218.

Proteomic databases

PaxDbO62218.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaF33H2.5; F33H2.5; F33H2.5.
GeneID173368.
KEGGcel:CELE_F33H2.5.
UCSCF33H2.5. c. elegans.

Organism-specific databases

CTD173368.
WormBaseF33H2.5; CE17767; WBGene00009368.

Phylogenomic databases

eggNOGCOG0417.
GeneTreeENSGT00390000010194.
HOGENOMHOG000196287.
InParanoidO62218.
KOK02324.
OMAQEESQDL.

Family and domain databases

InterProIPR006172. DNA-dir_DNA_pol_B.
IPR006133. DNA-dir_DNA_pol_B_exonuc.
IPR006134. DNA-dir_DNA_pol_B_multi_dom.
IPR013697. DNA_pol_e_suA_C.
IPR012337. RNaseH-like_dom.
[Graphical view]
PfamPF00136. DNA_pol_B. 1 hit.
PF03104. DNA_pol_B_exo1. 1 hit.
PF08490. DUF1744. 1 hit.
[Graphical view]
SMARTSM00486. POLBc. 1 hit.
[Graphical view]
SUPFAMSSF53098. RNaseH_fold. 1 hit.
ProtoNetSearch...

Other

NextBio879359.

Entry information

Entry nameO62218_CAEEL
AccessionPrimary (citable) accession number: O62218
Entry history
Integrated into UniProtKB/TrEMBL: August 1, 1998
Last sequence update: August 1, 1998
Last modified: May 1, 2013
This is version 92 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)