Skip Header

 
Contribute Send feedback
Read comments (1) or add your own

Unreviewed, UniProtKB/TrEMBL P71140 (P71140_CLOTM)

Last modified June 16, 2009. Version 56. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesSubmitted name:
    Endoglucanase J EMBL BAA12070.1
Gene names
Name: celJ EMBL BAA12070.1
OrganismClostridium thermocellum EMBL BAA12070.1
Taxonomic identifier1515 [NCBI]
Taxonomic lineageBacteriaFirmicutesClostridiaClostridialesClostridiaceaeClostridium

Protein attributes

Sequence length1601 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at protein level.

Ontologies

Gene Ontology (GO)
   Biological processpolysaccharide catabolic process

Inferred from electronic annotation. Source: InterPro

   Molecular functioncellulase activity

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
P71140-1 [UniParc].

Last modified February 1, 1997. Version 1.
Checksum: 31E85D77F8642565

FASTA1,601178,059
        10         20         30         40         50         60 
MAKRRLSLLL VLAIMFTMVV PQISASAETV APEGYRKLLD VQIFKDSPVV GWSGSGMGEL 

        70         80         90        100        110        120 
ETIGDTLPVD TTVTYNGLPT LRLNVQTTVQ SGWWISLLTL RGWNTHDLSQ YVENGYLEFD 

       130        140        150        160        170        180 
IKGKEGGEDF VIGFRDKVYE RVYGLEIDVT TVISNYVTVT TDWQHVKIPL RDLMKINNGF 

       190        200        210        220        230        240 
DPSSVTCLVF SKRYADPFTV WFSDIKITSE DNEKSAPAIK VNQLGFIPEA EKYALVTGFA 

       250        260        270        280        290        300 
EELAVSEGDE FAVINAADNS VAYTGKLTLV TEYEPLDSGE KILKADFSDL TVPGKYYISI 

       310        320        330        340        350        360 
EGLDNSPKFE IGEGIYGPLV VDAARYFYYQ RQGIELEEPY AQGYPRKDVT PQDAYAVFAS 

       370        380        390        400        410        420 
GKKDPIDITK GWYDAGDFGK YVNAGATGVS DLFWAYEMFP SQFVDGQFNI PESGNGVPDI 

       430        440        450        460        470        480 
LDEARWELEW MLKMQDKESG GFYPRVQSDN DENIKSRIIR DQNGCTTDDT ACAAGILAHA 

       490        500        510        520        530        540 
YLIYKDIDPD FAQECLDAAI NAWKFLEKNP ENIVSPPGPY NVYDDSGDRL WAAASLYRAT 

       550        560        570        580        590        600 
GEEVYHTYFK QNYKSFAQKF ESPTAYAHTW GDMWLTAFLS YLKAENKDQE VVDWIDTEFG 

       610        620        630        640        650        660 
IWLENILTRY ENNPWKNAIV PGNYFWGINM QVMNVPMDAI IGSQLLGKYS DRIEKLGFGS 

       670        680        690        700        710        720 
LNWLLGTNPL RFSFVSGYGE DSVKGVFSNI YNTDGKQGIP KGYMPGGPNA YEGAGLSRFA 

       730        740        750        760        770        780 
AKCYTRSTGD WVANEHTVYW NSALVFMAAF ANQGSEVNPG PAPEPGVTPN PTEPAKVVDI 

       790        800        810        820        830        840 
RIDTSAERKP ISPYIYGSNQ ELDATVTAKR FGGNRTTGYN WENNFSNAGS DWLHYSDTYL 

       850        860        870        880        890        900 
LEDGGVPKGE WSTPASVVTT FHDKALSKNV PYTLITLQAA GYVSADGNGP VSQEETAPSS 

       910        920        930        940        950        960 
RWKEVKFEKG APFSLTPDTE DDYVYMDEFV NYLVNKYGNA STPTGIKGYS IDNEPALWSH 

       970        980        990       1000       1010       1020 
THPRIHPDNV TAKELIEKSV ALSKAVKKVD PYAEIFGPAL YGFAAYETLQ SAPDWGTEGE 

      1030       1040       1050       1060       1070       1080 
GYRWFIDYYL DKMKKASDEE GKRLLDVLDV HWYPEARGGG ERICFGADPR NIETNKARLQ 

      1090       1100       1110       1120       1130       1140 
APRTLWDPTY IEDSWIGQWK KDFLPILPNL LDSIEKYYPG TKLAITEYDY GGGNHITGGI 

      1150       1160       1170       1180       1190       1200 
AQADVLGIFG KYGVYLATFW GDASNNYTEA GINLYTNYDG KGGKFGDTSV KCETSDIEVS 

      1210       1220       1230       1240       1250       1260 
SAYASIVGED DSKLHIILLN KNYDQPTTFN FSIDSSKNYT IGNVWAFDRG SSNITQRTPI 

      1270       1280       1290       1300       1310       1320 
VNIKDNTFTY TVPALTACHI VLEAAEPVVY GDLNNDSKVN AVDIMMLKRY ILGIIDNINL 

      1330       1340       1350       1360       1370       1380 
TAADIYFDGV VNSSDYNIMK RYLLKAIEDI PYVPENQAPK AIFTFSPEDP VTDENVVFNA 

      1390       1400       1410       1420       1430       1440 
SNSIDEDGTI AYYAWDFGDG YEGTSTTPTI TYKYKNPGTY KVKLIVTDNQ GASSSFTATI 

      1450       1460       1470       1480       1490       1500 
KVTSATGDNS KFNFEDGTLG GFTTSGTNAT GVVVNTTEKA FKGERGLKWT VTSEGEGTAE 

      1510       1520       1530       1540       1550       1560 
LKLDGGTIVV PGTTMTFRIW IPSGAPIAAI QPYIMPHTPD WSEVLWNSTW KGYTMVKTDD 

      1570       1580       1590       1600 
WNEITLTLPE DVDPTWPQQM GIQVQTIDEG EFTIYVDAID W 

« Hide

References

[1]"Cloning, DNA sequencing, and expression of the gene encoding Clostridium thermocellum cellulase CelJ, the largest catalytic component of the cellulosome."
Ahsan M.M., Kimura T., Karita S., Sakka K., Ohmiya K.
J. Bacteriol. 178:5732-5740(1996) [PubMed: 8824619] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: F1 EMBL BAA12070.1.
+Additional computationally mapped references.

Cross-references

Sequence databases

D83704 Genomic DNA. Translation: BAA12070.1.

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
2C24X-ray2.27A/B24-220[»]
2E0PX-ray1.60A773-1287[»]
2E4TX-ray0.96A773-1287[»]
2EEXX-ray2.00A773-1287[»]
2EJ1X-ray1.80A773-1287[»]
2EO7X-ray1.75A773-1287[»]
2EQDX-ray2.80A773-1287[»]
SMRP71140. Positions 34-228.
ModBaseSearch...

Protein family/group databases

CAZyCBM30. Carbohydrate-Binding Module Family 30.
CBM44. Carbohydrate-Binding Module Family 44.
GH44. Glycoside Hydrolase Family 44.
GH9. Glycoside Hydrolase Family 9.

Family and domain databases

InterProIPR012341. 6hp_glycosidase.
IPR016134. Cellulos_enz_dockerin_1.
IPR002105. Cellulos_enz_dockerin_1_Ca-bd.
IPR018242. Dockerin_1.
IPR001701. Glyco_hydro_9.
IPR004197. Glyco_hydro_9_Ig-like.
IPR000601. PKD.
[Graphical view]
Gene3DG3DSA:1.50.10.10. CelA/Cel48F_cat. 1 hit.
G3DSA:1.10.1330.10. Cellulos_enz_dockerin_1. 1 hit.
PANTHERPTHR22298:SF3. Glyco_hydro_9. 1 hit.
PfamPF02927. CelD_N. 1 hit.
PF00404. Dockerin_1. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
PF00801. PKD. 1 hit.
[Graphical view]
SMARTSM00089. PKD. 1 hit.
[Graphical view]
PROSITEPS00448. CLOS_CELLULOSOME_RPT. 1 hit. Uncertain.
PS50093. PKD. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameP71140_CLOTM
AccessionPrimary (citable) accession number: P71140
Entry history
Integrated into UniProtKB/TrEMBL: February 1, 1997
Last sequence update: February 1, 1997
Last modified: June 16, 2009
This is version 56 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information