Skip Header

Contribute Send feedback
Read comments (?) or add your own

P71140 (P71140_CLOTM) Unreviewed, UniProtKB/TrEMBL

Last modified September 21, 2011. Version 72. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
Gene names
Name:celJ EMBL BAA12070.1
OrganismClostridium thermocellum EMBL BAA12070.1
Taxonomic identifier1515 [NCBI]
Taxonomic lineageBacteriaFirmicutesClostridiaClostridialesClostridiaceaeClostridium

Protein attributes

Sequence length1601 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Metal binding8031Zinc PDB 2E0P PDB 2E4T PDB 2EEX PDB 2EJ1 PDB 2EO7 PDB 2EQD
Metal binding8221Calcium 1 PDB 2E0P PDB 2E4T PDB 2EEX PDB 2EJ1 PDB 2EO7 PDB 2EQD
Metal binding9181Calcium 1; via carbonyl oxygen PDB 2E0P PDB 2E4T PDB 2EEX PDB 2EJ1 PDB 2EO7 PDB 2EQD
Metal binding9211Calcium 1 PDB 2E0P PDB 2E4T PDB 2EEX PDB 2EJ1 PDB 2EO7 PDB 2EQD
Metal binding9231Calcium 1; via carbonyl oxygen PDB 2E0P PDB 2E4T PDB 2EEX PDB 2EJ1 PDB 2EO7 PDB 2EQD
Metal binding11631Zinc PDB 2E0P PDB 2E4T PDB 2EEX PDB 2EJ1 PDB 2EO7 PDB 2EQD
Metal binding11691Zinc PDB 2E0P PDB 2E4T PDB 2EEX PDB 2EJ1 PDB 2EO7 PDB 2EQD
Metal binding13561Calcium 2 PDB 2C4X
Metal binding13571Calcium 2; via carbonyl oxygen PDB 2C4X
Metal binding13851Calcium 2 PDB 2C4X
Metal binding13871Calcium 2 PDB 2C4X
Metal binding14281Calcium 2 PDB 2C4X
Metal binding14481Calcium 3 PDB 2C4X
Metal binding14531Calcium 3; via carbonyl oxygen PDB 2C4X
Metal binding14551Calcium 3 PDB 2C4X
Metal binding14821Calcium 3; via carbonyl oxygen PDB 2C4X
Metal binding14851Calcium 3; via carbonyl oxygen PDB 2C4X
Metal binding15971Calcium 3 PDB 2C4X

Sequences

Sequence LengthMass (Da)Tools
P71140 [UniParc].

Last modified February 1, 1997. Version 1.
Checksum: 31E85D77F8642565

FASTA1,601178,059
        10         20         30         40         50         60 
MAKRRLSLLL VLAIMFTMVV PQISASAETV APEGYRKLLD VQIFKDSPVV GWSGSGMGEL 

        70         80         90        100        110        120 
ETIGDTLPVD TTVTYNGLPT LRLNVQTTVQ SGWWISLLTL RGWNTHDLSQ YVENGYLEFD 

       130        140        150        160        170        180 
IKGKEGGEDF VIGFRDKVYE RVYGLEIDVT TVISNYVTVT TDWQHVKIPL RDLMKINNGF 

       190        200        210        220        230        240 
DPSSVTCLVF SKRYADPFTV WFSDIKITSE DNEKSAPAIK VNQLGFIPEA EKYALVTGFA 

       250        260        270        280        290        300 
EELAVSEGDE FAVINAADNS VAYTGKLTLV TEYEPLDSGE KILKADFSDL TVPGKYYISI 

       310        320        330        340        350        360 
EGLDNSPKFE IGEGIYGPLV VDAARYFYYQ RQGIELEEPY AQGYPRKDVT PQDAYAVFAS 

       370        380        390        400        410        420 
GKKDPIDITK GWYDAGDFGK YVNAGATGVS DLFWAYEMFP SQFVDGQFNI PESGNGVPDI 

       430        440        450        460        470        480 
LDEARWELEW MLKMQDKESG GFYPRVQSDN DENIKSRIIR DQNGCTTDDT ACAAGILAHA 

       490        500        510        520        530        540 
YLIYKDIDPD FAQECLDAAI NAWKFLEKNP ENIVSPPGPY NVYDDSGDRL WAAASLYRAT 

       550        560        570        580        590        600 
GEEVYHTYFK QNYKSFAQKF ESPTAYAHTW GDMWLTAFLS YLKAENKDQE VVDWIDTEFG 

       610        620        630        640        650        660 
IWLENILTRY ENNPWKNAIV PGNYFWGINM QVMNVPMDAI IGSQLLGKYS DRIEKLGFGS 

       670        680        690        700        710        720 
LNWLLGTNPL RFSFVSGYGE DSVKGVFSNI YNTDGKQGIP KGYMPGGPNA YEGAGLSRFA 

       730        740        750        760        770        780 
AKCYTRSTGD WVANEHTVYW NSALVFMAAF ANQGSEVNPG PAPEPGVTPN PTEPAKVVDI 

       790        800        810        820        830        840 
RIDTSAERKP ISPYIYGSNQ ELDATVTAKR FGGNRTTGYN WENNFSNAGS DWLHYSDTYL 

       850        860        870        880        890        900 
LEDGGVPKGE WSTPASVVTT FHDKALSKNV PYTLITLQAA GYVSADGNGP VSQEETAPSS 

       910        920        930        940        950        960 
RWKEVKFEKG APFSLTPDTE DDYVYMDEFV NYLVNKYGNA STPTGIKGYS IDNEPALWSH 

       970        980        990       1000       1010       1020 
THPRIHPDNV TAKELIEKSV ALSKAVKKVD PYAEIFGPAL YGFAAYETLQ SAPDWGTEGE 

      1030       1040       1050       1060       1070       1080 
GYRWFIDYYL DKMKKASDEE GKRLLDVLDV HWYPEARGGG ERICFGADPR NIETNKARLQ 

      1090       1100       1110       1120       1130       1140 
APRTLWDPTY IEDSWIGQWK KDFLPILPNL LDSIEKYYPG TKLAITEYDY GGGNHITGGI 

      1150       1160       1170       1180       1190       1200 
AQADVLGIFG KYGVYLATFW GDASNNYTEA GINLYTNYDG KGGKFGDTSV KCETSDIEVS 

      1210       1220       1230       1240       1250       1260 
SAYASIVGED DSKLHIILLN KNYDQPTTFN FSIDSSKNYT IGNVWAFDRG SSNITQRTPI 

      1270       1280       1290       1300       1310       1320 
VNIKDNTFTY TVPALTACHI VLEAAEPVVY GDLNNDSKVN AVDIMMLKRY ILGIIDNINL 

      1330       1340       1350       1360       1370       1380 
TAADIYFDGV VNSSDYNIMK RYLLKAIEDI PYVPENQAPK AIFTFSPEDP VTDENVVFNA 

      1390       1400       1410       1420       1430       1440 
SNSIDEDGTI AYYAWDFGDG YEGTSTTPTI TYKYKNPGTY KVKLIVTDNQ GASSSFTATI 

      1450       1460       1470       1480       1490       1500 
KVTSATGDNS KFNFEDGTLG GFTTSGTNAT GVVVNTTEKA FKGERGLKWT VTSEGEGTAE 

      1510       1520       1530       1540       1550       1560 
LKLDGGTIVV PGTTMTFRIW IPSGAPIAAI QPYIMPHTPD WSEVLWNSTW KGYTMVKTDD 

      1570       1580       1590       1600 
WNEITLTLPE DVDPTWPQQM GIQVQTIDEG EFTIYVDAID W 

« Hide

References

[1]"Cloning, DNA sequencing, and expression of the gene encoding Clostridium thermocellum cellulase CelJ, the largest catalytic component of the cellulosome."
Ahsan M.M., Kimura T., Karita S., Sakka K., Ohmiya K.
J. Bacteriol. 178:5732-5740(1996) [PubMed: 8824619] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: F1 EMBL BAA12070.1.
[2]"Crystal Structure of Family 30 Carbohydrate Binding Module."
Horiguchi Y., Kono M., Suzuki A., Yamane T., Arai M., Sakka K., Omiya K.
Submitted (JUL-2004) to the PDB data bank
Cited for: X-RAY CRYSTALLOGRAPHY (2.00 ANGSTROMS) OF 31-235.
[3]"Crystal structure of Cel44A, a glycoside hydrolase family 44 endoglucanase from Clostridium thermocellum."
Kitago Y., Karita S., Watanabe N., Kamiya M., Aizawa T., Sakka K., Tanaka I.
J. Biol. Chem. 282:35703-35711(2007) [PubMed: 17905739] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (0.96 ANGSTROMS) OF 773-1287 IN COMPLEX WITH CALCIUM AND ZINC.
[4]"Xyloglucan is recognized by carbohydrate-binding modules that interact with beta-glucan chains."
Najmudin S., Guerreiro C.I., Carvalho A.L., Prates J.A., Correia M.A., Alves V.D., Ferreira L.M., Romao M.J., Gilbert H.J., Bolam D.N., Fontes C.M.
J. Biol. Chem. 281:8815-8828(2006) [PubMed: 16314409] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.00 ANGSTROMS) OF 1353-1601 IN COMPLEX WITH CALCIUM.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
D83704 Genomic DNA. Translation: BAA12070.1.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1WMXX-ray2.00A/B31-235[»]
2C24X-ray2.27A/B24-220[»]
2C4XX-ray2.00A1353-1601[»]
2E0PX-ray1.60A773-1287[»]
2E4TX-ray0.96A773-1287[»]
2EEXX-ray2.00A773-1287[»]
2EJ1X-ray1.80A773-1287[»]
2EO7X-ray1.75A773-1287[»]
2EQDX-ray2.80A773-1287[»]
ProteinModelPortalP71140.
SMRP71140. Positions 34-228, 775-1283, 1354-1601.
ModBaseSearch...

Protein family/group databases

CAZyCBM30. Carbohydrate-Binding Module Family 30.
CBM44. Carbohydrate-Binding Module Family 44.
GH44. Glycoside Hydrolase Family 44.
GH9. Glycoside Hydrolase Family 9.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Family and domain databases

InterProIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR016134. Cellulos_enz_dockerin_1.
IPR018242. Dockerin_1.
IPR008979. Galactose-bd-like.
IPR001701. Glyco_hydro_9.
IPR004197. Glyco_hydro_9_Ig-like.
IPR017853. Glycoside_hydrolase_SF.
IPR013783. Ig-like_fold.
IPR014756. Ig_E-set.
IPR022409. PKD/Chitinase_dom.
IPR000601. PKD_dom.
[Graphical view]
Gene3DG3DSA:1.50.10.10. CelA/Cel48F_cat. 1 hit.
G3DSA:1.10.1330.10. Cellulos_enz_dockerin_1. 1 hit.
G3DSA:2.60.40.10. Ig-like_fold. 1 hit.
G3DSA:2.60.40.670. PKD. 1 hit.
PfamPF02927. CelD_N. 1 hit.
PF00404. Dockerin_1. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
PF00801. PKD. 1 hit.
[Graphical view]
SMARTSM00089. PKD. 1 hit.
[Graphical view]
SUPFAMSSF63446. Cellulos_enz_dockerin_1. 1 hit.
SSF49785. Gal_bind_like. 2 hits.
SSF51445. Glyco_hydro_cat. 1 hit.
SSF48208. Glyco_trans_6hp. 1 hit.
SSF81296. Ig_E-set. 1 hit.
SSF49299. PKD. 1 hit.
PROSITEPS50093. PKD. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameP71140_CLOTM
AccessionPrimary (citable) accession number: P71140
Entry history
Integrated into UniProtKB/TrEMBL: February 1, 1997
Last sequence update: February 1, 1997
Last modified: September 21, 2011
This is version 72 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)