Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Endoglucanase H

Gene

celH

Organism
Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

This enzyme catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Active sitei131Proton donorPROSITE-ProRule annotation1
Active sitei244NucleophilePROSITE-ProRule annotation1
Active sitei460Proton donorBy similarity1
Active sitei565NucleophileBy similarity1

GO - Molecular functioni

  • cellulase activity Source: MENGO

GO - Biological processi

Keywordsi

Molecular functionGlycosidase, Hydrolase
Biological processCarbohydrate metabolism, Cellulose degradation, Polysaccharide degradation

Enzyme and pathway databases

BioCyciCTHE203119:G1G86-1535-MONOMER
MetaCyc:MONOMER-16422

Protein family/group databases

CAZyiCBM11 Carbohydrate-Binding Module Family 11
GH26 Glycoside Hydrolase Family 26
GH5 Glycoside Hydrolase Family 5

Names & Taxonomyi

Protein namesi
Recommended name:
Endoglucanase H (EC:3.2.1.4)
Alternative name(s):
Cellulase H
Endo-1,4-beta-glucanase H
Short name:
EgH
Gene namesi
Name:celH
Ordered Locus Names:Cthe_1472
OrganismiClostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Taxonomic identifieri203119 [NCBI]
Taxonomic lineageiBacteriaFirmicutesClostridiaClostridialesRuminococcaceaeRuminiclostridium
Proteomesi
  • UP000002145 Componenti: Chromosome

Pathology & Biotechi

Chemistry databases

DrugBankiDB08785 4-METHYL-2H-CHROMEN-2-ONE

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 44Add BLAST44
ChainiPRO_000000785445 – 900Endoglucanase HAdd BLAST856

Proteomic databases

PRIDEiP16218

Interactioni

Protein-protein interaction databases

STRINGi203119.Cthe_1472

Structurei

Secondary structure

1900
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details

3D structure databases

ProteinModelPortaliP16218
SMRiP16218
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiP16218

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini45 – 298GH26PROSITE-ProRule annotationAdd BLAST254
Domaini655 – 900CBM11Add BLAST246
Domaini827 – 900DockerinPROSITE-ProRule annotationAdd BLAST74

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni300 – 630CatalyticBy similarityAdd BLAST331

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi631 – 654Pro/Thr-rich (linker)Add BLAST24

Sequence similaritiesi

In the N-terminal section; belongs to the glycosyl hydrolase 5 (cellulase A) family.Curated
In the C-terminal section; belongs to the glycosyl hydrolase 26 family.Curated

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiENOG4108K2F Bacteria
COG2730 LUCA
COG4124 LUCA
KOiK01179
OMAiWRINSSP
OrthoDBiPOG091H0FI3

Family and domain databases

InterProiView protein in InterPro
IPR005087 CBM_fam11
IPR002105 Dockerin_1_rpt
IPR016134 Dockerin_dom
IPR036439 Dockerin_dom_sf
IPR008979 Galactose-bd-like_sf
IPR022790 GH26_dom
IPR001547 Glyco_hydro_5
IPR018087 Glyco_hydro_5_CS
IPR017853 Glycoside_hydrolase_SF
PfamiView protein in Pfam
PF03425 CBM_11, 1 hit
PF00150 Cellulase, 1 hit
PF00404 Dockerin_1, 2 hits
PF02156 Glyco_hydro_26, 1 hit
SUPFAMiSSF49785 SSF49785, 1 hit
SSF51445 SSF51445, 2 hits
SSF63446 SSF63446, 1 hit
PROSITEiView protein in PROSITE
PS00448 CLOS_CELLULOSOME_RPT, 2 hits
PS51766 DOCKERIN, 1 hit
PS00018 EF_HAND_1, 1 hit
PS51764 GH26, 1 hit
PS00659 GLYCOSYL_HYDROL_F5, 1 hit

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P16218-1 [UniParc]FASTAAdd to basket
« Hide
        10         20         30         40         50
MKKRLLVSFL VLSIIVGLLS FQSLGNYNSG LKIGAWVGTQ PSESAIKSFQ
60 70 80 90 100
ELQGRKLDIV HQFINWSTDF SWVRPYADAV YNNGSILMIT WEPWEYNTVD
110 120 130 140 150
IKNGKADAYI TRMAQDMKAY GKEIWLRPLH EANGDWYPWA IGYSSRVNTN
160 170 180 190 200
ETYIAAFRHI VDIFRANGAT NVKWVFNVNC DNVGNGTSYL GHYPGDNYVD
210 220 230 240 250
YTSIDGYNWG TTQSWGSQWQ SFDQVFSRAY QALASINKPI IIAEFASAEI
260 270 280 290 300
GGNKARWITE AYNSIRTSYN KVIAAVWFHE NKETDWRINS SPEALAAYRE
310 320 330 340 350
AIGAGSSNPT PTPTWTSTPP SSSPKAVDPF EMVRKMGMGT NLGNTLEAPY
360 370 380 390 400
EGSWSKSAME YYFDDFKAAG YKNVRIPVRW DNHTMRTYPY TIDKAFLDRV
410 420 430 440 450
EQVVDWSLSR GFVTIINSHH DDWIKEDYNG NIERFEKIWE QIAERFKNKS
460 470 480 490 500
ENLLFEIMNE PFGNITDEQI DDMNSRILKI IRKTNPTRIV IIGGGYWNSY
510 520 530 540 550
NTLVNIKIPD DPYLIGTFHY YDPYEFTHKW RGTWGTQEDM DTVVRVFDFV
560 570 580 590 600
KSWSDRNNIP VYFGEFAVMA YADRTSRVKW YDFISDAALE RGFACSVWDN
610 620 630 640 650
GVFGSLDNDM AIYNRDTRTF DTEILNALFN PGTYPSYSPK PSPTPRPTKP
660 670 680 690 700
PVTPAVGEKM LDDFEGVLNW GSYSGEGAKV STKIVSGKTG NGMEVSYTGT
710 720 730 740 750
TDGYWGTVYS LPDGDWSKWL KISFDIKSVD GSANEIRFMI AEKSINGVGD
760 770 780 790 800
GEHWVYSITP DSSWKTIEIP FSSFRRRLDY QPPGQDMSGT LDLDNIDSIH
810 820 830 840 850
FMYANNKSGK FVVDNIKLIG ATSDPTPSIK HGDLNFDNAV NSTDLLMLKR
860 870 880 890 900
YILKSLELGT SEQEEKFKKA ADLNRDNKVD STDLTILKRY LLKAISEIPI
Length:900
Mass (Da):102,416
Last modified:April 1, 1990 - v1
Checksum:i973AFB1954FC246B
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M31903 Genomic DNA Translation: AAA23225.1
CP000568 Genomic DNA Translation: ABN52701.1
PIRiJH0157
RefSeqiWP_011838089.1, NC_009012.1

Genome annotation databases

EnsemblBacteriaiABN52701; ABN52701; Cthe_1472
GeneIDi35805724
KEGGicth:Cthe_1472

Similar proteinsi

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
M31903 Genomic DNA Translation: AAA23225.1
CP000568 Genomic DNA Translation: ABN52701.1
PIRiJH0157
RefSeqiWP_011838089.1, NC_009012.1

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
1V0AX-ray1.98A655-821[»]
2BV9X-ray1.50A26-304[»]
2BVDX-ray1.60A26-304[»]
2CIPX-ray1.40A26-304[»]
2CITX-ray1.40A26-304[»]
2LRONMR-A655-821[»]
2LRPNMR-A655-821[»]
2V3GX-ray1.20A26-305[»]
2VI0X-ray1.51A26-304[»]
4U3AX-ray2.42A/B290-654[»]
4U5IX-ray2.50A/B290-654[»]
4U5KX-ray2.65A/B290-654[»]
5BYWX-ray2.60A/B/C/D/E290-654[»]
ProteinModelPortaliP16218
SMRiP16218
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi203119.Cthe_1472

Chemistry databases

DrugBankiDB08785 4-METHYL-2H-CHROMEN-2-ONE

Protein family/group databases

CAZyiCBM11 Carbohydrate-Binding Module Family 11
GH26 Glycoside Hydrolase Family 26
GH5 Glycoside Hydrolase Family 5

Proteomic databases

PRIDEiP16218

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiABN52701; ABN52701; Cthe_1472
GeneIDi35805724
KEGGicth:Cthe_1472

Phylogenomic databases

eggNOGiENOG4108K2F Bacteria
COG2730 LUCA
COG4124 LUCA
KOiK01179
OMAiWRINSSP
OrthoDBiPOG091H0FI3

Enzyme and pathway databases

BioCyciCTHE203119:G1G86-1535-MONOMER
MetaCyc:MONOMER-16422

Miscellaneous databases

EvolutionaryTraceiP16218

Family and domain databases

InterProiView protein in InterPro
IPR005087 CBM_fam11
IPR002105 Dockerin_1_rpt
IPR016134 Dockerin_dom
IPR036439 Dockerin_dom_sf
IPR008979 Galactose-bd-like_sf
IPR022790 GH26_dom
IPR001547 Glyco_hydro_5
IPR018087 Glyco_hydro_5_CS
IPR017853 Glycoside_hydrolase_SF
PfamiView protein in Pfam
PF03425 CBM_11, 1 hit
PF00150 Cellulase, 1 hit
PF00404 Dockerin_1, 2 hits
PF02156 Glyco_hydro_26, 1 hit
SUPFAMiSSF49785 SSF49785, 1 hit
SSF51445 SSF51445, 2 hits
SSF63446 SSF63446, 1 hit
PROSITEiView protein in PROSITE
PS00448 CLOS_CELLULOSOME_RPT, 2 hits
PS51766 DOCKERIN, 1 hit
PS00018 EF_HAND_1, 1 hit
PS51764 GH26, 1 hit
PS00659 GLYCOSYL_HYDROL_F5, 1 hit
ProtoNetiSearch...

Entry informationi

Entry nameiGUNH_CLOTH
AccessioniPrimary (citable) accession number: P16218
Secondary accession number(s): A3DFH2
Entry historyiIntegrated into UniProtKB/Swiss-Prot: April 1, 1990
Last sequence update: April 1, 1990
Last modified: May 23, 2018
This is version 149 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families
  3. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again