Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Endoglucanase 1

Gene

celI

Organism
Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

This enzyme catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans. Principally active against barley beta-glucan.

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Pathwayi: cellulose degradation

This protein is involved in the pathway cellulose degradation, which is part of Glycan metabolism.
View all proteins of this organism that are known to be involved in the pathway cellulose degradation and in Glycan metabolism.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Active sitei448By similarity1
Active sitei486By similarity1
Active sitei495By similarity1

GO - Molecular functioni

  • cellulase activity Source: UniProtKB-EC
  • cellulose binding Source: InterPro
  • identical protein binding Source: IntAct

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Keywords - Biological processi

Carbohydrate metabolism, Cellulose degradation, Polysaccharide degradation

Enzyme and pathway databases

UniPathwayiUPA00696.

Protein family/group databases

CAZyiCBM3. Carbohydrate-Binding Module Family 3.
GH9. Glycoside Hydrolase Family 9.

Names & Taxonomyi

Protein namesi
Recommended name:
Endoglucanase 1 (EC:3.2.1.4)
Alternative name(s):
Cellulase I
Endo-1,4-beta-glucanase
Endoglucanase I
Short name:
EGI
Gene namesi
Name:celI
Ordered Locus Names:Cthe_0040
OrganismiClostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Taxonomic identifieri203119 [NCBI]
Taxonomic lineageiBacteriaFirmicutesClostridiaClostridialesRuminococcaceaeRuminiclostridium
Proteomesi
  • UP000002145 Componenti: Chromosome

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 551 PublicationAdd BLAST55
ChainiPRO_000000795156 – 887Endoglucanase 1Add BLAST832

Interactioni

Binary interactionsi

WithEntry#Exp.IntActNotes
itself4EBI-8601842,EBI-8601842

GO - Molecular functioni

  • identical protein binding Source: IntAct

Protein-protein interaction databases

MINTiMINT-6946608.
STRINGi203119.Cthe_0040.

Structurei

Secondary structure

1887
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Helixi78 – 91Combined sources14
Turni99 – 101Combined sources3
Beta strandi103 – 106Combined sources4
Turni112 – 115Combined sources4
Helixi116 – 118Combined sources3
Beta strandi129 – 131Combined sources3
Helixi136 – 152Combined sources17
Helixi154 – 159Combined sources6
Helixi163 – 179Combined sources17
Beta strandi186 – 192Combined sources7
Helixi194 – 198Combined sources5
Helixi204 – 206Combined sources3
Beta strandi213 – 220Combined sources8
Helixi223 – 239Combined sources17
Turni240 – 243Combined sources4
Helixi245 – 265Combined sources21
Turni273 – 277Combined sources5
Helixi285 – 299Combined sources15
Helixi302 – 310Combined sources9
Turni311 – 314Combined sources4
Beta strandi321 – 324Combined sources4
Helixi336 – 347Combined sources12
Helixi352 – 364Combined sources13
Beta strandi385 – 387Combined sources3
Helixi388 – 403Combined sources16
Helixi410 – 428Combined sources19
Turni429 – 431Combined sources3
Beta strandi439 – 442Combined sources4
Helixi450 – 453Combined sources4
Beta strandi456 – 458Combined sources3
Beta strandi462 – 465Combined sources4
Helixi491 – 494Combined sources4
Helixi498 – 515Combined sources18
Beta strandi533 – 544Combined sources12
Beta strandi547 – 556Combined sources10
Beta strandi565 – 575Combined sources11
Helixi577 – 580Combined sources4
Turni581 – 583Combined sources3
Helixi586 – 588Combined sources3
Beta strandi590 – 592Combined sources3
Beta strandi605 – 608Combined sources4
Beta strandi611 – 617Combined sources7
Beta strandi625 – 627Combined sources3
Turni628 – 630Combined sources3
Beta strandi631 – 640Combined sources10
Helixi649 – 651Combined sources3
Helixi653 – 655Combined sources3
Beta strandi671 – 673Combined sources3
Beta strandi676 – 680Combined sources5

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
2XFGX-ray1.68A54-516[»]
B517-683[»]
ProteinModelPortaliQ02934.
SMRiQ02934.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiQ02934.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini529 – 684CBM3 1PROSITE-ProRule annotationAdd BLAST156
Domaini736 – 887CBM3 2PROSITE-ProRule annotationAdd BLAST152

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni56 – 518CatalyticAdd BLAST463

Sequence similaritiesi

Contains 2 CBM3 (carbohydrate binding type-3) domains.PROSITE-ProRule annotation

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiENOG4105CYP. Bacteria.
ENOG410XPC9. LUCA.
HOGENOMiHOG000021032.
KOiK01179.
OMAiGEYSKEC.
OrthoDBiPOG091H0908.

Family and domain databases

Gene3Di1.50.10.10. 1 hit.
2.60.40.710. 2 hits.
InterProiIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR008965. Carb-bd_dom.
IPR001956. CBD_3.
IPR001701. Glyco_hydro_9.
IPR033126. Glyco_hydro_9_Asp/Glu_AS.
IPR018221. Glyco_hydro_9_His_AS.
[Graphical view]
PfamiPF00942. CBM_3. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SMARTiSM01067. CBM_3. 2 hits.
[Graphical view]
SUPFAMiSSF48208. SSF48208. 1 hit.
SSF49384. SSF49384. 2 hits.
PROSITEiPS51172. CBM3. 2 hits.
PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q02934-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MRLVNSLGRR KILLILAVIV AFSTVLLFAK LWGRKTSSTL DEVGSKTHGD
60 70 80 90 100
LTAENKNGGY LPEEEIPDQP PATGAFNYGE ALQKAIFFYE CQRSGKLDPS
110 120 130 140 150
TLRLNWRGDS GLDDGKDAGI DLTGGWYDAG DHVKFNLPMS YSAAMLGWAV
160 170 180 190 200
YEYEDAFKQS GQYNHILNNI KWACDYFIKC HPEKDVYYYQ VGDGHADHAW
210 220 230 240 250
WGPAEVMPME RPSYKVDRSS PGSTVVAETS AALAIASIIF KKVDGEYSKE
260 270 280 290 300
CLKHAKELFE FADTTKSDDG YTAANGFYNS WSGFYDELSW AAVWLYLATN
310 320 330 340 350
DSSYLDKAES YSDKWGYEPQ TNIPKYKWAQ CWDDVTYGTY LLLARIKNDN
360 370 380 390 400
GKYKEAIERH LDWWTTGYNG ERITYTPKGL AWLDQWGSLR YATTTAFLAC
410 420 430 440 450
VYSDWENGDK EKAKTYLEFA RSQADYALGS TGRSFVVGFG ENPPKRPHHR
460 470 480 490 500
TAHGSWADSQ MEPPEHRHVL YGALVGGPDS TDNYTDDISN YTCNEVACDY
510 520 530 540 550
NAGFVGLLAK MYKLYGGSPD PKFNGIEEVP EDEIFVEAGV NASGNNFIEI
560 570 580 590 600
KAIVNNKSGW PARVCENLSF RYFINIEEIV NAGKSASDLQ VSSSYNQGAK
610 620 630 640 650
LSDVKHYKDN IYYVEVDLSG TKIYPGGQSA YKKEVQFRIS APEGTVFNPE
660 670 680 690 700
NDYSYQGLSA GTVVKSEYIP VYDAGVLVFG REPGSASKST SKDNGLSKAT
710 720 730 740 750
PTVKTESQPT AKHTQNPASD FKTPANQNSV KKDQGIKGEV VLQYANGNAG
760 770 780 790 800
ATSNSINPRF KIINNGTKAI NLSDVKIRYY YTKEGGASQN FWCDWSSAGN
810 820 830 840 850
SNVTGNFFNL SSPKEGADTC LEVGFGSGAG TLDPGGSVEV QIRFSKEDWS
860 870 880
NYNQSNDYSF NPSASDYTDW NRVTLYISNK LVYGKEP
Length:887
Mass (Da):98,531
Last modified:April 17, 2007 - v2
Checksum:iE587626445BA7A95
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti861 – 887NPSAS…YGKEP → KQACLRQRTLIYLYATWLR in AAA20892 (PubMed:8436949).CuratedAdd BLAST27

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L04735 Genomic DNA. Translation: AAA20892.1.
CP000568 Genomic DNA. Translation: ABN51281.1.
PIRiA47704.
RefSeqiWP_011837740.1. NC_009012.1.

Genome annotation databases

EnsemblBacteriaiABN51281; ABN51281; Cthe_0040.
KEGGicth:Cthe_0040.
PATRICi19513685. VBICloThe47081_0045.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L04735 Genomic DNA. Translation: AAA20892.1.
CP000568 Genomic DNA. Translation: ABN51281.1.
PIRiA47704.
RefSeqiWP_011837740.1. NC_009012.1.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
2XFGX-ray1.68A54-516[»]
B517-683[»]
ProteinModelPortaliQ02934.
SMRiQ02934.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

MINTiMINT-6946608.
STRINGi203119.Cthe_0040.

Protein family/group databases

CAZyiCBM3. Carbohydrate-Binding Module Family 3.
GH9. Glycoside Hydrolase Family 9.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiABN51281; ABN51281; Cthe_0040.
KEGGicth:Cthe_0040.
PATRICi19513685. VBICloThe47081_0045.

Phylogenomic databases

eggNOGiENOG4105CYP. Bacteria.
ENOG410XPC9. LUCA.
HOGENOMiHOG000021032.
KOiK01179.
OMAiGEYSKEC.
OrthoDBiPOG091H0908.

Enzyme and pathway databases

UniPathwayiUPA00696.

Miscellaneous databases

EvolutionaryTraceiQ02934.

Family and domain databases

Gene3Di1.50.10.10. 1 hit.
2.60.40.710. 2 hits.
InterProiIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR008965. Carb-bd_dom.
IPR001956. CBD_3.
IPR001701. Glyco_hydro_9.
IPR033126. Glyco_hydro_9_Asp/Glu_AS.
IPR018221. Glyco_hydro_9_His_AS.
[Graphical view]
PfamiPF00942. CBM_3. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SMARTiSM01067. CBM_3. 2 hits.
[Graphical view]
SUPFAMiSSF48208. SSF48208. 1 hit.
SSF49384. SSF49384. 2 hits.
PROSITEiPS51172. CBM3. 2 hits.
PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiGUNI_CLOTH
AccessioniPrimary (citable) accession number: Q02934
Secondary accession number(s): A3DBF2
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 1, 1995
Last sequence update: April 17, 2007
Last modified: November 2, 2016
This is version 108 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  3. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  4. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.