Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Q02934

- GUNI_CLOTH

UniProt

Q02934 - GUNI_CLOTH

(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Protein

Endoglucanase 1

Gene

celI

Organism
Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Status
Reviewed - Annotation score: 4 out of 5- Experimental evidence at protein leveli

Functioni

This enzyme catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans. Principally active against barley beta-glucan.

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Pathwayi

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei448 – 4481By similarity
Active sitei486 – 4861By similarity
Active sitei495 – 4951By similarity

GO - Molecular functioni

  1. cellulase activity Source: UniProtKB-EC
  2. cellulose binding Source: InterPro
  3. identical protein binding Source: IntAct

GO - Biological processi

  1. cellulose catabolic process Source: UniProtKB-UniPathway
Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Keywords - Biological processi

Carbohydrate metabolism, Cellulose degradation, Polysaccharide degradation

Enzyme and pathway databases

BioCyciCTHE203119:GIW8-39-MONOMER.
UniPathwayiUPA00696.

Protein family/group databases

CAZyiCBM3. Carbohydrate-Binding Module Family 3.
GH9. Glycoside Hydrolase Family 9.

Names & Taxonomyi

Protein namesi
Recommended name:
Endoglucanase 1 (EC:3.2.1.4)
Alternative name(s):
Cellulase I
Endo-1,4-beta-glucanase
Endoglucanase I
Short name:
EGI
Gene namesi
Name:celI
Ordered Locus Names:Cthe_0040
OrganismiClostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Taxonomic identifieri203119 [NCBI]
Taxonomic lineageiBacteriaFirmicutesClostridiaClostridialesRuminococcaceaeRuminiclostridium
ProteomesiUP000002145: Chromosome

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 55551 PublicationAdd
BLAST
Chaini56 – 887832Endoglucanase 1PRO_0000007951Add
BLAST

Interactioni

Binary interactionsi

WithEntry#Exp.IntActNotes
itself4EBI-8601842,EBI-8601842

Protein-protein interaction databases

MINTiMINT-6946608.
STRINGi203119.Cthe_0040.

Structurei

Secondary structure

1
887
Legend: HelixTurnBeta strand
Show more details
Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Helixi78 – 9114
Turni99 – 1013
Beta strandi103 – 1064
Turni112 – 1154
Helixi116 – 1183
Beta strandi129 – 1313
Helixi136 – 15217
Helixi154 – 1596
Helixi163 – 17917
Beta strandi186 – 1927
Helixi194 – 1985
Helixi204 – 2063
Beta strandi213 – 2208
Helixi223 – 23917
Turni240 – 2434
Helixi245 – 26521
Turni273 – 2775
Helixi285 – 29915
Helixi302 – 3109
Turni311 – 3144
Beta strandi321 – 3244
Helixi336 – 34712
Helixi352 – 36413
Beta strandi385 – 3873
Helixi388 – 40316
Helixi410 – 42819
Turni429 – 4313
Beta strandi439 – 4424
Helixi450 – 4534
Beta strandi456 – 4583
Beta strandi462 – 4654
Helixi491 – 4944
Helixi498 – 51518
Beta strandi533 – 54412
Beta strandi547 – 55610
Beta strandi565 – 57511
Helixi577 – 5804
Turni581 – 5833
Helixi586 – 5883
Beta strandi590 – 5923
Beta strandi605 – 6084
Beta strandi611 – 6177
Beta strandi625 – 6273
Turni628 – 6303
Beta strandi631 – 64010
Helixi649 – 6513
Helixi653 – 6553
Beta strandi671 – 6733
Beta strandi676 – 6805

3D structure databases

Select the link destinations:
PDBe
RCSB PDB
PDBj
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
2XFGX-ray1.68A54-516[»]
B517-683[»]
ProteinModelPortaliQ02934.
SMRiQ02934. Positions 76-683.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiQ02934.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini529 – 684156CBM3 1PROSITE-ProRule annotationAdd
BLAST
Domaini736 – 887152CBM3 2PROSITE-ProRule annotationAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni56 – 518463CatalyticAdd
BLAST

Sequence similaritiesi

Contains 2 CBM3 (carbohydrate binding type-3) domains.PROSITE-ProRule annotation

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiCOG2730.
HOGENOMiHOG000021032.
KOiK01179.
K01225.
OMAiRKEVQFR.
OrthoDBiEOG6KQ6BP.

Family and domain databases

Gene3Di1.50.10.10. 1 hit.
2.60.40.710. 2 hits.
InterProiIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR008965. Carb-bd_dom.
IPR001956. CBD_3.
IPR001701. Glyco_hydro_9.
IPR018221. Glyco_hydro_9_AS.
[Graphical view]
PfamiPF00942. CBM_3. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SMARTiSM01067. CBM_3. 2 hits.
[Graphical view]
SUPFAMiSSF48208. SSF48208. 1 hit.
SSF49384. SSF49384. 2 hits.
PROSITEiPS51172. CBM3. 2 hits.
PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q02934-1 [UniParc]FASTAAdd to Basket

« Hide

        10         20         30         40         50
MRLVNSLGRR KILLILAVIV AFSTVLLFAK LWGRKTSSTL DEVGSKTHGD
60 70 80 90 100
LTAENKNGGY LPEEEIPDQP PATGAFNYGE ALQKAIFFYE CQRSGKLDPS
110 120 130 140 150
TLRLNWRGDS GLDDGKDAGI DLTGGWYDAG DHVKFNLPMS YSAAMLGWAV
160 170 180 190 200
YEYEDAFKQS GQYNHILNNI KWACDYFIKC HPEKDVYYYQ VGDGHADHAW
210 220 230 240 250
WGPAEVMPME RPSYKVDRSS PGSTVVAETS AALAIASIIF KKVDGEYSKE
260 270 280 290 300
CLKHAKELFE FADTTKSDDG YTAANGFYNS WSGFYDELSW AAVWLYLATN
310 320 330 340 350
DSSYLDKAES YSDKWGYEPQ TNIPKYKWAQ CWDDVTYGTY LLLARIKNDN
360 370 380 390 400
GKYKEAIERH LDWWTTGYNG ERITYTPKGL AWLDQWGSLR YATTTAFLAC
410 420 430 440 450
VYSDWENGDK EKAKTYLEFA RSQADYALGS TGRSFVVGFG ENPPKRPHHR
460 470 480 490 500
TAHGSWADSQ MEPPEHRHVL YGALVGGPDS TDNYTDDISN YTCNEVACDY
510 520 530 540 550
NAGFVGLLAK MYKLYGGSPD PKFNGIEEVP EDEIFVEAGV NASGNNFIEI
560 570 580 590 600
KAIVNNKSGW PARVCENLSF RYFINIEEIV NAGKSASDLQ VSSSYNQGAK
610 620 630 640 650
LSDVKHYKDN IYYVEVDLSG TKIYPGGQSA YKKEVQFRIS APEGTVFNPE
660 670 680 690 700
NDYSYQGLSA GTVVKSEYIP VYDAGVLVFG REPGSASKST SKDNGLSKAT
710 720 730 740 750
PTVKTESQPT AKHTQNPASD FKTPANQNSV KKDQGIKGEV VLQYANGNAG
760 770 780 790 800
ATSNSINPRF KIINNGTKAI NLSDVKIRYY YTKEGGASQN FWCDWSSAGN
810 820 830 840 850
SNVTGNFFNL SSPKEGADTC LEVGFGSGAG TLDPGGSVEV QIRFSKEDWS
860 870 880
NYNQSNDYSF NPSASDYTDW NRVTLYISNK LVYGKEP
Length:887
Mass (Da):98,531
Last modified:April 17, 2007 - v2
Checksum:iE587626445BA7A95
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti861 – 88727NPSAS…YGKEP → KQACLRQRTLIYLYATWLR in AAA20892. (PubMed:8436949)CuratedAdd
BLAST

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
L04735 Genomic DNA. Translation: AAA20892.1.
CP000568 Genomic DNA. Translation: ABN51281.1.
PIRiA47704.
RefSeqiWP_011837740.1. NC_009012.1.
YP_001036474.1. NC_009012.1.

Genome annotation databases

EnsemblBacteriaiABN51281; ABN51281; Cthe_0040.
GeneIDi4808805.
KEGGicth:Cthe_0040.
PATRICi19513685. VBICloThe47081_0045.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
L04735 Genomic DNA. Translation: AAA20892.1 .
CP000568 Genomic DNA. Translation: ABN51281.1 .
PIRi A47704.
RefSeqi WP_011837740.1. NC_009012.1.
YP_001036474.1. NC_009012.1.

3D structure databases

Select the link destinations:
PDBe
RCSB PDB
PDBj
Links Updated
Entry Method Resolution (Å) Chain Positions PDBsum
2XFG X-ray 1.68 A 54-516 [» ]
B 517-683 [» ]
ProteinModelPortali Q02934.
SMRi Q02934. Positions 76-683.
ModBasei Search...
MobiDBi Search...

Protein-protein interaction databases

MINTi MINT-6946608.
STRINGi 203119.Cthe_0040.

Protein family/group databases

CAZyi CBM3. Carbohydrate-Binding Module Family 3.
GH9. Glycoside Hydrolase Family 9.

Protocols and materials databases

Structural Biology Knowledgebase Search...

Genome annotation databases

EnsemblBacteriai ABN51281 ; ABN51281 ; Cthe_0040 .
GeneIDi 4808805.
KEGGi cth:Cthe_0040.
PATRICi 19513685. VBICloThe47081_0045.

Phylogenomic databases

eggNOGi COG2730.
HOGENOMi HOG000021032.
KOi K01179.
K01225.
OMAi RKEVQFR.
OrthoDBi EOG6KQ6BP.

Enzyme and pathway databases

UniPathwayi UPA00696 .
BioCyci CTHE203119:GIW8-39-MONOMER.

Miscellaneous databases

EvolutionaryTracei Q02934.

Family and domain databases

Gene3Di 1.50.10.10. 1 hit.
2.60.40.710. 2 hits.
InterProi IPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR008965. Carb-bd_dom.
IPR001956. CBD_3.
IPR001701. Glyco_hydro_9.
IPR018221. Glyco_hydro_9_AS.
[Graphical view ]
Pfami PF00942. CBM_3. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view ]
SMARTi SM01067. CBM_3. 2 hits.
[Graphical view ]
SUPFAMi SSF48208. SSF48208. 1 hit.
SSF49384. SSF49384. 2 hits.
PROSITEi PS51172. CBM3. 2 hits.
PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view ]
ProtoNeti Search...

Publicationsi

« Hide 'large scale' publications
  1. "Gene sequence and properties of CelI, a family E endoglucanase from Clostridium thermocellum."
    Hazlewood G.P., Davidson K., Laurie J.I., Huskisson N.S., Gilbert H.J.
    J. Gen. Microbiol. 139:307-316(1993) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], PROTEIN SEQUENCE OF 56-69.
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372.

Entry informationi

Entry nameiGUNI_CLOTH
AccessioniPrimary (citable) accession number: Q02934
Secondary accession number(s): A3DBF2
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 1, 1995
Last sequence update: April 17, 2007
Last modified: October 29, 2014
This is version 96 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  3. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  4. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3