SubmitCancel

Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Q05332

- GUNG_CLOTH

UniProt

Q05332 - GUNG_CLOTH

(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Protein
Endoglucanase G
Gene
celG, Cthe_2872
Organism
Clostridium thermocellum (strain ATCC 27405 / DSM 1237)
Status
Reviewed - Annotation score: 4 out of 5 - Protein inferred from homologyi

Functioni

This enzyme catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei226 – 2261Proton donor By similarity
Active sitei381 – 3811Nucleophile By similarity

GO - Molecular functioni

  1. cellulase activity Source: MENGO

GO - Biological processi

  1. cellulose catabolic process Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Keywords - Biological processi

Carbohydrate metabolism, Cellulose degradation, Polysaccharide degradation

Enzyme and pathway databases

BioCyciCTHE203119:GIW8-2979-MONOMER.
MetaCyc:MONOMER-16420.

Protein family/group databases

CAZyiGH5. Glycoside Hydrolase Family 5.

Names & Taxonomyi

Protein namesi
Recommended name:
Endoglucanase G (EC:3.2.1.4)
Alternative name(s):
Cellulase G
Endo-1,4-beta-glucanase G
Short name:
EgG
Gene namesi
Name:celG
Ordered Locus Names:Cthe_2872
OrganismiClostridium thermocellum (strain ATCC 27405 / DSM 1237)
Taxonomic identifieri203119 [NCBI]
Taxonomic lineageiBacteriaFirmicutesClostridiaClostridialesRuminococcaceaeRuminiclostridium
ProteomesiUP000002145: Chromosome

Subcellular locationi

GO - Cellular componenti

  1. cellulosome Source: MENGO
Complete GO annotation...

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 3030
Add
BLAST
Chaini31 – 566536Endoglucanase G
PRO_0000007853Add
BLAST

Interactioni

Protein-protein interaction databases

IntActiQ05332. 1 interaction.
STRINGi203119.Cthe_2872.

Structurei

3D structure databases

ProteinModelPortaliQ05332.
SMRiQ05332. Positions 498-565.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati503 – 526241
Add
BLAST
Repeati536 – 549142
Add
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni503 – 549472 X 24 AA approximate repeats
Add
BLAST

Domaini

A 24 residue domain is repeated twice in this enzyme as well as in other C.thermocellum cellulosome enzymes. This domain may function as the binding ligand for the SL component.

Sequence similaritiesi

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiCOG2730.
HOGENOMiHOG000225207.
OMAiKYKNDDT.
OrthoDBiEOG6PS5R7.

Family and domain databases

Gene3Di1.10.1330.10. 1 hit.
3.20.20.80. 1 hit.
InterProiIPR016134. Cellulos_enz_dockerin_1.
IPR002105. Cellulos_enz_dockerin_1_Ca-bd.
IPR018242. Dockerin_1.
IPR018247. EF_Hand_1_Ca_BS.
IPR001547. Glyco_hydro_5.
IPR018087. Glyco_hydro_5_CS.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PfamiPF00150. Cellulase. 1 hit.
PF00404. Dockerin_1. 2 hits.
[Graphical view]
SUPFAMiSSF51445. SSF51445. 1 hit.
SSF63446. SSF63446. 1 hit.
PROSITEiPS00448. CLOS_CELLULOSOME_RPT. 2 hits.
PS00018. EF_HAND_1. 2 hits.
PS00659. GLYCOSYL_HYDROL_F5. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q05332-1 [UniParc]FASTAAdd to Basket

« Hide

MKKAKAIFSL VVALMVLAIF CFAQNTGSTA TTAAAAVDSN NDDWLHCKGN    50
KIYDMYGNEV WLTGANWFGF NCSENCFHGA WYDVKTILTS IADRGINLLR 100
IPISTELLYS WMIGKPNPVS SVTASNNPPY HVVNPDFYDP ETDDVKNSME 150
IFDIIMGYCK ELGIKVMIDI HSPDANNSGH NYELWYGKET STCGVVTTKM 200
WIDTLVWLAD KYKNDDTIIA FDLKNEPHGK RGYTAEVPKL LAKWDNSTDE 250
NNWKYAAETC AKAILEVNPK VLIVIEGVEQ YPKTEKGYTY DTPDIWGATG 300
DASPWYSAWW GGNLRGVKDY PIDLGPLNSQ IVYSPHDYGP SVYAQPWFEK 350
DFTMQTLLDD YWYDTWAYIH DQGIAPILIG EWGGHMDGGK NQKWMTLLRD 400
YIVQNRIHHT FWCINPNSGD TGGLLGNDWS TWDEAKYALL KPALWQTKDG 450
KFIGLDHKIP LGSKGISLGE YYGTPQASDP PATPTATPTK PAASSTPSFI 500
YGDINSDGNV NSTDLGILKR IIVKNPPASA NMDAADVNAD GKVNSTDYTV 550
LKRYLLRSID KLPHTT 566
Length:566
Mass (Da):63,199
Last modified:February 1, 1994 - v1
Checksum:i2CC9DE1AD87C3178
GO

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
X69390 Genomic DNA. Translation: CAA49187.1.
CP000568 Genomic DNA. Translation: ABN54070.1.
PIRiA40589.
RefSeqiWP_003514548.1. NC_009012.1.
YP_001039263.1. NC_009012.1.

Genome annotation databases

EnsemblBacteriaiABN54070; ABN54070; Cthe_2872.
GeneIDi4809152.
KEGGicth:Cthe_2872.
PATRICi19519842. VBICloThe47081_3073.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
X69390 Genomic DNA. Translation: CAA49187.1 .
CP000568 Genomic DNA. Translation: ABN54070.1 .
PIRi A40589.
RefSeqi WP_003514548.1. NC_009012.1.
YP_001039263.1. NC_009012.1.

3D structure databases

ProteinModelPortali Q05332.
SMRi Q05332. Positions 498-565.
ModBasei Search...
MobiDBi Search...

Protein-protein interaction databases

IntActi Q05332. 1 interaction.
STRINGi 203119.Cthe_2872.

Protein family/group databases

CAZyi GH5. Glycoside Hydrolase Family 5.

Protocols and materials databases

Structural Biology Knowledgebase Search...

Genome annotation databases

EnsemblBacteriai ABN54070 ; ABN54070 ; Cthe_2872 .
GeneIDi 4809152.
KEGGi cth:Cthe_2872.
PATRICi 19519842. VBICloThe47081_3073.

Phylogenomic databases

eggNOGi COG2730.
HOGENOMi HOG000225207.
OMAi KYKNDDT.
OrthoDBi EOG6PS5R7.

Enzyme and pathway databases

BioCyci CTHE203119:GIW8-2979-MONOMER.
MetaCyc:MONOMER-16420.

Family and domain databases

Gene3Di 1.10.1330.10. 1 hit.
3.20.20.80. 1 hit.
InterProi IPR016134. Cellulos_enz_dockerin_1.
IPR002105. Cellulos_enz_dockerin_1_Ca-bd.
IPR018242. Dockerin_1.
IPR018247. EF_Hand_1_Ca_BS.
IPR001547. Glyco_hydro_5.
IPR018087. Glyco_hydro_5_CS.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view ]
Pfami PF00150. Cellulase. 1 hit.
PF00404. Dockerin_1. 2 hits.
[Graphical view ]
SUPFAMi SSF51445. SSF51445. 1 hit.
SSF63446. SSF63446. 1 hit.
PROSITEi PS00448. CLOS_CELLULOSOME_RPT. 2 hits.
PS00018. EF_HAND_1. 2 hits.
PS00659. GLYCOSYL_HYDROL_F5. 1 hit.
[Graphical view ]
ProtoNeti Search...

Publicationsi

« Hide 'large scale' publications
  1. "Nucleotide sequence of the celG gene of Clostridium thermocellum and characterization of its product, endoglucanase CelG."
    Lemaire M., Beguin P.
    J. Bacteriol. 175:3353-3360(1993) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ATCC 27405 / DSM 1237.

Entry informationi

Entry nameiGUNG_CLOTH
AccessioniPrimary (citable) accession number: Q05332
Secondary accession number(s): A3DJE1
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 1, 1994
Last sequence update: February 1, 1994
Last modified: September 3, 2014
This is version 93 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi