Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Endoglucanase G

Gene

celG

Organism
Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Status
Reviewed-Annotation score: -Protein inferred from homologyi

Functioni

This enzyme catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Active sitei226Proton donorBy similarity1
Active sitei381NucleophileBy similarity1

GO - Molecular functioni

  • cellulase activity Source: MENGO

GO - Biological processi

Keywordsi

Molecular functionGlycosidase, Hydrolase
Biological processCarbohydrate metabolism, Cellulose degradation, Polysaccharide degradation

Enzyme and pathway databases

BioCyciCTHE203119:G1G86-3010-MONOMER
MetaCyc:MONOMER-16420

Protein family/group databases

CAZyiGH5 Glycoside Hydrolase Family 5

Names & Taxonomyi

Protein namesi
Recommended name:
Endoglucanase G (EC:3.2.1.4)
Alternative name(s):
Cellulase G
Endo-1,4-beta-glucanase G
Short name:
EgG
Gene namesi
Name:celG
Ordered Locus Names:Cthe_2872
OrganismiClostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Taxonomic identifieri203119 [NCBI]
Taxonomic lineageiBacteriaFirmicutesClostridiaClostridialesRuminococcaceaeRuminiclostridium
Proteomesi
  • UP000002145 Componenti: Chromosome

Subcellular locationi

GO - Cellular componenti

  • cellulosome Source: MENGO

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 30Add BLAST30
ChainiPRO_000000785331 – 566Endoglucanase GAdd BLAST536

Interactioni

Protein-protein interaction databases

IntActiQ05332, 1 interactor
STRINGi203119.Cthe_2872

Structurei

3D structure databases

ProteinModelPortaliQ05332
SMRiQ05332
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini497 – 564DockerinPROSITE-ProRule annotationAdd BLAST68

Sequence similaritiesi

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiENOG4106R7G Bacteria
COG2730 LUCA
HOGENOMiHOG000225207
KOiK01179
OMAiTWCCGAD
OrthoDBiPOG091H1DNW

Family and domain databases

InterProiView protein in InterPro
IPR002105 Dockerin_1_rpt
IPR016134 Dockerin_dom
IPR036439 Dockerin_dom_sf
IPR018247 EF_Hand_1_Ca_BS
IPR001547 Glyco_hydro_5
IPR018087 Glyco_hydro_5_CS
IPR017853 Glycoside_hydrolase_SF
PfamiView protein in Pfam
PF00150 Cellulase, 1 hit
PF00404 Dockerin_1, 2 hits
SUPFAMiSSF51445 SSF51445, 1 hit
SSF63446 SSF63446, 1 hit
PROSITEiView protein in PROSITE
PS00448 CLOS_CELLULOSOME_RPT, 2 hits
PS51766 DOCKERIN, 1 hit
PS00018 EF_HAND_1, 2 hits
PS00659 GLYCOSYL_HYDROL_F5, 1 hit

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q05332-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKKAKAIFSL VVALMVLAIF CFAQNTGSTA TTAAAAVDSN NDDWLHCKGN
60 70 80 90 100
KIYDMYGNEV WLTGANWFGF NCSENCFHGA WYDVKTILTS IADRGINLLR
110 120 130 140 150
IPISTELLYS WMIGKPNPVS SVTASNNPPY HVVNPDFYDP ETDDVKNSME
160 170 180 190 200
IFDIIMGYCK ELGIKVMIDI HSPDANNSGH NYELWYGKET STCGVVTTKM
210 220 230 240 250
WIDTLVWLAD KYKNDDTIIA FDLKNEPHGK RGYTAEVPKL LAKWDNSTDE
260 270 280 290 300
NNWKYAAETC AKAILEVNPK VLIVIEGVEQ YPKTEKGYTY DTPDIWGATG
310 320 330 340 350
DASPWYSAWW GGNLRGVKDY PIDLGPLNSQ IVYSPHDYGP SVYAQPWFEK
360 370 380 390 400
DFTMQTLLDD YWYDTWAYIH DQGIAPILIG EWGGHMDGGK NQKWMTLLRD
410 420 430 440 450
YIVQNRIHHT FWCINPNSGD TGGLLGNDWS TWDEAKYALL KPALWQTKDG
460 470 480 490 500
KFIGLDHKIP LGSKGISLGE YYGTPQASDP PATPTATPTK PAASSTPSFI
510 520 530 540 550
YGDINSDGNV NSTDLGILKR IIVKNPPASA NMDAADVNAD GKVNSTDYTV
560
LKRYLLRSID KLPHTT
Length:566
Mass (Da):63,199
Last modified:February 1, 1994 - v1
Checksum:i2CC9DE1AD87C3178
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X69390 Genomic DNA Translation: CAA49187.1
CP000568 Genomic DNA Translation: ABN54070.1
PIRiA40589
RefSeqiWP_003514548.1, NC_009012.1

Genome annotation databases

EnsemblBacteriaiABN54070; ABN54070; Cthe_2872
GeneIDi35804359
KEGGicth:Cthe_2872

Similar proteinsi

Entry informationi

Entry nameiGUNG_CLOTH
AccessioniPrimary (citable) accession number: Q05332
Secondary accession number(s): A3DJE1
Entry historyiIntegrated into UniProtKB/Swiss-Prot: February 1, 1994
Last sequence update: February 1, 1994
Last modified: April 25, 2018
This is version 112 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Cookie policy

We would like to use anonymized google analytics cookies to gather statistics on how uniprot.org is used in aggregate. Learn more

UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health