Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Chitooligosaccharide deacetylase ChbG

Gene

chbG

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

ChbG is essential for growth on the acetylated chitooligosaccharides chitobiose and chitotriose but is dispensable for growth on cellobiose and chitosan dimer, the deacetylated form of chitobiose. Deacetylation of chitobiose-6-P and chitotriose-6-P is necessary for both the activation of the chb promoter by the regulatory protein ChbR and the hydrolysis of phosphorylated beta-glucosides by the phospho-beta-glucosidase ChbF. Catalyzes the removal of only one acetyl group from chitobiose-6-P to yield monoacetylchitobiose-6-P, the inducer of ChbR and the substrate of ChbF. It can also use chitobiose and chitotriose as substrates.1 Publication

Catalytic activityi

2-(acetylamino)-4-O-(2-(acetylamino)-2-deoxy-beta-D-glucopyranosyl)-2-deoxy-beta-D-glucopyranose + H2O = 2-(acetylamino)-4-O-(2-amino-2-deoxy-beta-D-glucopyranosyl)-2-deoxy-beta-D-glucopyranose + acetate.1 Publication
Diacetylchitobiose-6-phosphate + H2O = N-monoacetylchitobiose-6-phosphate + acetate.1 Publication

Cofactori

Mg2+By similarity

Pathwayi

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Metal bindingi61 – 611Magnesium; via tele nitrogenBy similarity
Metal bindingi125 – 1251Magnesium; via pros nitrogenBy similarity

GO - Molecular functioni

  1. chitin disaccharide deacetylase activity Source: EcoCyc
  2. deacetylase activity Source: EcoCyc

GO - Biological processi

  1. carbohydrate metabolic process Source: InterPro
  2. diacetylchitobiose catabolic process Source: EcoCyc
Complete GO annotation...

Keywords - Molecular functioni

Hydrolase

Keywords - Biological processi

Carbohydrate metabolism, Chitin degradation, Polysaccharide degradation

Keywords - Ligandi

Magnesium, Metal-binding

Enzyme and pathway databases

BioCyciEcoCyc:EG12198-MONOMER.
ECOL316407:JW1722-MONOMER.
MetaCyc:EG12198-MONOMER.
UniPathwayiUPA00349.

Names & Taxonomyi

Protein namesi
Recommended name:
Chitooligosaccharide deacetylase ChbG1 Publication (EC:3.5.1.1051 Publication)
Short name:
COD1 Publication
Alternative name(s):
Chitin disaccharide deacetylase1 Publication
Chitobiose deacetylase1 Publication
Chitobiose-6P deacetylase1 Publication
Chitotriose deacetylase1 Publication
Chitotriose-6P deacetylase1 Publication
Gene namesi
Name:chbG1 Publication
Synonyms:ydjC
Ordered Locus Names:b1733, JW1722
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
ProteomesiUP000000318: Chromosome, UP000000625: Chromosome

Organism-specific databases

EcoGeneiEG12198. chbG.

Subcellular locationi

Cytoplasm 1 Publication

Keywords - Cellular componenti

Cytoplasm

Pathology & Biotechi

Disruption phenotypei

Cells lacking this gene are unable to grow on chitobiose and chitotriose.1 Publication

Mutagenesis

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Mutagenesisi11 – 111D → A: Unable to induce CnbR and to grow on chitobiose. 1 Publication
Mutagenesisi61 – 611H → A: Unable to induce CnbR and to grow on chitobiose. 1 Publication
Mutagenesisi125 – 1251H → A: Unable to induce CnbR and to grow on chitobiose. 1 Publication

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 249249Chitooligosaccharide deacetylase ChbGPRO_0000051589Add
BLAST

Expressioni

Inductioni

By N,N'-diacetylchitobiose.1 Publication

Gene expression databases

GenevestigatoriP37794.

Interactioni

Subunit structurei

Homodimer.UniRule annotation

Protein-protein interaction databases

IntActiP37794. 4 interactions.
STRINGi511145.b1733.

Structurei

3D structure databases

ProteinModelPortaliP37794.
SMRiP37794. Positions 5-249.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the YdjC deacetylase family. ChbG subfamily.Curated

Phylogenomic databases

eggNOGiCOG3394.
HOGENOMiHOG000225034.
InParanoidiP37794.
KOiK03478.
OMAiEPTHIDS.
OrthoDBiEOG6Z0Q7C.
PhylomeDBiP37794.

Family and domain databases

Gene3Di3.20.20.370. 1 hit.
HAMAPiMF_01246. COD.
InterProiIPR011330. Glyco_hydro/deAcase_b/a-brl.
IPR002509. Polysac_deacetylase.
IPR006879. Uncharacterised_UPF0249/HpnK.
IPR022948. UPF0249.
[Graphical view]
PfamiPF04794. YdjC. 1 hit.
[Graphical view]
SUPFAMiSSF88713. SSF88713. 1 hit.

Sequencei

Sequence statusi: Complete.

P37794-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MERLLIVNAD DFGLSKGQNY GIIEACRNGI VTSTTALVNG QAIDHAVQLS
60 70 80 90 100
RDEPSLAIGM HFVLTMGKPL TAMPGLTRDG VLGKWIWQLA EEDALPLEEI
110 120 130 140 150
TQELVSQYLR FIELFGRKPT HLDSHHHVHM FPQIFPIVAR FAAEQGIALR
160 170 180 190 200
ADRQMAFDLP VNLRTTQGFS SAFYGEEISE SLFLQVLDDA GHRGDRSLEV
210 220 230 240
MCHPAFIDNT IRQSAYCFPR LTELDVLTSA SLKGAIAQRG YRLGSYRDV
Length:249
Mass (Da):27,774
Last modified:July 15, 1998 - v2
Checksum:i1D4747904C974F11
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti6 – 61I → L in CAA47260 (PubMed:8121401).Curated
Sequence conflicti6 – 61I → L in X52890 (PubMed:2179047).Curated
Sequence conflicti16 – 4025KGQNY…ALVNG → QRTELRHYRGLSQWDCHCRR RHCEW in CAA47260 (PubMed:8121401).CuratedAdd
BLAST
Sequence conflicti16 – 4025KGQNY…ALVNG → QRTELRHYRGLSQWDCHCRR RHCEW in X52890 (PubMed:2179047).CuratedAdd
BLAST
Sequence conflicti48 – 481Q → H in CAA47260 (PubMed:8121401).Curated
Sequence conflicti48 – 481Q → H in X52890 (PubMed:2179047).Curated
Sequence conflicti51 – 511R → C in CAA47260 (PubMed:8121401).Curated
Sequence conflicti51 – 511R → C in X52890 (PubMed:2179047).Curated
Sequence conflicti55 – 551S → I in CAA47260 (PubMed:8121401).Curated
Sequence conflicti55 – 551S → I in X52890 (PubMed:2179047).Curated
Sequence conflicti65 – 651T → I in CAA47260 (PubMed:8121401).Curated
Sequence conflicti65 – 651T → I in X52890 (PubMed:2179047).Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X66725 Genomic DNA. Translation: CAA47260.1.
X66725 Genomic DNA. Translation: CAA47261.1.
U00096 Genomic DNA. Translation: AAC74803.1.
AP009048 Genomic DNA. Translation: BAA15514.1.
X52890 Genomic DNA. No translation available.
M55161 Genomic DNA. No translation available.
PIRiE64932.
RefSeqiNP_416247.1. NC_000913.3.
YP_489994.1. NC_007779.1.

Genome annotation databases

EnsemblBacteriaiAAC74803; AAC74803; b1733.
BAA15514; BAA15514; BAA15514.
GeneIDi12931311.
946231.
KEGGiecj:Y75_p1708.
eco:b1733.
PATRICi32118775. VBIEscCol129921_1805.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X66725 Genomic DNA. Translation: CAA47260.1.
X66725 Genomic DNA. Translation: CAA47261.1.
U00096 Genomic DNA. Translation: AAC74803.1.
AP009048 Genomic DNA. Translation: BAA15514.1.
X52890 Genomic DNA. No translation available.
M55161 Genomic DNA. No translation available.
PIRiE64932.
RefSeqiNP_416247.1. NC_000913.3.
YP_489994.1. NC_007779.1.

3D structure databases

ProteinModelPortaliP37794.
SMRiP37794. Positions 5-249.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiP37794. 4 interactions.
STRINGi511145.b1733.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC74803; AAC74803; b1733.
BAA15514; BAA15514; BAA15514.
GeneIDi12931311.
946231.
KEGGiecj:Y75_p1708.
eco:b1733.
PATRICi32118775. VBIEscCol129921_1805.

Organism-specific databases

EchoBASEiEB2115.
EcoGeneiEG12198. chbG.

Phylogenomic databases

eggNOGiCOG3394.
HOGENOMiHOG000225034.
InParanoidiP37794.
KOiK03478.
OMAiEPTHIDS.
OrthoDBiEOG6Z0Q7C.
PhylomeDBiP37794.

Enzyme and pathway databases

UniPathwayiUPA00349.
BioCyciEcoCyc:EG12198-MONOMER.
ECOL316407:JW1722-MONOMER.
MetaCyc:EG12198-MONOMER.

Miscellaneous databases

PROiP37794.

Gene expression databases

GenevestigatoriP37794.

Family and domain databases

Gene3Di3.20.20.370. 1 hit.
HAMAPiMF_01246. COD.
InterProiIPR011330. Glyco_hydro/deAcase_b/a-brl.
IPR002509. Polysac_deacetylase.
IPR006879. Uncharacterised_UPF0249/HpnK.
IPR022948. UPF0249.
[Graphical view]
PfamiPF04794. YdjC. 1 hit.
[Graphical view]
SUPFAMiSSF88713. SSF88713. 1 hit.
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "A luxAB transcriptional fusion to the cryptic celF gene of Escherichia coli displays increased luminescence in the presence of nickel."
    Guzzo A., Dubow M.S.
    Mol. Gen. Genet. 242:455-460(1994) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
    Strain: K12 / DH1 / ATCC 33849 / DSM 4235 / NCIB 12045.
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
  3. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / MG1655 / ATCC 47076.
  4. "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
    Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
    Mol. Syst. Biol. 2:E1-E5(2006) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
  5. "Characterization and nucleotide sequence of the cryptic cel operon of Escherichia coli K12."
    Parker L.L., Hall B.G.
    Genetics 124:455-471(1990) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-76.
    Strain: K12.
  6. "Nucleotide sequence of Escherichia coli katE, which encodes catalase HPII."
    von Ossowski I., Mulvey M.R., Leco P.A., Borys A., Loewen P.C.
    J. Bacteriol. 173:514-520(1991) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 207-249.
    Strain: K12.
  7. "Wild-type Escherichia coli grows on the chitin disaccharide, N,N'-diacetylchitobiose, by expressing the cel operon."
    Keyhani N.O., Roseman S.
    Proc. Natl. Acad. Sci. U.S.A. 94:14367-14371(1997) [PubMed] [Europe PMC] [Abstract]
    Cited for: INDUCTION, PATHWAY, GENE NAME.
  8. "The chbG gene of the chitobiose (chb) operon of Escherichia coli encodes a chitooligosaccharide deacetylase."
    Verma S.C., Mahadevan S.
    J. Bacteriol. 194:4959-4971(2012) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, CATALYTIC ACTIVITY, MUTAGENESIS OF ASP-11; HIS-61 AND HIS-125, DISRUPTION PHENOTYPE, SUBSTRATE SPECIFICITY, SUBCELLULAR LOCATION.

Entry informationi

Entry nameiCHBG_ECOLI
AccessioniPrimary (citable) accession number: P37794
Secondary accession number(s): P77435
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 1, 1994
Last sequence update: July 15, 1998
Last modified: March 4, 2015
This is version 101 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Caution

Was originally (PubMed:8121401 and PubMed:2179047) characterized as part of a cryptic cel operon for a cellobiose degradation system. The Cel+ phenotype is due to mutations making expression chitobiose-independent and altering the substrate specificity.Curated

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene
  2. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  3. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.