Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Periplasmic beta-glucosidase

Gene

bglX

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Catalytic activityi

Hydrolysis of terminal, non-reducing beta-D-glucosyl residues with release of beta-D-glucose.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei287 – 2871By similarity

GO - Molecular functioni

  1. beta-glucosidase activity Source: EcoliWiki
  2. glucosidase activity Source: EcoliWiki

GO - Biological processi

  1. carbohydrate metabolic process Source: InterPro
Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Enzyme and pathway databases

BioCyciEcoCyc:EG12013-MONOMER.
ECOL316407:JW2120-MONOMER.
MetaCyc:EG12013-MONOMER.

Protein family/group databases

CAZyiGH3. Glycoside Hydrolase Family 3.

Names & Taxonomyi

Protein namesi
Recommended name:
Periplasmic beta-glucosidase (EC:3.2.1.21)
Alternative name(s):
Beta-D-glucoside glucohydrolase
Cellobiase
Gentiobiase
Gene namesi
Name:bglX
Synonyms:yohA
Ordered Locus Names:b2132, JW2120
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
ProteomesiUP000000318 Componenti: Chromosome UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG12013. bglX.

Subcellular locationi

GO - Cellular componenti

  1. outer membrane-bounded periplasmic space Source: EcoCyc
  2. periplasmic space Source: EcoliWiki
Complete GO annotation...

Keywords - Cellular componenti

Periplasm

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2020Sequence AnalysisAdd
BLAST
Chaini21 – 765745Periplasmic beta-glucosidasePRO_0000011781Add
BLAST

Proteomic databases

PaxDbiP33363.
PRIDEiP33363.

Expressioni

Gene expression databases

GenevestigatoriP33363.

Interactioni

Binary interactionsi

WithEntry#Exp.IntActNotes
nrdAP004521EBI-1114600,EBI-370018

Protein-protein interaction databases

DIPiDIP-9218N.
IntActiP33363. 16 interactions.
STRINGi511145.b2132.

Structurei

3D structure databases

ProteinModelPortaliP33363.
SMRiP33363. Positions 35-765.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the glycosyl hydrolase 3 family.Curated

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiCOG1472.
HOGENOMiHOG000031217.
InParanoidiP33363.
KOiK05349.
OMAiLPPYKAS.
OrthoDBiEOG6DNT6C.
PhylomeDBiP33363.

Family and domain databases

Gene3Di3.20.20.300. 1 hit.
3.40.50.1700. 1 hit.
InterProiIPR026891. Fn3-like.
IPR026892. Glyco_hydro_3.
IPR019800. Glyco_hydro_3_AS.
IPR002772. Glyco_hydro_3_C.
IPR001764. Glyco_hydro_3_N.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PANTHERiPTHR30620. PTHR30620. 1 hit.
PfamiPF14310. Fn3-like. 1 hit.
PF00933. Glyco_hydro_3. 1 hit.
PF01915. Glyco_hydro_3_C. 1 hit.
[Graphical view]
PRINTSiPR00133. GLHYDRLASE3.
SUPFAMiSSF51445. SSF51445. 1 hit.
SSF52279. SSF52279. 2 hits.
PROSITEiPS00775. GLYCOSYL_HYDROL_F3. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P33363-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKWLCSVGIA VSLALQPALA DDLFGNHPLT PEARDAFVTE LLKKMTVDEK
60 70 80 90 100
IGQLRLISVG PDNPKEAIRE MIKDGQVGAI FNTVTRQDIR AMQDQVMELS
110 120 130 140 150
RLKIPLFFAY DVLHGQRTVF PISLGLASSF NLDAVKTVGR VSAYEAADDG
160 170 180 190 200
LNMTWAPMVD VSRDPRWGRA SEGFGEDTYL TSTMGKTMVE AMQGKSPADR
210 220 230 240 250
YSVMTSVKHF AAYGAVEGGK EYNTVDMSPQ RLFNDYMPPY KAGLDAGSGA
260 270 280 290 300
VMVALNSLNG TPATSDSWLL KDVLRDQWGF KGITVSDHGA IKELIKHGTA
310 320 330 340 350
ADPEDAVRVA LKSGINMSMS DEYYSKYLPG LIKSGKVTMA ELDDAARHVL
360 370 380 390 400
NVKYDMGLFN DPYSHLGPKE SDPVDTNAES RLHRKEAREV ARESLVLLKN
410 420 430 440 450
RLETLPLKKS ATIAVVGPLA DSKRDVMGSW SAAGVADQSV TVLTGIKNAV
460 470 480 490 500
GENGKVLYAK GANVTSDKGI IDFLNQYEEA VKVDPRSPQE MIDEAVQTAK
510 520 530 540 550
QSDVVVAVVG EAQGMAHEAS SRTDITIPQS QRDLIAALKA TGKPLVLVLM
560 570 580 590 600
NGRPLALVKE DQQADAILET WFAGTEGGNA IADVLFGDYN PSGKLPMSFP
610 620 630 640 650
RSVGQIPVYY SHLNTGRPYN ADKPNKYTSR YFDEANGALY PFGYGLSYTT
660 670 680 690 700
FTVSDVKLSA PTMKRDGKVT ASVQVTNTGK REGATVVQMY LQDVTASMSR
710 720 730 740 750
PVKQLKGFEK ITLKPGETQT VSFPIDIEAL KFWNQQMKYD AEPGKFNVFI
760
GTDSARVKKG EFELL
Length:765
Mass (Da):83,460
Last modified:February 1, 1995 - v2
Checksum:i0E89B0AB42B8F8F3
GO

Sequence cautioni

The sequence AAA60495.1 differs from that shown. Reason: Erroneous initiation. Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U15049 Genomic DNA. Translation: AAB38487.1.
U00007 Genomic DNA. Translation: AAA60495.1. Different initiation.
U00096 Genomic DNA. Translation: AAC75193.1.
AP009048 Genomic DNA. Translation: BAE76609.1.
PIRiC64981.
RefSeqiNP_416636.1. NC_000913.3.
YP_490371.1. NC_007779.1.

Genome annotation databases

EnsemblBacteriaiAAC75193; AAC75193; b2132.
BAE76609; BAE76609; BAE76609.
GeneIDi12931457.
946682.
KEGGiecj:Y75_p2094.
eco:b2132.
PATRICi32119603. VBIEscCol129921_2212.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U15049 Genomic DNA. Translation: AAB38487.1.
U00007 Genomic DNA. Translation: AAA60495.1. Different initiation.
U00096 Genomic DNA. Translation: AAC75193.1.
AP009048 Genomic DNA. Translation: BAE76609.1.
PIRiC64981.
RefSeqiNP_416636.1. NC_000913.3.
YP_490371.1. NC_007779.1.

3D structure databases

ProteinModelPortaliP33363.
SMRiP33363. Positions 35-765.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

DIPiDIP-9218N.
IntActiP33363. 16 interactions.
STRINGi511145.b2132.

Protein family/group databases

CAZyiGH3. Glycoside Hydrolase Family 3.

Proteomic databases

PaxDbiP33363.
PRIDEiP33363.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC75193; AAC75193; b2132.
BAE76609; BAE76609; BAE76609.
GeneIDi12931457.
946682.
KEGGiecj:Y75_p2094.
eco:b2132.
PATRICi32119603. VBIEscCol129921_2212.

Organism-specific databases

EchoBASEiEB1951.
EcoGeneiEG12013. bglX.

Phylogenomic databases

eggNOGiCOG1472.
HOGENOMiHOG000031217.
InParanoidiP33363.
KOiK05349.
OMAiLPPYKAS.
OrthoDBiEOG6DNT6C.
PhylomeDBiP33363.

Enzyme and pathway databases

BioCyciEcoCyc:EG12013-MONOMER.
ECOL316407:JW2120-MONOMER.
MetaCyc:EG12013-MONOMER.

Miscellaneous databases

PROiP33363.

Gene expression databases

GenevestigatoriP33363.

Family and domain databases

Gene3Di3.20.20.300. 1 hit.
3.40.50.1700. 1 hit.
InterProiIPR026891. Fn3-like.
IPR026892. Glyco_hydro_3.
IPR019800. Glyco_hydro_3_AS.
IPR002772. Glyco_hydro_3_C.
IPR001764. Glyco_hydro_3_N.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PANTHERiPTHR30620. PTHR30620. 1 hit.
PfamiPF14310. Fn3-like. 1 hit.
PF00933. Glyco_hydro_3. 1 hit.
PF01915. Glyco_hydro_3_C. 1 hit.
[Graphical view]
PRINTSiPR00133. GLHYDRLASE3.
SUPFAMiSSF51445. SSF51445. 1 hit.
SSF52279. SSF52279. 2 hits.
PROSITEiPS00775. GLYCOSYL_HYDROL_F3. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. Yang M., Luoh S., Goddard A., Reilly D., Henzel W., Bass S.
    Submitted (AUG-1994) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
    Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
  2. "Automated multiplex sequencing of the E.coli genome."
    Richterich P., Lakey N., Gryan G., Jaehn L., Mintz L., Robison K., Church G.M.
    Submitted (SEP-1993) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / BHB2600.
  3. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / MG1655 / ATCC 47076.
  4. "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
    Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
    Mol. Syst. Biol. 2:E1-E5(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / W3110 / ATCC 27325 / DSM 5911.

Entry informationi

Entry nameiBGLX_ECOLI
AccessioniPrimary (citable) accession number: P33363
Secondary accession number(s): Q2MAU7
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 1, 1994
Last sequence update: February 1, 1995
Last modified: January 7, 2015
This is version 112 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene
  2. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  3. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.