Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Beta-galactosidase

Gene

lacZ

Organism
Escherichia coli (strain SMS-3-5 / SECEC)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Catalytic activityi

Hydrolysis of terminal non-reducing beta-D-galactose residues in beta-D-galactosides.UniRule annotation

Cofactori

Protein has several cofactor binding sites:
  • Mg2+UniRule annotationNote: Binds 2 magnesium ions per monomer.UniRule annotation
  • Na(+)UniRule annotationNote: Binds 1 sodium ion per monomer.UniRule annotation

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Binding sitei103 – 1031SubstrateUniRule annotation
Metal bindingi202 – 2021SodiumUniRule annotation
Binding sitei202 – 2021SubstrateUniRule annotation
Sitei358 – 3581Transition state stabilizerUniRule annotation
Sitei392 – 3921Transition state stabilizerUniRule annotation
Metal bindingi417 – 4171Magnesium 1UniRule annotation
Metal bindingi419 – 4191Magnesium 1UniRule annotation
Active sitei462 – 4621Proton donorUniRule annotation
Metal bindingi462 – 4621Magnesium 1UniRule annotation
Binding sitei462 – 4621SubstrateUniRule annotation
Active sitei538 – 5381NucleophileUniRule annotation
Metal bindingi598 – 5981Magnesium 2UniRule annotation
Metal bindingi602 – 6021Sodium; via carbonyl oxygenUniRule annotation
Metal bindingi605 – 6051SodiumUniRule annotation
Binding sitei605 – 6051SubstrateUniRule annotation
Binding sitei1000 – 10001SubstrateUniRule annotation

GO - Molecular functioni

  1. beta-galactosidase activity Source: UniProtKB-EC
  2. carbohydrate binding Source: InterPro
  3. magnesium ion binding Source: UniProtKB-HAMAP

GO - Biological processi

  1. carbohydrate metabolic process Source: InterPro
Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Keywords - Ligandi

Magnesium, Metal-binding, Sodium

Enzyme and pathway databases

BioCyciECOL439855:GHHB-373-MONOMER.

Protein family/group databases

CAZyiGH2. Glycoside Hydrolase Family 2.

Names & Taxonomyi

Protein namesi
Recommended name:
Beta-galactosidaseUniRule annotation (EC:3.2.1.23UniRule annotation)
Short name:
Beta-galUniRule annotation
Alternative name(s):
LactaseUniRule annotation
Gene namesi
Name:lacZUniRule annotation
Ordered Locus Names:EcSMS35_0375
OrganismiEscherichia coli (strain SMS-3-5 / SECEC)
Taxonomic identifieri439855 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
ProteomesiUP000007011 Componenti: Chromosome

Subcellular locationi

GO - Cellular componenti

  1. beta-galactosidase complex Source: InterPro
Complete GO annotation...

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 10241024Beta-galactosidasePRO_0000366990Add
BLAST

Interactioni

Subunit structurei

Homotetramer.UniRule annotation

Protein-protein interaction databases

STRINGi439855.EcSMS35_0375.

Structurei

3D structure databases

ProteinModelPortaliB1LIM9.
SMRiB1LIM9. Positions 14-1024.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni538 – 5414Substrate bindingUniRule annotation

Sequence similaritiesi

Belongs to the glycosyl hydrolase 2 family.UniRule annotation

Phylogenomic databases

eggNOGiCOG3250.
HOGENOMiHOG000252443.
KOiK01190.
OMAiTNRHEHH.
OrthoDBiEOG6XWV0T.

Family and domain databases

Gene3Di2.60.120.260. 1 hit.
2.60.40.320. 2 hits.
2.70.98.10. 1 hit.
3.20.20.80. 1 hit.
HAMAPiMF_01687. Beta_gal.
InterProiIPR004199. B-gal_small/dom_5.
IPR011013. Gal_mutarotase_SF_dom.
IPR008979. Galactose-bd-like.
IPR014718. Glyco_hydro-type_carb-bd_sub.
IPR006101. Glyco_hydro_2.
IPR013812. Glyco_hydro_2/20_Ig-like.
IPR023232. Glyco_hydro_2_AS.
IPR023933. Glyco_hydro_2_beta_Galsidase.
IPR023230. Glyco_hydro_2_CS.
IPR006102. Glyco_hydro_2_Ig-like.
IPR006104. Glyco_hydro_2_N.
IPR006103. Glyco_hydro_2_TIM.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PfamiPF02929. Bgal_small_N. 1 hit.
PF00703. Glyco_hydro_2. 1 hit.
PF02836. Glyco_hydro_2_C. 1 hit.
PF02837. Glyco_hydro_2_N. 1 hit.
[Graphical view]
PRINTSiPR00132. GLHYDRLASE2.
SMARTiSM01038. Bgal_small_N. 1 hit.
[Graphical view]
SUPFAMiSSF49303. SSF49303. 2 hits.
SSF49785. SSF49785. 1 hit.
SSF51445. SSF51445. 1 hit.
SSF74650. SSF74650. 1 hit.
PROSITEiPS00719. GLYCOSYL_HYDROL_F2_1. 1 hit.
PS00608. GLYCOSYL_HYDROL_F2_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

B1LIM9-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MTMITDSLAV VLQRRDWENP GVTQLNRLAA HPHFASWRNS EEARTDRPSQ
60 70 80 90 100
QLRSLNGEWR FAWFPAPEAV PESWLDCDLP DADTVVVPSN WQMHGYDAPI
110 120 130 140 150
YTNVTYPITV NPPFVPAENP TGCYSLTFNI DECWLQKGQT RIIFDGVNSA
160 170 180 190 200
FHLWCNGRWV GYGQDSRLPS EFDLSAFLRA GKNRLAVMVL RWSDGSYLED
210 220 230 240 250
QDMWRMSGIF RDVSLLHKPT TQISDFHVAT RFNDDFSRAV LEAEVQMCGE
260 270 280 290 300
LRDELRVTVS LWQGETQVAS GTTPFGGEII DERGGYADRV TLRLNVENPA
310 320 330 340 350
LWSAEIPNLY RAVVELHTAD GTLIEAEACD VGFREVRIEN GLLLLNGKPV
360 370 380 390 400
LIRGVNRHEH HPLHGQVMDE QTMVQDILLM KQNNFNAVRC SHYPNHPLWY
410 420 430 440 450
TLCDRYGLYV VDEANIETHG MVPMNRLTDD PRWLPAMSER VTRMVQRDRN
460 470 480 490 500
HPSVIIWSLG NESGHGANHD ALYRWIKSVD PSRPVQYEGG GADTTATDII
510 520 530 540 550
CPMYARVDED QPFPAVPKWS IKKWLSLPGE LRPLILCEYA HAMGNSLGGF
560 570 580 590 600
AKYWQAFRQY PRLQGGFVWD WVDQSLIKYD ENGNPWSAYG GDFGDTPNDR
610 620 630 640 650
QFCMNGLVFA DRTPHPALTE AKHQQQFFQF RLSGRTIEVT SEYLFRHSDN
660 670 680 690 700
ELLHWSVALD GKPLASGEMP LDVAPQDKQL IELPELPQPE STGQLWLTVH
710 720 730 740 750
VVQPNATAWS EAGHISAWQQ WRLAENLSVA LPSAPHAIPQ LTTSEMDFCI
760 770 780 790 800
ELGNKRWQFN RQSGFLSQMW IGDEKQLLTP LRDQFIRAPL DNDIGVSEAT
810 820 830 840 850
RIDPNAWVER WKAAGHYQAE VALLQCTADI LADAVLITTA HAWQHQGKTL
860 870 880 890 900
FISRKTYRID GSGQMAITVD VEVASDTPHP ARIGLTCQLA QVAERVNWLG
910 920 930 940 950
LGPQENYPDR LTAACFDRWD LPLSDMYTPY VFPSENGLRC GTRELNYGPH
960 970 980 990 1000
QWRGDFQFNI SRYSQQQLME TSHRHLLHAE EGTWLNIDGF HMGIGGDDSW
1010 1020
SPSVSAEFQL SAGRYHYQLV WCQK
Length:1,024
Mass (Da):116,542
Last modified:April 28, 2008 - v1
Checksum:iA600DCF909D6DBFF
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CP000970 Genomic DNA. Translation: ACB17296.1.
RefSeqiYP_001742478.1. NC_010498.1.

Genome annotation databases

EnsemblBacteriaiACB17296; ACB17296; EcSMS35_0375.
KEGGiecm:EcSMS35_0375.
PATRICi18429593. VBIEscCol6161_0548.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CP000970 Genomic DNA. Translation: ACB17296.1.
RefSeqiYP_001742478.1. NC_010498.1.

3D structure databases

ProteinModelPortaliB1LIM9.
SMRiB1LIM9. Positions 14-1024.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi439855.EcSMS35_0375.

Protein family/group databases

CAZyiGH2. Glycoside Hydrolase Family 2.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiACB17296; ACB17296; EcSMS35_0375.
KEGGiecm:EcSMS35_0375.
PATRICi18429593. VBIEscCol6161_0548.

Phylogenomic databases

eggNOGiCOG3250.
HOGENOMiHOG000252443.
KOiK01190.
OMAiTNRHEHH.
OrthoDBiEOG6XWV0T.

Enzyme and pathway databases

BioCyciECOL439855:GHHB-373-MONOMER.

Family and domain databases

Gene3Di2.60.120.260. 1 hit.
2.60.40.320. 2 hits.
2.70.98.10. 1 hit.
3.20.20.80. 1 hit.
HAMAPiMF_01687. Beta_gal.
InterProiIPR004199. B-gal_small/dom_5.
IPR011013. Gal_mutarotase_SF_dom.
IPR008979. Galactose-bd-like.
IPR014718. Glyco_hydro-type_carb-bd_sub.
IPR006101. Glyco_hydro_2.
IPR013812. Glyco_hydro_2/20_Ig-like.
IPR023232. Glyco_hydro_2_AS.
IPR023933. Glyco_hydro_2_beta_Galsidase.
IPR023230. Glyco_hydro_2_CS.
IPR006102. Glyco_hydro_2_Ig-like.
IPR006104. Glyco_hydro_2_N.
IPR006103. Glyco_hydro_2_TIM.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PfamiPF02929. Bgal_small_N. 1 hit.
PF00703. Glyco_hydro_2. 1 hit.
PF02836. Glyco_hydro_2_C. 1 hit.
PF02837. Glyco_hydro_2_N. 1 hit.
[Graphical view]
PRINTSiPR00132. GLHYDRLASE2.
SMARTiSM01038. Bgal_small_N. 1 hit.
[Graphical view]
SUPFAMiSSF49303. SSF49303. 2 hits.
SSF49785. SSF49785. 1 hit.
SSF51445. SSF51445. 1 hit.
SSF74650. SSF74650. 1 hit.
PROSITEiPS00719. GLYCOSYL_HYDROL_F2_1. 1 hit.
PS00608. GLYCOSYL_HYDROL_F2_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "Insights into the environmental resistance gene pool from the genome sequence of the multidrug-resistant environmental isolate Escherichia coli SMS-3-5."
    Fricke W.F., Wright M.S., Lindell A.H., Harkins D.M., Baker-Austin C., Ravel J., Stepanauskas R.
    J. Bacteriol. 190:6779-6794(2007) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: SMS-3-5 / SECEC.

Entry informationi

Entry nameiBGAL_ECOSM
AccessioniPrimary (citable) accession number: B1LIM9
Entry historyi
Integrated into UniProtKB/Swiss-Prot: March 23, 2009
Last sequence update: April 28, 2008
Last modified: March 31, 2015
This is version 44 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.