Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

B1J0T5 (BGAL_ECOLC) Reviewed, UniProtKB/Swiss-Prot

Last modified February 19, 2014. Version 41. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Beta-galactosidase

Short name=Beta-gal
EC=3.2.1.23
Alternative name(s):
Lactase
Gene names
Name:lacZ
Ordered Locus Names:EcolC_3281
OrganismEscherichia coli (strain ATCC 8739 / DSM 1576 / Crooks) [Complete proteome] [HAMAP]
Taxonomic identifier481805 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length1024 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

Hydrolysis of terminal non-reducing beta-D-galactose residues in beta-D-galactosides. HAMAP-Rule MF_01687

Cofactor

Binds 2 magnesium ions per monomer By similarity. HAMAP-Rule MF_01687

Binds 1 sodium ion per monomer By similarity. HAMAP-Rule MF_01687

Subunit structure

Homotetramer By similarity. HAMAP-Rule MF_01687

Sequence similarities

Belongs to the glycosyl hydrolase 2 family.

Ontologies

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 10241024Beta-galactosidase HAMAP-Rule MF_01687
PRO_0000366989

Regions

Region538 – 5414Substrate binding By similarity

Sites

Active site4621Proton donor By similarity
Active site5381Nucleophile By similarity
Metal binding2021Sodium By similarity
Metal binding4171Magnesium 1 By similarity
Metal binding4191Magnesium 1 By similarity
Metal binding4621Magnesium 1 By similarity
Metal binding5981Magnesium 2 By similarity
Metal binding6021Sodium; via carbonyl oxygen By similarity
Metal binding6051Sodium By similarity
Binding site1031Substrate By similarity
Binding site2021Substrate By similarity
Binding site4621Substrate By similarity
Binding site6051Substrate By similarity
Binding site10001Substrate By similarity
Site3581Transition state stabilizer By similarity
Site3921Transition state stabilizer By similarity

Sequences

Sequence LengthMass (Da)Tools
B1J0T5 [UniParc].

Last modified April 29, 2008. Version 1.
Checksum: 52915C880FC17DC6

FASTA1,024116,512
        10         20         30         40         50         60 
MTMITDSLAV VLQRRDWENP GVTQLNRLAA HPPFASWRNS EEARTDRPSQ QLRSLNGEWR 

        70         80         90        100        110        120 
FAWFPAPEAV PESWLECDLP EADTVVVPSN WQMHGYDAPI YTNVTYPITV NPPFVPTENP 

       130        140        150        160        170        180 
TSCYSLTFNV DESWLQEGQT RIIFDGVNSA FHLWCNGRWV GYGQDSRLPS EFDLSAFLRA 

       190        200        210        220        230        240 
GKNRLAVMVL RWSDGSYLED QDMWRMSGIF RDVSLLHKPT TQISDFHVAT RFNDDFSRAV 

       250        260        270        280        290        300 
LEAEVQMCGE LRDYLRVTVS LWQGETQVAS GTAPFGGEII DERGGYADRV TLRLNVENPK 

       310        320        330        340        350        360 
LWSAEIPNLY RAVVELHTAD GTLIEAEACD VGFREVRIEN GLLLLNGKPL LIRGVNRHEH 

       370        380        390        400        410        420 
HPLHGQVMDE QTMVQDILLM KQNNFNAVRC SHYPNHPLWY TLCDRYGLYV VDEANIETHG 

       430        440        450        460        470        480 
MVPMNRLTDD PRWLPAMSER VTRMVQRDRN HPSVIIWSLG NESGHGANHD ALYRWIKSVD 

       490        500        510        520        530        540 
PSRPVQYEGG GADTTATDII CPMYARVDED QPFPAVPKWS IKKWLSLPGE TRPLILCEYA 

       550        560        570        580        590        600 
HAMGNSLGGF AKYWQAFRQY PRLQGGFVWD WVDQSLIKYD ENGNPWSAYG GDFGDTPNDR 

       610        620        630        640        650        660 
QFCMNGLVFA DRTPHPALTE AKHQQQFFQF RLSGQTIEVT SEYLFRHSDN ELLHWMVALD 

       670        680        690        700        710        720 
GKPLASGEVP LDVAPQGKQL IELPELPQPE SAGQLWLTVR VVQPNATAWS EAGHISAWQQ 

       730        740        750        760        770        780 
WRLAENLSVT LPAASHAIPH LTTSEMDFCI ELGNKRWQFN RQSGFLSQMW IGDKKQLLTP 

       790        800        810        820        830        840 
LRDQFTRAPL DNDIGVSEAT RIDPNAWVER WKAAGHYQAE AALLQCTADT LADAVLITTA 

       850        860        870        880        890        900 
HAWQHQGKTL FISRKTYRID GSGQMAITVD VEVASDTPHP ARIGLNCQLA QVAERVNWLG 

       910        920        930        940        950        960 
LGPQENYPDR LTAACFDRWD LPLSDMYTPY VFPSENGLRC GTRELNYGPH QWRGDFQFNI 

       970        980        990       1000       1010       1020 
SRYSQQQLME TSHRHLLHAE EGTWLNIDGF HMGIGGDDSW SPSVSAEFQL SAGRYHYQLV 


WCQK 

« Hide

References

[1]"Complete sequence of Escherichia coli C str. ATCC 8739."
Copeland A., Lucas S., Lapidus A., Glavina del Rio T., Dalin E., Tice H., Bruce D., Goodwin L., Pitluck S., Kiss H., Brettin T., Detter J.C., Han C., Kuske C.R., Schmutz J., Larimer F., Land M., Hauser L. expand/collapse author list , Kyrpides N., Mikhailova N., Ingram L., Richardson P.
Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 8739 / DSM 1576 / Crooks.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000946 Genomic DNA. Translation: ACA78903.1.
RefSeqYP_001726230.1. NC_010468.1.

3D structure databases

ProteinModelPortalB1J0T5.
SMRB1J0T5. Positions 14-1024.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING481805.EcolC_3281.

Protein family/group databases

CAZyGH2. Glycoside Hydrolase Family 2.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaACA78903; ACA78903; EcolC_3281.
GeneID6066946.
KEGGecl:EcolC_3281.
PATRIC18229602. VBIEscCol82905_3504.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGCOG3250.
HOGENOMHOG000252443.
KOK01190.
OMAHRCAWDQ.
OrthoDBEOG6XWV0T.
ProtClustDBPRK09525.

Enzyme and pathway databases

BioCycECOL481805:GI3G-3353-MONOMER.

Family and domain databases

Gene3D2.60.120.260. 1 hit.
2.60.40.320. 2 hits.
2.70.98.10. 1 hit.
3.20.20.80. 1 hit.
HAMAPMF_01687. Beta_gal.
InterProIPR004199. B-gal_small/dom_5.
IPR011013. Gal_mutarotase_SF_dom.
IPR008979. Galactose-bd-like.
IPR014718. Glyco_hydro-type_carb-bd_sub.
IPR006101. Glyco_hydro_2.
IPR013812. Glyco_hydro_2/20_Ig-like.
IPR023232. Glyco_hydro_2_AS.
IPR023933. Glyco_hydro_2_beta_Galsidase.
IPR023230. Glyco_hydro_2_CS.
IPR006102. Glyco_hydro_2_Ig-like.
IPR006104. Glyco_hydro_2_N.
IPR006103. Glyco_hydro_2_TIM.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PfamPF02929. Bgal_small_N. 1 hit.
PF00703. Glyco_hydro_2. 1 hit.
PF02836. Glyco_hydro_2_C. 1 hit.
PF02837. Glyco_hydro_2_N. 1 hit.
[Graphical view]
PRINTSPR00132. GLHYDRLASE2.
SMARTSM01038. Bgal_small_N. 1 hit.
[Graphical view]
SUPFAMSSF49303. SSF49303. 2 hits.
SSF49785. SSF49785. 1 hit.
SSF51445. SSF51445. 1 hit.
SSF74650. SSF74650. 1 hit.
PROSITEPS00719. GLYCOSYL_HYDROL_F2_1. 1 hit.
PS00608. GLYCOSYL_HYDROL_F2_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameBGAL_ECOLC
AccessionPrimary (citable) accession number: B1J0T5
Entry history
Integrated into UniProtKB/Swiss-Prot: March 24, 2009
Last sequence update: April 29, 2008
Last modified: February 19, 2014
This is version 41 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries