Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot B5Z2P7 (BGAL_ECO5E)

Last modified November 3, 2009. Version 8. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Beta-galactosidase
      Short name=Beta-gal
    EC=3.2.1.23
Alternative name(s):
    Lactase
Gene names
Name: lacZ
Ordered Locus Names: ECH74115_0417
OrganismEscherichia coli O157:H7 (strain EC4115 / EHEC) [Complete proteome] [HAMAP]
Taxonomic identifier444450 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length1024 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceInferred from homology.

General annotation (Comments)

Catalytic activity

Hydrolysis of terminal non-reducing beta-D-galactose residues in beta-D-galactosides. HAMAP MF_01687

Cofactor

Binds 2 magnesium ions per monomer By similarity.

Binds 1 sodium ion per monomer By similarity.

Subunit structure

Homotetramer By similarity.

Sequence similarities

Belongs to the glycosyl hydrolase 2 family.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 10241024Beta-galactosidase HAMAP MF_01687
PRO_0000366994

Regions

Region538 – 5414Substrate binding By similarity

Sites

Active site4621Proton donor By similarity
Active site5381Nucleophile By similarity
Metal binding2021Sodium By similarity
Metal binding4171Magnesium 1 By similarity
Metal binding4191Magnesium 1 By similarity
Metal binding4621Magnesium 1 By similarity
Metal binding5981Magnesium 2 By similarity
Metal binding6021Sodium; via carbonyl oxygen By similarity
Metal binding6051Sodium By similarity
Binding site1031Substrate By similarity
Binding site2021Substrate By similarity
Binding site4621Substrate By similarity
Binding site6051Substrate By similarity
Binding site10001Substrate By similarity
Site3581Transition state stabilizer By similarity
Site3921Transition state stabilizer By similarity

Sequences

Sequence LengthMass (Da)Tools
B5Z2P7-1 [UniParc].

Last modified November 25, 2008. Version 1.
Checksum: BCA3922388694911

FASTA1,024116,391
        10         20         30         40         50         60 
MTMITDSLAV VLQRRDWENP GVTQLNRLAA HPPFASWRNS EEARTNRPSQ QLRSLNGEWQ 

        70         80         90        100        110        120 
FVWFPAPEAV PESWLECDLP DADTVVVPSN WQMHGYDAPI YTNVTYPITV NPPFVPTENP 

       130        140        150        160        170        180 
TGCYSLTFNV DESWLQEGQT RIIFDGVNSA FHLWCNGRWV GYGQDSRLLS EFDLSAFLRA 

       190        200        210        220        230        240 
GENRLAVMVL RWSDGSYLED QDMWRMSGIF RDVSLLHKPT TQISDFHVAT LFNDDFSRAV 

       250        260        270        280        290        300 
LEAEVQMYGE LRDELRVTVS LWQGETQVAS GTAPFGGEII DERGGYADRV TLGLNVENPK 

       310        320        330        340        350        360 
LWSAEIPNIY RAVVELHTAD GTLIEAEACD VGFREVRIEN GLLLLNGKPL LIRGVNRHEH 

       370        380        390        400        410        420 
HPLHGQVMDE QTMVQDILLM KQNNFNAVRC SHYPNHPLWY TLCDRYGLYV VDEANIETHG 

       430        440        450        460        470        480 
MVPMNRLTDD PRWLPAMSER VTRMVQRDRN HPSVIIWSLG NESGHGANHD ALYRWIKSVD 

       490        500        510        520        530        540 
PSRPVQYEGG GADTSATDII CPMYARVDED QPFPAVPKWS IKKWLSLPGE MRPLILCEYA 

       550        560        570        580        590        600 
HAMGNSLGGF AKYWQAFRQY PRLQGGFVWD LVDQSLIKYD ENGNPWSAYG GDFGDTPNDR 

       610        620        630        640        650        660 
QFCMNGLVFA DRTPHPALTE AKHQQQFFQF RLSGRTIEVT SEYLFHHSDN ELLHWTVALD 

       670        680        690        700        710        720 
GKPLASGEVP LDVAPQGKQV IELPELPRLE STGQLWLTVH VVQPNATAWS EAGHISAWQQ 

       730        740        750        760        770        780 
WRLAENLSVT LPSAPHAIPQ LTTSETDFCI ELDNKRWQFN RQSGFLSQMW IGDKKQLLTP 

       790        800        810        820        830        840 
LRDQFTRAPL DNDIGVSEAT RIDPNAWVER WKAAGHYQAE AALLQCTADT LADAVLITTV 

       850        860        870        880        890        900 
HAWQHQGKTL FISRKTYRID GSGQMAITVD VEVASDTPHP ARIGLTCQLA QVAERVNWLG 

       910        920        930        940        950        960 
LGPQENYPDR LTAACFDRWD LPLSDMYTPY VFPSENGLRC GTRELNYGPH QWRGDFQFNI 

       970        980        990       1000       1010       1020 
SRYSQQQLME TSHRHLLHAE EGTWLNIDGF HMGIGGDDSW SPSVSAEFQL SAGRYHYQLV 


WCQK 

« Hide

References

[1]"Complete genome sequence of Escherichia coli O157:H7 str. EC4115."
Eppinger M., Sebastian Y., Ravel J.
Submitted (SEP-2008) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

CP001164 Genomic DNA. Translation: ACI39312.1.
RefSeqYP_002268983.1.

3D structure databases

ModBaseSearch...

Genome annotation databases

GeneID6971849.
GenomeReviewsGene locus ECH74115_0417 in contig CP001164_GR.
KEGGecf:ECH74115_0417.

Organism-specific databases

CMRSearch...

Phylogenomic databases

OMALDACYRD.

Family and domain databases

HAMAPMF_01687.
[Tree]
InterProIPR014718. Glyco_hydro-type_carb-bd_sub.
IPR006101. Glyco_hydro_2.
IPR013812. Glyco_hydro_2/20_Ig-like.
IPR006104. Glyco_hydro_2_carb-bd.
IPR006102. Glyco_hydro_2_Ig-like.
IPR006103. Glyco_hydro_2_TIM.
IPR004199. Glyco_hydro_42_D5.
IPR013781. Glyco_hydro_sg_catalytic.
[Graphical view]
Gene3DG3DSA:2.60.40.320. Glyco_hydro_2/20_Ig-like. 2 hits.
G3DSA:2.70.98.10. Glyco_hydro_42_D5. 1 hit.
G3DSA:3.20.20.80. Glyco_hydro_cat. 1 hit.
PfamPF02929. Bgal_small_N. 1 hit.
PF00703. Glyco_hydro_2. 1 hit.
PF02836. Glyco_hydro_2_C. 1 hit.
PF02837. Glyco_hydro_2_N. 1 hit.
[Graphical view]
PRINTSPR00132. GLHYDRLASE2.
PROSITEPS00719. GLYCOSYL_HYDROL_F2_1. 1 hit.
PS00608. GLYCOSYL_HYDROL_F2_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameBGAL_ECO5E
AccessionPrimary (citable) accession number: B5Z2P7
Entry history
Integrated into UniProtKB/Swiss-Prot: March 24, 2009
Last sequence update: November 25, 2008
Last modified: November 3, 2009
This is version 8 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHAMAP (High-quality Automated and Manual Annotation of microbial Proteomes)

Relevant documents

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents