Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q5WI64 (Q5WI64_BACSK) Unreviewed, UniProtKB/TrEMBL

Last modified December 14, 2011. Version 45. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Beta-galactosidase PIRNR PIRNR001084

Short name=Beta-gal PIRNR PIRNR001084
EC=3.2.1.23 PIRNR PIRNR001084
Gene names
Ordered Locus Names:ABC1405
OrganismBacillus clausii (strain KSM-K16) [Complete proteome] [HAMAP]
Taxonomic identifier66692 [NCBI]
Taxonomic lineageBacteriaFirmicutesBacillalesBacillaceaeBacillus

Protein attributes

Sequence length684 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

Hydrolysis of terminal non-reducing beta-D-galactose residues in beta-D-galactosides. PIRNR PIRNR001084

Sequence similarities

Belongs to the glycosyl hydrolase 42 family. PIRNR PIRNR001084

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Active site1501Proton donor By similarity PIRSR PIRSR001084-1
Active site3131Nucleophile By similarity PIRSR PIRSR001084-1
Metal binding1151Zinc By similarity PIRSR PIRSR001084-3
Metal binding1571Zinc By similarity PIRSR PIRSR001084-3
Metal binding1591Zinc By similarity PIRSR PIRSR001084-3
Metal binding1621Zinc By similarity PIRSR PIRSR001084-3
Binding site1111Substrate By similarity PIRSR PIRSR001084-2
Binding site1491Substrate By similarity PIRSR PIRSR001084-2
Binding site3211Substrate By similarity PIRSR PIRSR001084-2

Sequences

Sequence LengthMass (Da)Tools
Q5WI64 [UniParc].

Last modified November 23, 2004. Version 1.
Checksum: CB208ECB529FC8AA

FASTA68477,711
        10         20         30         40         50         60 
MINAKKPKIW YGGDYNPDQW DSSIWDEDLR MFKLAGIDVV TLNVFAWAKN QPDENTYDFG 

        70         80         90        100        110        120 
WLDTMMDKLH EDGIGVCLAT STAAHPAWMA RKYPDVLQVD FYGRKRKFGG RHNSCPNSPT 

       130        140        150        160        170        180 
YRKYAVRMAE KLAERYKDHP ALLIWHINNE YGGAGNCYCD NCETAFREWA KARYGSLDEV 

       190        200        210        220        230        240 
NRAWNTGFWG HTFHSWEDIV LPSGLSEEWT GANGRTETNF QGISLDYMRF HSDSLLECYK 

       250        260        270        280        290        300 
LEYEAVKKHT PSIPVTTNLM GAFKKLDYHK WAKHMDVVSW DNYPRFDTPH SYTGMMHDLM 

       310        320        330        340        350        360 
RGLKHGQPFM LMEQTPSQQN WQPYNSLKRP GVMRLWSYQA AARGADTILF FQLRRSIGAC 

       370        380        390        400        410        420 
EKYHGAVIEH VGHEHTRVFR ECAALGQELG QLGDTLLDAN VQAKVALLFD WENWWAVEMS 

       430        440        450        460        470        480 
SGPSIDLRYV DEVHKYYDAL YRLGISVDVV GVDADFSRYD LVIAPVMYMV KAGIADKLEK 

       490        500        510        520        530        540 
YVHGGGSLIT TFFSGIVDEN DLVKTGGYPG ELRKLLGIWA EEIDALLPSQ RNRIVLSDQA 

       550        560        570        580        590        600 
PGLKSEYECG ILCDLIHSEG AEVKAVYGND FYRGMPVLTV NKFGAGQAWY VATSPEASFL 

       610        620        630        640        650        660 
QDWLSQLCSS IGIHPLINDM PVGVETTLRS KEGQSYLFVL NHNENPVNIK LGERAGVELL 

       670        680 
SGRTLDGEQE ELAGRDVWII KQKP 

« Hide

References

[1]"The complete genome sequence of the alkaliphilic Bacillus clausii KSM-K16."
Takaki Y., Kageyama Y., Shimamura S., Suzuki H., Nishi S., Hatada Y., Kawai S., Ito S., Horikoshi K.
Submitted (OCT-2003) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AP006627 Genomic DNA. Translation: BAD63941.1.
RefSeqYP_174902.1. NC_006582.1.

3D structure databases

ProteinModelPortalQ5WI64.
ModBaseSearch...

Protein-protein interaction databases

STRINGQ5WI64.

Protein family/group databases

CAZyGH42. Glycoside Hydrolase Family 42.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaEBBACT00000046486; EBBACP00000045247; EBBACG00000046477.
GeneID3203277.
GenomeReviewsGene locus ABC1405 in contig AP006627_GR.
KEGGbcl:ABC1405.
NMPDRfig|66692.3.peg.1468.
PATRIC18922246. VBIBacCla58185_1491.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGCOG1874.
GeneTreeEBGT00050000006111.
HOGENOMHBG476453.
OMAMFFQLRQ.
ProtClustDBCLSK2502546.

Family and domain databases

InterProIPR013739. Beta_galactosidase_C.
IPR013738. Beta_galactosidase_Trimer.
IPR003476. Glyco_hydro_42.
IPR013529. Glyco_hydro_42_N.
IPR013781. Glyco_hydro_subgr_catalytic.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
Gene3DG3DSA:3.20.20.80. Glyco_hydro_cat. 1 hit.
KOK12308.
PfamPF02449. Glyco_hydro_42. 1 hit.
PF08533. Glyco_hydro_42C. 1 hit.
PF08532. Glyco_hydro_42M. 1 hit.
[Graphical view]
PIRSFPIRSF001084. B-galactosidase. 1 hit.
SUPFAMSSF51445. Glyco_hydro_cat. 1 hit.
ProtoNetSearch...

Entry information

Entry nameQ5WI64_BACSK
AccessionPrimary (citable) accession number: Q5WI64
Entry history
Integrated into UniProtKB/TrEMBL: November 23, 2004
Last sequence update: November 23, 2004
Last modified: December 14, 2011
This is version 45 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)