Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

E8W331 (E8W331_STRFA) Unreviewed, UniProtKB/TrEMBL

Last modified February 19, 2014. Version 22. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein attributes

Sequence length681 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

Hydrolysis of terminal non-reducing beta-D-galactose residues in beta-D-galactosides. PIRNR PIRNR001084

Sequence similarities

Belongs to the glycosyl hydrolase 42 family. PIRNR PIRNR001084

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Active site1571Proton donor By similarity PIRSR PIRSR001084-1
Active site3161Nucleophile By similarity PIRSR PIRSR001084-1
Metal binding1221Zinc By similarity PIRSR PIRSR001084-3
Metal binding1651Zinc By similarity PIRSR PIRSR001084-3
Metal binding1671Zinc By similarity PIRSR PIRSR001084-3
Metal binding1701Zinc By similarity PIRSR PIRSR001084-3
Binding site1181Substrate By similarity PIRSR PIRSR001084-2
Binding site1561Substrate By similarity PIRSR PIRSR001084-2
Binding site3241Substrate By similarity PIRSR PIRSR001084-2

Sequences

Sequence LengthMass (Da)Tools
E8W331 [UniParc].

Last modified April 5, 2011. Version 1.
Checksum: A9C3C0E8F32B7364

FASTA68174,827
        10         20         30         40         50         60 
MPHSESAPYP VGLHKLAFGG DYNPEQWPEE VWHEDVRLMR EAGVTMVSVG IFSWALLEPE 

        70         80         90        100        110        120 
PGTYDFGWLD RLLDLLHVNG IRADLGTPTV VPPAWFYRAH PEALPVSRDG VRYAFGSRGA 

       130        140        150        160        170        180 
ICHSSAPYRA AAADITEQLA RRYGNHPALA MWHVHNEYGV PVSACYCDSC AAHFRRWLAA 

       190        200        210        220        230        240 
RHGSADAVNA AWGTAFWGQR YRSLEDIDPP RTTPTVGNPA QQLDYARFAD ATMRENFTAE 

       250        260        270        280        290        300 
RDILHRLAPG IPVTTNFMTA LSQCESVDYW AWGREVDIVS NDHYLITDGR RTHVNLAMAA 

       310        320        330        340        350        360 
DLTRSVAGGA PWLLLEHSTS GVNWQPRNPA KRPGEMARNS LAHVARGSEG AMFFQWRQSR 

       370        380        390        400        410        420 
RGAEKFHSAM VPHAGTDSRI WREVSALGAG LGLLEGIRGT RTVADVAMIW DWQSWWAQGL 

       430        440        450        460        470        480 
EWRPSEEHDA RERADTFYAS LFDRHLTVDF AHPDADLSGY PLVVVPALYL ATEETGRNLR 

       490        500        510        520        530        540 
RYVEQGGTLV VSYFSGIVDA DDAVHPGAYP GALRDVLGLT VEEFSPLGEG GTVRLITPEG 

       550        560        570        580        590        600 
APEGPELTGD LWSDVVIPRG AETVWSYADG IPAGRPAVTR NRLGEGTAWY LSTRLSGPDL 

       610        620        630        640        650        660 
DAVLDRAAAD ARIEPRTSLP YDVEVVRRTG DSGSYLFVIN HTDAEAVVAL DTPGTELLTG 

       670        680 
EPAVDKLAVP AGAVRVVRLD G 

« Hide

References

[1]"Complete sequence of chromosome of Streptomyces flavogriseus ATCC 33331."
US DOE Joint Genome Institute
Lucas S., Copeland A., Lapidus A., Cheng J.-F., Goodwin L., Pitluck S., Davenport K., Detter J.C., Han C., Tapia R., Land M., Hauser L., Kyrpides N., Ivanova N., Ovchinnikova G., Pagani I., Brumm P., Mead D., Woyke T.
Submitted (JAN-2011) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 33331 / DSM 40990 / IAF-45CD.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP002475 Genomic DNA. Translation: ADW06353.1.
RefSeqYP_004925870.1. NC_016114.1.

3D structure databases

ProteinModelPortalE8W331.
ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaADW06353; ADW06353; Sfla_4952.
GeneID11368625.
KEGGsfa:Sfla_4952.

Organism-specific databases

CMRSearch...

Phylogenomic databases

KOK12308.

Enzyme and pathway databases

BioCycSFLA591167:GI5Y-5026-MONOMER.

Family and domain databases

Gene3D3.20.20.80. 1 hit.
InterProIPR013739. Beta_galactosidase_C.
IPR013738. Beta_galactosidase_Trimer.
IPR003476. Glyco_hydro_42.
IPR013529. Glyco_hydro_42_N.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PfamPF02449. Glyco_hydro_42. 1 hit.
PF08533. Glyco_hydro_42C. 1 hit.
PF08532. Glyco_hydro_42M. 1 hit.
[Graphical view]
PIRSFPIRSF001084. B-galactosidase. 1 hit.
SUPFAMSSF51445. SSF51445. 1 hit.
ProtoNetSearch...

Entry information

Entry nameE8W331_STRFA
AccessionPrimary (citable) accession number: E8W331
Entry history
Integrated into UniProtKB/TrEMBL: April 5, 2011
Last sequence update: April 5, 2011
Last modified: February 19, 2014
This is version 22 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)