Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

P32138 (YIHQ_ECOLI) Reviewed, UniProtKB/Swiss-Prot

Last modified June 11, 2014. Version 110. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Alpha-glucosidase YihQ

EC=3.2.1.20
Gene names
Name:yihQ
Ordered Locus Names:b3878, JW3849
OrganismEscherichia coli (strain K12) [Reference proteome] [HAMAP]
Taxonomic identifier83333 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length678 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Exhibits hydrolysis activity against alpha-glucosyl fluoride, although natural substrates, such as alpha-glucobioses are scarcely hydrolyzed. Ref.4

Catalytic activity

Hydrolysis of terminal, non-reducing (1->4)-linked alpha-D-glucose residues with release of alpha-D-glucose.

Induction

Induced during growth with sulfoquinovose. Ref.5

Sequence similarities

Belongs to the glycosyl hydrolase 31 family.

Ontologies

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 678678Alpha-glucosidase YihQ
PRO_0000185372

Sites

Active site4051Nucleophile By similarity
Active site4081 By similarity
Active site4721Proton donor By similarity

Experimental info

Sequence conflict3581A → R in AAB03011. Ref.1
Sequence conflict5171S → T in AAB03011. Ref.1

Sequences

Sequence LengthMass (Da)Tools
P32138 [UniParc].

Last modified July 15, 1998. Version 3.
Checksum: 85869F52BB72FEE7

FASTA67877,275
        10         20         30         40         50         60 
MDTPRPQLLD FQFHQNNDSF TLHFQQRLIL THSKDNPCLW IGSGIADIDM FRGNFSIKDK 

        70         80         90        100        110        120 
LQEKIALTDA IVSQSPDGWL IHFSRGSDIS ATLNISADDQ GRLLLELQND NLNHNRIWLR 

       130        140        150        160        170        180 
LAAQPEDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKQ TYVTWQADCK ENAGGDYYWT 

       190        200        210        220        230        240 
FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWED KATLRFECAD TYISLLEKLT 

       250        260        270        280        290        300 
ALLGRQPELP DWIYDGVTLG IQGGTEVCQK KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK 

       310        320        330        340        350        360 
RVMWNWKWNS ENYPQLDSRI KQWNQEGVQF LAYINPYVAS DKDLCEEAAQ HGYLAKDASG 

       370        380        390        400        410        420 
GDYLVEFGEF YGGVVDLTNP EAYAWFKEVI KKNMIELGCG GWMADFGEYL PTDTYLHNGV 

       430        440        450        460        470        480 
SAEIMHNAWP ALWAKCNYEA LEETGKLGEI LFFMRAGSTG SQKYSTMMWA GDQNVDWSLD 

       490        500        510        520        530        540 
DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFEMKRSKEL LLRWCDFSAF TPMMRTHEGN 

       550        560        570        580        590        600 
RPGDNWQFDG DAETIAHFAR MTTVFTTLKP YLKEAVALNA KSGLPVMRPL FLHYEDDAHT 

       610        620        630        640        650        660 
YTLKYQYLLG RDILVAPVHE EGRSDWTLYL PEDNWVHAWT GEAFRGGEVT VNAPIGKPPV 

       670 
FYRADSEWAA LFASLKSI 

« Hide

References

« Hide 'large scale' references
[1]"Analysis of the Escherichia coli genome. III. DNA sequence of the region from 87.2 to 89.2 minutes."
Plunkett G. III, Burland V., Daniels D.L., Blattner F.R.
Nucleic Acids Res. 21:3391-3398(1993) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / MG1655 / ATCC 47076.
[2]"The complete genome sequence of Escherichia coli K-12."
Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V., Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F., Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B., Shao Y.
Science 277:1453-1462(1997) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], SEQUENCE REVISION TO 358 AND 517.
Strain: K12 / MG1655 / ATCC 47076.
[3]"Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
Mol. Syst. Biol. 2:E1-E5(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
[4]"Overexpression and characterization of two unknown proteins, YicI and YihQ, originated from Escherichia coli."
Okuyama M., Mori H., Chiba S., Kimura A.
Protein Expr. Purif. 37:170-179(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION.
[5]"Sulphoglycolysis in Escherichia coli K-12 closes a gap in the biogeochemical sulphur cycle."
Denger K., Weiss M., Felux A.K., Schneider A., Mayer C., Spiteller D., Huhn T., Cook A.M., Schleheck D.
Nature 507:114-117(2014) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION BY MASS SPECTROMETRY, INDUCTION.
Strain: K12.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
L19201 Genomic DNA. Translation: AAB03011.1.
U00096 Genomic DNA. Translation: AAC76875.1.
AP009048 Genomic DNA. Translation: BAE77431.1.
PIRA65193.
RefSeqNP_418314.1. NC_000913.3.
YP_491572.1. NC_007779.1.

3D structure databases

ProteinModelPortalP32138.
SMRP32138. Positions 309-635.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

DIPDIP-12498N.
IntActP32138. 1 interaction.
MINTMINT-1249337.
STRING511145.b3878.

Protein family/group databases

CAZyGH31. Glycoside Hydrolase Family 31.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaAAC76875; AAC76875; b3878.
BAE77431; BAE77431; BAE77431.
GeneID12931872.
948376.
KEGGecj:Y75_p3308.
eco:b3878.
PATRIC32123259. VBIEscCol129921_3990.

Organism-specific databases

EchoBASEEB1789.
EcoGeneEG11843. yihQ.

Phylogenomic databases

eggNOGCOG1501.
HOGENOMHOG000064244.
KOK15922.
OMAKYQYLFG.
OrthoDBEOG6Z99WN.
PhylomeDBP32138.

Enzyme and pathway databases

BioCycEcoCyc:EG11843-MONOMER.
ECOL316407:JW3849-MONOMER.

Gene expression databases

GenevestigatorP32138.

Family and domain databases

InterProIPR011013. Gal_mutarotase_SF_dom.
IPR000322. Glyco_hydro_31.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PfamPF01055. Glyco_hydro_31. 1 hit.
[Graphical view]
SUPFAMSSF51445. SSF51445. 1 hit.
SSF74650. SSF74650. 1 hit.
ProtoNetSearch...

Other

PROP32138.

Entry information

Entry nameYIHQ_ECOLI
AccessionPrimary (citable) accession number: P32138
Secondary accession number(s): P76775, Q2M8H5
Entry history
Integrated into UniProtKB/Swiss-Prot: October 1, 1993
Last sequence update: July 15, 1998
Last modified: June 11, 2014
This is version 110 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries

Escherichia coli

Escherichia coli (strain K12): entries and cross-references to EcoGene