Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Entry version 132 (07 Apr 2021)
Sequence version 1 (01 Jan 1998)
Previous versions | rss
Add a publicationFeedback
Protein

Alpha-galactosidase

Gene

melA

Organism
Bacillus subtilis (strain 168)
Status
Reviewed-Annotation score:

Annotation score:4 out of 5

<p>The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score <strong>cannot</strong> be used as a measure of the accuracy of the annotation as we cannot define the 'correct annotation' for any given protein.<p><a href='/help/annotation_score' target='_top'>More...</a></p>
-Experimental evidence at protein leveli <p>This indicates the type of evidence that supports the existence of the protein. Note that the 'protein existence' evidence does not give information on the accuracy or correctness of the sequence(s) displayed.<p><a href='/help/protein_existence' target='_top'>More...</a></p>

<p>This section provides any useful information about the protein, mostly biological knowledge.<p><a href='/help/function_section' target='_top'>More...</a></p>Functioni

Catalyzes the hydrolysis of melibiose and alpha-galactosides of the raffinose family of oligosaccharides (RFOs) such as raffinose and stachyose. Cannot act on polymeric substrates such as locust bean gum.1 Publication

<p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section describes the catalytic activity of an enzyme, i.e. a chemical reaction that the enzyme catalyzes.<p><a href='/help/catalytic_activity' target='_top'>More...</a></p>Catalytic activityi

  • Hydrolysis of terminal, non-reducing alpha-D-galactose residues in alpha-D-galactosides, including galactose oligosaccharides, galactomannans and galactolipids.1 Publication EC:3.2.1.22

<p>This subsection of the 'Function' section provides information relevant to cofactors. A cofactor is any non-protein substance required for a protein to be catalytically active. Some cofactors are inorganic, such as the metal atoms zinc, iron, and copper in various oxidation states. Others, such as most vitamins, are organic.<p><a href='/help/cofactor' target='_top'>More...</a></p>Cofactori

Protein has several cofactor binding sites:

<p>This subsection of the 'Function' section describes biophysical and chemical properties, such as maximal absorption, kinetic parameters, pH dependence, redox potentials and temperature dependence.<p><a href='/help/biophysicochemical_properties' target='_top'>More...</a></p>Kineticsi

  1. KM=10 mM for melibiose1 Publication
  2. KM=25 mM for raffinose1 Publication

    Sites

    Feature keyPosition(s)DescriptionActionsGraphical viewLength
    <p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section describes the interaction between a single amino acid and another chemical entity. Priority is given to the annotation of physiological ligands.<p><a href='/help/binding' target='_top'>More...</a></p>Binding sitei148SubstrateBy similarity1
    <p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section indicates at which position the protein binds a given metal ion. The nature of the metal is indicated in the 'Description' field.<p><a href='/help/metal' target='_top'>More...</a></p>Metal bindingi169ManganeseBy similarity1
    <p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section is used for enzymes and indicates the residues directly involved in catalysis.<p><a href='/help/act_site' target='_top'>More...</a></p>Active sitei170Proton donorBy similarity1
    Metal bindingi199ManganeseBy similarity1

    Regions

    Feature keyPosition(s)DescriptionActionsGraphical viewLength
    <p>This subsection of the <a href="http://www.uniprot.org/help/function%5Fsection">Function</a> section describes a region in the protein which binds nucleotide phosphates. It always involves more than one amino acid and includes all residues involved in nucleotide-binding.<p><a href='/help/np_bind' target='_top'>More...</a></p>Nucleotide bindingi2 – 68NADBy similarityAdd BLAST67

    <p>The <a href="http://www.geneontology.org/">Gene Ontology (GO)</a> project provides a set of hierarchical controlled vocabulary split into 3 categories:<p><a href='/help/gene_ontology' target='_top'>More...</a></p>GO - Molecular functioni

    GO - Biological processi

    <p>UniProtKB Keywords constitute a <a href="http://www.uniprot.org/keywords">controlled vocabulary</a> with a hierarchical structure. Keywords summarise the content of a UniProtKB entry and facilitate the search for proteins of interest.<p><a href='/help/keywords' target='_top'>More...</a></p>Keywordsi

    Molecular functionGlycosidase, Hydrolase
    Biological processCarbohydrate metabolism
    LigandManganese, Metal-binding, NAD

    Enzyme and pathway databases

    BioCyc Collection of Pathway/Genome Databases

    More...
    BioCyci
    BSUB:BSU30300-MONOMER

    Protein family/group databases

    Carbohydrate-Active enZymes

    More...
    CAZyi
    GH4, Glycoside Hydrolase Family 4

    <p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Names & Taxonomyi

    <p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section provides an exhaustive list of all names of the protein, from commonly used to obsolete, to allow unambiguous identification of a protein.<p><a href='/help/protein_names' target='_top'>More...</a></p>Protein namesi
    Recommended name:
    Alpha-galactosidaseCurated (EC:3.2.1.221 Publication)
    Alternative name(s):
    Melibiase
    <p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section indicates the name(s) of the gene(s) that code for the protein sequence(s) described in the entry. Four distinct tokens exist: 'Name', 'Synonyms', 'Ordered locus names' and 'ORF names'.<p><a href='/help/gene_name' target='_top'>More...</a></p>Gene namesi
    Name:melA1 Publication
    Ordered Locus Names:BSU30300
    <p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section provides information on the name(s) of the organism that is the source of the protein sequence.<p><a href='/help/organism-name' target='_top'>More...</a></p>OrganismiBacillus subtilis (strain 168)
    <p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section shows the unique identifier assigned by the NCBI to the source organism of the protein. This is known as the 'taxonomic identifier' or 'taxid'.<p><a href='/help/taxonomic_identifier' target='_top'>More...</a></p>Taxonomic identifieri224308 [NCBI]
    <p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section contains the taxonomic hierarchical classification lineage of the source organism. It lists the nodes as they appear top-down in the taxonomic tree, with the more general grouping listed first.<p><a href='/help/taxonomic_lineage' target='_top'>More...</a></p>Taxonomic lineageiBacteriaFirmicutesBacilliBacillalesBacillaceaeBacillus
    <p>This subsection of the <a href="http://www.uniprot.org/help/names%5Fand%5Ftaxonomy%5Fsection">Names and taxonomy</a> section is present for entries that are part of a <a href="http://www.uniprot.org/proteomes">proteome</a>, i.e. of a set of proteins thought to be expressed by organisms whose genomes have been completely sequenced.<p><a href='/help/proteomes_manual' target='_top'>More...</a></p>Proteomesi
    • UP000001570 <p>A UniProt <a href="http://www.uniprot.org/manual/proteomes%5Fmanual">proteome</a> can consist of several components.<br></br>The component name refers to the genomic component encoding a set of proteins.<p><a href='/help/proteome_component' target='_top'>More...</a></p> Componenti: Chromosome

    <p>This section provides information on the location and the topology of the mature protein in the cell.<p><a href='/help/subcellular_location_section' target='_top'>More...</a></p>Subcellular locationi

    Keywords - Cellular componenti

    Cytoplasm

    <p>This section describes post-translational modifications (PTMs) and/or processing events.<p><a href='/help/ptm_processing_section' target='_top'>More...</a></p>PTM / Processingi

    Molecule processing

    Feature keyPosition(s)DescriptionActionsGraphical viewLength
    <p>This subsection of the 'PTM / Processing' section describes the extent of a polypeptide chain in the mature protein following processing or proteolytic cleavage.<p><a href='/help/chain' target='_top'>More...</a></p>ChainiPRO_00001698511 – 432Alpha-galactosidaseAdd BLAST432

    Proteomic databases

    jPOST - Japan Proteome Standard Repository/Database

    More...
    jPOSTi
    O34645

    PaxDb, a database of protein abundance averages across all three domains of life

    More...
    PaxDbi
    O34645

    PRoteomics IDEntifications database

    More...
    PRIDEi
    O34645

    <p>This section provides information on the expression of a gene at the mRNA or protein level in cells or in tissues of multicellular organisms.<p><a href='/help/expression_section' target='_top'>More...</a></p>Expressioni

    <p>This subsection of the 'Expression' section reports the experimentally proven effects of inducers and repressors (usually chemical compounds or environmental factors) on the level of protein (or mRNA) expression (up-regulation, down-regulation, constitutive expression).<p><a href='/help/induction' target='_top'>More...</a></p>Inductioni

    Repressed by the transcriptional regulator MelR. Induced by melibiose and raffinose.1 Publication

    <p>This section provides information on the quaternary structure of a protein and on interaction(s) with other proteins or protein complexes.<p><a href='/help/interaction_section' target='_top'>More...</a></p>Interactioni

    <p>This subsection of the <a href="http://www.uniprot.org/help/interaction%5Fsection">'Interaction'</a> section provides information about the protein quaternary structure and interaction(s) with other proteins or protein complexes (with the exception of physiological receptor-ligand interactions which are annotated in the <a href="http://www.uniprot.org/help/function%5Fsection">'Function'</a> section).<p><a href='/help/subunit_structure' target='_top'>More...</a></p>Subunit structurei

    Homodimer.

    By similarity

    Protein-protein interaction databases

    STRING: functional protein association networks

    More...
    STRINGi
    224308.BSU30300

    <p>This section provides information on the tertiary and secondary structure of a protein.<p><a href='/help/structure_section' target='_top'>More...</a></p>Structurei

    3D structure databases

    SWISS-MODEL Repository - a database of annotated 3D protein structure models

    More...
    SMRi
    O34645

    Database of comparative protein structure models

    More...
    ModBasei
    Search...

    <p>This section provides information on sequence similarities with other proteins and the domain(s) present in a protein.<p><a href='/help/family_and_domains_section' target='_top'>More...</a></p>Family & Domainsi

    <p>This subsection of the 'Family and domains' section provides information about the sequence similarity with other proteins.<p><a href='/help/sequence_similarities' target='_top'>More...</a></p>Sequence similaritiesi

    Belongs to the glycosyl hydrolase 4 family.Curated

    Phylogenomic databases

    evolutionary genealogy of genes: Non-supervised Orthologous Groups

    More...
    eggNOGi
    COG1486, Bacteria

    InParanoid: Eukaryotic Ortholog Groups

    More...
    InParanoidi
    O34645

    Identification of Orthologs from Complete Genome Data

    More...
    OMAi
    LQYVNPM

    Database for complete collections of gene phylogenies

    More...
    PhylomeDBi
    O34645

    Family and domain databases

    Integrated resource of protein families, domains and functional sites

    More...
    InterProi
    View protein in InterPro
    IPR019802, GlycHydrolase_4_CS
    IPR001088, Glyco_hydro_4
    IPR022616, Glyco_hydro_4_C
    IPR015955, Lactate_DH/Glyco_Ohase_4_C
    IPR036291, NAD(P)-bd_dom_sf

    The PANTHER Classification System

    More...
    PANTHERi
    PTHR32092, PTHR32092, 1 hit

    Pfam protein domain database

    More...
    Pfami
    View protein in Pfam
    PF02056, Glyco_hydro_4, 1 hit
    PF11975, Glyco_hydro_4C, 1 hit

    Protein Motif fingerprint database; a protein domain database

    More...
    PRINTSi
    PR00732, GLHYDRLASE4

    Superfamily database of structural and functional annotation

    More...
    SUPFAMi
    SSF51735, SSF51735, 1 hit
    SSF56327, SSF56327, 1 hit

    PROSITE; a protein domain and family database

    More...
    PROSITEi
    View protein in PROSITE
    PS01324, GLYCOSYL_HYDROL_F4, 1 hit

    <p>This section displays by default the canonical protein sequence and upon request all isoforms described in the entry. It also includes information pertinent to the sequence(s), including <a href="http://www.uniprot.org/help/sequence%5Flength">length</a> and <a href="http://www.uniprot.org/help/sequences">molecular weight</a>. The information is filed in different subsections. The current subsections and their content are listed below:<p><a href='/help/sequences_section' target='_top'>More...</a></p>Sequencei

    <p>This subsection of the <a href="http://www.uniprot.org/help/sequences%5Fsection">Sequence</a> section indicates if the <a href="http://www.uniprot.org/help/canonical%5Fand%5Fisoforms">canonical sequence</a> displayed by default in the entry is complete or not.<p><a href='/help/sequence_status' target='_top'>More...</a></p>Sequence statusi: Complete.

    O34645-1 [UniParc]FASTAAdd to basket
    « Hide
            10         20         30         40         50
    MKKITFIGAG STIFAKNVLG DCLLTEALNG FEFALYDIDP KRLQESQLML
    60 70 80 90 100
    ENLRDRYNPS VAINSYDDRK LALQNAGYVI NAIQVGGYKP STVIDFEIPK
    110 120 130 140 150
    RYGLRQTIAD TVGIGGIFRS LRTIPVLFDI AKDMEEMCPD AWFLNYTNPM
    160 170 180 190 200
    ATLTGAMLRY TNIKTIGLCH SVQVCTKDLF KALGMEHDGI EERIAGINHM
    210 220 230 240 250
    AWLLEVKKDG TDLYPEIKRR AKEKQKTKHH DMVRFELMDK FGYYVTESSE
    260 270 280 290 300
    HNAEYHPYFI KRNYPELISE LQIPLDEYPR RCVKQIENWE KMRDDIVNNK
    310 320 330 340 350
    NLTHERSKEY GSRIIEAMET NEPFTFGGNV LNTGLITNLP SKAVVEVTCV
    360 370 380 390 400
    ADRKKITPCF AGELPEQLAA LNRTNINTQL MTIEAAVTRK KEAVYQAAML
    410 420 430
    DPHTSAELSM KDIISMCDDL FAAHGDWLPE YK
    Length:432
    Mass (Da):49,264
    Last modified:January 1, 1998 - v1
    <p>The checksum is a form of redundancy check that is calculated from the sequence. It is useful for tracking sequence updates.</p> <p>It should be noted that while, in theory, two different sequences could have the same checksum value, the likelihood that this would happen is extremely low.</p> <p>However UniProtKB may contain entries with identical sequences in case of multiple genes (paralogs).</p> <p>The checksum is computed as the sequence 64-bit Cyclic Redundancy Check value (CRC64) using the generator polynomial: x<sup>64</sup> + x<sup>4</sup> + x<sup>3</sup> + x + 1. The algorithm is described in the ISO 3309 standard. </p> <p class="publication">Press W.H., Flannery B.P., Teukolsky S.A. and Vetterling W.T.<br /> <strong>Cyclic redundancy and other checksums</strong><br /> <a href="http://www.nrbook.com/b/bookcpdf.php">Numerical recipes in C 2nd ed., pp896-902, Cambridge University Press (1993)</a>)</p> Checksum:iE773A01018E65562
    GO

    Sequence databases

    Select the link destinations:

    EMBL nucleotide sequence database

    More...
    EMBLi

    GenBank nucleotide sequence database

    More...
    GenBanki

    DNA Data Bank of Japan; a nucleotide sequence database

    More...
    DDBJi
    Links Updated
    AF008220 Genomic DNA Translation: AAC00383.1
    AL009126 Genomic DNA Translation: CAB15008.1

    Protein sequence database of the Protein Information Resource

    More...
    PIRi
    E69656

    NCBI Reference Sequences

    More...
    RefSeqi
    NP_390908.1, NC_000964.3
    WP_003246000.1, NZ_JNCM01000036.1

    Genome annotation databases

    Ensembl bacterial and archaeal genome annotation project

    More...
    EnsemblBacteriai
    CAB15008; CAB15008; BSU_30300

    Database of genes from NCBI RefSeq genomes

    More...
    GeneIDi
    937255

    KEGG: Kyoto Encyclopedia of Genes and Genomes

    More...
    KEGGi
    bsu:BSU30300

    Pathosystems Resource Integration Center (PATRIC)

    More...
    PATRICi
    fig|224308.179.peg.3286

    <p>This section provides links to proteins that are similar to the protein sequence(s) described in this entry at different levels of sequence identity thresholds (100%, 90% and 50%) based on their membership in UniProt Reference Clusters (<a href="http://www.uniprot.org/help/uniref">UniRef</a>).<p><a href='/help/similar_proteins_section' target='_top'>More...</a></p>Similar proteinsi

    <p>This section is used to point to information related to entries and found in data collections other than UniProtKB.<p><a href='/help/cross_references_section' target='_top'>More...</a></p>Cross-referencesi

    Sequence databases

    Select the link destinations:
    EMBLi
    GenBanki
    DDBJi
    Links Updated
    AF008220 Genomic DNA Translation: AAC00383.1
    AL009126 Genomic DNA Translation: CAB15008.1
    PIRiE69656
    RefSeqiNP_390908.1, NC_000964.3
    WP_003246000.1, NZ_JNCM01000036.1

    3D structure databases

    SMRiO34645
    ModBaseiSearch...

    Protein-protein interaction databases

    STRINGi224308.BSU30300

    Protein family/group databases

    CAZyiGH4, Glycoside Hydrolase Family 4

    Proteomic databases

    jPOSTiO34645
    PaxDbiO34645
    PRIDEiO34645

    Genome annotation databases

    EnsemblBacteriaiCAB15008; CAB15008; BSU_30300
    GeneIDi937255
    KEGGibsu:BSU30300
    PATRICifig|224308.179.peg.3286

    Phylogenomic databases

    eggNOGiCOG1486, Bacteria
    InParanoidiO34645
    OMAiLQYVNPM
    PhylomeDBiO34645

    Enzyme and pathway databases

    BioCyciBSUB:BSU30300-MONOMER

    Family and domain databases

    InterProiView protein in InterPro
    IPR019802, GlycHydrolase_4_CS
    IPR001088, Glyco_hydro_4
    IPR022616, Glyco_hydro_4_C
    IPR015955, Lactate_DH/Glyco_Ohase_4_C
    IPR036291, NAD(P)-bd_dom_sf
    PANTHERiPTHR32092, PTHR32092, 1 hit
    PfamiView protein in Pfam
    PF02056, Glyco_hydro_4, 1 hit
    PF11975, Glyco_hydro_4C, 1 hit
    PRINTSiPR00732, GLHYDRLASE4
    SUPFAMiSSF51735, SSF51735, 1 hit
    SSF56327, SSF56327, 1 hit
    PROSITEiView protein in PROSITE
    PS01324, GLYCOSYL_HYDROL_F4, 1 hit

    ProtoNet; Automatic hierarchical classification of proteins

    More...
    ProtoNeti
    Search...

    MobiDB: a database of protein disorder and mobility annotations

    More...
    MobiDBi
    Search...

    <p>This section provides general information on the entry.<p><a href='/help/entry_information_section' target='_top'>More...</a></p>Entry informationi

    <p>This subsection of the 'Entry information' section provides a mnemonic identifier for a UniProtKB entry, but it is not a stable identifier. Each reviewed entry is assigned a unique entry name upon integration into UniProtKB/Swiss-Prot.<p><a href='/help/entry_name' target='_top'>More...</a></p>Entry nameiAGAL_BACSU
    <p>This subsection of the 'Entry information' section provides one or more accession number(s). These are stable identifiers and should be used to cite UniProtKB entries. Upon integration into UniProtKB, each entry is assigned a unique accession number, which is called 'Primary (citable) accession number'.<p><a href='/help/accession_numbers' target='_top'>More...</a></p>AccessioniPrimary (citable) accession number: O34645
    <p>This subsection of the 'Entry information' section shows the date of integration of the entry into UniProtKB, the date of the last sequence update and the date of the last annotation modification ('Last modified'). The version number for both the entry and the <a href="http://www.uniprot.org/help/canonical%5Fand%5Fisoforms">canonical sequence</a> are also displayed.<p><a href='/help/entry_history' target='_top'>More...</a></p>Entry historyiIntegrated into UniProtKB/Swiss-Prot: December 15, 1998
    Last sequence update: January 1, 1998
    Last modified: April 7, 2021
    This is version 132 of the entry and version 1 of the sequence. See complete history.
    <p>This subsection of the 'Entry information' section indicates whether the entry has been manually annotated and reviewed by UniProtKB curators or not, in other words, if the entry belongs to the Swiss-Prot section of UniProtKB (<strong>reviewed</strong>) or to the computer-annotated TrEMBL section (<strong>unreviewed</strong>).<p><a href='/help/entry_status' target='_top'>More...</a></p>Entry statusiReviewed (UniProtKB/Swiss-Prot)
    Annotation programProkaryotic Protein Annotation Program

    <p>This section contains any relevant information that doesn't fit in any other defined sections<p><a href='/help/miscellaneous_section' target='_top'>More...</a></p>Miscellaneousi

    Keywords - Technical termi

    Reference proteome

    Documents

    1. Glycosyl hydrolases
      Classification of glycosyl hydrolase families and list of entries
    2. SIMILARITY comments
      Index of protein domains and families
    UniProt is an ELIXIR core data resource
    Main funding by: National Institutes of Health

    We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

    Do not show this banner again