Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Alpha-N-acetylgalactosaminidase

Gene

BF0874

Organism
Bacteroides fragilis (strain ATCC 25285 / DSM 2151 / JCM 11019 / NCTC 9343)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Glycosidase that has specific alpha-N-acetylgalactosaminidase activity.

Catalytic activityi

Cleavage of non-reducing alpha-(1->3)-N-acetylgalactosamine residues from human blood group A and AB mucin glycoproteins, Forssman hapten and blood group A lacto series glycolipids.1 Publication

Cofactori

NAD+By similarityNote: Binds 1 NAD+ per subunit. The NAD+ cannot dissociate.By similarity

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Binding sitei51NADBy similarity1
Binding sitei148NADBy similarity1
Binding sitei177SubstrateBy similarity1
Binding sitei211NADBy similarity1
Binding sitei293SubstrateBy similarity1

Regions

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Nucleotide bindingi29 – 30NADBy similarity2
Nucleotide bindingi99 – 102NADBy similarity4
Nucleotide bindingi119 – 120NADBy similarity2
Nucleotide bindingi194 – 198NADBy similarity5

GO - Molecular functioni

Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Keywords - Ligandi

NAD

Enzyme and pathway databases

BioCyciBFRA272559:GKF0-840-MONOMER.

Protein family/group databases

CAZyiGH109. Glycoside Hydrolase Family 109.

Names & Taxonomyi

Protein namesi
Recommended name:
Alpha-N-acetylgalactosaminidase (EC:3.2.1.49)
Alternative name(s):
Glycosyl hydrolase family 109 protein
Gene namesi
Ordered Locus Names:BF0874
OrganismiBacteroides fragilis (strain ATCC 25285 / DSM 2151 / JCM 11019 / NCTC 9343)
Taxonomic identifieri272559 [NCBI]
Taxonomic lineageiBacteriaBacteroidetesBacteroidiaBacteroidalesBacteroidaceaeBacteroides
Proteomesi
  • UP000006731 Componenti: Chromosome

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00003485471 – 425Alpha-N-acetylgalactosaminidaseAdd BLAST425

Interactioni

Protein-protein interaction databases

STRINGi272559.BF0874.

Structurei

3D structure databases

ProteinModelPortaliQ5LGW9.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni211 – 214Substrate bindingBy similarity4

Sequence similaritiesi

Phylogenomic databases

eggNOGiENOG4105EPC. Bacteria.
ENOG410XP5M. LUCA.
HOGENOMiHOG000252553.
OMAiWSCITEL.

Family and domain databases

Gene3Di3.40.50.720. 2 hits.
InterProiIPR016040. NAD(P)-bd_dom.
IPR000683. Oxidoreductase_N.
[Graphical view]
PfamiPF01408. GFO_IDH_MocA. 1 hit.
[Graphical view]
SUPFAMiSSF51735. SSF51735. 1 hit.

Sequencei

Sequence statusi: Complete.

Q5LGW9-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKTPSQTHVL GLAHPPLPMV RLAFIGLGNR GVLTLQRYLQ IEGVEIKALC
60 70 80 90 100
EIREGNLVKA QKILREAGYP QPDGYTGPDG WKRMCERDDI DLVFICTDWL
110 120 130 140 150
THTPMAVYSM EHGKHVAIEV PAAMTVEECW KLVDTAEKTR QHCMMLENCC
160 170 180 190 200
YDPFALTTLN MAQQGVFGEI THVEGAYIHD LRSIYFADES KGGFHNHWGK
210 220 230 240 250
KYSIEHTGNP YPTHGLGPVC QILNIHRGDR MNYLVSLSSL QAGMTEYARK
260 270 280 290 300
NFGADSPEAR QKYLLGDMNT TLIQTVKGKS IMIQYNVVTP RPYSRLHTVC
310 320 330 340 350
GTKGFAQKYP VPSIALEPDA GSPLEGKALE EIMERYKHPF TATFGTEAHR
360 370 380 390 400
RNLPNEMNYV MDCRLIYCLR NGLPLDMDVY DAAEWSCITE LSEQSVLNGS
410 420
IPVEIPDFTR GAWKKCHISR TSDLY
Length:425
Mass (Da):47,952
Last modified:June 21, 2005 - v1
Checksum:iFB747658272705FF
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AM039447 Genomic DNA. Translation: CAJ01379.1.
CR626927 Genomic DNA. Translation: CAH06617.1.
RefSeqiWP_005813119.1. NC_003228.3.

Genome annotation databases

EnsemblBacteriaiCAH06617; CAH06617; BF9343_0836.
KEGGibfs:BF9343_0836.
PATRICi21038216. VBIBacFra29119_0864.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AM039447 Genomic DNA. Translation: CAJ01379.1.
CR626927 Genomic DNA. Translation: CAH06617.1.
RefSeqiWP_005813119.1. NC_003228.3.

3D structure databases

ProteinModelPortaliQ5LGW9.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi272559.BF0874.

Protein family/group databases

CAZyiGH109. Glycoside Hydrolase Family 109.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiCAH06617; CAH06617; BF9343_0836.
KEGGibfs:BF9343_0836.
PATRICi21038216. VBIBacFra29119_0864.

Phylogenomic databases

eggNOGiENOG4105EPC. Bacteria.
ENOG410XP5M. LUCA.
HOGENOMiHOG000252553.
OMAiWSCITEL.

Enzyme and pathway databases

BioCyciBFRA272559:GKF0-840-MONOMER.

Family and domain databases

Gene3Di3.40.50.720. 2 hits.
InterProiIPR016040. NAD(P)-bd_dom.
IPR000683. Oxidoreductase_N.
[Graphical view]
PfamiPF01408. GFO_IDH_MocA. 1 hit.
[Graphical view]
SUPFAMiSSF51735. SSF51735. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiG1092_BACFN
AccessioniPrimary (citable) accession number: Q5LGW9
Secondary accession number(s): A4Q8G0
Entry historyi
Integrated into UniProtKB/Swiss-Prot: September 2, 2008
Last sequence update: June 21, 2005
Last modified: November 2, 2016
This is version 66 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.