Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Probable beta-galactosidase C

Gene

lacC

Organism
Penicillium marneffei (strain ATCC 18224 / CBS 334.59 / QM 7333)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Cleaves beta-linked terminal galactosyl residues from gangliosides, glycoproteins, and glycosaminoglycans.By similarity

Catalytic activityi

Hydrolysis of terminal non-reducing beta-D-galactose residues in beta-D-galactosides.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Binding sitei80 – 801SubstrateBy similarity
Binding sitei125 – 1251SubstrateBy similarity
Binding sitei127 – 1271SubstrateBy similarity
Binding sitei185 – 1851SubstrateBy similarity
Active sitei186 – 1861Proton donorSequence Analysis
Binding sitei249 – 2491SubstrateBy similarity
Active sitei285 – 2851NucleophileSequence Analysis
Binding sitei351 – 3511SubstrateBy similarity

GO - Molecular functioni

  1. beta-galactosidase activity Source: UniProtKB-EC

GO - Biological processi

  1. polysaccharide catabolic process Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Keywords - Biological processi

Carbohydrate metabolism, Polysaccharide degradation

Names & Taxonomyi

Protein namesi
Recommended name:
Probable beta-galactosidase C (EC:3.2.1.23)
Alternative name(s):
Lactase C
Gene namesi
Name:lacC
ORF Names:PMAA_093700
OrganismiPenicillium marneffei (strain ATCC 18224 / CBS 334.59 / QM 7333)
Taxonomic identifieri441960 [NCBI]
Taxonomic lineageiEukaryotaFungiDikaryaAscomycotaPezizomycotinaEurotiomycetesEurotiomycetidaeEurotialesTrichocomaceaeTalaromyces
ProteomesiUP000001294 Componenti: Unassembled WGS sequence

Subcellular locationi

Secreted By similarity

GO - Cellular componenti

  1. extracellular region Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2121Sequence AnalysisAdd
BLAST
Chaini22 – 999978Probable beta-galactosidase CPRO_0000395240Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi25 – 251N-linked (GlcNAc...)Sequence Analysis
Glycosylationi195 – 1951N-linked (GlcNAc...)Sequence Analysis
Disulfide bondi255 ↔ 302By similarity
Glycosylationi274 – 2741N-linked (GlcNAc...)Sequence Analysis
Glycosylationi389 – 3891N-linked (GlcNAc...)Sequence Analysis
Glycosylationi441 – 4411N-linked (GlcNAc...)Sequence Analysis
Glycosylationi512 – 5121N-linked (GlcNAc...)Sequence Analysis
Glycosylationi519 – 5191N-linked (GlcNAc...)Sequence Analysis
Glycosylationi600 – 6001N-linked (GlcNAc...)Sequence Analysis
Glycosylationi675 – 6751N-linked (GlcNAc...)Sequence Analysis
Glycosylationi713 – 7131N-linked (GlcNAc...)Sequence Analysis
Glycosylationi757 – 7571N-linked (GlcNAc...)Sequence Analysis
Glycosylationi808 – 8081N-linked (GlcNAc...)Sequence Analysis
Glycosylationi897 – 8971N-linked (GlcNAc...)Sequence Analysis

Keywords - PTMi

Disulfide bond, Glycoprotein

Structurei

3D structure databases

ProteinModelPortaliB6QHA9.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the glycosyl hydrolase 35 family.Curated

Keywords - Domaini

Signal

Phylogenomic databases

OrthoDBiEOG7ZGXBD.

Family and domain databases

Gene3Di2.102.20.10. 1 hit.
2.60.120.260. 2 hits.
2.60.390.10. 1 hit.
3.20.20.80. 1 hit.
InterProiIPR018954. Betagal_dom2.
IPR025972. BetaGal_dom3.
IPR025300. BetaGal_jelly_roll_dom.
IPR008979. Galactose-bd-like.
IPR013781. Glyco_hydro_catalytic_dom.
IPR001944. Glycoside_Hdrlase_35.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PANTHERiPTHR23421. PTHR23421. 1 hit.
PfamiPF10435. BetaGal_dom2. 1 hit.
PF13363. BetaGal_dom3. 1 hit.
PF13364. BetaGal_dom4_5. 2 hits.
PF01301. Glyco_hydro_35. 1 hit.
[Graphical view]
PRINTSiPR00742. GLHYDRLASE35.
SUPFAMiSSF117100. SSF117100. 1 hit.
SSF49785. SSF49785. 2 hits.
SSF51445. SSF51445. 1 hit.

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

B6QHA9-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MFFFRFLTTV LLLFNAKLLV AQSSNTSSPV HWDKYSLSIN GERLFVFAGE
60 70 80 90 100
FHYIRLPVPE LWLDVFQKLK ANGFNAISVY FYWNHHSASE GVYDFETGGH
110 120 130 140 150
NVQRLFDYAK QAGVYIIARP GPYANGELSA GGYALWAANG RLGGERTRDS
160 170 180 190 200
QYYDLWSPWM TKIGKIIAAN QITEGGPVIL VQHENELQET THRANNTLVL
210 220 230 240 250
YMEQITQILD AAGIVVPSTH NEKGMRSMSW SMDYEDVGGA VNIYGLDSYP
260 270 280 290 300
GGLSCTNPNA GFNLIRTYYQ WFQNYSYTQP EYLAEFQGGY FTPWGGVFYD
310 320 330 340 350
DCASMLQPEY ADVFYKNNIG NRVTLQSLYM AYGGTNWGHI AAPVVYTSYD
360 370 380 390 400
YSAPLRETRE IRDKLKQTKL LGLFTRVSPD LLQTEMEGNG TSYTTGANIF
410 420 430 440 450
TWALRNPETN AGFYVVAQDD SSSTTDVVFD LEVETSAGSV NITNIGLDGR
460 470 480 490 500
QSKIITTDYK VGDTTLLYCS ADILTYATLD VDVLALYLNK GQTGTFVLAN
510 520 530 540 550
AASHLKYTVY GNSTVTSSNS SQGTIYTYTQ GQGISAIKFS NRFLVYLLDK
560 570 580 590 600
YTAWDFFAPP LQLSDPNVKP NEHIFVIGPY LVREATIKGR TLELTGDNQN
610 620 630 640 650
TTSIEIYHGN PFITSITWNG KHLSTKRTAY GSLTATIPGA EAITITLPKL
660 670 680 690 700
TSWKSHDMIP EIDPEYDDSN WVVCNKTTSF NAIAPLSLPV LYSGDYGYHA
710 720 730 740 750
GPKIYRGRFG STNATGVTVT AQNGNAAGWS AWLNGIYIGG VTGDPSIEAT
760 770 780 790 800
SAVLKFNSST TLKQEGSENV LTVLVDYTGH DEDNVKPARA QNPRGLLGVI
810 820 830 840 850
FEGSTSTNFT SWKLQGNAGG EKNIDALRGP MNEGGFYGER LGWHLPGFEP
860 870 880 890 900
STKSGWDTRA PSDGVDGGSH RFYITEFTLD LGPNSHALDV PIGIHLNASS
910 920 930 940 950
TSGPAVAYVW LNGYKFAHYL PHIGPQTVFP FQPGVLNIQG SEGHKRKNTL
960 970 980 990
AVSLWALTDQ PAALDVVELV AYGKYTSSFD FARDWSYLQP RWVDRSKYA
Length:999
Mass (Da):110,430
Last modified:December 16, 2008 - v1
Checksum:iCD1A02A994EB2050
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
DS995902 Genomic DNA. Translation: EEA22754.1.
RefSeqiXP_002148921.1. XM_002148885.1.

Genome annotation databases

GeneIDi7026488.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
DS995902 Genomic DNA. Translation: EEA22754.1.
RefSeqiXP_002148921.1. XM_002148885.1.

3D structure databases

ProteinModelPortaliB6QHA9.
ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

GeneIDi7026488.

Phylogenomic databases

OrthoDBiEOG7ZGXBD.

Family and domain databases

Gene3Di2.102.20.10. 1 hit.
2.60.120.260. 2 hits.
2.60.390.10. 1 hit.
3.20.20.80. 1 hit.
InterProiIPR018954. Betagal_dom2.
IPR025972. BetaGal_dom3.
IPR025300. BetaGal_jelly_roll_dom.
IPR008979. Galactose-bd-like.
IPR013781. Glyco_hydro_catalytic_dom.
IPR001944. Glycoside_Hdrlase_35.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PANTHERiPTHR23421. PTHR23421. 1 hit.
PfamiPF10435. BetaGal_dom2. 1 hit.
PF13363. BetaGal_dom3. 1 hit.
PF13364. BetaGal_dom4_5. 2 hits.
PF01301. Glyco_hydro_35. 1 hit.
[Graphical view]
PRINTSiPR00742. GLHYDRLASE35.
SUPFAMiSSF117100. SSF117100. 1 hit.
SSF49785. SSF49785. 2 hits.
SSF51445. SSF51445. 1 hit.
ProtoNetiSearch...

Publicationsi

  1. "The genome sequence of Penicillium marneffei strain ATCC 18224."
    Fedorova N.D., Joardar V.S., Maiti R., Schobel S., Amedeo P., Galens K., Inman J.M., White O.R., Whitty B.R., Wortman J.R., Nierman W.C.
    Submitted (SEP-2007) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ATCC 18224 / CBS 334.59 / QM 7333.

Entry informationi

Entry nameiBGALC_PENMQ
AccessioniPrimary (citable) accession number: B6QHA9
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 13, 2010
Last sequence update: December 16, 2008
Last modified: January 7, 2015
This is version 36 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programFungal Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.