Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Probable endo-beta-1,4-glucanase D

Gene

eglD

Organism
Aspergillus kawachii (strain NBRC 4308) (White koji mold) (Aspergillus awamori var. kawachi)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Protein inferred from homologyi

Functioni

Has endoglucanase activity on substrates containing beta-1,4 glycosidic bonds, like in carboxymethylcellulose (CMC), hydroxyethylcellulose (HEC) and beta-glucan. Involved in the degradation of complex natural cellulosic substrates (By similarity).By similarity

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei167 – 1671Proton donorBy similarity
Active sitei213 – 2131NucleophileSequence Analysis

GO - Molecular functioni

  1. cellulase activity Source: UniProtKB-EC
  2. cellulose binding Source: InterPro

GO - Biological processi

  1. cellulose catabolic process Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Keywords - Biological processi

Carbohydrate metabolism, Cellulose degradation, Polysaccharide degradation

Protein family/group databases

CAZyiCBM1. Carbohydrate-Binding Module Family 1.
GH61. Glycoside Hydrolase Family 61.

Names & Taxonomyi

Protein namesi
Recommended name:
Probable endo-beta-1,4-glucanase D (EC:3.2.1.4)
Short name:
Endoglucanase D
Alternative name(s):
Carboxymethylcellulase D
Cellulase 61A
Cellulase D
Gene namesi
Name:eglD
Synonyms:cel61A
ORF Names:AKAW_08531
OrganismiAspergillus kawachii (strain NBRC 4308) (White koji mold) (Aspergillus awamori var. kawachi)
Taxonomic identifieri1033177 [NCBI]
Taxonomic lineageiEukaryotaFungiDikaryaAscomycotaPezizomycotinaEurotiomycetesEurotiomycetidaeEurotialesAspergillaceaeAspergillus
ProteomesiUP000006812 Componenti: Unassembled WGS sequence

Subcellular locationi

Secreted By similarity

GO - Cellular componenti

  1. extracellular region Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2020Sequence AnalysisAdd
BLAST
Chaini21 – 408388Probable endo-beta-1,4-glucanase DPRO_0000394064Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi151 – 1511N-linked (GlcNAc...)Sequence Analysis
Glycosylationi331 – 3311N-linked (GlcNAc...)Sequence Analysis
Disulfide bondi377 ↔ 394By similarity
Glycosylationi381 – 3811N-linked (GlcNAc...)Sequence Analysis
Disulfide bondi388 ↔ 404By similarity

Keywords - PTMi

Disulfide bond, Glycoprotein

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini369 – 40537CBM1PROSITE-ProRule annotationAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni21 – 237217CatalyticAdd
BLAST
Regioni238 – 25417Ser/Thr-rich linkerAdd
BLAST

Domaini

Has a modular structure: an endo-beta-1,4-glucanase catalytic module at the N-terminus, a linker rich in serines and threonines, and a C-terminal carbohydrate-binding module (CBM). The genes for catalytic modules and CBMs seem to have evolved separately and have been linked by gene fusion.

Sequence similaritiesi

Belongs to the glycosyl hydrolase 61 family.Curated
Contains 1 CBM1 (fungal-type carbohydrate-binding) domain.PROSITE-ProRule annotation

Keywords - Domaini

Signal

Phylogenomic databases

InParanoidiQ96WQ9.
OrthoDBiEOG7KM64H.

Family and domain databases

InterProiIPR000254. Cellulose-bd_dom_fun.
IPR005103. Glyco_hydro_61.
[Graphical view]
PfamiPF00734. CBM_1. 1 hit.
PF03443. Glyco_hydro_61. 1 hit.
[Graphical view]
ProDomiPD001821. CBD_fun. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00236. fCBD. 1 hit.
[Graphical view]
SUPFAMiSSF57180. SSF57180. 1 hit.
PROSITEiPS00562. CBM1_1. 1 hit.
PS51164. CBM1_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q96WQ9-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKTTTYSLLA LAAASKLASA HTTVQAVWIN GEDQGLGNTD DGYIRSPPSN
60 70 80 90 100
SPVTDVTSTD MTCNVNGDQA ASKTLSVKAG DVVTFEWHHS DRSDSDDIIA
110 120 130 140 150
SSHKGPVQVY MAPTAKGSNG NNWVKIAEDG YHKSSDEWAT DILIANKGKH
160 170 180 190 200
NITVPDVPAG NYLFRPEIIA LHEGNREGGA QFYMECVQFK VTSDGSNELP
210 220 230 240 250
SGVSIPGVYT ATDPGILFDI YNSFDSYPIP GPDVWDGSSS GSSSSGSSSA
260 270 280 290 300
AVSSAAAAAT TSAVAATTPA TQAAVEVSSS AAAATTEAAA PVVSSAAPVQ
310 320 330 340 350
QATSAVTSQA QAAPTTFATS SKKSSKTACK NKTKSNSQVA AATSSVVAPA
360 370 380 390 400
ATSSVVPVVS ASASASAGGV AKQYERCGGI NHTGPTTCES GSVCKKWNPY

YYQCVASQ
Length:408
Mass (Da):41,650
Last modified:November 30, 2001 - v1
Checksum:iB7CA86C9019F0089
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB055432 Genomic DNA. Translation: BAB62318.1.
DF126473 Genomic DNA. Translation: GAA90417.1.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB055432 Genomic DNA. Translation: BAB62318.1.
DF126473 Genomic DNA. Translation: GAA90417.1.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein family/group databases

CAZyiCBM1. Carbohydrate-Binding Module Family 1.
GH61. Glycoside Hydrolase Family 61.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Phylogenomic databases

InParanoidiQ96WQ9.
OrthoDBiEOG7KM64H.

Family and domain databases

InterProiIPR000254. Cellulose-bd_dom_fun.
IPR005103. Glyco_hydro_61.
[Graphical view]
PfamiPF00734. CBM_1. 1 hit.
PF03443. Glyco_hydro_61. 1 hit.
[Graphical view]
ProDomiPD001821. CBD_fun. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00236. fCBD. 1 hit.
[Graphical view]
SUPFAMiSSF57180. SSF57180. 1 hit.
PROSITEiPS00562. CBM1_1. 1 hit.
PS51164. CBM1_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Cloning and sequence analysis of endoglucanase genes from an industrial fungus, Aspergillus kawachii."
    Hara Y., Hinoki Y., Shimoi H., Ito K.
    Biosci. Biotechnol. Biochem. 67:2010-2013(2002) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
    Strain: NBRC 4308.
  2. "Genome sequence of the white koji mold Aspergillus kawachii IFO 4308, used for brewing the Japanese distilled spirit shochu."
    Futagami T., Mori K., Yamashita A., Wada S., Kajiwara Y., Takashita H., Omori T., Takegawa K., Tashiro K., Kuhara S., Goto M.
    Eukaryot. Cell 10:1586-1587(2010) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: NBRC 4308.

Entry informationi

Entry nameiEGLD_ASPKW
AccessioniPrimary (citable) accession number: Q96WQ9
Secondary accession number(s): G7XU08
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 17, 2010
Last sequence update: November 30, 2001
Last modified: March 31, 2015
This is version 48 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programFungal Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.