Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Endoglucanase

Gene

bglC

Organism
Bacillus subtilis
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Binding sitei65SubstrateBy similarity1
Binding sitei96SubstrateBy similarity1
Binding sitei131SubstrateBy similarity1
Active sitei169Proton donorBy similarity1
Binding sitei231SubstrateBy similarity1
Active sitei257NucleophileBy similarity1
Binding sitei291SubstrateBy similarity1

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionGlycosidase, Hydrolase
Biological processCarbohydrate metabolism, Cellulose degradation, Polysaccharide degradation

Protein family/group databases

CAZyiCBM3. Carbohydrate-Binding Module Family 3.
GH5. Glycoside Hydrolase Family 5.

Names & Taxonomyi

Protein namesi
Recommended name:
Endoglucanase (EC:3.2.1.4)
Alternative name(s):
Carboxymethyl-cellulase
Short name:
CMCase
Short name:
Cellulase
Endo-1,4-beta-glucanase
Gene namesi
Name:bglC
OrganismiBacillus subtilis
Taxonomic identifieri1423 [NCBI]
Taxonomic lineageiBacteriaFirmicutesBacilliBacillalesBacillaceaeBacillus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 29Add BLAST29
ChainiPRO_000000784130 – 499EndoglucanaseAdd BLAST470

Proteomic databases

PaxDbiP23549.

Structurei

3D structure databases

ProteinModelPortaliP23549.
SMRiP23549.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini350 – 499CBM3PROSITE-ProRule annotationAdd BLAST150

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni69 – 70Substrate bindingBy similarity2
Regioni263 – 264Substrate bindingBy similarity2
Regioni296 – 298Substrate bindingBy similarity3

Sequence similaritiesi

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiENOG4107QWR. Bacteria.
COG2730. LUCA.

Family and domain databases

Gene3Di2.60.40.710. 1 hit.
InterProiView protein in InterPro
IPR008965. Carb-bd_dom.
IPR001956. CBD_3.
IPR001547. Glyco_hydro_5.
IPR018087. Glyco_hydro_5_CS.
IPR017853. Glycoside_hydrolase_SF.
PfamiView protein in Pfam
PF00942. CBM_3. 1 hit.
PF00150. Cellulase. 1 hit.
SMARTiView protein in SMART
SM01067. CBM_3. 1 hit.
SUPFAMiSSF49384. SSF49384. 1 hit.
SSF51445. SSF51445. 1 hit.
PROSITEiView protein in PROSITE
PS51172. CBM3. 1 hit.
PS00659. GLYCOSYL_HYDROL_F5. 1 hit.

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P23549-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKRSISIFIT CLLITLLTMG GMLASPASAA GTKTPVAKNG QLSIKGTQLV
60 70 80 90 100
NRDGKAVQLK GISSHGLQWY GEYVNKDSLK WLRDDWGITV FRAAMYTADG
110 120 130 140 150
GIIDNPSVKN KMKEAVEAAK ELGIYVIIDW HILNDGNPNQ NKEKAKEFFK
160 170 180 190 200
EMSSLYGNTP NVIYEIANEP NGDVNWKRDI KPYAEEVISV IRKNDPDNII
210 220 230 240 250
IVGTGTWSQD VNDAADDQLK DANVMDALHF YAGTHGQFLR DKANYALSKG
260 270 280 290 300
APIFVTEWGT SDASGNGGVF LDQSREWLKY LDSKTISWVN WNLSDKQESS
310 320 330 340 350
SALKPGASKT GGWRLSDLSA SGTFVRENIL GTKDSTKDIP ETPAKDKPTQ
360 370 380 390 400
ENGISVQYRA GDGSMNSNQI RPQLQIKNNG NTTVDLKDVT ARYWYNAKNK
410 420 430 440 450
GQNVDCDYAQ LGCGNVTYKF VTLHKPKQGA DTYLELGFKN GTLAPGASTG
460 470 480 490
NIQLRLHNDD WSNYAQSGDY SFFKSNTFKT TKKITLYDQG KLIWGTEPN
Length:499
Mass (Da):55,169
Last modified:November 1, 1991 - v1
Checksum:i2E821E3D8BBACA04
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
D01057 Genomic DNA. Translation: BAA00859.1.
PIRiJN0111.

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.

Entry informationi

Entry nameiGUN3_BACIU
AccessioniPrimary (citable) accession number: P23549
Entry historyiIntegrated into UniProtKB/Swiss-Prot: November 1, 1991
Last sequence update: November 1, 1991
Last modified: July 5, 2017
This is version 96 of the entry and version 1 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families