Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Endoglucanase

Gene

E1

Organism
Thermobifida fusca (Thermomonospora fusca)
Status
Unreviewed-Annotation score: Annotation score: 2 out of 5-Protein inferred from homologyi

Functioni

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.UniRule annotation

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

GlycosidaseUniRule annotation, Hydrolase

Keywords - Biological processi

Carbohydrate metabolism, Cellulose degradationUniRule annotation, Polysaccharide degradation

Protein family/group databases

CAZyiCBM2. Carbohydrate-Binding Module Family 2.
CBM4. Carbohydrate-Binding Module Family 4.
GH9. Glycoside Hydrolase Family 9.

Names & Taxonomyi

Protein namesi
Recommended name:
EndoglucanaseUniRule annotation (EC:3.2.1.4UniRule annotation)
Gene namesi
Name:E1Imported
OrganismiThermobifida fusca (Thermomonospora fusca)Imported
Taxonomic identifieri2021 [NCBI]
Taxonomic lineageiBacteriaActinobacteriaStreptosporangialesNocardiopsaceaeThermobifida

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 3232UniRule annotationAdd
BLAST
Chaini33 – 974942EndoglucanaseUniRule annotationPRO_5005142371Add
BLAST

Interactioni

Protein-protein interaction databases

STRINGi269800.Tfu_1627.

Structurei

3D structure databases

ProteinModelPortaliQ08166.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini773 – 86290Fibronectin type-IIIInterPro annotationAdd
BLAST
Domaini864 – 974111CBM2 (carbohydrate binding type-2)InterPro annotationAdd
BLAST

Sequence similaritiesi

Belongs to the glycosyl hydrolase 9 (cellulase E) family.UniRule annotation
Contains 1 fibronectin type-III domain.UniRule annotation

Keywords - Domaini

SignalUniRule annotation

Phylogenomic databases

eggNOGiENOG4105E08. Bacteria.
ENOG410XNTA. LUCA.

Family and domain databases

Gene3Di1.50.10.10. 1 hit.
2.60.120.260. 1 hit.
2.60.40.10. 2 hits.
2.60.40.290. 1 hit.
InterProiIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR008965. Carb-bd_dom.
IPR001919. CBD2.
IPR012291. CBD_carb-bd_dom.
IPR004197. Cellulase_Ig-like.
IPR003305. CenC_carb-bd.
IPR003961. FN3_dom.
IPR008979. Galactose-bd-like.
IPR001701. Glyco_hydro_9.
IPR033126. Glyco_hydro_9_Asp/Glu_AS.
IPR013783. Ig-like_fold.
IPR014756. Ig_E-set.
[Graphical view]
PfamiPF00553. CBM_2. 1 hit.
PF02018. CBM_4_9. 1 hit.
PF02927. CelD_N. 1 hit.
PF00041. fn3. 1 hit.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SMARTiSM00637. CBD_II. 1 hit.
SM00060. FN3. 1 hit.
[Graphical view]
SUPFAMiSSF48208. SSF48208. 1 hit.
SSF49265. SSF49265. 1 hit.
SSF49384. SSF49384. 1 hit.
SSF49785. SSF49785. 1 hit.
SSF81296. SSF81296. 1 hit.
PROSITEiPS51173. CBM2. 1 hit.
PS50853. FN3. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q08166-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MLRRPRSRSP LVALTAATCR VALGGTAVPA QADEVNQIRN GDFSSGTAPW
60 70 80 90 100
WGTENIQLNV TDGMLCVDVP GGTVNPWDVI IGQDDIPLIE GESYAFSFTA
110 120 130 140 150
SSTVPVSIRA LVQEPVEPWT TQMDERALLG PEAETYEFVF TSNVDWDDAQ
160 170 180 190 200
VAFQIGGSDE PWTFCLDDVA LLGRAEPPVY EPDTGPRVRV NQVGYLPHGP
210 220 230 240 250
KKATVVTDAT SALTWELADA DGNVVASGQT KPHGADSSSG LNVHTVDFSS
260 270 280 290 300
YTTKGSDYTL TVDGETSYPF DIDESVYEEL RVDALSFYYP QRSGIEILDS
310 320 330 340 350
IAPGYGRPAG HIGVPPNQGD TDVPCAPGTC DYSLDVSGGW YDAGDHGKYV
360 370 380 390 400
VNGGISVHQI MSIYERSQLA DTAQPDKLAD STLRLPETGN GVPDVLDEAR
410 420 430 440 450
WEMEFLLKMQ VPEGEPLAGM AHHKIHDEQW TGLPLLPSAD PQPRYLQPPS
460 470 480 490 500
TAATLNLAAT AAQCARVFEP FDEDFAAECL AAAETAWDAA KANPNIYAPA
510 520 530 540 550
FGEGGGPYND NNVTDEFYWA AAELFLTTGK EEYRDAVTSS PLHTDDEEVF
560 570 580 590 600
RDGAFDWGWT AALARLQLAT IPNDLADRDR VRQSVVDAAD MYLANVETSP
610 620 630 640 650
WGLAYKPNNG VFVWGSNSAV LNNMVILAVA FDLTGDTKYR DGVLEGMDYI
660 670 680 690 700
FGRNALNQSY VTGYGDKDSR NQHSRWYAHQ LDPRLPNPPK GTLAGGPNSD
710 720 730 740 750
STTWDPVAQS KLTGCAPQMC YIDHIESWST NELTINWNAP LSWIASFIAD
760 770 780 790 800
QDDAGEPGGE EPGPGDDETP PSKPGNLKAS DITATSATLT WDASTDNVGV
810 820 830 840 850
VGYKVSLVRD GDAEEVGTTA QTSYTLTGLS ADQEYTVQVV AYDAAGNLST
860 870 880 890 900
PATVTFTTEK EDETPTPSAS CAVTYQTNDW PGGFTASVTL TNTGSTPWDS
910 920 930 940 950
WELRFTFPSG QTVSHGWSAN WQQSGSDVTA TSLPWNGSVP PGGGSVNIGF
960 970
NGTWGGSNTK PEKFTVNGAV CSIG
Length:974
Mass (Da):104,578
Last modified:November 1, 1996 - v1
Checksum:i17FEE7330404A83C
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L20094 Genomic DNA. Translation: AAC06387.1.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
L20094 Genomic DNA. Translation: AAC06387.1.

3D structure databases

ProteinModelPortaliQ08166.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi269800.Tfu_1627.

Protein family/group databases

CAZyiCBM2. Carbohydrate-Binding Module Family 2.
CBM4. Carbohydrate-Binding Module Family 4.
GH9. Glycoside Hydrolase Family 9.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Phylogenomic databases

eggNOGiENOG4105E08. Bacteria.
ENOG410XNTA. LUCA.

Family and domain databases

Gene3Di1.50.10.10. 1 hit.
2.60.120.260. 1 hit.
2.60.40.10. 2 hits.
2.60.40.290. 1 hit.
InterProiIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR008965. Carb-bd_dom.
IPR001919. CBD2.
IPR012291. CBD_carb-bd_dom.
IPR004197. Cellulase_Ig-like.
IPR003305. CenC_carb-bd.
IPR003961. FN3_dom.
IPR008979. Galactose-bd-like.
IPR001701. Glyco_hydro_9.
IPR033126. Glyco_hydro_9_Asp/Glu_AS.
IPR013783. Ig-like_fold.
IPR014756. Ig_E-set.
[Graphical view]
PfamiPF00553. CBM_2. 1 hit.
PF02018. CBM_4_9. 1 hit.
PF02927. CelD_N. 1 hit.
PF00041. fn3. 1 hit.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SMARTiSM00637. CBD_II. 1 hit.
SM00060. FN3. 1 hit.
[Graphical view]
SUPFAMiSSF48208. SSF48208. 1 hit.
SSF49265. SSF49265. 1 hit.
SSF49384. SSF49384. 1 hit.
SSF49785. SSF49785. 1 hit.
SSF81296. SSF81296. 1 hit.
PROSITEiPS51173. CBM2. 1 hit.
PS50853. FN3. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "Identification of a celE binding protein and its potential role in induction of the celE gene in Thermomonospora fusca."
    Lin E., Wilson D.B.
    J. Bacteriol. 1701:3843-3846(1988)
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: YXImported.
  2. "DNA sequences of three beta-1,4-endoglucanase genes from Thermomonospora fusca."
    Lao G., Ghangas G.S., Jung E.D., Wilson D.B.
    J. Bacteriol. 173:3397-3407(1991) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: YXImported.
  3. "DNA sequences and expression in Streptomyces lividans of an exoglucanase gene and an endoglucanase gene from Thermomonospora fusca."
    Jung E.D., Lao G., Irwin D., Barr B.K., Benjamin A., Wilson D.B.
    Appl. Environ. Microbiol. 59:3032-3043(1993) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: YXImported.
  4. "Activity studies of eight purified cellulases: Specificity, synergism, and binding domain effects."
    Irwin D.C., Spezio M., Walker L.P., Wilson D.B.
    Biotechnol. Bioeng. 42:1002-1013(1993) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE.
    Strain: YXImported.

Entry informationi

Entry nameiQ08166_THEFU
AccessioniPrimary (citable) accession number: Q08166
Entry historyi
Integrated into UniProtKB/TrEMBL: November 1, 1996
Last sequence update: November 1, 1996
Last modified: May 11, 2016
This is version 96 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.