Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Unreviewed, UniProtKB/TrEMBL Q08166 (Q08166_THEFU)

Last modified April 14, 2009. Version 61. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesSubmitted name:
    Beta-1,4-endoglucanase EMBL AAC06387.1
Gene names
Name: E1 EMBL AAC06387.1
OrganismThermomonospora fusca EMBL AAC06387.1
Taxonomic identifier2021 [NCBI]
Taxonomic lineageBacteriaActinobacteriaActinobacteridaeActinomycetalesStreptosporangineaeNocardiopsaceaeThermobifida

Protein attributes

Sequence length974 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceInferred from homology.

General annotation (Comments)

Sequence similarities

Contains 1 fibronectin type-III domain. RuleBase RU000718V1

Ontologies

Keywords
   DomainSignal
Gene Ontology (GO)
   Biological processcarbohydrate metabolic process

Inferred from electronic annotation. Source: InterPro

   Molecular functioncellulase activity

Inferred from electronic annotation. Source: InterPro

polysaccharide binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 3232 Potential EMBL AAC06387.1
Chain33 – 974942beta-1,4-endoglucanase EMBL AAC06387.1
PRO_5000142289

Sequences

Sequence LengthMass (Da)Tools
Q08166-1 [UniParc].

Last modified November 1, 1996. Version 1.
Checksum: 17FEE7330404A83C

FASTA974104,578
        10         20         30         40         50         60 
MLRRPRSRSP LVALTAATCR VALGGTAVPA QADEVNQIRN GDFSSGTAPW WGTENIQLNV 

        70         80         90        100        110        120 
TDGMLCVDVP GGTVNPWDVI IGQDDIPLIE GESYAFSFTA SSTVPVSIRA LVQEPVEPWT 

       130        140        150        160        170        180 
TQMDERALLG PEAETYEFVF TSNVDWDDAQ VAFQIGGSDE PWTFCLDDVA LLGRAEPPVY 

       190        200        210        220        230        240 
EPDTGPRVRV NQVGYLPHGP KKATVVTDAT SALTWELADA DGNVVASGQT KPHGADSSSG 

       250        260        270        280        290        300 
LNVHTVDFSS YTTKGSDYTL TVDGETSYPF DIDESVYEEL RVDALSFYYP QRSGIEILDS 

       310        320        330        340        350        360 
IAPGYGRPAG HIGVPPNQGD TDVPCAPGTC DYSLDVSGGW YDAGDHGKYV VNGGISVHQI 

       370        380        390        400        410        420 
MSIYERSQLA DTAQPDKLAD STLRLPETGN GVPDVLDEAR WEMEFLLKMQ VPEGEPLAGM 

       430        440        450        460        470        480 
AHHKIHDEQW TGLPLLPSAD PQPRYLQPPS TAATLNLAAT AAQCARVFEP FDEDFAAECL 

       490        500        510        520        530        540 
AAAETAWDAA KANPNIYAPA FGEGGGPYND NNVTDEFYWA AAELFLTTGK EEYRDAVTSS 

       550        560        570        580        590        600 
PLHTDDEEVF RDGAFDWGWT AALARLQLAT IPNDLADRDR VRQSVVDAAD MYLANVETSP 

       610        620        630        640        650        660 
WGLAYKPNNG VFVWGSNSAV LNNMVILAVA FDLTGDTKYR DGVLEGMDYI FGRNALNQSY 

       670        680        690        700        710        720 
VTGYGDKDSR NQHSRWYAHQ LDPRLPNPPK GTLAGGPNSD STTWDPVAQS KLTGCAPQMC 

       730        740        750        760        770        780 
YIDHIESWST NELTINWNAP LSWIASFIAD QDDAGEPGGE EPGPGDDETP PSKPGNLKAS 

       790        800        810        820        830        840 
DITATSATLT WDASTDNVGV VGYKVSLVRD GDAEEVGTTA QTSYTLTGLS ADQEYTVQVV 

       850        860        870        880        890        900 
AYDAAGNLST PATVTFTTEK EDETPTPSAS CAVTYQTNDW PGGFTASVTL TNTGSTPWDS 

       910        920        930        940        950        960 
WELRFTFPSG QTVSHGWSAN WQQSGSDVTA TSLPWNGSVP PGGGSVNIGF NGTWGGSNTK 

       970 
PEKFTVNGAV CSIG 

« Hide

References

[1]"DNA sequences of three beta-1,4-endoglucanase genes from Thermomonospora fusca."
Lao G., Ghangas G.S., Jung E.D., Wilson D.B.
J. Bacteriol. 173:3397-3407(1991) [PubMed: 1904434] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: YX EMBL AAC06387.1.
[2]"Identification of a celE binding protein and its potential role in induction of the celE gene in Thermomonospora fusca."
Lin E., Wilson D.B.
J. Bacteriol. 1701:3843-3846(1988)
Cited for: NUCLEOTIDE SEQUENCE.
Strain: YX EMBL AAC06387.1.
[3]"Activity studies of eight purified cellulases: specificty, synergism, and binding domain effects."
Irwin D.C., Spezio M., Walker L.P., Wilson D.B.
Biotechnol. Bioeng. 42:1002-1013(1993) [PubMed: 18613149] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: YX EMBL AAC06387.1.
[4]"DNA sequences and expression in Streptomyces lividans of an exoglucanase gene and an endoglucanase gene from Thermomonospora fusca."
Jung E.D., Lao G., Irwin D., Barr B.K., Benjamin A., Wilson D.B.
Appl. Environ. Microbiol. 59:3032-3043(1993) [PubMed: 8215374] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: YX EMBL AAC06387.1.

Cross-references

Sequence databases

L20094 Genomic DNA. Translation: AAC06387.1.

3D structure databases

HSSPHSSP built from PDB template 1K85 based on UniProtKB P20533.
ModBaseSearch...

Protein family/group databases

CAZyCBM2. Carbohydrate-Binding Module Family 2.
CBM4. Carbohydrate-Binding Module Family 4.
GH9. Glycoside Hydrolase Family 9.

Family and domain databases

InterProIPR012341. 6hp_glycosidase.
IPR001919. CBD_bac.
IPR012291. CBD_carb_bd.
IPR003305. CenC_carb_bd.
IPR008957. Fibronectin_typ-III-like_fold.
IPR003961. FN_III.
IPR001701. Glyco_hydro_9.
IPR018221. Glyco_hydro_9_AS.
IPR004197. Glyco_hydro_9_Ig-like.
IPR013783. Ig-like_fold.
[Graphical view]
Gene3DG3DSA:2.60.40.290. CBD_carb_bd. 1 hit.
G3DSA:1.50.10.10. CelA/Cel48F_cat. 1 hit.
G3DSA:2.60.40.30. FN_III-like. 1 hit.
G3DSA:2.60.40.10. Ig-like_fold. 1 hit.
PANTHERPTHR22298:SF3. Glyco_hydro_9. 1 hit.
PfamPF00553. CBM_2. 1 hit.
PF02018. CBM_4_9. 1 hit.
PF02927. CelD_N. 1 hit.
PF00041. fn3. 1 hit.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SMARTSM00637. CBD_II. 1 hit.
SM00060. FN3. 1 hit.
[Graphical view]
PROSITEPS51173. CBM2. 1 hit.
PS50853. FN3. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameQ08166_THEFU
AccessionPrimary (citable) accession number: Q08166
Entry history
Integrated into UniProtKB/TrEMBL: November 1, 1996
Last sequence update: November 1, 1996
Last modified: April 14, 2009
This is version 61 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information