Skip Header

 
Contribute Send feedback
Read comments (1) or add your own

Reviewed, UniProtKB/Swiss-Prot P26221 (GUN4_THEFU)

Last modified June 16, 2009. Version 77. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (4) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Endoglucanase E-4
    EC=3.2.1.4
Alternative name(s):
    Endo-1,4-beta-glucanase E-4
    Cellulase E-4
    Cellulase E4
Gene names
Name: celD
OrganismThermomonospora fusca
Taxonomic identifier2021 [NCBI]
Taxonomic lineageBacteriaActinobacteriaActinobacteridaeActinomycetalesStreptosporangineaeNocardiopsaceaeThermobifida

Protein attributes

Sequence length880 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at protein level.

General annotation (Comments)

Catalytic activity

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Pathway

Glycan metabolism; cellulose degradation.

Sequence similarities

Belongs to the glycosyl hydrolase 9 (cellulase E) family.

Contains 1 CBM2 (carbohydrate binding type-2) domain.

Contains 1 CBM3 (carbohydrate binding type-3) domain.

Contains 1 fibronectin type-III domain.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 4646 Ref.4
Chain47 – 880834Endoglucanase E-4
PRO_0000007959

Regions

Domain504 – 652149CBM3
Domain675 – 76692Fibronectin type-III
Domain771 – 880110CBM2

Sites

Active site4271 By similarity
Active site4611 By similarity
Active site4701 By similarity

Secondary structure

............................................................................................ 880
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P26221-1 [UniParc].

Last modified November 1, 1997. Version 2.
Checksum: 5EA9A6ABF45A4D9A

FASTA88095,203
        10         20         30         40         50         60 
MSVTEPPPRR RGRHSRARRF LTSLGATAAL TAGMLGVPLA TGTAHAEPAF NYAEALQKSM 

        70         80         90        100        110        120 
FFYEAQRSGK LPENNRVSWR GDSGLNDGAD VGLDLTGGWY DAGDHVKFGF PMAFTATMLA 

       130        140        150        160        170        180 
WGAIESPEGY IRSGQMPYLK DNLRWVNDYF IKAHPSPNVL YVQVGDGDAD HKWWGPAEVM 

       190        200        210        220        230        240 
PMERPSFKVD PSCPGSDVAA ETAAAMAASS IVFADDDPAY AATLVQHAKQ LYTFADTYRG 

       250        260        270        280        290        300 
VYSDCVPAGA FYNSWSGYQD ELVWGAYWLY KATGDDSYLA KAEYEYDFLS TEQQTDLRSY 

       310        320        330        340        350        360 
RWTIAWDDKS YGTYVLLAKE TGKQKYIDDA NRWLDYWTVG VNGQRVPYSP GGMAVLDTWG 

       370        380        390        400        410        420 
ALRYAANTAF VALVYAKVID DPVRKQRYHD FAVRQINYAL GDNPRNSSYV VGFGNNPPRN 

       430        440        450        460        470        480 
PHHRTAHGSW TDSIASPAEN RHVLYGALVG GPGSPNDAYT DDRQDYVANE VATDYNAGFS 

       490        500        510        520        530        540 
SALAMLVEEY GGTPLADFPP TEEPDGPEIF VEAQINTPGT TFTEIKAMIR NQSGWPARML 

       550        560        570        580        590        600 
DKGTFRYWFT LDEGVDPADI TVSSAYNQCA TPEDVHHVSG DLYYVEIDCT GEKIFPGGQS 

       610        620        630        640        650        660 
EHRREVQFRI AGGPGWDPSN DWSFQGIGNE LAPAPYIVLY DDGVPVWGTA PEEGEEPGGG 

       670        680        690        700        710        720 
EGPGGGEEPG EDVTPPSAPG SPAVRDVTST SAVLTWSASS DTGGSGVAGY DVFLRAGTGQ 

       730        740        750        760        770        780 
EQKVGSTTRT SFTLTGLEPD TTYIAAVVAR DNAGNVSQRS TVSFTTLAEN GGGPDASCTV 

       790        800        810        820        830        840 
GYSTNDWDSG FTASIRITYH GTAPLSSWEL SFTFPAGQQV THGWNATWRQ DGAAVTATPM 

       850        860        870        880 
SWNSSLAPGA TVEVGFNGSW SGSNTPPTDF TLNGEPCALA 

« Hide

References

[1]"DNA sequences and expression in Streptomyces lividans of an exoglucanase gene and an endoglucanase gene from Thermomonospora fusca."
Jung E.D., Lao G., Irwin D., Barr B.K., Benjamin A., Wilson D.B.
Appl. Environ. Microbiol. 59:3032-3043(1993) [PubMed: 8215374] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: YX.
[2]Wilson D.B.
Submitted (FEB-1997) to the EMBL/GenBank/DDBJ databases
Cited for: SEQUENCE REVISION.
[3]"DNA sequences of three beta-1,4-endoglucanase genes from Thermomonospora fusca."
Lao G., Ghangas G.S., Jung E.D., Wilson D.B.
J. Bacteriol. 173:3397-3407(1991) [PubMed: 1904434] [Abstract]
Cited for: PARTIAL NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: YX.
[4]"Cellulases of Thermomonospora fusca."
Wilson D.B.
Methods Enzymol. 160:314-323(1988)
Cited for: PROTEIN SEQUENCE OF 47-67.
[5]"Structure and mechanism of endo/exocellulase E4 from Thermomonospora fusca."
Sakon J., Irwin D., Wilson D.B., Karplus P.A.
Nat. Struct. Biol. 4:810-818(1997) [PubMed: 9334746] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (1.9 ANGSTROMS) OF 47-651.

Cross-references

Sequence databases

L20093 Genomic DNA. Translation: AAB42155.1.
PIRB42360.

3D structure databases

EntryMethodResolution (Å)ChainPositionsPDBsum
1JS4X-ray2.00A/B47-651[»]
1TF4X-ray1.90A/B47-651[»]
3TF4X-ray2.20A/B47-651[»]
4TF4X-ray2.00A/B47-651[»]
ModBaseSearch...

Protein family/group databases

CAZyCBM2. Carbohydrate-Binding Module Family 2.
CBM3. Carbohydrate-Binding Module Family 3.
GH9. Glycoside Hydrolase Family 9.

Family and domain databases

InterProIPR012341. 6hp_glycosidase.
IPR001956. CBD_3.
IPR001919. CBD_bac.
IPR018366. CBM2_CS.
IPR003961. FN_III.
IPR001701. Glyco_hydro_9.
IPR018221. Glyco_hydro_9_AS.
[Graphical view]
Gene3DG3DSA:1.50.10.10. CelA/Cel48F_cat. 1 hit.
PANTHERPTHR22298:SF3. Glyco_hydro_9. 1 hit.
PfamPF00553. CBM_2. 1 hit.
PF00942. CBM_3. 1 hit.
PF00041. fn3. 1 hit.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
ProDomPD001947. CBD_3. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTSM00637. CBD_II. 1 hit.
SM00060. FN3. 1 hit.
[Graphical view]
PROSITEPS51173. CBM2. 1 hit.
PS00561. CBM2_A. 1 hit.
PS51172. CBM3. 1 hit.
PS50853. FN3. 1 hit.
PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameGUN4_THEFU
AccessionPrimary (citable) accession number: P26221
Secondary accession number(s): Q08167
Entry history
Integrated into UniProtKB/Swiss-Prot: May 1, 1992
Last sequence update: November 1, 1997
Last modified: June 16, 2009
This is version 77 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectHAMAP (High-quality Automated and Manual Annotation of microbial Proteomes)

Relevant documents

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries

PATHWAY comments

Index of metabolic and biosynthesis pathways

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents