Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Basket 0
(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

P26224

- GUNF_CLOTH

UniProt

P26224 - GUNF_CLOTH

Protein

Endoglucanase F

Gene

celF

Organism
Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Status
Reviewed - Annotation score: 4 out of 5- Protein inferred from homologyi
    • BLAST
    • Align
    • Format
    • Add to basket
    • History
      Entry version 108 (01 Oct 2014)
      Sequence version 1 (01 May 1992)
      Previous versions | rss
    • Help video
    • Feedback
    • Comment

    Functioni

    This enzyme catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

    Catalytic activityi

    Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

    Cofactori

    Calcium.

    Sites

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Active sitei400 – 4001By similarity
    Active sitei438 – 4381By similarity
    Active sitei447 – 4471By similarity

    GO - Molecular functioni

    1. cellulase activity Source: UniProtKB-EC
    2. cellulose binding Source: InterPro

    GO - Biological processi

    1. cellulose catabolic process Source: UniProtKB-KW

    Keywords - Molecular functioni

    Glycosidase, Hydrolase

    Keywords - Biological processi

    Carbohydrate metabolism, Cellulose degradation, Polysaccharide degradation

    Keywords - Ligandi

    Calcium

    Enzyme and pathway databases

    BioCyciCTHE203119:GIW8-561-MONOMER.
    MetaCyc:MONOMER-16419.

    Protein family/group databases

    CAZyiCBM3. Carbohydrate-Binding Module Family 3.
    GH9. Glycoside Hydrolase Family 9.

    Names & Taxonomyi

    Protein namesi
    Recommended name:
    Endoglucanase F (EC:3.2.1.4)
    Short name:
    EGF
    Alternative name(s):
    Cellulase F
    Endo-1,4-beta-glucanase
    Gene namesi
    Name:celF
    Ordered Locus Names:Cthe_0543
    OrganismiClostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
    Taxonomic identifieri203119 [NCBI]
    Taxonomic lineageiBacteriaFirmicutesClostridiaClostridialesRuminococcaceaeRuminiclostridium
    ProteomesiUP000002145: Chromosome

    PTM / Processingi

    Molecule processing

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Signal peptidei1 – 2727Add
    BLAST
    Chaini28 – 739712Endoglucanase FPRO_0000007950Add
    BLAST

    Interactioni

    Protein-protein interaction databases

    STRINGi203119.Cthe_0543.

    Structurei

    3D structure databases

    ProteinModelPortaliP26224.
    SMRiP26224. Positions 30-638.
    ModBaseiSearch...
    MobiDBiSearch...

    Family & Domainsi

    Domains and Repeats

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Domaini480 – 639160CBM3PROSITE-ProRule annotationAdd
    BLAST
    Repeati670 – 693241Add
    BLAST
    Repeati709 – 732242Add
    BLAST

    Region

    Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
    Regioni28 – 470443CatalyticAdd
    BLAST
    Regioni670 – 732632 X 24 AA approximate repeatsAdd
    BLAST

    Domaini

    A 24 residue domain is repeated twice in this enzyme as well as in other C.thermocellum cellulosome enzymes. This domain may function as the binding ligand for the SL component.

    Sequence similaritiesi

    Contains 1 CBM3 (carbohydrate binding type-3) domain.PROSITE-ProRule annotation

    Keywords - Domaini

    Repeat, Signal

    Phylogenomic databases

    eggNOGiNOG05134.
    HOGENOMiHOG000021032.
    OMAiAINCAND.
    OrthoDBiEOG6KQ6BP.

    Family and domain databases

    Gene3Di1.10.1330.10. 1 hit.
    1.50.10.10. 1 hit.
    2.60.40.710. 1 hit.
    InterProiIPR008928. 6-hairpin_glycosidase-like.
    IPR012341. 6hp_glycosidase.
    IPR008965. Carb-bd_dom.
    IPR001956. CBD_3.
    IPR016134. Cellulos_enz_dockerin_1.
    IPR002105. Cellulos_enz_dockerin_1_Ca-bd.
    IPR018242. Dockerin_1.
    IPR018247. EF_Hand_1_Ca_BS.
    IPR001701. Glyco_hydro_9.
    IPR018221. Glyco_hydro_9_AS.
    [Graphical view]
    PfamiPF00942. CBM_3. 1 hit.
    PF00404. Dockerin_1. 2 hits.
    PF00759. Glyco_hydro_9. 1 hit.
    [Graphical view]
    SMARTiSM01067. CBM_3. 1 hit.
    [Graphical view]
    SUPFAMiSSF48208. SSF48208. 1 hit.
    SSF49384. SSF49384. 1 hit.
    SSF63446. SSF63446. 1 hit.
    PROSITEiPS51172. CBM3. 1 hit.
    PS00448. CLOS_CELLULOSOME_RPT. 2 hits.
    PS00018. EF_HAND_1. 1 hit.
    PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
    PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
    [Graphical view]

    Sequencei

    Sequence statusi: Complete.

    Sequence processingi: The displayed sequence is further processed into a mature form.

    P26224-1 [UniParc]FASTAAdd to Basket

    « Hide

    MKKILAFLLT VALVAVVAIP QAVVSFAADF NYGEALQKAI MFYEFQRSGK    50
    LPENKRNNWR GDSALNDGAD NGLDLTGGWY DAGDHVKFNL PMAYAVTMLA 100
    WSVYESRDAY VQSGQLPYIL DNIKWATDYF IKCHPSPNVY YYQVGDGALD 150
    HSWWGPAEVM QMPRPSFKVD LTNPGSTVVA ETAAAMAASS IVFKPTDPEY 200
    AATLLRHAKE LFTFADTTRS DAGYRAAEGY YSSHSGFYDE LTWASIWLYL 250
    ATGDQSYLDK AESYEPHWER ERGTTLISYS WAHCWDNKLY GSLLLLAKIT 300
    GKSYYKQCIE NHLDYWTVGF NGSRVQYTPK GLAYLDRWGS LRYATTQAFL 350
    ASVYADWSGC DPAKAAVYKE FAKKQVDYAL GSTGRSFVVG FGKNPPRNPH 400
    HRTAHSSWSA LMTEPAECRH ILVGALVGGP DGSDSYVDRL DDYQCNEVAN 450
    DYNAGFVGAL AKMYEKYGGE PIPNFVAFET PGEEFYVEAA VNAAGPGFVN 500
    IKASIINKSG WPARGSDKLS AKYFVDISEA VAKGITLDQI TVQSTTNGGA 550
    KVSQLLPWDP DNHIYYVNID FTGINIFPGG INEYKRDVYF TITAPYGEGN 600
    WDNTNDFSFQ GLEQGFTSKK TEYIPLYDGN VRVWGKVPDG GSEPDPTPTI 650
    TVGPTPSVTP TSVPGIMLGD VNFDGRINST DYSRLKRYVI KSLEFTDPEE 700
    HQKFIAAADV DGNGRINSTD LYVLNRYILK LIEKFPAEQ 739
    Length:739
    Mass (Da):82,089
    Last modified:May 1, 1992 - v1
    Checksum:i0CD69EEFC6D4AEEF
    GO

    Sequence databases

    Select the link destinations:
    EMBL
    GenBank
    DDBJ
    Links Updated
    X60545 Genomic DNA. Translation: CAA43035.1.
    CP000568 Genomic DNA. Translation: ABN51779.1.
    PIRiI40804.
    S15727.
    RefSeqiWP_003517595.1. NC_009012.1.
    YP_001036972.1. NC_009012.1.

    Genome annotation databases

    EnsemblBacteriaiABN51779; ABN51779; Cthe_0543.
    GeneIDi4808292.
    KEGGicth:Cthe_0543.
    PATRICi19514745. VBICloThe47081_0565.

    Cross-referencesi

    Sequence databases

    Select the link destinations:
    EMBL
    GenBank
    DDBJ
    Links Updated
    X60545 Genomic DNA. Translation: CAA43035.1 .
    CP000568 Genomic DNA. Translation: ABN51779.1 .
    PIRi I40804.
    S15727.
    RefSeqi WP_003517595.1. NC_009012.1.
    YP_001036972.1. NC_009012.1.

    3D structure databases

    ProteinModelPortali P26224.
    SMRi P26224. Positions 30-638.
    ModBasei Search...
    MobiDBi Search...

    Protein-protein interaction databases

    STRINGi 203119.Cthe_0543.

    Protein family/group databases

    CAZyi CBM3. Carbohydrate-Binding Module Family 3.
    GH9. Glycoside Hydrolase Family 9.

    Protocols and materials databases

    Structural Biology Knowledgebase Search...

    Genome annotation databases

    EnsemblBacteriai ABN51779 ; ABN51779 ; Cthe_0543 .
    GeneIDi 4808292.
    KEGGi cth:Cthe_0543.
    PATRICi 19514745. VBICloThe47081_0565.

    Phylogenomic databases

    eggNOGi NOG05134.
    HOGENOMi HOG000021032.
    OMAi AINCAND.
    OrthoDBi EOG6KQ6BP.

    Enzyme and pathway databases

    BioCyci CTHE203119:GIW8-561-MONOMER.
    MetaCyc:MONOMER-16419.

    Family and domain databases

    Gene3Di 1.10.1330.10. 1 hit.
    1.50.10.10. 1 hit.
    2.60.40.710. 1 hit.
    InterProi IPR008928. 6-hairpin_glycosidase-like.
    IPR012341. 6hp_glycosidase.
    IPR008965. Carb-bd_dom.
    IPR001956. CBD_3.
    IPR016134. Cellulos_enz_dockerin_1.
    IPR002105. Cellulos_enz_dockerin_1_Ca-bd.
    IPR018242. Dockerin_1.
    IPR018247. EF_Hand_1_Ca_BS.
    IPR001701. Glyco_hydro_9.
    IPR018221. Glyco_hydro_9_AS.
    [Graphical view ]
    Pfami PF00942. CBM_3. 1 hit.
    PF00404. Dockerin_1. 2 hits.
    PF00759. Glyco_hydro_9. 1 hit.
    [Graphical view ]
    SMARTi SM01067. CBM_3. 1 hit.
    [Graphical view ]
    SUPFAMi SSF48208. SSF48208. 1 hit.
    SSF49384. SSF49384. 1 hit.
    SSF63446. SSF63446. 1 hit.
    PROSITEi PS51172. CBM3. 1 hit.
    PS00448. CLOS_CELLULOSOME_RPT. 2 hits.
    PS00018. EF_HAND_1. 1 hit.
    PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
    PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
    [Graphical view ]
    ProtoNeti Search...

    Publicationsi

    1. "Nucleotide sequence of the cellulase gene celF of Clostridium thermocellum."
      Navarro A., Chebrou M.-C., Beguin P., Aubert J.-P.
      Res. Microbiol. 142:927-936(1991) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
    2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
      Strain: ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372.
    3. "Transcription of Clostridium thermocellum endoglucanase genes celF and celD."
      Mishra S., Beguin P., Aubert J.-P.
      J. Bacteriol. 173:80-85(1991) [PubMed] [Europe PMC] [Abstract]
      Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-4.

    Entry informationi

    Entry nameiGUNF_CLOTH
    AccessioniPrimary (citable) accession number: P26224
    Secondary accession number(s): A3DCV0
    Entry historyi
    Integrated into UniProtKB/Swiss-Prot: May 1, 1992
    Last sequence update: May 1, 1992
    Last modified: October 1, 2014
    This is version 108 of the entry and version 1 of the sequence. [Complete history]
    Entry statusiReviewed (UniProtKB/Swiss-Prot)
    Annotation programProkaryotic Protein Annotation Program

    Miscellaneousi

    Keywords - Technical termi

    Complete proteome, Reference proteome

    Documents

    1. Glycosyl hydrolases
      Classification of glycosyl hydrolase families and list of entries
    2. SIMILARITY comments
      Index of protein domains and families

    External Data

    Dasty 3