Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

P26224

- GUNF_CLOTH

UniProt

P26224 - GUNF_CLOTH

(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Protein

Endoglucanase F

Gene

celF

Organism
Clostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Status
Reviewed - Annotation score: 3 out of 5- Protein inferred from homologyi

Functioni

This enzyme catalyzes the endohydrolysis of 1,4-beta-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Catalytic activityi

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Cofactori

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei400 – 4001By similarity
Active sitei438 – 4381By similarity
Active sitei447 – 4471By similarity

GO - Molecular functioni

  1. cellulase activity Source: UniProtKB-EC
  2. cellulose binding Source: InterPro

GO - Biological processi

  1. cellulose catabolic process Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Glycosidase, Hydrolase

Keywords - Biological processi

Carbohydrate metabolism, Cellulose degradation, Polysaccharide degradation

Keywords - Ligandi

Calcium

Enzyme and pathway databases

BioCyciCTHE203119:GIW8-561-MONOMER.
MetaCyc:MONOMER-16419.

Protein family/group databases

CAZyiCBM3. Carbohydrate-Binding Module Family 3.
GH9. Glycoside Hydrolase Family 9.

Names & Taxonomyi

Protein namesi
Recommended name:
Endoglucanase F (EC:3.2.1.4)
Short name:
EGF
Alternative name(s):
Cellulase F
Endo-1,4-beta-glucanase
Gene namesi
Name:celF
Ordered Locus Names:Cthe_0543
OrganismiClostridium thermocellum (strain ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Ruminiclostridium thermocellum)
Taxonomic identifieri203119 [NCBI]
Taxonomic lineageiBacteriaFirmicutesClostridiaClostridialesRuminococcaceaeRuminiclostridium
ProteomesiUP000002145: Chromosome

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2727Add
BLAST
Chaini28 – 739712Endoglucanase FPRO_0000007950Add
BLAST

Interactioni

Protein-protein interaction databases

STRINGi203119.Cthe_0543.

Structurei

3D structure databases

ProteinModelPortaliP26224.
SMRiP26224. Positions 30-638.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini480 – 639160CBM3PROSITE-ProRule annotationAdd
BLAST
Repeati670 – 693241Add
BLAST
Repeati709 – 732242Add
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni28 – 470443CatalyticAdd
BLAST
Regioni670 – 732632 X 24 AA approximate repeatsAdd
BLAST

Domaini

A 24 residue domain is repeated twice in this enzyme as well as in other C.thermocellum cellulosome enzymes. This domain may function as the binding ligand for the SL component.

Sequence similaritiesi

Contains 1 CBM3 (carbohydrate binding type-3) domain.PROSITE-ProRule annotation

Keywords - Domaini

Repeat, Signal

Phylogenomic databases

eggNOGiNOG05134.
HOGENOMiHOG000021032.
OMAiAINCAND.
OrthoDBiEOG6KQ6BP.

Family and domain databases

Gene3Di1.10.1330.10. 1 hit.
1.50.10.10. 1 hit.
2.60.40.710. 1 hit.
InterProiIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR008965. Carb-bd_dom.
IPR001956. CBD_3.
IPR016134. Cellulos_enz_dockerin_1.
IPR002105. Cellulos_enz_dockerin_1_Ca-bd.
IPR018242. Dockerin_1.
IPR018247. EF_Hand_1_Ca_BS.
IPR001701. Glyco_hydro_9.
IPR018221. Glyco_hydro_9_AS.
[Graphical view]
PfamiPF00942. CBM_3. 1 hit.
PF00404. Dockerin_1. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SMARTiSM01067. CBM_3. 1 hit.
[Graphical view]
SUPFAMiSSF48208. SSF48208. 1 hit.
SSF49384. SSF49384. 1 hit.
SSF63446. SSF63446. 1 hit.
PROSITEiPS51172. CBM3. 1 hit.
PS00448. CLOS_CELLULOSOME_RPT. 2 hits.
PS00018. EF_HAND_1. 1 hit.
PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P26224-1 [UniParc]FASTAAdd to Basket

« Hide

        10         20         30         40         50
MKKILAFLLT VALVAVVAIP QAVVSFAADF NYGEALQKAI MFYEFQRSGK
60 70 80 90 100
LPENKRNNWR GDSALNDGAD NGLDLTGGWY DAGDHVKFNL PMAYAVTMLA
110 120 130 140 150
WSVYESRDAY VQSGQLPYIL DNIKWATDYF IKCHPSPNVY YYQVGDGALD
160 170 180 190 200
HSWWGPAEVM QMPRPSFKVD LTNPGSTVVA ETAAAMAASS IVFKPTDPEY
210 220 230 240 250
AATLLRHAKE LFTFADTTRS DAGYRAAEGY YSSHSGFYDE LTWASIWLYL
260 270 280 290 300
ATGDQSYLDK AESYEPHWER ERGTTLISYS WAHCWDNKLY GSLLLLAKIT
310 320 330 340 350
GKSYYKQCIE NHLDYWTVGF NGSRVQYTPK GLAYLDRWGS LRYATTQAFL
360 370 380 390 400
ASVYADWSGC DPAKAAVYKE FAKKQVDYAL GSTGRSFVVG FGKNPPRNPH
410 420 430 440 450
HRTAHSSWSA LMTEPAECRH ILVGALVGGP DGSDSYVDRL DDYQCNEVAN
460 470 480 490 500
DYNAGFVGAL AKMYEKYGGE PIPNFVAFET PGEEFYVEAA VNAAGPGFVN
510 520 530 540 550
IKASIINKSG WPARGSDKLS AKYFVDISEA VAKGITLDQI TVQSTTNGGA
560 570 580 590 600
KVSQLLPWDP DNHIYYVNID FTGINIFPGG INEYKRDVYF TITAPYGEGN
610 620 630 640 650
WDNTNDFSFQ GLEQGFTSKK TEYIPLYDGN VRVWGKVPDG GSEPDPTPTI
660 670 680 690 700
TVGPTPSVTP TSVPGIMLGD VNFDGRINST DYSRLKRYVI KSLEFTDPEE
710 720 730
HQKFIAAADV DGNGRINSTD LYVLNRYILK LIEKFPAEQ
Length:739
Mass (Da):82,089
Last modified:May 1, 1992 - v1
Checksum:i0CD69EEFC6D4AEEF
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X60545 Genomic DNA. Translation: CAA43035.1.
CP000568 Genomic DNA. Translation: ABN51779.1.
PIRiI40804.
S15727.
RefSeqiWP_003517595.1. NC_009012.1.
YP_001036972.1. NC_009012.1.

Genome annotation databases

EnsemblBacteriaiABN51779; ABN51779; Cthe_0543.
GeneIDi4808292.
KEGGicth:Cthe_0543.
PATRICi19514745. VBICloThe47081_0565.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X60545 Genomic DNA. Translation: CAA43035.1 .
CP000568 Genomic DNA. Translation: ABN51779.1 .
PIRi I40804.
S15727.
RefSeqi WP_003517595.1. NC_009012.1.
YP_001036972.1. NC_009012.1.

3D structure databases

ProteinModelPortali P26224.
SMRi P26224. Positions 30-638.
ModBasei Search...
MobiDBi Search...

Protein-protein interaction databases

STRINGi 203119.Cthe_0543.

Protein family/group databases

CAZyi CBM3. Carbohydrate-Binding Module Family 3.
GH9. Glycoside Hydrolase Family 9.

Protocols and materials databases

Structural Biology Knowledgebase Search...

Genome annotation databases

EnsemblBacteriai ABN51779 ; ABN51779 ; Cthe_0543 .
GeneIDi 4808292.
KEGGi cth:Cthe_0543.
PATRICi 19514745. VBICloThe47081_0565.

Phylogenomic databases

eggNOGi NOG05134.
HOGENOMi HOG000021032.
OMAi AINCAND.
OrthoDBi EOG6KQ6BP.

Enzyme and pathway databases

BioCyci CTHE203119:GIW8-561-MONOMER.
MetaCyc:MONOMER-16419.

Family and domain databases

Gene3Di 1.10.1330.10. 1 hit.
1.50.10.10. 1 hit.
2.60.40.710. 1 hit.
InterProi IPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR008965. Carb-bd_dom.
IPR001956. CBD_3.
IPR016134. Cellulos_enz_dockerin_1.
IPR002105. Cellulos_enz_dockerin_1_Ca-bd.
IPR018242. Dockerin_1.
IPR018247. EF_Hand_1_Ca_BS.
IPR001701. Glyco_hydro_9.
IPR018221. Glyco_hydro_9_AS.
[Graphical view ]
Pfami PF00942. CBM_3. 1 hit.
PF00404. Dockerin_1. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view ]
SMARTi SM01067. CBM_3. 1 hit.
[Graphical view ]
SUPFAMi SSF48208. SSF48208. 1 hit.
SSF49384. SSF49384. 1 hit.
SSF63446. SSF63446. 1 hit.
PROSITEi PS51172. CBM3. 1 hit.
PS00448. CLOS_CELLULOSOME_RPT. 2 hits.
PS00018. EF_HAND_1. 1 hit.
PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view ]
ProtoNeti Search...

Publicationsi

« Hide 'large scale' publications
  1. "Nucleotide sequence of the cellulase gene celF of Clostridium thermocellum."
    Navarro A., Chebrou M.-C., Beguin P., Aubert J.-P.
    Res. Microbiol. 142:927-936(1991) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ATCC 27405 / DSM 1237 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372.
  3. "Transcription of Clostridium thermocellum endoglucanase genes celF and celD."
    Mishra S., Beguin P., Aubert J.-P.
    J. Bacteriol. 173:80-85(1991) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 1-4.

Entry informationi

Entry nameiGUNF_CLOTH
AccessioniPrimary (citable) accession number: P26224
Secondary accession number(s): A3DCV0
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 1, 1992
Last sequence update: May 1, 1992
Last modified: November 26, 2014
This is version 109 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Glycosyl hydrolases
    Classification of glycosyl hydrolase families and list of entries
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3