Skip Header

Contribute Send feedback
Read comments (?) or add your own

P94622 (P94622_CLOCL) Unreviewed, UniProtKB/TrEMBL

Last modified May 31, 2011. Version 61. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
EC=3.2.1.4 EMBL AAB40891.1
Gene names
Name:engF EMBL AAB40891.1
OrganismClostridium cellulovorans EMBL AAB40891.1
Taxonomic identifier1493 [NCBI]
Taxonomic lineageBacteriaFirmicutesClostridiaClostridialesClostridiaceaeClostridium

Protein attributes

Sequence length557 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Metal binding3911Calcium; via carbonyl oxygen PDB 1J84 PDB 1J83
Metal binding3931Calcium PDB 1J84 PDB 1J83
Metal binding4181Calcium; via carbonyl oxygen PDB 1J84 PDB 1J83
Metal binding5511Calcium PDB 1J84 PDB 1J83

Sequences

Sequence LengthMass (Da)Tools
P94622 [UniParc].

Last modified May 1, 1997. Version 1.
Checksum: D186EC88EB504EED

FASTA55760,131
        10         20         30         40         50         60 
MFNNVKKKIL SIVAAGAMLM ALVPNINVAA ETTYSNLTGN ANVKKPSVGG KLQLLNKNGI 

        70         80         90        100        110        120 
KTLCDKDGNP IQLRGMSTHG LQWFPGVVNN NAFAALSNDW NSNVIRLAMY VAEGGYATNP 

       130        140        150        160        170        180 
SVKQTVINGI NYAIANDMYV IVDWHMMNPG DPNASVYSGA QSFFNDISTL YPNNKNIIYE 

       190        200        210        220        230        240 
LCNEPNGENG GVTNDATGWA QVKSYATPIV QLLRNKGNEN LIIVGNPFWS QRPDLAADNP 

       250        260        270        280        290        300 
INDSNTMYSV HFYSGTNPIS TVDTNRDNAM SNVRYALNHG AAVFATEWGT SLATGTTGPY 

       310        320        330        340        350        360 
LAKADAWLDF LNGNNISWCN FSISNKDEKA AALNSLTSLD PGSDKLWADN ELTTSGQYVR 

       370        380        390        400        410        420 
ARIKGAYYAT PVDPVTNQPT APKDFSSGFW DFNDGTTQGF GVNPDSPITA INVENANNAL 

       430        440        450        460        470        480 
KISNLNSKGS NDLSEGNFWA NVRISADIWG QSINIYGDTK LTMDVIAPTP VNVSIAAIPQ 

       490        500        510        520        530        540 
SSTHGWGNPT RAIRVWTNNF VAQTDGTYKA TLTISTNDSP NFNTIATDAA DSVVTNMILF 

       550 
VGSNSDNISL DNIKFTK 

« Hide

References

[1]"Characterization of engF, a gene for a non-cellulosomal Clostridium cellulovorans endoglucanase."
Sheweita S.A., Ichi-ishi A., Park J.S., Liu C., Malburg L.M. Jr, Doi R.H.
Gene 182:163-167(1996) [PubMed: 8982083] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
[2]Sheweita S., Park J.-S., Doi R.H.
Submitted (SEP-1995) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
[3]"Recognition of cello-oligosaccharides by a family 17 carbohydrate-binding module: an X-ray crystallographic, thermodynamic and mutagenic study."
Notenboom V., Boraston A.B., Chiu P., Freelove A.C., Kilburn D.G., Rose D.R.
J. Mol. Biol. 314:797-806(2001) [PubMed: 11733998] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (1.70 ANGSTROMS) OF 378-557 IN COMPLEX WITH CALCIUM.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
U37056 Genomic DNA. Translation: AAB40891.1.
PIRJC5487.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1J83X-ray1.70A/B378-557[»]
1J84X-ray2.02A378-557[»]
ProteinModelPortalP94622.
SMRP94622. Positions 38-374, 378-557.
ModBaseSearch...

Protein family/group databases

CAZyCBM17. Carbohydrate-Binding Module Family 17.
GH5. Glycoside Hydrolase Family 5.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Family and domain databases

InterProIPR005086. CBM_fam_17/28.
IPR008979. Galactose-bd-like.
IPR001547. Glyco_hydro_5.
IPR018087. Glyco_hydro_5_CS.
IPR013781. Glyco_hydro_subgr_catalytic.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
Gene3DG3DSA:3.20.20.80. Glyco_hydro_cat. 1 hit.
PfamPF03424. CBM_17_28. 1 hit.
PF00150. Cellulase. 1 hit.
[Graphical view]
SUPFAMSSF49785. Gal_bind_like. 1 hit.
SSF51445. Glyco_hydro_cat. 1 hit.
PROSITEPS00659. GLYCOSYL_HYDROL_F5. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameP94622_CLOCL
AccessionPrimary (citable) accession number: P94622
Entry history
Integrated into UniProtKB/TrEMBL: May 1, 1997
Last sequence update: May 1, 1997
Last modified: May 31, 2011
This is version 61 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)