Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q59325 (Q59325_CLOTM) Unreviewed, UniProtKB/TrEMBL

Last modified January 25, 2012. Version 70. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names
EC=3.2.1.91 EMBL CAA56918.1
Gene names
Name:cbhA EMBL CAA56918.1
OrganismClostridium thermocellum EMBL CAA56918.1
Taxonomic identifier1515 [NCBI]
Taxonomic lineageBacteriaFirmicutesClostridiaClostridialesClostridiaceaeClostridium

Protein attributes

Sequence length1230 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Metal binding4211Calcium 1 PDB 1RQ5
Metal binding4281Calcium 1; via carbonyl oxygen PDB 1RQ5
Metal binding4311Calcium 1 PDB 1RQ5
Metal binding4331Calcium 1; via carbonyl oxygen PDB 1RQ5
Metal binding4351Calcium 1 PDB 1RQ5
Metal binding4381Calcium 1 PDB 1RQ5
Metal binding5571Calcium 2 PDB 1RQ5
Metal binding5591Calcium 2; via carbonyl oxygen PDB 1RQ5
Metal binding5621Calcium 2 PDB 1RQ5
Metal binding5631Calcium 2 PDB 1RQ5
Metal binding6171Calcium 2; via carbonyl oxygen PDB 1RQ5

Sequences

Sequence LengthMass (Da)Tools
Q59325 [UniParc].

Last modified November 1, 1996. Version 1.
Checksum: A398D9814B5D6A0E

FASTA1,230138,078
        10         20         30         40         50         60 
MKFRRSICTA VLLAVLLTLL VPTSVFALED NSSTLPPYKN DLLYERTFDE GLCYPWHTCE 

        70         80         90        100        110        120 
DSGGKCSFDV VDVPGQPGNK AFAVTVLDKG QNRWRVQMRH RGLTLEQGHT YRVRLKIWAD 

       130        140        150        160        170        180 
ASCKVYIKIG QMAEPYAEYW NNKWSPYTLT AGKVLEIDET FVMDKPTDDT CEFTFHLGGE 

       190        200        210        220        230        240 
LAATPPYTVY LDDVSLYDPE YTKPVEYILP QPDVRVNQVG YLPEGKKVAT VVCNSTQPVK 

       250        260        270        280        290        300 
WQLKNAAGVV VLEGYTEPKG LDKDSQDYVH WLDFSDFATE GIGYYFELPT VNSPTNYSHP 

       310        320        330        340        350        360 
FDIRKDIYTQ MKYDALAFFY HKRSGIPIEM PYAGGEQWTR PAGHIGIEPN KGDTNVPTWP 

       370        380        390        400        410        420 
QDDEYAGIPQ KNYTKDVTGG WYDAGDHGKY VVNGGIAVWT LMNMYERAKI RGLDNWGPYR 

       430        440        450        460        470        480 
DGGMNIPEQN NGYPDILDEA RWEIEFFKKM QVTEKEDPSI AGMVHHKIHD FRWTALGMLP 

       490        500        510        520        530        540 
HEDPQPRYLR PVSTAATLNF AATLAQSARL WKDYDPTFAA DCLEKAEIAW QAALKHPDIY 

       550        560        570        580        590        600 
AEYTPGSGGP GGGPYNDDYV GDEFYWAACE LYVTTGKDEY KNYLMNSPHY LEMPAKMGEN 

       610        620        630        640        650        660 
GGANGEDNGL WGCFTWGTTQ GLGTITLALV ENGLPATDIQ KARNNIAKAA DRWLENIEEQ 

       670        680        690        700        710        720 
GYRLPIKRAE DERAGYPWGS NSLHFEPDDL VMGYAYDFTG DSNISMECLT GISYLLGRNA 

       730        740        750        760        770        780 
MDQSYVTGYG ERPLQNPHDR FWTPQTSKRF PAPPPGIISG RPNSRFEDPT INAAVKKDTP 

       790        800        810        820        830        840 
PQKCFIDHTD SWSTNEITVN WNAPFAWVTA YLDEQYTDSE TDKVTIDSPV AGERFEAGKD 

       850        860        870        880        890        900 
INIRTVKSKT PVSKVEFYNG DTLISSDTTA PYTAKITGAA VGAYNLKAVA VLSDGRRIES 

       910        920        930        940        950        960 
PVTPVLVKVI VKPTVKLTAP KSNVVAYGNE FLKITATASD SDGKISRVDF LVDGEVIGSD 

       970        980        990       1000       1010       1020 
REAPYEYEWK AVEGNHEISV IAYDDDDAAS TPDSVKIFVK QARDVKVQYL CENTQTSTQE 

      1030       1040       1050       1060       1070       1080 
IKGKFNIVNT GNRDYSLKDI VLRYYFTKEH NSQLQFICYY TPIGSGNLIP SFGGSGDEHY 

      1090       1100       1110       1120       1130       1140 
LQLEFKDVKL PAGGQTGEIQ FVIRYADNSF HDQSNDYSFD PTIKAFQDYG KVTLYKNGEL 

      1150       1160       1170       1180       1190       1200 
VWGTPPGGTE PEEPEEPEEP EEPAIVYGDC NDDGKVNSTD VAVMKRYLKK ENVNINLDNA 

      1210       1220       1230 
DVNADGKVNS TDFSILKRYV MKNIEELPYR 

« Hide

References

[1]"Multidomain structure and cellulosomal localization of the Clostridium thermocellum cellobiohydrolase CbhA."
Zverlov V.V., Velikodvorskaya G.V., Schwarz W.H., Bronnenmeier K., Kellermann J., Staudenbauer W.L.
J. Bacteriol. 180:3091-3099(1998) [PubMed: 9620957] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: F7 EMBL CAA56918.1.
[2]"Structural basis for the exocellulase activity of the cellobiohydrolase CbhA from Clostridium thermocellum."
Schubot F.D., Kataeva I.A., Chang J., Shah A.K., Ljungdahl L.G., Rose J.P., Wang B.C.
Biochemistry 43:1163-1170(2004) [PubMed: 14756552] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.40 ANGSTROMS) OF 208-818 IN COMPLEX WITH CALCIUM.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X80993 Genomic DNA. Translation: CAA56918.1.
PIRS47466.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1RQ5X-ray2.40A208-818[»]
ProteinModelPortalQ59325.
SMRQ59325. Positions 1166-1230.
ModBaseSearch...

Protein family/group databases

CAZyCBM3. Carbohydrate-Binding Module Family 3.
CBM4. Carbohydrate-Binding Module Family 4.
GH9. Glycoside Hydrolase Family 9.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Enzyme and pathway databases

BRENDA3.2.1.91. 97464.

Family and domain databases

InterProIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR008965. Carb-bd_dom.
IPR001956. CBD_3.
IPR016134. Cellulos_enz_dockerin_1.
IPR002105. Cellulos_enz_dockerin_1_Ca-bd.
IPR003305. CenC_carb-bd.
IPR018242. Dockerin_1.
IPR018247. EF_Hand_1_Ca_BS.
IPR008979. Galactose-bd-like.
IPR001701. Glyco_hydro_9.
IPR018221. Glyco_hydro_9_AS.
IPR004197. Glyco_hydro_9_Ig-like.
IPR013783. Ig-like_fold.
IPR014756. Ig_E-set.
[Graphical view]
Gene3DG3DSA:2.60.40.710. CBD_3. 1 hit.
G3DSA:1.50.10.10. CelA/Cel48F_cat. 1 hit.
G3DSA:1.10.1330.10. Cellulos_enz_dockerin_1. 1 hit.
G3DSA:2.60.40.10. Ig-like_fold. 1 hit.
PfamPF00942. CBM_3. 1 hit.
PF02018. CBM_4_9. 1 hit.
PF02927. CelD_N. 1 hit.
PF00404. Dockerin_1. 2 hits.
PF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SMARTSM01067. CBM_3. 1 hit.
[Graphical view]
SUPFAMSSF49384. Cellul_bind. 1 hit.
SSF63446. Cellulos_enz_dockerin_1. 1 hit.
SSF49785. Gal_bind_like. 1 hit.
SSF48208. Glyco_trans_6hp. 1 hit.
SSF81296. Ig_E-set. 1 hit.
PROSITEPS51172. CBM3. 1 hit.
PS00448. CLOS_CELLULOSOME_RPT. 1 hit.
PS00018. EF_HAND_1. 2 hits.
PS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameQ59325_CLOTM
AccessionPrimary (citable) accession number: Q59325
Entry history
Integrated into UniProtKB/TrEMBL: November 1, 1996
Last sequence update: November 1, 1996
Last modified: January 25, 2012
This is version 70 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)