Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome_redundancy">more...</a>)</p> 3,105
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters ‘UP’ followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000002145
Taxonomy203119 - Hungateiclostridium thermocellum (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
StrainATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372
Last modifiedNovember 5, 2019
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000015865.1 from ENA/EMBL full
Pan proteomei <p>A pan proteome is the full set of proteins thought to be expressed by a group of highly related organisms (e.g. multiple strains of the same bacterial species).<p><a href='/help/pan_proteomes' target='_top'>More...</a></p> This proteome is part of the Hungateiclostridium thermocellum (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum) pan proteome (fasta)

Clostridium thermocellum is a Gram-positive, anaerobic, thermophilic plant cell wall degrading bacterium capable of converting cellulosic substrates into ethanol. Degradation of cellulose occurs extracellularly in a large complex called the cellulosome. This nanomachine contains about 20 catalytic components, and can attach to the surface of the host organism. The complex is built on a "scaffoldin" subunit onto which the catalytic subunits are added through interactions of the "cohesion" domains of the enzymes with the "dockerin" domains on the scaffold. The C.thermocellum cellulosome is the best-characterized cellulase complex and thus serves as a paradigm. Continuing research on the cellulosome and its producer will provide crucial information for better understanding the cellulolytic reaction, a key process in biomass conversion (adapted from

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins


  1. "Global transcriptome analysis of Clostridium thermocellum ATCC 27405 during growth on dilute acid pretreated Populus and switchgrass."
    Wilson C.M., Rodriguez M.Jr., Johnson C.M., Martin S.L., Chu T.M., Wolfinger R.D., Hauser L.J., Land M.L., Klingeman D.M., Syed M.H., Ragauskas A.J., Tschaplinski T.J., Mielenz J.R., Brown S.D.
    Biotechnol Biofuels 6:179-179(2013) [PubMed] [Europe PMC] [Abstract]
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again