Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Overview

StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 40,615
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="http://www.uniprot.org/manual/proteomes%5Fmanual">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000026915
Taxonomy3641 - Theobroma cacao
Last modifiedDecember 1, 2019
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="https://www.ensembl.org/Help/Faq?id=216">more...</a>)</p> GCA_000403535.1 from ENA/EMBL full
BuscoC:98.1%[S:59.4%,D:38.7%],F:0.7%,M:1.2%,n:1440
CompletenessStandard

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome 102549
Chromosome 25001
Chromosome 72363
Chromosome 95166
Chromosome 15716
Chromosome 44327
Chromosome 63346
Chromosome 82890
Chromosome 54503
Unassembled WGS sequence257
Chromosome 34511

Publications

  1. "The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color."
    Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L., Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C., Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A., Mustiga G.M.
    Kuhn D.N.
    Genome Biol. 14:R53-R53(2013) [PubMed] [Europe PMC] [Abstract]
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again