Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 41,380
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000000768
Taxonomy4558 - Sorghum bicolor
Straincv. BTx623
Last modifiedJanuary 15, 2020
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000003195.3 from ENA/EMBL full

Sorghum bicolor (sorghum) is a food and feed cereal crop that originated in Africa and is adapted to heat and drought. It is a staple food for 500 million of the world's poorest people. Sorghum bicolor belongs to the Panicoideae which is a subfamily of the Poaceae (formerly known as the Gramineae). Maize and sugar cane are members of the same this subfamily.

Sorghum has a small diploid genome and wide phenotypic diversity which make it an attractive plant for genomic studies. Sorghum carries out carbon fixation through the C4 pathway and is an ideal grass model for comparison with rice which uses the C3 pathway.

Sorghum bicolor has a haploid chromosome number of 10, containing 530 Mb with 33,000 protein-coding genes. The genome contains a high level of repeats (61%). Despite this, a high-quality genome sequence was assembled from homozygous sorghum genotype BTx623 using the whole-genome shotgun technique.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Unassembled WGS sequence138
Chromosome 17036
Chromosome 25375
Chromosome 35696
Chromosome 44557
Chromosome 52839
Chromosome 63618
Chromosome 72842
Chromosome 82445
Chromosome 93269
Chromosome 103543
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again