Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 47,180
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000075243
Taxonomy3821 - Cajanus cajan
Straincv. Asha
Last modifiedJuly 4, 2020
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000340665.1 from ENA/EMBL full

Cajanus cajan (Pigeon pea) is the sixth most important food legume crop which is mainly grown in semi-arid tropical regions of the world. It is the main source of protein in vegetarian populations of developing countries and provides nutrition which is not provided by cereals. The genetic improvement of pigeon pea is constrained by (a) lack of genetic resources (b) low level of genetic diversity in the primary gene pool. The genome sequencing of Pigeon pea is a first step towards understanding the genomic aspect of this immensely important yet scientifically neglected crop. There is almost a three- fold gap between the potential yield and the actual yield of this crop. The process of exploring genomics details of this crop is the first step towards bridging this vast gap.

The genome size of the Pigeon pea is 833 Mb which is predicted to code for ~48,000 genes. More than fifty percent of the genome constitutes repetitive DNA which excludes low complexity regions and could not be associated with known transposable element (TE) families.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome 11638
Chromosome 23443
Chromosome 32924
Chromosome 41207
Chromosome 5460
Chromosome 62260
Chromosome 71834
Chromosome 81907
Chromosome 9954
Chromosome 101976
Chromosome 114423
Unassembled WGS sequence24197
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again