Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 74,863
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000008827
Taxonomy3847 - Glycine max
Straincv. Williams 82
Last modifiedJanuary 9, 2020
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000004515.4 from EnsemblPlants full
CompletenessClose to Standard

Glycine max (soybean) is one of the most important crop plants for seed protein and oil content. As a member of the plant family Leguminosae, soybean also has the capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms.

The species originated in South East Asia, with the main areas of production today being in North America, South America and China. It is the world's most important legume crop and ranks sixth of all cultivated crops in terms of total harvest.

The reference proteome for Glycine max is derived from the genome published in 2010. Glycine max has a haploid chromosome number of 10 and is an ancient polyploid (palaeopolyploid) with over 50% more protein-coding genes than Arabidopsis, and 75% of the genes occurring as multiple copies. About 80% of the predicted genes are found in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. The soybean genome contains 1 Gb with 64,000 protein-coding genes, which is eight times larger than the Arabidopsis genome.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome 13237
Chromosome 24231
Chromosome 33516
Chromosome 43491
Chromosome 53406
Chromosome 64327
Chromosome 73663
Chromosome 85015
Chromosome 93810
Chromosome 103974
Chromosome 113394
Chromosome 123201
Chromosome 134980
Chromosome 143011
Chromosome 153641
Chromosome 162953
Chromosome 173565
Chromosome 183856
Chromosome 193494
Chromosome 203411
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again