Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteome IDiUP000008827
Taxonomy3847 - Glycine max
Straincv. Williams 82
Last modifiedMarch 13, 2018
Genome assembly and annotationi GCA_000004515.3 from EnsemblPlants

Glycine max (soybean) is one of the most important crop plants for seed protein and oil content. As a member of the plant family Leguminosae, soybean also has the capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms.

The species originated in South East Asia, with the main areas of production today being in North America, South America and China. It is the world's most important legume crop and ranks sixth of all cultivated crops in terms of total harvest.

The reference proteome for Glycine max is derived from the genome published in 2010. Glycine max has a haploid chromosome number of 10 and is an ancient polyploid (palaeopolyploid) with over 50% more protein-coding genes than Arabidopsis, and 75% of the genes occurring as multiple copies. About 80% of the predicted genes are found in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. The soybean genome contains 1 Gb with 64,000 protein-coding genes, which is eight times larger than the Arabidopsis genome.


DownloadView all proteins
Component nameGenome Accession(s)
Chromosome 13284
Chromosome 24264
Chromosome 33559
Chromosome 43519
Chromosome 53469
Chromosome 64351
Chromosome 73685
Chromosome 85077
Chromosome 93851
Chromosome 104005
Chromosome 113417
Chromosome 123222
Chromosome 135038
Chromosome 143035
Chromosome 153675
Chromosome 162988
Chromosome 173592
Chromosome 183894
Chromosome 193520
Chromosome 203633

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health