Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Overview

StatusReference proteome
Proteinsi74,864
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="http://www.uniprot.org/manual/proteomes_manual">proteome</a>. It consists of the characters ‘UP’ followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000008827
Taxonomy3847 - Glycine max
Straincv. Williams 82
Last modifiedMarch 9, 2019
Genome assembly and annotationi GCA_000004515.4 from EnsemblPlants

Glycine max (soybean) is one of the most important crop plants for seed protein and oil content. As a member of the plant family Leguminosae, soybean also has the capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms.

The species originated in South East Asia, with the main areas of production today being in North America, South America and China. It is the world's most important legume crop and ranks sixth of all cultivated crops in terms of total harvest.

The reference proteome for Glycine max is derived from the genome published in 2010. Glycine max has a haploid chromosome number of 10 and is an ancient polyploid (palaeopolyploid) with over 50% more protein-coding genes than Arabidopsis, and 75% of the genes occurring as multiple copies. About 80% of the predicted genes are found in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. The soybean genome contains 1 Gb with 64,000 protein-coding genes, which is eight times larger than the Arabidopsis genome.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Proteins
Chromosome 13237
Chromosome 24234
Chromosome 33520
Chromosome 43494
Chromosome 53409
Chromosome 64324
Chromosome 73662
Chromosome 85013
Chromosome 93812
Chromosome 103970
Chromosome 113392
Chromosome 123203
Chromosome 134978
Chromosome 143012
Chromosome 153643
Chromosome 162954
Chromosome 173565
Chromosome 183854
Chromosome 193490
Chromosome 203413
Unplaced
545
Chloroplast79
Mitochondrion78
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again