Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Pan proteomes

A pan proteome is the full set of proteins thought to be expressed by a group of highly related organisms (e.g. multiple strains of the same bacterial species).

Pan proteomes provide a representative set of all the sequences within a taxonomic group and capture unique sequences not found in the group’s reference proteome. UniProtKB pan proteomes encompass all non-redundant proteomes and are aimed at users interested in phylogenetic comparisons and the study of genome evolution and gene diversity.

When a proteome has proteins that are part of a larger pan proteome, you will see it indicated on the proteome page in the ‘Pan proteome’ row. You will also see a link to download the full fasta sequence set, e.g. Bacillus subtilis 168.

Pan proteome sets can be downloaded from the UniProt FTP site through the Pan proteomes subdirectory.

See also:
Proteomes
Pan proteomes in UniProt (UniProt blog)
Pan proteomes, technical documentation

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health