Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Chordata protein annotation project

Statistics

UniProt release 2020_02 - Apr-22, 2020 contains a total of 562,253 reviewed entries, which includes 85,963 entries from 3,143 species of Chordata.

Homo sapiens (Human) - 20,365 reviewed entries.

Number of canonical and isoform protein sequences: organism:9606 reviewed:yes (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 79.9%
at transcript level 12.7%
inferred from homology 3.7%
predicted 0.7%
uncertain 2.9%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 20,365 77,230 unique 100%
Alternative products 10,631 0 52.2%
General annotation 19,775 116,188 97.1%
Function 16,441 17,547 80.7%
Catalytic activity 3,645 6,940 17.9%
Subcellular location 16,672 55,319 N/A
Sequence annotation 19,870 563,587 97.6%
Amino acid modifications 13,592 96,775 66.7%
Natural Variant 12,966 80,877 63.7%
Cross-references 20,365 0 100%
EMBL 20,351 0 99.9%
InterPro 19,610 0 96.3%
PDB 6,833 0 33.6%
RefSeq 18,992 0 93.3%
MIM 15,241 0 74.8%
HGNC 20,173 0 99.1%

100% of reviewed human entries are annotated with at least one keyword .

93.9% of reviewed human entries are annotated with at least one GO (Gene Ontology ) term.

Mus musculus (Mouse) - 17,038 reviewed entries.

Number of canonical and isoform protein sequences: organism:10090 reviewed:yes (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 74.4%
at transcript level 23%
inferred from homology 2.2%
predicted 0.4%
uncertain 0.1%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 17,038 31,031 unique 100%
Alternative products 4,920 0 28.9%
General annotation 16,560 86,390 97.2%
Function 14,438 14,896 84.7%
Catalytic activity 3,508 6,471 20.6%
Subcellular location 14,521 49,347 85.2%
Sequence annotation 16,750 293,857 98.3%
Amino acid modifications 12,193 85,268 71.6%
Natural Variant 385 1,174 2.3%
Cross-references 17,038 0 100%
EMBL 16,910 0 99.2%
InterPro 16,874 0 99%
PDB 1,891 0 11.1%
RefSeq 16,058 0 94.2%
MGI 16,872 0 99%

100% of reviewed mouse entries are annotated with at least one keyword .

97% of reviewed mouse entries are annotated with at least one GO (Gene Ontology ) term.

Rattus norvegicus (Rat) - 8,094 reviewed entries.

Number of canonical and isoform protein sequences: organism:10116 reviewed:yes (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 57%
at transcript level 38.2%
inferred from homology 4.7%
predicted 0.1%
uncertain 0%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 8,094 12,496 unique 100%
Alternative products 991 0 12.2%
General annotation 7,926 38,165 97.9%
Function 7,281 7,556 90%
Catalytic activity 2,011 3,896 24.8%
Subcellular location 7,200 26,757 89%
Sequence annotation 7,938 129,762 98.1%
Amino acid modifications 6,295 44,508 77.8%
Natural Variant 112 277 1.4%
Cross-references 8,094 0 100%
EMBL 7,987 0 98.7%
InterPro 8,007 0 98.9%
PDB 635 0 7.8%
RefSeq 7,199 0 88.9%
RGD 8,016 0 99%

100% of reviewed rat entries are annotated with at least one keyword .

98.2% of reviewed rat entries are annotated with at least one GO (Gene Ontology ) term.

UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again