Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Chordata protein annotation project

Statistics

UniProt release 2019_02 - Feb-13, 2019 contains a total of 559,228 reviewed entries, which includes 85,714 entries from 3,121 species of Chordata.

Homo sapiens (Human) - 20,417 reviewed entries.

Number of canonical and isoform protein sequences: organism:9606 reviewed:yes (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 75.4%
at transcript level 17.1%
inferred from homology 4%
predicted 0.7%
uncertain 2.8%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 20,417 74,954 unique 100%
Alternative products 10,632 0 52.1%
General annotation 19,810 110,272 97%
Function 16,335 17,277 80%
Catalytic activity 3,487 4,870 17.1%
Subcellular location 16,628 53,889 N/A
Sequence annotation 19,916 552,010 97.5%
Amino acid modifications 13,647 96,500 66.8%
Natural Variant 12,943 80,179 63.4%
Cross-references 20,417 0 100%
EMBL 20,403 0 99.9%
InterPro 19,645 0 96.2%
PDB 6,559 0 32.1%
RefSeq 18,987 0 93%
MIM 15,107 0 74%
HGNC 20,217 0 99%

100% of reviewed human entries are annotated with at least one keyword .

93.7% of reviewed human entries are annotated with at least one GO (Gene Ontology ) term.

Mus musculus (Mouse) - 17,009 reviewed entries.

Number of canonical and isoform protein sequences: organism:10090 reviewed:yes (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 73.7%
at transcript level 23.6%
inferred from homology 2.2%
predicted 0.4%
uncertain 0.1%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 17,009 29,715 unique 100%
Alternative products 4,905 0 28.8%
General annotation 16,510 82,022 97.1%
Function 14,272 14,690 83.9%
Catalytic activity 3,352 4,647 19.7%
Subcellular location 14,410 47,670 84.7%
Sequence annotation 16,714 288,915 98.3%
Amino acid modifications 12,168 84,597 71.5%
Natural Variant 381 1,183 2.2%
Cross-references 17,009 0 100%
EMBL 16,881 0 99.2%
InterPro 16,846 0 99%
PDB 1,817 0 10.7%
RefSeq 16,004 0 94.1%
MGI 16,843 0 99%

100% of reviewed mouse entries are annotated with at least one keyword .

96.7% of reviewed mouse entries are annotated with at least one GO (Gene Ontology ) term.

Rattus norvegicus (Rat) - 8,060 reviewed entries.

Number of canonical and isoform protein sequences: organism:10116 reviewed:yes (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 56.3%
at transcript level 38.9%
inferred from homology 4.7%
predicted 0.1%
uncertain 0%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 8,060 12,181 unique 100%
Alternative products 989 0 12.3%
General annotation 7,881 35,977 97.8%
Function 7,199 7,446 89.3%
Catalytic activity 1,935 2,691 24%
Subcellular location 7,140 25,592 88.6%
Sequence annotation 7,903 127,363 98.1%
Amino acid modifications 6,256 43,812 77.6%
Natural Variant 112 277 1.4%
Cross-references 8,054 0 100%
EMBL 7,954 0 98.7%
InterPro 7,977 0 99%
PDB 623 0 7.7%
RefSeq 7,163 0 88.9%
RGD 7,983 0 99%

100% of reviewed rat entries are annotated with at least one keyword .

98% of reviewed rat entries are annotated with at least one GO (Gene Ontology ) term.

UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again