Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Chordata protein annotation project

Statistics

UniProt release 2018_08 - Sep-12, 2018 contains a total of 558,125 reviewed entries, which includes 85,542 entries from 3,118 species of Chordata.

Homo sapiens (Human) - 20,394 reviewed entries.

Number of canonical and isoform protein sequences: organism:9606 reviewed:yes (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 75.1%
at transcript level 17.5%
inferred from homology 3.9%
predicted 0.7%
uncertain 2.8%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 20,394 73,736 unique 100%
Alternative products 10,633 0 52.1%
General annotation 19,769 108,312 96.9%
Function 16,226 17,110 79.6%
Catalytic activity 3,438 4,025 16.9%
Subcellular location 16,554 52,873 N/A
Sequence annotation 19,895 545,707 97.6%
Amino acid modifications 13,617 96,248 66.8%
Natural Variant 12,908 79,580 63.3%
Cross-references 20,394 0 100%
EMBL 20,380 0 99.9%
InterPro 19,557 0 95.9%
PDB 6,424 0 31.5%
RefSeq 18,984 0 93.1%
MIM 15,024 0 73.7%
HGNC 20,192 0 99%

100% of reviewed human entries are annotated with at least one keyword .

93.4% of reviewed human entries are annotated with at least one GO (Gene Ontology ) term.

Mus musculus (Mouse) - 16,991 reviewed entries.

Number of canonical and isoform protein sequences: organism:10090 reviewed:yes (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 73.4%
at transcript level 23.9%
inferred from homology 2.3%
predicted 0.4%
uncertain 0.1%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 16,991 29,146 unique 100%
Alternative products 4,898 0 28.8%
General annotation 16,483 80,222 97%
Function 14,183 14,570 83.5%
Catalytic activity 3,304 3,826 19.4%
Subcellular location 14,342 46,761 84.4%
Sequence annotation 16,697 286,790 98.3%
Amino acid modifications 12,136 84,272 71.4%
Natural Variant 381 1,183 2.2%
Cross-references 16,991 0 100%
EMBL 16,863 0 99.2%
InterPro 16,780 0 98.8%
PDB 1,794 0 10.6%
RefSeq 15,980 0 94%
MGI 16,819 0 99%

100% of reviewed mouse entries are annotated with at least one keyword .

96.6% of reviewed mouse entries are annotated with at least one GO (Gene Ontology ) term.

Rattus norvegicus (Rat) - 8,036 reviewed entries.

Number of canonical and isoform protein sequences: organism:10116 reviewed:yes (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 56.1%
at transcript level 39.1%
inferred from homology 4.7%
predicted 0.1%
uncertain 0%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 8,036 12,057 unique 100%
Alternative products 985 0 12.3%
General annotation 7,853 35,136 97.7%
Function 7,157 7,386 89.1%
Catalytic activity 1,917 2,249 23.9%
Subcellular location 7,103 25,048 88.4%
Sequence annotation 7,874 126,190 98%
Amino acid modifications 6,232 43,620 77.6%
Natural Variant 112 277 1.4%
Cross-references 8,030 0 100%
EMBL 7,930 0 98.7%
InterPro 7,942 0 98.8%
PDB 615 0 7.7%
RefSeq 7,135 0 88.8%
RGD 7,959 0 99%

100% of reviewed rat entries are annotated with at least one keyword .

98% of reviewed rat entries are annotated with at least one GO (Gene Ontology ) term.

UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again