Skip Header

ProgramChordata protein annotation program

Statistics

UniProt release 2013_06 - May-29, 2013 contains a total of 540,261 reviewed entries, which includes 83,836 entries from 3,121 species of Chordata.

Homo sapiens (Human) - 20,257 reviewed entries.

Number of canonical and isoform protein sequences: 38,529 (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 67%
at transcript level 28.3%
inferred from homology 1%
predicted 0.5%
uncertain 3.1%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 20,173 60,916 unique 99.6%
Alternative products 9,046 27,551 44.7%
General annotation 19,887 154,013 98.2%
Function 14,828 15,173 73.2%
Catalytic activity 3,065 3,494 15.1%
Subcellular location 15,495 32,036 76.5%
Sequence annotation 19,639 439,747 96.9%
Amino acid modifications 11,358 65,515 56.1%
Natural Variant 12,489 68,454 61.7%
Cross-references 20,257 1,197,513 100%
EMBL 20,141 144,635 99.4%
InterPro 18,337 63,404 90.5%
PDB 4,822 26,060 23.8%
RefSeq 18,774 32,796 92.7%
MIM 13,686 18,059 67.6%
HGNC 19,686 19,854 97.2%

100% of reviewed human entries are annotated with at least one keyword .

90.4% of reviewed human entries are annotated with at least one GO (Gene Ontology ) term.

Mus musculus (Mouse) - 16,613 reviewed entries.

Number of canonical and isoform protein sequences: 24,503 (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 47.7%
at transcript level 51%
inferred from homology 0.8%
predicted 0.4%
uncertain 0.1%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 16,538 23,253 unique 99.5%
Alternative products 4,739 12,786 28.5%
General annotation 16,394 115,852 98.7%
Function 12,892 13,092 77.6%
Catalytic activity 2,902 3,279 17.5%
Subcellular location 13,329 27,538 80.2%
Sequence annotation 16,208 233,125 97.6%
Amino acid modifications 10,101 56,628 60.8%
Natural Variant 367 1,160 2.2%
Cross-references 16,613 685,860 100%
EMBL 16,482 74,978 99.2%
InterPro 15,614 54,285 94%
PDB 1,315 3,957 7.9%
RefSeq 15,378 19,735 92.6%
MGI 16,444 16,490 99%

100% of reviewed mouse entries are annotated with at least one keyword .

93.7% of reviewed mouse entries are annotated with at least one GO (Gene Ontology ) term.

Rattus norvegicus (Rat) - 7,854 reviewed entries.

Number of canonical and isoform protein sequences: 9,387 (download data in FASTA format)

Evidence for the existence of protein Percentage of entries
at protein level 40.2%
at transcript level 55.5%
inferred from homology 4%
predicted 0.3%
uncertain 0%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 7,406 10,718 unique 94.3%
Alternative products 945 2,589 12%
General annotation 7,774 53,200 99%
Function 6,665 6,799 84.9%
Catalytic activity 1,759 2,020 22.4%
Subcellular location 6,710 14,789 85.4%
Sequence annotation 7,583 102,392 96.5%
Amino acid modifications 5,225 29,005 66.5%
Natural Variant 114 273 1.5%
Cross-references 7,854 263,585 100%
EMBL 7,746 14,985 98.6%
InterPro 7,476 26,894 95.2%
PDB 504 2,192 6.4%
RefSeq 6,937 7,667 88.3%
RGD 7,764 7,768 98.9%

100% of reviewed rat entries are annotated with at least one keyword .

96.6% of reviewed rat entries are annotated with at least one GO (Gene Ontology ) term.