Skip Header

ProgramProkaryotic protein annotation program

Statistics

UniProt release 2013_05 - May-01, 2013 contains a total of 540,052 reviewed entries, which includes 347,875 entries from 2,076 species of Bacteria and Archae.

Statistics for complete bacterial proteomes

Statistics for complete archaeal proteomes

Escherichia coli (strain K12) - 4,431 reviewed entries (HAMAP complete proteome)

Evidence for the existence of protein Percentage of entries
at protein level 61.1%
at transcript level 2.9%
inferred from homology 16.3%
predicted 16.4%
uncertain 3.4%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 4,399 6,890 unique 99.3%
General annotation 4,141 19,320 93.5%
Function 2,676 2,776 60.4%
Catalytic activity 1,239 1,389 28%
Subcellular location 2,023 19,320 45.7%
Sequence annotation 3,609 59,850 81.4%
Amino acid modifications 471 781 10.6%
Natural Variant 132 434 3%
Cross-references 4,431 177,681 100%
EMBL 4,431 16,128 100%
InterPro 4,166 12,392 94%
PDB 1,272 9,185 28.7%
RefSeq 4,374 9,074 98.7%

100% of reviewed Escherichia coli (strain K12) entries are annotated with at least one keyword.

87.5% of reviewed Escherichia coli (strain K12) entries are annotated with at least one GO (Gene Ontology) term.

Bacillus subtilis - 4,188 reviewed entries (HAMAP complete proteome)

Evidence for the existence of protein Percentage of entries
at protein level 29.6%
at transcript level 8.7%
inferred from homology 32.1%
predicted 29.5%
uncertain 0.1%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 4,188 2,018 unique 100%
General annotation 3,543 14,066 84.6%
Function 1,844 1,890 44%
Catalytic activity 953 1,045 22.8%
Subcellular location 1,754 14,066 41.9%
Sequence annotation 3,201 26,479 76.4%
Amino acid modifications 388 592 9.3%
Natural Variant 42 141 1%
Cross-references 4,188 104,484 100%
EMBL 4,188 9,177 100%
InterPro 3,638 10,254 86.9%
PDB 468 1,076 11.2%
RefSeq 4,136 4,137 98.8%

100% of reviewed Bacillus subtilis entries are annotated with at least one keyword.

96.6% of reviewed Bacillus subtilis entries are annotated with at least one GO (Gene Ontology) term.

Mycobacterium tuberculosis - 2,022 reviewed entries (HAMAP complete proteome)

Evidence for the existence of protein Percentage of entries
at protein level 36.4%
at transcript level 4.2%
inferred from homology 43.2%
predicted 16.1%
uncertain 0.1%

Annotation categories Entries with Number of annotations Coverage
PubMed citations 2,018 989 unique 99.8%
General annotation 1,912 8,642 94.6%
Function 1,111 1,160 54.9%
Catalytic activity 687 756 34%
Subcellular location 740 8,642 36.6%
Sequence annotation 1,562 15,990 77.3%
Amino acid modifications 237 342 11.7%
Natural Variant 50 120 2.5%
Cross-references 2,019 64,774 99.9%
EMBL 2,018 6,028 99.8%
InterPro 1,864 5,528 92.2%
PDB 326 1,106 16.1%
RefSeq 1,993 5,794 98.6%

100% of reviewed Mycobacterium tuberculosis entries are annotated with at least one keyword.

91.6% of reviewed Mycobacterium tuberculosis entries are annotated with at least one GO (Gene Ontology) term.