Skip Header

UniProt release 6.3

Published October 25, 2005

Headlines

Over 100'000 prokaryotic entries in UniProtKB/Swiss-Prot

We have reached >100'000 prokaryotic (bacterial and archaeal) entries in UniProtKB/Swiss-Prot. Of these, just over 10'000 are archaeal entries. To deal with the enormous increase in the amount of available prokaryotic protein sequences, the Swiss-Prot group started the HAMAP project, which aims to automatically annotate, with a high-throughput but with no decrease in quality, proteins from complete microbial genomes that belong to a family (well-defined or uncharacterized).

The HAMAP annotation system is based on manually curated family rules, which contain the information, derived from searches of the available literature, that can be safely propagated to all members of the family. Profiles are generated from an alignment of seed members; these profiles, in turn, are used to scan all available sequences in Swiss-Prot and TrEMBL to identify family members. The goal of the system is not to search for distant sequence similarities, but to annotate only the proteins that can be conservatively assigned to a HAMAP family. Cases and conditions are included in most rules so that warnings are generated if some conserved features, such as active sites or metal-binding amino acids, are not present in a given protein sequence, or if there are other problems, such as size or taxonomic range. All entries that contain warnings are subjected to manual verification. In fact, since the implementation of the system and for the time being, ALL the proteins that have been annotated using the HAMAP family rules have been manually verified to check the reliability of the HAMAP annotation module.

More than 1'200 family rules are available, and almost 70'000 prokaryotic entries belong to one (or more) HAMAP families. From the HAMAP website it is possible to scan protein sequences for matches against HAMAP families, and it is also possible to submit a whole proteome, by confidential ftp, to be scanned against the collection of HAMAP family rules.


UniProtKB News

Changes concerning keywords

New keywords:

Deleted keywords:

  • Dehydrin