Biocuration in UniProt
One of the central activities of the UniProt Consortium is the biocuration of the UniProt Knowledgebase (UniProtKB). Biocuration involves the interpretation and integration of information relevant to biology into a database or resource that enables integration of the scientific literature as well as large data sets. Accurate and comprehensive representation of biological knowledge, as well as easy access to this data for working scientists and a basis for computational analysis, are primary goals of biocuration. In order to respond to the flood of sequencing data, UniProt provides both manual curation and automatic annotation. UniProtKB consists of two sections, UniProtKB/Swiss-Prot and UniProtKB/TrEMBL. The former contains manually reviewed records with annotation extracted from the literature and curator evaluated computational analysis while the latter contains computationally generated records enhanced by automatic classification and annotation.
UniProt manual curation
Manual curation consists of a critical review of experimental and predicted data for each protein as well as manual verification of each protein sequence. Curation methods applied to UniProtKB/Swiss-Prot include manual extraction and structuring of information from the literature, manual verification of results from computational analyses, mining and integration of large-scale data sets, and continuous updating as new information becomes available. More…
UniProt automatic annotation
UniProt has developed two complementary approaches to automatically annotate protein sequences with a high degree of accuracy. UniRule is a collection of manually curated annotation rules which define annotations that can be propagated based on specific conditions while the Statistical Automatic Annotation System (SAAS) is an automatic decision-tree based rule-generating system. The central components of these approaches are rules based on InterPro classification and the manually curated data in UniProtKB/Swiss-Prot. More…
UniProt annotation flow diagram