Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

How is protein family membership assigned in UniProtKB?

Last modified September 3, 2013

How is protein family membership assigned in UniProtKB?

The task of assigning proteins to families makes use of a range of sources including protein family databases, sequence analysis tools, scientific literature and sequence similarity search tools.

InterPro integrates predictive signatures representing protein domains, families and functional sites into a single resource from a number of family and domain databases: Gene3D, PANTHER, Pfam, PIRSF, PRINTS, ProDom, PROSITE, SMART, SUPERFAMILY and TIGRFAMs. Use is made of InterPro and its member databases in assigning UniProtKB proteins to families.

Protein families for which no predictive signatures exist are defined through sequence similarity searches combined with use of scientific literature and sequence analysis tools. Factors such as end-to-end similarity with other family members and shared organisation such as common domain order and topology are considered. These newly identified families are passed to InterPro and its member databases to facilitate creation of new signatures which in turn leads to detection of additional family members.

The full list of protein families in UniProtKB and the entries which belong to each family is available here.

See also: