UniProtKB query fields
Supported query fields for searching specific data in UniProtKB (see also query syntax).
| Field | Example | Description |
|---|---|---|
| accession |
accession:P62988
|
Lists all entries with the primary or secondary accession number P62988. |
| active |
active:no
|
Lists all obsolete entries. |
| annotation |
annotation:(type:non-positional)
|
Lists all entries with:
|
| author |
author:ashburner
|
Lists all entries with at least one reference co-authored by Michael Ashburner. |
| cdantigen |
cdantigen:CD233
|
Lists all entries whose cell differenciation number is CD233 (see cdlist.txt). |
| citation |
citation:("intracellular structural proteins")
|
Lists all entries with a literature citation:
|
| cluster |
cluster:UniRef90_A5YMT3
|
Lists all entries in the UniRef 90% identity cluster whose representative sequence is UniProtKB entry A5YMT3 (about UniRef). |
| count |
annotation:(type:transmem count:5)
|
Lists all entries with:
|
| created |
created:[20071001 TO *]
|
Lists all entries created since October 1st 2007. |
| database |
database:(type:pfam)
|
Lists all entries with:
|
| domain |
domain:VWFA
|
Lists all entries with a Von Willebrand factor type A domain described in the general annotation section (Index of protein domains and families). |
| ec |
ec:3.2.1.23
|
Lists all beta-galactosidases (Enzyme nomenclature database). |
| existence |
existence:"inferred from homology"
|
See Protein existence criteria. |
| family |
family:serpin
|
Lists all entries belonging to the Serpin family of proteins (Index of protein domains and families). |
| fragment |
fragment:yes
|
Lists all entries with an incomplete sequence. |
| gene |
gene:HSPC233
|
Lists all entries for proteins encoded by gene HSPC233. |
| go |
go:cytoskeleton
|
Lists all entries associated with:
|
| host |
host:mouse
|
Lists all entries for viruses infecting:
|
| id |
id:P00750
|
Returns the entry with the primary accession number P00750. |
| inn |
inn:Anakinra
|
Lists all entries whose "International Nonproprietary Name" is Anakinra. |
| interactor |
interactor:P00520
|
Lists all entries describing interactions with the protein described by entry P00520. |
| keyword |
keyword:toxin
|
Lists all entries associated with the keyword Toxin (UniProtKB Keywords). |
| length |
length:[500 TO 700]
|
Lists all entries describing sequences of length between 500 and 700 residues. |
| lineage |
This field is a synonym for the field taxonomy.
|
|
| mass |
mass:[500000 TO *]
|
Lists all entries describing sequences with a mass of at least 500,000 Da. |
| method |
method:maldi
|
Lists all entries for proteins identified by: matrix-assisted laser
desorption/ionization (MALDI), crystallography (X-Ray). The
method field searches names of physico-chemical
identification methods in the general annotation, reference and
cross-reference sections.
|
| mnemonic |
mnemonic:ATP6_HUMAN
|
Lists all entries with entry name (ID) ATP6_HUMAN. Searches also obsolete entry names (What is the difference between an accession number (AC) and the entry name?). |
| modified |
modified:[20060101 TO 20060301]
|
Lists all entries that were modified between January and March 2006. |
| name |
name:"prion protein"
|
Lists all entries for prion proteins. |
| organelle |
organelle:Mitochondrion
|
Lists all entries for proteins encoded by a gene of the mitochondrial chromosome. |
| organism |
organism:"Ovis aries"
|
Lists all entries for proteins expressed in sheep (first 2 examples) and organisms whose name contains the term "sheep" (UniProt taxonomy). |
| plasmid |
plasmid:ColE1
|
Lists all entries for proteins encoded by a gene of plasmid ColE1 (Controlled vocabulary of plasmids). |
| replaces |
replaces:P02023
|
Lists all entries that were created from a merge with entry P02023 (see FAQ). |
| reviewed |
reviewed:yes
|
Lists all UniProtKB/Swiss-Prot entries (about UniProtKB). |
| scope |
scope:mutagenesis
|
Lists all entries containing a reference that was used to gather information about mutagenesis (Entry view: "Cited for", See reference section of the user manual). |
| sequence |
sequence:P05067-9
|
Lists all entries containing a link to isoform 9 of the sequence described in entry P05067. Allows searching by specific sequence identifier. |
| source |
source:intact
|
Lists all entries containing a GO term whose annotation source is the IntAct database. |
| strain |
strain:wistar
|
Lists all entries containing a reference relevant to strain wistar (Lists of strains in reference comments and Taxonomy help: organism strains). |
| taxonomy |
taxonomy:40674
|
Lists all entries for proteins expressed in Mammals. This field is used to retrieve entries for all organisms classified below a given taxonomic node (taxonomy classification). |
| tissue |
tissue:liver
|
Lists all entries containing a reference describing the protein sequence obtained from a clone isolated from liver (Controlled vocabulary of tissues). |
| web |
web:wikipedia
|
Lists all entries for proteins that are described in Wikipedia. |
