Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome_redundancy">more...</a>)</p> 3,258
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000002676
Taxonomy257313 - Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251)
StrainTohama I / ATCC BAA-589 / NCTC 13251
Last modifiedJune 15, 2021
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000195715.1 from ENA/EMBL full
Pan proteomei <p>A pan proteome is the full set of proteins thought to be expressed by a group of highly related organisms (e.g. multiple strains of the same bacterial species).<p><a href='/help/pan_proteomes' target='_top'>More...</a></p> This proteome is part of the Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) pan proteome (fasta)
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (M) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment, and the name of the taxonomic lineage dataset used.</p> C:98.1%[S:97.7%,D:0.4%],F:0%,M:1.9%,n:688 burkholderiales_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Outlier (low value)

The genus Bordetella contains nine designated species, three of which are so closely related that they are considered subspecies and are referred to as the "classical Bordetella"; B. pertussis, B. parapertussis and B. bronchiseptica. Although highly similar at the DNA sequence level, these 3 vary in host specificity, severity of diseases, and their ability to cause acute versus chronic infection. B. bronchiseptica causes infections ranging from lethal pneumonia to asymptomatic respiratory carriage and chronically colonizes the respiratory tracts of various mammalian hosts, with some lineages primarily isolated from humans. B. pertussis and B. parapertussis(hu), which are thought to have evolved independently from a B. bronchiseptica-like progenitor, are causative agents of whooping cough in humans. Another distinct lineage only isolated from sheep has been designated B. parapertussis(ov). Comparison of seven divergent classical Bordetella isolates shows only approximately 50% of genes are shared by all strains in the core genome consisting of 2,857 gene families. The pan-genome consists of at least 5,558 gene families; analysis suggests that the classical Bordetellae pan-genome is "open" with limited gene acquisition, although some evidence of horizontal gene transfer is seen.

Bordetella pertussis is a human pathogen which colonizes the respiratory tract. It is the etiologic agent of whooping cough.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again