Proteinsi
<p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p>
| 5,829 |
Gene counti
<p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p>
| - Download one protein sequence per gene (FASTA) |
Proteome IDi
<p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="http://www.uniprot.org/manual/proteomes%5Fmanual">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p> | UP000002605 |
Taxonomy | 573826 - Candida dubliniensis (strain CD36 / ATCC MYA-646 / CBS 7987 / NCPF 3949 / NRRL Y-17841) |
Strain | CD36 / ATCC MYA-646 / CBS 7987 / NCPF 3949 / NRRL Y-17841 |
Last modified | August 23, 2020 |
Genome assembly and annotationi
<p>Identifier for the genome assembly (<a href="https://www.ensembl.org/Help/Faq?id=216">more...</a>)</p>
|
GCA_000026945.1
from ENA/EMBL
full |
Pan proteomei
<p>A pan proteome is the full set of proteins thought to be expressed by a group of highly related organisms (e.g. multiple strains of the same bacterial species).<p><a href='/help/pan_proteomes' target='_top'>More...</a></p> |
This proteome is part of the
Candida albicans (strain SC5314 / ATCC MYA-2876) (Yeast)
pan proteome (fasta)
|
Buscoi
<p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (F) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment.</p>
| C:99.3%[S:98.7%,D:0.6%],F:0.2%,M:0.4%,n:2137 saccharomycetes_odb10 |
Completenessi
<p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p>
| Standard |