Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 32,609
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000291000
Taxonomy9925 - Capra hircus
Last modifiedAugust 25, 2020
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_001704415.1 from Ensembl full
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (F) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment.</p> C:97.1%[S:58.3%,D:38.9%],F:0.9%,M:2%,n:12234 laurasiatheria_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Standard

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome 6919
Chromosome 21391
Chromosome 191698
Chromosome 111364
Chromosome 24475
Chromosome 14720
Chromosome 151273
Chromosome 31811
Chromosome 81132
Chromosome 21808
Chromosome 251053
Chromosome 17783
Chromosome 20495
Chromosome 41117
Chromosome 28450
Chromosome 26593
Chromosome 29922
Chromosome 71762
Chromosome 101329
Chromosome 51819
Chromosome 9743
Chromosome 131184
Chromosome 181674
Chromosome 16968
Chromosome 22813
Chromosome 23916
Chromosome 12518
Chromosome 11330
Chromosome 27378
Unassembled WGS sequence2186


  1. "Polished mammalian reference genomes with single-molecule sequencing and chromosome conformation capture applied to the Capra hircus genome."
    Bickhart D.M., Koren S., Rosen B., Hastie A., Liachko I., Sullivan S.T., Burton J., Sayre B.L., Huson H.J., Lee J., Lam E., Kelley C.M., Hutchison J.L., Zhou Y., Sun J., Crisa A., Schwartz J.C., Hammond J.A.
    Smith T.P.
    Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again