Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 48,769
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000002277
Taxonomy9598 - Pan troglodytes
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000001515.5 from Ensembl full
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (M) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment, and the name of the taxonomic lineage dataset used.</p> C:97.8%[S:50.3%,D:47.5%],F:0.5%,M:1.7%,n:13780 primates_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Close to standard (high value)

The genus Pan is divided into two species Pan troglodytes and Pan paniscus or Bonobo. Pan troglodytes or the common chimpanzee is a great ape and is evolutionarily the closest living species to humans, with around 30% of all proteins identical to us. Four subspecies of common chimpanzee have been identified and are spread across west and central Africa. Chimpanzees weigh between 40-70kg with an average standing height of 1.3 to 1.6m. They have dense black hair covering their bodies except the face, fingers, toes, palms of the hands and soles of the feet. They usually walk on all four limbs and use their knuckles rather than the palms of their hands, but can walk short distances upright. The thumbs and big toes are opposable providing grip for climbing and tool use. They are very adaptable and can live in a variety of habitats and are mostly herbivorous but eat insects, small mammals and occasionally other primates. They are highly intelligent and have a complex language and a hierarchical and territorial system which, when threatened, can result in intense aggressive behavior.

The current assembly covers about 97 percent of the Pan troglodytes genome and contains 3.0Gb with 18,759 protein-coding genes and 29,160 currently predicted transcripts. This low number is a result of alignment errors between human and chimp genomes resulting in loss of transfer of a number of exons.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome 14531
Chromosome 2A1559
Chromosome 2B1757
Chromosome 32789
Chromosome 41875
Chromosome 52114
Chromosome 62442
Chromosome 72185
Chromosome 81629
Chromosome 91832
Chromosome 101864
Chromosome 112899
Chromosome 122559
Chromosome 13800
Chromosome 141577
Chromosome 151531
Chromosome 161838
Chromosome 172621
Chromosome 18766
Chromosome 193009
Chromosome 201230
Chromosome 221052
Chromosome X1755
Chromosome Y124
Chromosome 21466


  1. "Conservation of Y-linked genes during human evolution revealed by comparative sequencing in chimpanzee."
    Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., Rozen S., Wilson R.K., Page D.C.
    Nature 437:100-103(2005) [PubMed] [Europe PMC] [Abstract]
  2. "Initial sequence of the chimpanzee genome and comparison with the human genome."
    Chimpanzee sequencing and analysis consortium
    Nature 437:69-87(2005) [PubMed] [Europe PMC] [Abstract]
  3. "Recent African origin of modern humans revealed by complete sequences of hominoid mitochondrial DNAs."
    Horai S., Hayasaka K., Kondo R., Tsugane K., Takahata N.
    Proc. Natl. Acad. Sci. U.S.A. 92:532-536(1995) [PubMed] [Europe PMC] [Abstract]
  4. "The DNA sequence of the chimpanzee Y chromosome."
    Hughes J.F., Pyntikova T., Skaletsky H., Minx P.J., Rozen S., Wilson R.K., Page D.C.
    Submitted (JUL-2004) to the EMBL/GenBank/DDBJ databases
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again