Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 39,359
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000001595
Taxonomy9601 - Pongo abelii
Genome assemblyi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> from Ensembl
Genome annotationi Ensembl
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (M) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment, and the name of the taxonomic lineage dataset used.</p> C:88.2%[S:82.9%,D:5.3%],F:3.8%,M:8%,n:13780 primates_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Close to standard (low value)

The orangutan is one of two species of Asian ape, the other being the gibbon. Orangutans are found only in the rainforests of Borneo and Sumatra. The two groups have been given separate species status diverging around 400,000 years ago and are named Pongo pygmaeus and Pongo abelii, respectively. The Bornean orangutan (P. pygmaeus) has been further classified into three subspecies. Adult males can reach a height of 1.75m and weigh over 118kg; females are considerably smaller at 1.27m and 45kg. They are covered with long reddish brown hair, which in P. abelii (Sumatran) is lighter in color and more sparse. Orangutans have long hooked fingers and toes, consistent with them being mostly tree dwellers. Their diet is largely fruit with other parts of plants, small insects and bird eggs. Orangutans are very intelligent, making use of tools and making nests but they are much more solitary than other great apes. Their estimated lifespan is over 50 years.

The current assembly in Ensembl contains 3.1Gb with 20,424 protein-coding genes and 29,447 predicted transcripts.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome 13810
Chromosome 2a4
Chromosome 2b7
Chromosome 32088
Chromosome 41457
Chromosome 51676
Chromosome 61966
Chromosome 71674
Chromosome 81286
Chromosome 91439
Chromosome 101556
Chromosome 112174
Chromosome 122080
Chromosome 13653
Chromosome 141317
Chromosome 151162
Chromosome 161423
Chromosome 172032
Chromosome 18594
Chromosome 192439
Chromosome 20971
Chromosome 21402
Chromosome 22906
Chromosome X1323


  1. "The mitochondrial DNA molecule of Sumatran orangutan and a molecular proposal for two (Bornean and Sumatran) species of orangutan."
    Xu X., Arnason U.
    J. Mol. Evol. 43:431-437(1996) [PubMed] [Europe PMC] [Abstract]
  2. "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome."
    Wilson R.K., Mardis E.
    Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again