Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 6,454
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000001300
Taxonomy284591 - Yarrowia lipolytica (strain CLIB 122 / E 150)
StrainCLIB 122 / E 150
Last modifiedMarch 7, 2021
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000002525.1 from ENA/EMBL full
Pan proteomei <p>A pan proteome is the full set of proteins thought to be expressed by a group of highly related organisms (e.g. multiple strains of the same bacterial species).<p><a href='/help/pan_proteomes' target='_top'>More...</a></p> This proteome is part of the Yarrowia lipolytica (strain CLIB 122 / E 150) (Yeast) (Candida lipolytica) pan proteome (fasta)
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (F) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment.</p> C:95.6%[S:95.2%,D:0.5%],F:1.3%,M:3%,n:2137 saccharomycetes_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Close to standard (high value)

Yarrowia lipolytica is a methane-using yeast which assimilates hydrocarbons and produces citric acid from n-alkanes, vegetable oils or glucose under aerobic conditions. Yarrowia lipolytica is routinely isolated from different food media (ie.cheeses, sausages)It is a hemiascomycetous yeast and these yeasts represent a homogeneous phylogenetic group of eukaryotes with a relatively large diversity at physiological and ecological levels. With their relatively small and compact genomes, yeasts offer a unique opportunity to explore eukaryotic genome evolution by comparative analysis of several species. Yeasts are widely used as cell factories, for the production of beer, wine and bread and more recently of various metabolic products such as vitamins, ethanol, citric acid, lipids, and for assimilation of hydrocarbons ie. Yarrowia lipolytica. Complete sequencing and comparison of four hemiascomycetous yeasts has been undertaken these are: Candida glabrata, Kluyveromyces lactis , Debaryomyces hansenii , Yarrowia lipolytica. They were selected on the basis of their phylogenetic positions and their specific interest as human pathogens, or as industrially or environmentally important yeasts. Yarrowia lipolytica is very distantly related to the rest of the yeasts; instead it shares a number of common properties with filamentous fungi. For each species, the haploid type strain was sequenced. Of importance for evolutionary studies, the four yeast species display different mechanisms of sexuality. Yarrowia lipolytica has a haplo-diplontic cycle (that is, it alternates between haploid and diploid phases of similar importance). This work, which represents the first multispecies exploration of genome evolution across an entire eukaryotic phylum, reveals the variety of events and mechanisms that have taken place, and shouldallow useful comparisons with other phyla of multicellular organisms when more genome sequences are determined.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome A684
Chromosome B948
Chromosome F1318
Chromosome E1462
Chromosome C924
Chromosome D1097
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again