Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 22,080
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000000803
Taxonomy7227 - Drosophila melanogaster
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000001215.4 from ENA/EMBL full
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (M) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment, and the name of the taxonomic lineage dataset used.</p> C:100%[S:41.8%,D:58.2%],F:0%,M:0%,n:3285 diptera_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Outlier (high value)

Drosophila melanogaster is a fruit fly and the most studied species from the family Drosophilidae. It has been used as a model organism for genetic and developmental studies since the first experiments on sex-linkage and genetic recombination carried out by Thomas Hunt Morgan in the 1900s. Its importance for human health was recognized by the award of the Nobel Prize in medicine/physiology to Ed Lewis, Christiane Nusslein-Volhard and Eric Wieschaus in 1995 for their research on the genetic control of embryonic development. Moreover, recent genetic analyses have shown that around 75% of human disease genes have homologs in Drosophila melanogaster

The Drosophila melanogaster complete genome sequence was published in 2000. It contains 180 Mb (one third of which is centric heterochromatin) and approximately 14,000 protein-coding genes.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome 2L3886
Chromosome 2R4590
Chromosome 3L4400
Chromosome 3R5306
Chromosome 4201
Chromosome X3660
Chromosome Y23
Unplaced scaffolds3


  1. "The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective."
    Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R., Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M., Celniker S.E.
    Genome Biol. 3:RESEARCH0084-RESEARCH0084(2002) [PubMed] [Europe PMC] [Abstract]
  2. "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly."
    Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A., Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G., Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M., Karpen G.H.
    Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002) [PubMed] [Europe PMC] [Abstract]
  3. "Combined evidence annotation of transposable elements in genome sequences."
    Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D., Ashburner M., Anxolabehere D.
    PLoS Comput. Biol. 1:166-175(2005) [PubMed] [Europe PMC] [Abstract]
  4. "The genome sequence of Drosophila melanogaster."
    Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., Sutton G.G., Wortman J.R., Yandell M.D.
    Venter J.C.
    Science 287:2185-2195(2000) [PubMed] [Europe PMC] [Abstract]
  5. "The Release 5.1 annotation of Drosophila melanogaster heterochromatin."
    Smith C.D., Shu S., Mungall C.J., Karpen G.H.
    Science 316:1586-1591(2007) [PubMed] [Europe PMC] [Abstract]
  6. Wan K., Celniker S.
    Submitted (AUG-2014) to the EMBL/GenBank/DDBJ databases
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again