Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Overview

StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 18,165
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="http://www.uniprot.org/manual/proteomes%5Fmanual">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000009192
Taxonomy7230 - Drosophila mojavensis
StrainTucson 15081-1352.22
Last modifiedMarch 4, 2021
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="https://www.ensembl.org/Help/Faq?id=216">more...</a>)</p> GCA_000005175.1 from ENA/EMBL full
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (F) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment.</p> C:99.3%[S:73.3%,D:26.1%],F:0.2%,M:0.4%,n:3285 diptera_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Standard

Drosophila mojavensis (repleta group, mulleri subgroup) is a cactophilic fruit fly from the southwestern United States and Mexico. It is found in four different geographic regions and feeds on a different cactus host in each of these areas. Populations of the fruit fly are isolated by areas in which there are no host plants, potentially limiting their gene flow and making D. mojavensis a good model for speciation studies.

D. mojavensis is one of 12 fruit fly genomes sequenced for a large comparative study by the Drosophila 12 Genomes Consortium. Its genome is 130 Mb in size and contains 14,849 protein-coding genes (81% of which have homologues in D. melanogaster). The sequenced strain of D. mojavensis was from a small, isolated population on the Santa Catalina Island that feeds on prickly pear.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Mitochondrion13
Unassembled WGS sequence18152

Publications

  1. "Comparative genomics of Drosophila mtDNA: Novel features of conservation and change across functional domains and lineages."
    Montooth K.L., Abt D.N., Hofmann J.W., Rand D.M.
    J. Mol. Evol. 69:94-114(2009) [PubMed] [Europe PMC] [Abstract]
  2. "Evolution of genes and genomes on the Drosophila phylogeny."
    Drosophila 12 genomes consortium
    Nature 450:203-218(2007) [PubMed] [Europe PMC] [Abstract]
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again