Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome_redundancy">more...</a>)</p> 14,776
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000005204
Taxonomy7091 - Bombyx mori
Last modifiedMay 24, 2021
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000151625.1 from EnsemblMetazoa full
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (M) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment, and the name of the taxonomic lineage dataset used.</p> C:84.8%[S:83.8%,D:1%],F:6.2%,M:9%,n:5286 lepidoptera_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Standard

Bombyx mori, the silkworm, belongs to the order Lepidoptera in the phylum Arthropoda. Its larvae, which feed on mulberry leaves, are farmed for the production of silk (sericulture) and are thus of great economic importance. The origins of sericulture date back about 5,000 years and today B. mori is a fully domesticated insect species. In science, B. mori is used as a model of lepidopteran biology.

The genome sequence was assembled via the merging of two whole-genome shotgun sequencing (WGS) datasets and published in 2008. It comprises 432 Mb in 28 chromosomes containing roughly 14,500 predicted protein-coding genes. Upon publication, 3,000 of those genes did not have a homolog in other insect or vertebrate species. A large proportion of the genome, about 44%, appears to consists of transposable elements.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Unassembled WGS sequence14776


  1. "The genome of a lepidopteran model insect, the silkworm Bombyx mori."
    International Silkworm Genome Consortium
    Insect Biochem. Mol. Biol. 38:1036-1045(2008) [PubMed] [Europe PMC] [Abstract]
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again