Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 2,041
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000000819
Taxonomy284813 - Encephalitozoon cuniculi (strain GB-M1)
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000091225.2 from ENA/EMBL full
Pan proteomei <p>A pan proteome is the full set of proteins thought to be expressed by a group of highly related organisms (e.g. multiple strains of the same bacterial species).<p><a href='/help/pan_proteomes' target='_top'>More...</a></p> This proteome is part of the Encephalitozoon cuniculi (strain GB-M1) (Microsporidian parasite) pan proteome (fasta)
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (M) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment, and the name of the taxonomic lineage dataset used.</p> C:100%[S:99.8%,D:0.2%],F:0%,M:0%,n:600 microsporidia_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Unknown

Microsporidia are unicellular parasites that lack mitochondria and peroxisomes and are characterized by production of resistant spores. Encephalitozoon cuniculi is one of the 14 microsporidians that have been identified as human pathogens, causing various pathologies affecting the nervous system as well as the respiratory and digestive tracts. It can also infect a wide range of other hosts, being a major health issue for rabbits. Encephalitozoon cuniculi has a very small and specialized genome, which can be used as a tool for identifying genes that are involved in energy metabolism and in infections that parasites cause in humans.

The reference proteome for E. cuniculi is derived from the genome sequence published in 2001. The genome is remarkably compact, with a size of about 2.9Mb. Genome compaction is reflected by reduced intergenic spacers and by the shortness of most putative proteins relative to their eukaryote orthologs.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome I145
Chromosome II168
Chromosome III166
Chromosome IV189
Chromosome V179
Chromosome VI183
Chromosome VII200
Chromosome VIII221
Chromosome IX221
Chromosome X197
Chromosome XI227


  1. "Sequence and analysis of chromosome I of the amitochondriate intracellular parasite Encephalitozoon cuniculi (Microspora)."
    Peyret P., Katinka M.D., Duprat S., Duffieux F., Barbe V., Barbazanges M., Weissenbach J., Saurin W., Vivares C.P.
    Genome Res. 11:198-207(2001) [PubMed] [Europe PMC] [Abstract]
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again