Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 7,959
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000001926
Taxonomy5759 - Entamoeba histolytica
StrainATCC 30459 / HM-1:IMSS
Last modifiedMarch 8, 2021
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000208925.2 from ENA/EMBL full
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (F) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment.</p> C:52.5%[S:46.3%,D:6.3%],F:5.1%,M:42.4%,n:255 eukaryota_odb10

Entamoeba histolytica is an anaerobic parasitic protozoan, part of the genus Entamoeba, which predominantly infects humans and other primates. It is an intestinal parasite and the causative agent of amoebiasis. This disease is a significant source of mortality in developing countries, and is estimated to infect about 40-50 million people worldwide.

The parasite feeds on bacteria in the lumen of the colon and lyses host epithelial cells after invasion of the intestinal wall. Entamoeba histolytica (strain HM-1:IMSS) is the first human amoeba to have its genome sequenced, assembled and analyzed.

The reference proteome of Entamoeba histolytica is derived from the genome sequence published in 2005. The genome is 24Mb in size and is split into 14 chromosomes. Approximately 9,938 predicted protein-coding genes with an average size of 1.2 kb comprise 49% of the genome. One third of this organism's predicted proteins do not have identifiable sequence homologs in other species.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Partially assembled WGS sequence7959
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again