Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters ‘UP’ followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000001926
Taxonomy5759 - Entamoeba histolytica
StrainATCC 30459 / HM-1:IMSS
Last modifiedOctober 26, 2018
Genome assembly and annotationi GCA_000208925.2 from ENA/EMBL

Entamoeba histolytica is an anaerobic parasitic protozoan, part of the genus Entamoeba, which predominantly infects humans and other primates. It is an intestinal parasite and the causative agent of amoebiasis. This disease is a significant source of mortality in developing countries, and is estimated to infect about 40-50 million people worldwide.

The parasite feeds on bacteria in the lumen of the colon and lyses host epithelial cells after invasion of the intestinal wall. Entamoeba histolytica (strain HM-1:IMSS) is the first human amoeba to have its genome sequenced, assembled and analyzed.

The reference proteome of Entamoeba histolytica is derived from the genome sequence published in 2005. The genome is 24Mb in size and is split into 14 chromosomes. Approximately 9,938 predicted protein-coding genes with an average size of 1.2 kb comprise 49% of the genome. One third of this organism's predicted proteins do not have identifiable sequence homologs in other species.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Partially assembled WGS sequence7959
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again