Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome_redundancy">more...</a>)</p> 10,465
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000000759
Taxonomy556484 - Phaeodactylum tricornutum (strain CCAP 1055/1)
StrainCCAP 1055/1
Last modifiedMay 24, 2021
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000150955.2 from ENA/EMBL full
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (M) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment, and the name of the taxonomic lineage dataset used.</p> C:97%[S:95%,D:2%],F:3%,M:0%,n:100 stramenopiles_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Unknown

Phaeodactylum tricornutum (strain CCAP 1055/1) is a diatom, a major algal group and among the most common types of phytoplankton. It has been found in several locations around the world, typically in coastal areas with wide fluctuations in salinity. Unlike other diatoms, it can exist in different morphotypes, and changes in cell shape can be stimulated by environmental conditions.

The completed P. tricornutum genome is approximately 27 Mb and is predicted to contain 10,402 protein-genes. It is the second diatom for which a whole genome sequence has been generated, the first being Thalassiosira pseudonana. The availability of both diatom genomes provides the basis for comparative genomics studies of diatoms with other eukaryotes. The clone of P. tricornutum that was sequenced is CCAP 1055/1. It represents a monoclonal culture derived from a fusiform cell in May 2003 from strain CCMP632, which was originally isolated in 1956 off the coast of Blackpool, UK.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome 11037
Chromosome 2610
Chromosome 3552
Chromosome 4538
Chromosome 5427
Chromosome 6442
Chromosome 7389
Chromosome 8402
Chromosome 9378
Chromosome 10377
Chromosome 11367
Chromosome 12352
Chromosome 13357
Chromosome 14313
Chromosome 15322
Chromosome 16291
Chromosome 17259
Chromosome 18259
Chromosome 19258
Chromosome 20259
Chromosome 21254
Chromosome 22237
Chromosome 23191
Chromosome 24158
Chromosome 25182
Chromosome 26140
Chromosome 27148
Chromosome 28167
Chromosome 2995
Chromosome 30104
Chromosome 3178
Chromosome 3323
Chromosome 3244
Unassembled WGS sequence372


  1. "Chloroplast genomes of the diatoms Phaeodactylum tricornutum and Thalassiosira pseudonana: comparison with other plastid genomes of the red lineage."
    Oudot-Le Secq M.-P., Grimwood J., Shapiro H., Armbrust E.V., Bowler C., Green B.R.
    Mol. Genet. Genomics 277:427-439(2007) [PubMed] [Europe PMC] [Abstract]
  2. "The Phaeodactylum genome reveals the evolutionary history of diatom genomes."
    Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A., Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A., Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T.
    Grigoriev I.V.
    Nature 456:239-244(2008) [PubMed] [Europe PMC] [Abstract]
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again