Fungal protein annotation project
The Fungal protein annotation project focuses on the manual annotation of fungi-specific proteins and protein families.
The fungal kingdom
Species of the fungal kingdom occupy almost every ecological niche, and include important human, animal and plant pathogens. They are of tremendous industrial importance. Not only are they employed in beer, wine, bread and cheese manufacturing, but they are also natural producers of antibiotics (i.e. penicillin), and can be engineered to make biotechnologically relevant enzymes like hemicellulases, vitamins, and other industrially important secondary metabolites. In addition, fungi share many important biological processes with higher eukaryotes, and their experimental tractability make certain species of fungi among the most useful model systems in cell biology.
Our major manual curation efforts are directed towards the annotation of proteins from the well-studied model yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe as well as the annotation of representative members of fungal-specific protein families from other model fungi like Aspergillus nidulans and Neurospora crassa, or the human pathogens Candida albicans and Cryptococcus neoformans. This is done with special emphasis on proteins of particular interest within these individual species and also on proteins belonging to conserved fungal families, but which have no discernable members in the two model yeasts S. cerevisiae and S. pombe. This allows an even broader coverage of biochemically characterized fungal proteins in our knowledgebase.
- All manually reviewed Saccharomyces cerevisiae entries can be found here (statistics)
- All manually reviewed Schizosaccharomyces pombe entries can be found here (statistics)
- All manually reviewed Aspergillus nidulans entries can be found here
- All manually reviewed Neurospora crassa entries can be found here
- All manually reviewed Candida albicans entries can be found here
- All manually reviewed Cryptococcus neoformans entries can be found here
With the availability of increasing numbers of completely sequenced fungal genomes, the scope of this annotation project was extended to cover the identification and annotation of the orthologs of characterized proteins in all fungal species. Orthologs are identified by similarity searches and phylogenetic analysis, and annotation from characterized proteins is transferred to other family members using strict rules, to avoid propagation of inaccurate information and preserve the high standard of UniProtKB/Swiss-Prot manual annotation.
S. cerevisiae has always been at the forefront of biochemical research. Its ease of cultivation coupled with its genetic tractability made it an organism of choice in the early days of research, and it has retained its pre-eminence in the post-genomics area: many innovative and large-scale investigative techniques have been pioneered in S. cerevisiae in recent years. Therefore yeast remains one of the (if not THE) best-characterized eukaryotic organisms, and the functions of orthologous proteins in other organisms are often inferred at least in part based on studies of their yeast counterpart. For this reason, the Fungal Protein Annotation project has prioritized S. cerevisiae annotation and pursued the goal of completely annotating all yeast ORFs thought to code for a protein, resulting in the complete S. cerevisiae proteome integration into UniProtKB/Swiss-Prot in early 2007.
Another yeast that was recognized early on as a valuable model organism for the study of conserved pathways and cell mechanisms is Schizosaccharomyces pombe, also commonly known as fission yeast. S. pombe shares many features with cells of more complicated eukaryotes and has long been used as a model for research into cell cycle progression and cell division. S. pombe grows by length extension and divides by medial fission, properties that allow for facile determination of cell cycle stage. Gene sequence comparisons and phylogenetic analyses have suggested that fission yeast diverged from budding yeast around 330-420 million years ago, and from Metazoa and plants around 1,000-1,200 million years ago, with some gene sequences as divergent between the two yeasts as they are from their human homologs. The wealth of biochemical information for S. pombe proteins, its interesting phylogenetic position and, not at least, the fact that it was the second completely sequenced genome of a fungus in 2002 prompted us to direct special annotation efforts to this species, which resulted in the complete S. pombe proteome integration into UniProtKB/Swiss-Prot in 2009.