Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Due to a server migration, the UniProt 'ID mapping', 'Peptide search' and 'community bibliography submission' tools will not be available on the 19th April 2021 during the morning (EST).


StatusReference proteome
Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 31,341
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000007754
Taxonomy59729 - Taeniopygia guttata
Last modifiedDecember 21, 2020
Genome assemblyi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> from Ensembl
Genome annotationi Ensembl
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (F) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment.</p> C:94.9%[S:49.2%,D:45.6%],F:1.1%,M:4.1%,n:10844 passeriformes_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Outlier (high)

Taeniopygia guttata, the zebra finch, belongs to the order Passeriformes. It is a model organism in the field of neuroscience, especially for the study of human speech.

The genome was published in 2010. The sequence is that of a male, chosen in order to maximize coverage of the Z chromosome. It comprises 1.2 Gb. Gene models were built by ENSEMBL using alignments with chicken and other avian as well as mammalian proteins.

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
Chromosome 12255
Chromosome 1A1687
Chromosome 1B0
Chromosome 22688
Chromosome 32333
Chromosome 41563
Chromosome 4A654
Chromosome 51833
Chromosome 61064
Chromosome 71104
Chromosome 8999
Chromosome 9794
Chromosome 10838
Chromosome 11629
Chromosome 12651
Chromosome 13677
Chromosome 14787
Chromosome 15657
Chromosome 1674
Chromosome 17591
Chromosome 18582
Chromosome 19509
Chromosome 20617
Chromosome 21499
Chromosome 22385
Chromosome 23394
Chromosome 24309
Chromosome 25450
Chromosome 26480
Chromosome 27479
Chromosome 28520
Chromosome LG50
Chromosome LG20
Chromosome LGE220
Chromosome Z1495
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again