Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.


Proteinsi <p>Number of protein entries associated with this proteome: UniProtKB entries for regular proteomes or UniParc entries for redundant proteomes (<a href="/help/proteome%5Fredundancy">more...</a>)</p> 4,990
Gene counti <p>This is the total number of unique genes found in the proteome set, algorithmically computed. For each gene, a single representative protein sequence is chosen from the proteome. Where possible, reviewed (Swiss-Prot) protein sequences are chosen as the representatives.</p> - Download one protein sequence per gene (FASTA)
Proteome IDi <p>The proteome identifier (UPID) is the unique identifier assigned to the set of proteins that constitute the <a href="">proteome</a>. It consists of the characters 'UP' followed by 9 digits, is stable across releases and can therefore be used to cite a UniProt proteome.<p><a href='/help/proteome_id' target='_top'>More...</a></p>UP000002424
Taxonomy322710 - Azotobacter vinelandii (strain DJ / ATCC BAA-1303)
StrainDJ / ATCC BAA-1303
Genome assembly and annotationi <p>Identifier for the genome assembly (<a href="">more...</a>)</p> GCA_000021045.1 from ENA/EMBL full
Pan proteomei <p>A pan proteome is the full set of proteins thought to be expressed by a group of highly related organisms (e.g. multiple strains of the same bacterial species).<p><a href='/help/pan_proteomes' target='_top'>More...</a></p> This proteome is part of the Azotobacter chroococcum NCIMB 8003 pan proteome (fasta)
Buscoi <p>The Benchmarking Universal Single-Copy Ortholog (BUSCO) assessment tool is used, for eukaryotic and bacterial proteomes, to provide quantitative measures of UniProt proteome data completeness in terms of expected gene content. BUSCO scores include percentages of complete (C) single-copy (S) genes, complete (C) duplicated (D) genes, fragmented (F) and missing (M) genes, as well as the total number of orthologous clusters (n) used in the BUSCO assessment, and the name of the taxonomic lineage dataset used.</p> C:98.2%[S:96.7%,D:1.5%],F:0.5%,M:1.3%,n:782 pseudomonadales_odb10
Completenessi <p>Complete Proteome Detector (CPD) is an algorithm which employs statistical evaluation of the completeness and quality of proteomes in UniProt, by looking at the sizes of taxonomically close proteomes. Possible values are 'Standard', 'Close to Standard' and 'Outlier'.</p> Close to standard (low value)

Azotobacter vinelandii is a free-living, obligately aerobic, nitrogen-fixing gamma-proteobacteria. It is found in soils world-wide, with features of nitrogen and energy metabolism relevant to agriculture. In response to carbon starvation it differentiates to form cysts that are impervious to chemical and physical challenge. Studies have been focused on its ability to fix diatmospheric nitrogen under free-living conditions, a process that occurs in the presence of oxygen levels that typically inactivate the nitrogenase enzyme. Unusually it encodes three distinct nitrogenase systems, the molybdenum, vanadium and iron-only nitrogenases, expression of which is differentially regulated by metal availability from the medium. Diazotrophic growth under aerobic conditions is possible because it adjusts oxygen-consumption rates to help maintain low levels of cytoplasmic oxygen, a phenomenon called respiratory protection. It is able to produce alginate, a polymer that further protects the organism from excess exogenous oxygen, and it has multiple duplications of alginate modification genes, which may alter alginate composition in response to oxygen availability. Other oxygen-sensitive enzymes have also been identified in the genome (carbon monoxide dehydrogenase, a formate dehydrogenase, and another hydrogenase). Genome annotation was done by supervised teams of undergraduate students using a web-based system over a preliminary automated annotation, both developed by J. C. Setubal (adapted from PubMed 19429624).

Componentsi <p>Genomic components encoding the proteome</p>

Component nameGenome Accession(s)
Component representationProteins
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health

We'd like to inform you that we have updated our Privacy Notice to comply with Europe’s new General Data Protection Regulation (GDPR) that applies since 25 May 2018.

Do not show this banner again