Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Thiazole synthase

Gene

thiG

Organism
Gluconacetobacter diazotrophicus (strain ATCC 49037 / DSM 5601 / PAl5)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Catalyzes the rearrangement of 1-deoxy-D-xylulose 5-phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H2S.UniRule annotation

Catalytic activityi

1-deoxy-D-xylulose 5-phosphate + 2-iminoacetate + thiocarboxy-adenylate-[sulfur-carrier protein ThiS] = 2-((2R,5Z)-2-carboxy-4-methylthiazol-5(2H)-ylidene)ethyl phosphate + [sulfur-carrier protein ThiS] + 2 H2O.UniRule annotation

Pathwayi: thiamine diphosphate biosynthesis

This protein is involved in the pathway thiamine diphosphate biosynthesis, which is part of Cofactor biosynthesis.UniRule annotation
View all proteins of this organism that are known to be involved in the pathway thiamine diphosphate biosynthesis and in Cofactor biosynthesis.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei95 – 951Schiff-base intermediate with DXPUniRule annotation
Binding sitei156 – 1561DXP; via amide nitrogenUniRule annotation

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Transferase

Keywords - Biological processi

Thiamine biosynthesis

Keywords - Ligandi

Schiff base

Enzyme and pathway databases

BioCyciGDIA272568:GJPS-1814-MONOMER.
UniPathwayiUPA00060.

Names & Taxonomyi

Protein namesi
Recommended name:
Thiazole synthaseUniRule annotation (EC:2.8.1.10UniRule annotation)
Gene namesi
Name:thiGUniRule annotation
Ordered Locus Names:GDI1781, Gdia_0011
OrganismiGluconacetobacter diazotrophicus (strain ATCC 49037 / DSM 5601 / PAl5)
Taxonomic identifieri272568 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaAlphaproteobacteriaRhodospirillalesAcetobacteraceaeGluconacetobacter
Proteomesi
  • UP000000736 Componenti: Chromosome
  • UP000001176 Componenti: Chromosome

Subcellular locationi

  • Cytoplasm UniRule annotation

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 256256Thiazole synthasePRO_1000080871Add
BLAST

Interactioni

Subunit structurei

Homotetramer. Forms heterodimers with either ThiH or ThiS.UniRule annotation

Protein-protein interaction databases

STRINGi272568.Gdia_0011.

Structurei

3D structure databases

ProteinModelPortaliA9HI56.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni183 – 1842DXP bindingUniRule annotation
Regioni205 – 2062DXP bindingUniRule annotation

Sequence similaritiesi

Belongs to the ThiG family.UniRule annotation

Phylogenomic databases

eggNOGiENOG4105CA8. Bacteria.
COG2022. LUCA.
HOGENOMiHOG000248049.
KOiK03149.
OMAiAQYPSPA.
OrthoDBiEOG6KMBD9.

Family and domain databases

Gene3Di3.20.20.70. 1 hit.
HAMAPiMF_00443. ThiG.
InterProiIPR013785. Aldolase_TIM.
IPR008867. ThiG.
[Graphical view]
SUPFAMiSSF110399. SSF110399. 1 hit.

Sequencei

Sequence statusi: Complete.

A9HI56-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MLFYGTELKS RLMLGTAQYP SPEILSDAVR AAEAGVVTVS LRRESAGQRA
60 70 80 90 100
GQAFWSMIRD LGVPVLPNTA GCHTVKEAVT TAHMAREVFD TDWIKLEVIG
110 120 130 140 150
ESDTLQPDMF GLVEAARILS RDGFKVFPYM TEDLVGAERL LQAGCEVLMP
160 170 180 190 200
WGAPIGSGKG LNNVFGLRAL RAHFPDVPLV VDAGIGLPSH AAQAMELGYD
210 220 230 240 250
AVLINTAVAK AGDPVRMARA FRLAVEAGLI AREADPIEER DMAAPSTPVL

GRAMLA
Length:256
Mass (Da):27,401
Last modified:February 5, 2008 - v1
Checksum:i4C4F7AEF870EB6CB
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AM889285 Genomic DNA. Translation: CAP55724.1.
CP001189 Genomic DNA. Translation: ACI49813.1.
RefSeqiWP_012225292.1. NC_011365.1.

Genome annotation databases

EnsemblBacteriaiACI49813; ACI49813; Gdia_0011.
CAP55724; CAP55724; GDI1781.
KEGGigdi:GDI1781.
gdj:Gdia_0011.
PATRICi22048378. VBIGluDia203729_0011.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AM889285 Genomic DNA. Translation: CAP55724.1.
CP001189 Genomic DNA. Translation: ACI49813.1.
RefSeqiWP_012225292.1. NC_011365.1.

3D structure databases

ProteinModelPortaliA9HI56.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi272568.Gdia_0011.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiACI49813; ACI49813; Gdia_0011.
CAP55724; CAP55724; GDI1781.
KEGGigdi:GDI1781.
gdj:Gdia_0011.
PATRICi22048378. VBIGluDia203729_0011.

Phylogenomic databases

eggNOGiENOG4105CA8. Bacteria.
COG2022. LUCA.
HOGENOMiHOG000248049.
KOiK03149.
OMAiAQYPSPA.
OrthoDBiEOG6KMBD9.

Enzyme and pathway databases

UniPathwayiUPA00060.
BioCyciGDIA272568:GJPS-1814-MONOMER.

Family and domain databases

Gene3Di3.20.20.70. 1 hit.
HAMAPiMF_00443. ThiG.
InterProiIPR013785. Aldolase_TIM.
IPR008867. ThiG.
[Graphical view]
SUPFAMiSSF110399. SSF110399. 1 hit.
ProtoNetiSearch...

Publicationsi

  1. "Complete genome sequence of the sugarcane nitrogen-fixing endophyte Gluconacetobacter diazotrophicus Pal5."
    Bertalan M., Albano R., de Padua V., Rouws L., Rojas C., Hemerly A., Teixeira K., Schwab S., Araujo J., Oliveira A., Franca L., Magalhaes V., Alqueres S., Cardoso A., Almeida W., Loureiro M.M., Nogueira E., Cidade D.
    , Oliveira D., Simao T., Macedo J., Valadao A., Dreschsel M., Freitas F., Vidal M., Guedes H., Rodrigues E., Meneses C., Brioso P., Pozzer L., Figueiredo D., Montano H., Junior J., de Souza Filho G., Martin Quintana Flores V., Ferreira B., Branco A., Gonzalez P., Guillobel H., Lemos M., Seibel L., Macedo J., Alves-Ferreira M., Sachetto-Martins G., Coelho A., Santos E., Amaral G., Neves A., Pacheco A.B., Carvalho D., Lery L., Bisch P., Rossle S.C., Urmenyi T., Rael Pereira A., Silva R., Rondinelli E., von Kruger W., Martins O., Baldani J.I., Ferreira P.C.
    BMC Genomics 10:450-450(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ATCC 49037 / DSM 5601 / PAl5.
  2. "Two genome sequences of the same bacterial strain, Gluconacetobacter diazotrophicus PAl 5, suggest a new standard in genome sequence submission."
    Giongo A., Tyler H.L., Zipperer U.N., Triplett E.W.
    Stand. Genomic Sci. 2:309-317(2010) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ATCC 49037 / DSM 5601 / PAl5.

Entry informationi

Entry nameiTHIG_GLUDA
AccessioniPrimary (citable) accession number: A9HI56
Secondary accession number(s): B5ZJ35
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 20, 2008
Last sequence update: February 5, 2008
Last modified: July 6, 2016
This is version 57 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.