Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Thiazole synthase

Gene

thiG

Organism
Escherichia coli O139:H28 (strain E24377A / ETEC)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Catalyzes the rearrangement of 1-deoxy-D-xylulose 5-phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H2S.UniRule annotation

Catalytic activityi

1-deoxy-D-xylulose 5-phosphate + 2-iminoacetate + thiocarboxy-adenylate-[sulfur-carrier protein ThiS] = 2-((2R,5Z)-2-carboxy-4-methylthiazol-5(2H)-ylidene)ethyl phosphate + [sulfur-carrier protein ThiS] + 2 H2O.UniRule annotation

Pathwayi: thiamine diphosphate biosynthesis

This protein is involved in the pathway thiamine diphosphate biosynthesis, which is part of Cofactor biosynthesis.UniRule annotation
View all proteins of this organism that are known to be involved in the pathway thiamine diphosphate biosynthesis and in Cofactor biosynthesis.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei95 – 951Schiff-base intermediate with DXPUniRule annotation
Binding sitei156 – 1561DXP; via amide nitrogenUniRule annotation

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Transferase

Keywords - Biological processi

Thiamine biosynthesis

Keywords - Ligandi

Schiff base

Enzyme and pathway databases

BioCyciECOL331111:GH7P-4513-MONOMER.
UniPathwayiUPA00060.

Names & Taxonomyi

Protein namesi
Recommended name:
Thiazole synthaseUniRule annotation (EC:2.8.1.10UniRule annotation)
Gene namesi
Name:thiGUniRule annotation
Ordered Locus Names:EcE24377A_4532
OrganismiEscherichia coli O139:H28 (strain E24377A / ETEC)
Taxonomic identifieri331111 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000001122 Componenti: Chromosome

Subcellular locationi

  • Cytoplasm UniRule annotation

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 256256Thiazole synthasePRO_1000060249Add
BLAST

Interactioni

Subunit structurei

Homotetramer. Forms heterodimers with either ThiH or ThiS.UniRule annotation

Structurei

3D structure databases

ProteinModelPortaliA7ZUK5.
SMRiA7ZUK5. Positions 1-238.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni182 – 1832DXP bindingUniRule annotation
Regioni204 – 2052DXP bindingUniRule annotation

Sequence similaritiesi

Belongs to the ThiG family.UniRule annotation

Phylogenomic databases

HOGENOMiHOG000248049.
KOiK03149.
OMAiAQYPSPA.
OrthoDBiEOG6KMBD9.

Family and domain databases

Gene3Di3.20.20.70. 1 hit.
HAMAPiMF_00443. ThiG.
InterProiIPR013785. Aldolase_TIM.
IPR008867. ThiG.
[Graphical view]
SUPFAMiSSF110399. SSF110399. 1 hit.

Sequencei

Sequence statusi: Complete.

A7ZUK5-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MLRIADKTFD SHLFTGTGKF ASSQLMVEAI RASGSQLVTL AMKRVDLRQH
60 70 80 90 100
NDAILEPLIA AGVTLLPNTS GAKTAEEAIF AAHLAREALG TNWLKLEIHP
110 120 130 140 150
DARWLLPDPI ETLKAAETLV QQGFVVLPYC GADPVLCKRL EEVGCAAVMP
160 170 180 190 200
LGAPIGSNQG LETRAMLEII IQQATVPVVV DAGIGVPSHA AQALEMGADA
210 220 230 240 250
VLVNTAIAVA DDPVNMAKAF RLAVEAGLLA RQSGPGSRSH FAHATSPLTG

FLEASA
Length:256
Mass (Da):26,870
Last modified:October 23, 2007 - v1
Checksum:i9767AEE617104E38
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CP000800 Genomic DNA. Translation: ABV17396.1.
RefSeqiWP_000944103.1. NC_009801.1.

Genome annotation databases

EnsemblBacteriaiABV17396; ABV17396; EcE24377A_4532.
KEGGiecw:EcE24377A_4532.
PATRICi18298330. VBIEscCol31211_4745.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CP000800 Genomic DNA. Translation: ABV17396.1.
RefSeqiWP_000944103.1. NC_009801.1.

3D structure databases

ProteinModelPortaliA7ZUK5.
SMRiA7ZUK5. Positions 1-238.
ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiABV17396; ABV17396; EcE24377A_4532.
KEGGiecw:EcE24377A_4532.
PATRICi18298330. VBIEscCol31211_4745.

Phylogenomic databases

HOGENOMiHOG000248049.
KOiK03149.
OMAiAQYPSPA.
OrthoDBiEOG6KMBD9.

Enzyme and pathway databases

UniPathwayiUPA00060.
BioCyciECOL331111:GH7P-4513-MONOMER.

Family and domain databases

Gene3Di3.20.20.70. 1 hit.
HAMAPiMF_00443. ThiG.
InterProiIPR013785. Aldolase_TIM.
IPR008867. ThiG.
[Graphical view]
SUPFAMiSSF110399. SSF110399. 1 hit.
ProtoNetiSearch...

Publicationsi

  1. "The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates."
    Rasko D.A., Rosovitz M.J., Myers G.S.A., Mongodin E.F., Fricke W.F., Gajer P., Crabtree J., Sebaihia M., Thomson N.R., Chaudhuri R., Henderson I.R., Sperandio V., Ravel J.
    J. Bacteriol. 190:6881-6893(2008) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: E24377A / ETEC.

Entry informationi

Entry nameiTHIG_ECO24
AccessioniPrimary (citable) accession number: A7ZUK5
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 5, 2008
Last sequence update: October 23, 2007
Last modified: July 6, 2016
This is version 54 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome

Documents

  1. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.