Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Thiazole synthase

Gene

thiG

Organism
Escherichia coli (strain SE11)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Catalyzes the rearrangement of 1-deoxy-D-xylulose 5-phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H2S.UniRule annotation

Catalytic activityi

1-deoxy-D-xylulose 5-phosphate + 2-iminoacetate + thiocarboxy-adenylate-[sulfur-carrier protein ThiS] = 2-((2R,5Z)-2-carboxy-4-methylthiazol-5(2H)-ylidene)ethyl phosphate + [sulfur-carrier protein ThiS] + 2 H2O.UniRule annotation

Pathwayi: thiamine diphosphate biosynthesis

This protein is involved in the pathway thiamine diphosphate biosynthesis, which is part of Cofactor biosynthesis.UniRule annotation
View all proteins of this organism that are known to be involved in the pathway thiamine diphosphate biosynthesis and in Cofactor biosynthesis.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Active sitei95 – 951Schiff-base intermediate with DXPUniRule annotation
Binding sitei156 – 1561DXP; via amide nitrogenUniRule annotation

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Transferase

Keywords - Biological processi

Thiamine biosynthesis

Keywords - Ligandi

Schiff base

Enzyme and pathway databases

BioCyciECOL409438:GHUU-4369-MONOMER.
UniPathwayiUPA00060.

Names & Taxonomyi

Protein namesi
Recommended name:
Thiazole synthaseUniRule annotation (EC:2.8.1.10UniRule annotation)
Gene namesi
Name:thiGUniRule annotation
Ordered Locus Names:ECSE_4278
OrganismiEscherichia coli (strain SE11)
Taxonomic identifieri409438 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000008199 Componenti: Chromosome

Subcellular locationi

  • Cytoplasm UniRule annotation

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 256256Thiazole synthasePRO_1000196856Add
BLAST

Interactioni

Subunit structurei

Homotetramer. Forms heterodimers with either ThiH or ThiS.UniRule annotation

Structurei

3D structure databases

ProteinModelPortaliB6I5K1.
SMRiB6I5K1. Positions 1-238.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni182 – 1832DXP bindingUniRule annotation
Regioni204 – 2052DXP bindingUniRule annotation

Sequence similaritiesi

Belongs to the ThiG family.UniRule annotation

Phylogenomic databases

HOGENOMiHOG000248049.
KOiK03149.
OMAiAQYPSPA.
OrthoDBiEOG6KMBD9.

Family and domain databases

Gene3Di3.20.20.70. 1 hit.
HAMAPiMF_00443. ThiG.
InterProiIPR013785. Aldolase_TIM.
IPR008867. ThiG.
[Graphical view]
SUPFAMiSSF110399. SSF110399. 1 hit.

Sequencei

Sequence statusi: Complete.

B6I5K1-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MLRIADKTFD SHLFTGTGKF ASSQLMVEAI RASGSQLVTL AMKRVDLRQH
60 70 80 90 100
NDAILEPLIA AGVTLLPNTS GAKTAEEAIF AAHLAREALG TNWLKLEIHP
110 120 130 140 150
DARWLLPDPI ETLKAAETLV QQGFVVLPYC GADPVLCKRL EEVGCAAVMP
160 170 180 190 200
LGAPIGSNQG LETRAMLEII IQQATVPVVV DAGIGVPSHA AQALEMGADA
210 220 230 240 250
VLVNTAIAVA DDPVNMAKAF RLAVEAGLLA RQSGPGSRSH FAHATSPLTG

FLEASA
Length:256
Mass (Da):26,870
Last modified:December 16, 2008 - v1
Checksum:i9767AEE617104E38
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AP009240 Genomic DNA. Translation: BAG79802.1.
RefSeqiWP_000944103.1. NC_011415.1.

Genome annotation databases

EnsemblBacteriaiBAG79802; BAG79802; ECSE_4278.
KEGGiecy:ECSE_4278.
PATRICi18427262. VBIEscCol83070_4457.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AP009240 Genomic DNA. Translation: BAG79802.1.
RefSeqiWP_000944103.1. NC_011415.1.

3D structure databases

ProteinModelPortaliB6I5K1.
SMRiB6I5K1. Positions 1-238.
ModBaseiSearch...
MobiDBiSearch...

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiBAG79802; BAG79802; ECSE_4278.
KEGGiecy:ECSE_4278.
PATRICi18427262. VBIEscCol83070_4457.

Phylogenomic databases

HOGENOMiHOG000248049.
KOiK03149.
OMAiAQYPSPA.
OrthoDBiEOG6KMBD9.

Enzyme and pathway databases

UniPathwayiUPA00060.
BioCyciECOL409438:GHUU-4369-MONOMER.

Family and domain databases

Gene3Di3.20.20.70. 1 hit.
HAMAPiMF_00443. ThiG.
InterProiIPR013785. Aldolase_TIM.
IPR008867. ThiG.
[Graphical view]
SUPFAMiSSF110399. SSF110399. 1 hit.
ProtoNetiSearch...

Publicationsi

  1. "Complete genome sequence and comparative analysis of the wild-type commensal Escherichia coli strain SE11 isolated from a healthy adult."
    Oshima K., Toh H., Ogura Y., Sasamoto H., Morita H., Park S.-H., Ooka T., Iyoda S., Taylor T.D., Hayashi T., Itoh K., Hattori M.
    DNA Res. 15:375-386(2008) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: SE11.

Entry informationi

Entry nameiTHIG_ECOSE
AccessioniPrimary (citable) accession number: B6I5K1
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 14, 2009
Last sequence update: December 16, 2008
Last modified: July 6, 2016
This is version 51 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome

Documents

  1. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.