Skip Header

Contribute Send feedback
Read comments (?) or add your own

P30139 (THIG_ECOLI) Reviewed, UniProtKB/Swiss-Prot

Last modified January 25, 2012. Version 96. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Thiazole synthase

EC=2.8.1.10
Gene names
Name:thiG
Ordered Locus Names:b3991, JW5549
OrganismEscherichia coli (strain K12)
Taxonomic identifier83333 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length256 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Catalyzes the rearrangement of 1-deoxy-D-xylulose 5-phosphate (DXP) to produce the thiazole phosphate moiety of thiamine. Sulfur is provided by the thiocarboxylate moiety of the carrier protein ThiS. In vitro, sulfur can be provided by H2S. Ref.5

Catalytic activity

1-deoxy-D-xylulose 5-phosphate + 2-iminoacetate + thiocarboxy-adenylate-[sulfur-carrier protein ThiS] = 2-((2R,5Z)-2-carboxy-4-methylthiazol-5(2H)-ylidene)ethyl phosphate + [sulfur-carrier protein ThiS] + 2 H2O. HAMAP MF_00443

Pathway

Cofactor biosynthesis; thiamine diphosphate biosynthesis. HAMAP MF_00443

Subunit structure

Homotetramer. Forms heterodimers with either ThiH or ThiS. Ref.5

Subcellular location

Cytoplasm HAMAP MF_00443.

Sequence similarities

Belongs to the ThiG family.

Mass spectrometry

Molecular mass is 26893.3 Da from positions 1 - 256. Determined by ESI. Ref.5

Molecular mass is 26896.5 Da from positions 1 - 256. Determined by ESI. Ref.6

Sequence caution

The sequence AAC43089.1 differs from that shown. Reason: Erroneous initiation.

Ontologies

Keywords
   Biological processThiamine biosynthesis
   Cellular componentCytoplasm
   LigandSchiff base
   Molecular functionTransferase
   Technical termComplete proteome
Direct protein sequencing
Reference proteome
Gene Ontology (GO)
   Biological processthiamine biosynthetic process

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentcytoplasm

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionlyase activity

Inferred from electronic annotation. Source: UniProtKB-KW

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 256256Thiazole synthase HAMAP MF_00443
PRO_0000162815

Regions

Region182 – 1832DXP binding By similarity
Region204 – 2052DXP binding By similarity

Sites

Active site951Schiff-base intermediate with DXP By similarity
Binding site1561DXP; via amide nitrogen By similarity

Sequences

Sequence LengthMass (Da)Tools
P30139 [UniParc].

Last modified August 29, 2001. Version 3.
Checksum: 8B0BDEE617104E38

FASTA25626,896
        10         20         30         40         50         60 
MLRIADKTFD SHLFTGTGKF ASSQLMVEAI RASGSQLVTL AMKRVDLRQH NDAILEPLIA 

        70         80         90        100        110        120 
AGVTLLPNTS GAKTAEEAIF AAHLAREALG TNWLKLEIHP DARWLLPDPI ETLKAAETLV 

       130        140        150        160        170        180 
QQGFVVLPYC GADPVLCKRL EEVGCAAVMP LGAPIGSNQG LETRAMLEII IQQATVPVVV 

       190        200        210        220        230        240 
DAGIGVPSHA AQALEMGADA VLVNTAIAVA DDPVNMAKAF RLAVEAGLLA RQSGPGSRSY 

       250 
FAHATSPLTG FLEASA 

« Hide

References

« Hide 'large scale' references
[1]"Structural genes for thiamine biosynthetic enzymes (thiCEFGH) in Escherichia coli K-12."
Vander Horn P.B., Backstrom A.D., Stewart V., Begley T.P.
J. Bacteriol. 175:982-992(1993) [PubMed: 8432721] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: K12.
[2]"Analysis of the Escherichia coli genome. IV. DNA sequence of the region from 89.2 to 92.8 minutes."
Blattner F.R., Burland V.D., Plunkett G. III, Sofia H.J., Daniels D.L.
Nucleic Acids Res. 21:5408-5417(1993) [PubMed: 8265357] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / MG1655 / ATCC 47076.
[3]"The complete genome sequence of Escherichia coli K-12."
Blattner F.R., Plunkett G. III, Bloch C.A., Perna N.T., Burland V., Riley M., Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F., Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A., Rose D.J., Mau B., Shao Y.
Science 277:1453-1474(1997) [PubMed: 9278503] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / MG1655 / ATCC 47076.
[4]"Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
Mol. Syst. Biol. 2:E1-E5(2006) [PubMed: 16738553] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
[5]"Thiamine biosynthesis in Escherichia coli: isolation and initial characterisation of the ThiGH complex."
Leonardi R., Fairhurst S.A., Kriek M., Lowe D.J., Roach P.L.
FEBS Lett. 539:95-99(2003) [PubMed: 12650933] [Abstract]
Cited for: PROTEIN SEQUENCE OF 1-7, FUNCTION, MASS SPECTROMETRY, SUBUNIT, INTERACTION WITH THIH.
[6]"Efficient sequence analysis of the six gene products (7-74 kDa) from the Escherichia coli thiamin biosynthetic operon by tandem high-resolution mass spectrometry."
Kelleher N.L., Taylor S.V., Grannis D., Kinsland C., Chiu H.-J., Begley T.P., McLafferty F.W.
Protein Sci. 7:1796-1801(1998) [PubMed: 10082377] [Abstract]
Cited for: MASS SPECTROMETRY.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
M88701 Genomic DNA. Translation: AAB95621.1.
U00006 Genomic DNA. Translation: AAC43089.1. Different initiation.
U00096 Genomic DNA. Translation: AAC76965.2.
AP009048 Genomic DNA. Translation: BAE77329.1.
PIRB65206.
RefSeqNP_418418.2. NC_000913.2.

3D structure databases

ProteinModelPortalP30139.
SMRP30139. Positions 1-238.
ModBaseSearch...

Protein-protein interaction databases

DIPDIP-6868N.
IntActP30139. 9 interactions.
MINTMINT-1289283.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaEBESCT00000000061; EBESCP00000000061; EBESCG00000000047.
EBESCT00000016457; EBESCP00000015748; EBESCG00000015517.
GeneID948493.
GenomeReviewsGene locus JW5549 in contig AP009048_GR.
Gene locus b3991 in contig U00096_GR.
KEGGecj:JW5549.
eco:b3991.
PATRIC32123503. VBIEscCol129921_4104.

Organism-specific databases

EchoBASEEB1547.
EcoGeneEG11589. thiG.

Phylogenomic databases

eggNOGCOG2022.
GeneTreeEBGT00050000011495.
HOGENOMHBG296821.
OMATAGCFTA.
PhylomeDBP30139.
ProtClustDBPRK00208.

Enzyme and pathway databases

BioCycEcoCyc:THIG-MONOMER.
MetaCyc:THIG-MONOMER.

Gene expression databases

GenevestigatorP30139.

Family and domain databases

HAMAPMF_00443. ThiG.
[Tree]
InterProIPR013785. Aldolase_TIM.
IPR008867. ThiG.
[Graphical view]
Gene3DG3DSA:3.20.20.70. Aldolase_TIM. 1 hit.
KOK03149.
PfamPF05690. ThiG. 1 hit.
[Graphical view]
SUPFAMSSF110399. ThiG. 1 hit.
ProtoNetSearch...

Entry information

Entry nameTHIG_ECOLI
AccessionPrimary (citable) accession number: P30139
Secondary accession number(s): P76779, Q2M8S7
Entry history
Integrated into UniProtKB/Swiss-Prot: April 1, 1993
Last sequence update: August 29, 2001
Last modified: January 25, 2012
This is version 96 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Relevant documents

Escherichia coli

Escherichia coli (strain K12): entries and cross-references to EcoGene

PATHWAY comments

Index of metabolic and biosynthesis pathways

SIMILARITY comments

Index of protein domains and families