Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Thiamine-binding periplasmic protein

Gene

thiB

Organism
Escherichia coli (strain K12)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

Part of the ABC transporter complex ThiBPQ involved in thiamine import.1 Publication

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Transport

Enzyme and pathway databases

BioCyciEcoCyc:SFUA-MONOMER.
ECOL316407:JW0067-MONOMER.

Protein family/group databases

TCDBi3.A.1.19.1. the atp-binding cassette (abc) superfamily.

Names & Taxonomyi

Protein namesi
Recommended name:
Thiamine-binding periplasmic protein
Gene namesi
Name:thiB
Synonyms:tbpA, yabL
Ordered Locus Names:b0068, JW0067
OrganismiEscherichia coli (strain K12)
Taxonomic identifieri83333 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
Proteomesi
  • UP000000318 Componenti: Chromosome
  • UP000000625 Componenti: Chromosome

Organism-specific databases

EcoGeneiEG11574. tbpA.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Periplasm

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 1818Add
BLAST
Chaini19 – 327309Thiamine-binding periplasmic proteinPRO_0000031705Add
BLAST

Proteomic databases

EPDiP31550.
PaxDbiP31550.
PRIDEiP31550.

Interactioni

Subunit structurei

The complex is composed of two ATP-binding proteins (ThiQ), two transmembrane proteins (ThiP) and a solute-binding protein (ThiB).Curated

Protein-protein interaction databases

BioGridi4261341. 8 interactions.
DIPiDIP-10967N.
IntActiP31550. 3 interactions.
STRINGi511145.b0068.

Chemistry

BindingDBiP31550.

Structurei

Secondary structure

1
327
Legend: HelixTurnBeta strand
Show more details
Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Beta strandi21 – 266Combined sources
Helixi28 – 314Combined sources
Helixi37 – 459Combined sources
Beta strandi50 – 567Combined sources
Helixi60 – 7011Combined sources
Helixi71 – 733Combined sources
Beta strandi77 – 837Combined sources
Helixi84 – 863Combined sources
Helixi87 – 937Combined sources
Helixi103 – 1053Combined sources
Beta strandi118 – 13013Combined sources
Turni131 – 1333Combined sources
Helixi141 – 1466Combined sources
Beta strandi153 – 1564Combined sources
Turni158 – 1603Combined sources
Helixi162 – 17514Combined sources
Helixi176 – 1783Combined sources
Helixi179 – 1879Combined sources
Beta strandi190 – 1967Combined sources
Helixi197 – 2059Combined sources
Beta strandi210 – 2156Combined sources
Helixi218 – 2269Combined sources
Beta strandi231 – 2333Combined sources
Beta strandi240 – 25011Combined sources
Helixi256 – 26611Combined sources
Helixi269 – 2724Combined sources
Helixi275 – 2784Combined sources
Beta strandi281 – 2855Combined sources
Helixi293 – 2953Combined sources
Beta strandi300 – 3034Combined sources
Helixi307 – 32519Combined sources

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
2QRYX-ray2.25A/B/C/D19-327[»]
ProteinModelPortaliP31550.
SMRiP31550. Positions 19-327.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiP31550.

Family & Domainsi

Sequence similaritiesi

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiENOG4105DP9. Bacteria.
COG4143. LUCA.
HOGENOMiHOG000272499.
InParanoidiP31550.
KOiK02064.
OMAiHYMQVEV.
OrthoDBiEOG6D8BCG.
PhylomeDBiP31550.

Family and domain databases

InterProiIPR006061. SBP_1_CS.
IPR005948. Thi_ABC_peri-bd.
IPR005967. ThiB_ABC_peri-bd.
[Graphical view]
TIGRFAMsiTIGR01254. sfuA. 1 hit.
TIGR01276. thiB. 1 hit.
PROSITEiPS01037. SBP_BACTERIAL_1. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

P31550-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MLKKCLPLLL LCTAPVFAKP VLTVYTYDSF AADWGPGPVV KKAFEADCNC
60 70 80 90 100
ELKLVALEDG VSLLNRLRME GKNSKADVVL GLDNNLLDAA SKTGLFAKSG
110 120 130 140 150
VAADAVNVPG GWNNDTFVPF DYGYFAFVYD KNKLKNPPQS LKELVESDQN
160 170 180 190 200
WRVIYQDPRT STPGLGLLLW MQKVYGDDAP QAWQKLAKKT VTVTKGWSEA
210 220 230 240 250
YGLFLKGESD LVLSYTTSPA YHILEEKKDN YAAANFSEGH YLQVEVAART
260 270 280 290 300
AASKQPELAQ KFLQFMVSPA FQNAIPTGNW MYPVANVTLP AGFEKLTKPA
310 320
TTLEFTPAEV AAQRQAWISE WQRAVSR
Length:327
Mass (Da):36,163
Last modified:November 1, 1997 - v2
Checksum:i348A5625FFC94597
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti1 – 1212MLKKC…LLLLC → MSAPAVAV AA sequence (Ref. 1) CuratedAdd
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U09984 Genomic DNA. Translation: AAA18833.1.
U00096 Genomic DNA. Translation: AAC73179.1.
AP009048 Genomic DNA. Translation: BAB96637.2.
PIRiD64728.
RefSeqiNP_414610.1. NC_000913.3.
WP_001301364.1. NZ_LN832404.1.

Genome annotation databases

EnsemblBacteriaiAAC73179; AAC73179; b0068.
BAB96637; BAB96637; BAB96637.
GeneIDi946306.
KEGGiecj:JW0067.
eco:b0068.
PATRICi32115237. VBIEscCol129921_0070.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U09984 Genomic DNA. Translation: AAA18833.1.
U00096 Genomic DNA. Translation: AAC73179.1.
AP009048 Genomic DNA. Translation: BAB96637.2.
PIRiD64728.
RefSeqiNP_414610.1. NC_000913.3.
WP_001301364.1. NZ_LN832404.1.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
2QRYX-ray2.25A/B/C/D19-327[»]
ProteinModelPortaliP31550.
SMRiP31550. Positions 19-327.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi4261341. 8 interactions.
DIPiDIP-10967N.
IntActiP31550. 3 interactions.
STRINGi511145.b0068.

Chemistry

BindingDBiP31550.

Protein family/group databases

TCDBi3.A.1.19.1. the atp-binding cassette (abc) superfamily.

Proteomic databases

EPDiP31550.
PaxDbiP31550.
PRIDEiP31550.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiAAC73179; AAC73179; b0068.
BAB96637; BAB96637; BAB96637.
GeneIDi946306.
KEGGiecj:JW0067.
eco:b0068.
PATRICi32115237. VBIEscCol129921_0070.

Organism-specific databases

EchoBASEiEB1534.
EcoGeneiEG11574. tbpA.

Phylogenomic databases

eggNOGiENOG4105DP9. Bacteria.
COG4143. LUCA.
HOGENOMiHOG000272499.
InParanoidiP31550.
KOiK02064.
OMAiHYMQVEV.
OrthoDBiEOG6D8BCG.
PhylomeDBiP31550.

Enzyme and pathway databases

BioCyciEcoCyc:SFUA-MONOMER.
ECOL316407:JW0067-MONOMER.

Miscellaneous databases

EvolutionaryTraceiP31550.
PROiP31550.

Family and domain databases

InterProiIPR006061. SBP_1_CS.
IPR005948. Thi_ABC_peri-bd.
IPR005967. ThiB_ABC_peri-bd.
[Graphical view]
TIGRFAMsiTIGR01254. sfuA. 1 hit.
TIGR01276. thiB. 1 hit.
PROSITEiPS01037. SBP_BACTERIAL_1. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "E. coli periplasmic thiamin binding protein: cloning, overexpression, purification, and characterization."
    Hollenbach A.D., Dickson K.A., Washabaugh M.W.
    Submitted (MAY-1994) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA], PARTIAL PROTEIN SEQUENCE, CHARACTERIZATION.
    Strain: K12 / C600 / ATCC 23724 / DSM 3925 / LMG 3041 / NCIB 10222.
  2. "Systematic sequencing of the Escherichia coli genome: analysis of the 0-2.4 min region."
    Yura T., Mori H., Nagai H., Nagata T., Ishihama A., Fujita N., Isono K., Mizobuchi K., Nakata A.
    Nucleic Acids Res. 20:3305-3308(1992) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12.
  3. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: K12 / MG1655 / ATCC 47076.
  4. "Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110."
    Hayashi K., Morooka N., Yamamoto Y., Fujita K., Isono K., Choi S., Ohtsubo E., Baba T., Wanner B.L., Mori H., Horiuchi T.
    Mol. Syst. Biol. 2:E1-E5(2006) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], SEQUENCE REVISION TO 1-12.
    Strain: K12 / W3110 / ATCC 27325 / DSM 5911.
  5. "thiBPQ encodes an ABC transporter required for transport of thiamine and thiamine pyrophosphate in Salmonella typhimurium."
    Webb E., Claas K., Downs D.
    J. Biol. Chem. 273:8946-8950(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION.

Entry informationi

Entry nameiTHIB_ECOLI
AccessioniPrimary (citable) accession number: P31550
Secondary accession number(s): P75637
Entry historyi
Integrated into UniProtKB/Swiss-Prot: July 1, 1993
Last sequence update: November 1, 1997
Last modified: March 16, 2016
This is version 129 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Escherichia coli
    Escherichia coli (strain K12): entries and cross-references to EcoGene
  2. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.