Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Trehalose-phosphate synthase

Gene

otsA

Organism
Mycobacterium smegmatis (strain ATCC 700084 / mc(2)155)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

Catalyzes the transfer of glucose from a nucleoside diphosphate-glucose to glucose-6-phosphate to form trehalose-6-phosphate and nucleoside diphosphate.By similarity

Catalytic activityi

Nucleoside diphosphate-glucose + D-glucose 6-phosphate = trehalose 6-phosphate + nucleoside diphosphate.

Pathwayi: trehalose biosynthesis

This protein is involved in the pathway trehalose biosynthesis, which is part of Glycan biosynthesis.
View all proteins of this organism that are known to be involved in the pathway trehalose biosynthesis and in Glycan biosynthesis.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Binding sitei22 – 221Glucose-6-phosphateBy similarity
Binding sitei94 – 941Glucose-6-phosphateBy similarity
Sitei103 – 1031Involved in alpha anomer selectivityBy similarity
Binding sitei148 – 1481Glucose-6-phosphateBy similarity
Sitei173 – 1731Involved in alpha anomer selectivityBy similarity
Binding sitei290 – 2901NDP-glucoseBy similarity
Binding sitei295 – 2951NDP-glucoseBy similarity
Binding sitei328 – 3281Glucose-6-phosphateBy similarity

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Glycosyltransferase, Transferase

Enzyme and pathway databases

BioCyciMSME246196:GJ4Y-5890-MONOMER.
UniPathwayiUPA00299.

Protein family/group databases

CAZyiGT20. Glycosyltransferase Family 20.

Names & Taxonomyi

Protein namesi
Recommended name:
Trehalose-phosphate synthase (EC:2.4.1.-)
Short name:
TPS
Alternative name(s):
Trehalose-6-phosphate synthase
Gene namesi
Name:otsA
Ordered Locus Names:MSMEG_5892, MSMEI_5731
OrganismiMycobacterium smegmatis (strain ATCC 700084 / mc(2)155)
Taxonomic identifieri246196 [NCBI]
Taxonomic lineageiBacteriaActinobacteriaCorynebacterialesMycobacteriaceaeMycobacterium
Proteomesi
  • UP000006158 Componenti: Chromosome
  • UP000000757 Componenti: Chromosome

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 503503Trehalose-phosphate synthasePRO_0000348910Add
BLAST

Proteomic databases

PRIDEiA0R4M9.

Interactioni

Subunit structurei

Homotetramer.By similarity

Protein-protein interaction databases

STRINGi246196.MSMEG_5892.

Structurei

3D structure databases

ProteinModelPortaliA0R4M9.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni42 – 432NDP-glucose bindingBy similarity
Regioni393 – 3975NDP-glucose bindingBy similarity

Sequence similaritiesi

Belongs to the glycosyltransferase 20 family.Curated

Phylogenomic databases

eggNOGiENOG4105C1K. Bacteria.
COG0380. LUCA.
HOGENOMiHOG000191477.
KOiK00697.
OMAiPEWIGKI.
OrthoDBiEOG6CCH1M.

Family and domain databases

InterProiIPR001830. Glyco_trans_20.
[Graphical view]
PfamiPF00982. Glyco_transf_20. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

A0R4M9-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSPESGHETI SGTSDFVVVA NRLPVDLERL PDGTTRWKRS PGGLVTALEP
60 70 80 90 100
LLRKRRGSWI GWAGVADSDE EPIVQDGLQL HPVRLSADDV AKYYEGFSNA
110 120 130 140 150
TLWPLYHDLI VKPEYHREWW DRYVEVNRRF AEATARAAAE GATVWIQDYQ
160 170 180 190 200
LQLVPKMLRM LRPDVTIGFF LHIPFPPVEL FMQMPWRTEI VEGLLGADLV
210 220 230 240 250
GFHLPGGAQN FLVLSRRLVG ANTSRASIGV RSRFGEVQVG FRTVKVGAFP
260 270 280 290 300
ISIDSAELDG KARNRAIRQR ARQIRAELGN PRKIMLGVDR LDYTKGIDVR
310 320 330 340 350
LRALSELLEE KRIKRDDTVL VQLATPSRER VESYIAMRED IERQVGHING
360 370 380 390 400
EYGEVGHPIV HYLHRPIPRD ELIAFFVAAD VMLVTPLRDG MNLVAKEYVA
410 420 430 440 450
CRSDLGGALV LSEFTGAAAE LRQAYLVNPH DLEGVKDKIE AAVNQNPEEG
460 470 480 490 500
KRRMRALRRQ VLAHDVDRWA RSFLDALAAT GETGDSGVTG ESTPAPESDS

GSF
Length:503
Mass (Da):56,280
Last modified:January 9, 2007 - v1
Checksum:i3D4B5CE2E6DA5ACF
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CP000480 Genomic DNA. Translation: ABK71484.1.
CP001663 Genomic DNA. Translation: AFP42165.1.
RefSeqiWP_011730872.1. NZ_CP009494.1.
YP_890117.1. NC_008596.1.

Genome annotation databases

EnsemblBacteriaiABK71484; ABK71484; MSMEG_5892.
AFP42165; AFP42165; MSMEI_5731.
GeneIDi4532640.
KEGGimsb:LJ00_29135.
msg:MSMEI_5731.
msm:MSMEG_5892.
PATRICi18084027. VBIMycSme59918_5733.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CP000480 Genomic DNA. Translation: ABK71484.1.
CP001663 Genomic DNA. Translation: AFP42165.1.
RefSeqiWP_011730872.1. NZ_CP009494.1.
YP_890117.1. NC_008596.1.

3D structure databases

ProteinModelPortaliA0R4M9.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi246196.MSMEG_5892.

Protein family/group databases

CAZyiGT20. Glycosyltransferase Family 20.

Proteomic databases

PRIDEiA0R4M9.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiABK71484; ABK71484; MSMEG_5892.
AFP42165; AFP42165; MSMEI_5731.
GeneIDi4532640.
KEGGimsb:LJ00_29135.
msg:MSMEI_5731.
msm:MSMEG_5892.
PATRICi18084027. VBIMycSme59918_5733.

Phylogenomic databases

eggNOGiENOG4105C1K. Bacteria.
COG0380. LUCA.
HOGENOMiHOG000191477.
KOiK00697.
OMAiPEWIGKI.
OrthoDBiEOG6CCH1M.

Enzyme and pathway databases

UniPathwayiUPA00299.
BioCyciMSME246196:GJ4Y-5890-MONOMER.

Family and domain databases

InterProiIPR001830. Glyco_trans_20.
[Graphical view]
PfamiPF00982. Glyco_transf_20. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. Fleischmann R.D., Dodson R.J., Haft D.H., Merkel J.S., Nelson W.C., Fraser C.M.
    Submitted (OCT-2006) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ATCC 700084 / mc(2)155.
  2. "Interrupted coding sequences in Mycobacterium smegmatis: authentic mutations or sequencing errors?"
    Deshayes C., Perrodou E., Gallien S., Euphrasie D., Schaeffer C., Van-Dorsselaer A., Poch O., Lecompte O., Reyrat J.-M.
    Genome Biol. 8:R20.1-R20.9(2007) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: ATCC 700084 / mc(2)155.
  3. "Ortho-proteogenomics: multiple proteomes investigation through orthology and a new MS-based protocol."
    Gallien S., Perrodou E., Carapito C., Deshayes C., Reyrat J.-M., Van Dorsselaer A., Poch O., Schaeffer C., Lecompte O.
    Genome Res. 19:128-135(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Strain: ATCC 700084 / mc(2)155.

Entry informationi

Entry nameiOTSA_MYCS2
AccessioniPrimary (citable) accession number: A0R4M9
Secondary accession number(s): I7FLH5
Entry historyi
Integrated into UniProtKB/Swiss-Prot: September 2, 2008
Last sequence update: January 9, 2007
Last modified: December 9, 2015
This is version 62 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programProkaryotic Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.