Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Malate synthase G

Gene

glcB

Organism
Escherichia coli O139:H28 (strain E24377A / ETEC)
Status
Unreviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Involved in the glycolate utilization. Catalyzes the condensation and subsequent hydrolysis of acetyl-coenzyme A (acetyl-CoA) and glyoxylate to form malate and CoA.UniRule annotationSAAS annotation

Catalytic activityi

Acetyl-CoA + H2O + glyoxylate = (S)-malate + CoA.UniRule annotationSAAS annotation

Cofactori

Mg2+UniRule annotation

Pathwayi

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Binding sitei118 – 1181Acetyl-CoA; via carbonyl oxygenUniRule annotation
Binding sitei274 – 2741Acetyl-CoAUniRule annotation
Binding sitei311 – 3111Acetyl-CoAUniRule annotation
Active sitei338 – 3381Proton acceptorUniRule annotation
Binding sitei338 – 3381GlyoxylateUniRule annotation
Metal bindingi427 – 4271MagnesiumUniRule annotation
Binding sitei427 – 4271GlyoxylateUniRule annotation
Metal bindingi455 – 4551MagnesiumUniRule annotation
Binding sitei536 – 5361Acetyl-CoA; via carbonyl oxygenUniRule annotation
Active sitei631 – 6311Proton donorUniRule annotation

GO - Molecular functioni

  1. malate synthase activity Source: UniProtKB-HAMAP
  2. metal ion binding Source: UniProtKB-KW

GO - Biological processi

  1. glyoxylate cycle Source: UniProtKB-HAMAP
  2. tricarboxylic acid cycle Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

AcyltransferaseImported, Transferase

Keywords - Biological processi

Glyoxylate bypassUniRule annotation, Tricarboxylic acid cycleUniRule annotation

Keywords - Ligandi

MagnesiumUniRule annotationSAAS annotation, Metal-bindingUniRule annotationSAAS annotation

Enzyme and pathway databases

BioCyciECOL331111:GH7P-3416-MONOMER.
UniPathwayiUPA00703; UER00720.
UPA00703; UER00720.

Names & Taxonomyi

Protein namesi
Recommended name:
Malate synthase GUniRule annotation (EC:2.3.3.9UniRule annotation)
Gene namesi
Name:glcBUniRule annotationImported
Ordered Locus Names:EcE24377A_3435Imported
OrganismiEscherichia coli O139:H28 (strain E24377A / ETEC)Imported
Taxonomic identifieri331111 [NCBI]
Taxonomic lineageiBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia
ProteomesiUP000001122: Chromosome

Subcellular locationi

Cytoplasm UniRule annotationSAAS annotation

GO - Cellular componenti

  1. cytoplasm Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

CytoplasmUniRule annotationSAAS annotation

PTM / Processingi

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei617 – 6171Cysteine sulfenic acid (-SOH)UniRule annotation
Modified residuei688 – 6881Cysteine sulfenic acid (-SOH)UniRule annotation

Keywords - PTMi

OxidationUniRule annotation

Interactioni

Subunit structurei

Monomer.UniRule annotation

Protein-protein interaction databases

STRINGi331111.EcE24377A_3435.

Structurei

3D structure databases

ProteinModelPortaliA7ZRK3.
SMRiA7ZRK3. Positions 1-723.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni125 – 1262Acetyl-CoA bindingUniRule annotation
Regioni452 – 4554Glyoxylate bindingUniRule annotation

Sequence similaritiesi

Belongs to the malate synthase family. GlcB subfamily.UniRule annotation

Phylogenomic databases

eggNOGiCOG2225.
HOGENOMiHOG000220740.
KOiK01638.
OMAiIQIDAND.
OrthoDBiEOG6HJ286.

Family and domain databases

Gene3Di2.170.170.11. 2 hits.
HAMAPiMF_00641. Malate_synth_G.
InterProiIPR011076. Malate_synth-like.
IPR023310. Malate_synth_G_beta_sub_dom.
IPR001465. Malate_synthase.
IPR006253. Malate_synthG.
[Graphical view]
PfamiPF01274. Malate_synthase. 1 hit.
[Graphical view]
SUPFAMiSSF51645. SSF51645. 1 hit.
TIGRFAMsiTIGR01345. malate_syn_G. 1 hit.

Sequencei

Sequence statusi: Complete.

A7ZRK3-1 [UniParc]FASTAAdd to Basket

« Hide

        10         20         30         40         50
MSQTITQGRL RIDANFKRFV DEEVLPGTGQ DAAAFWRNFD EIVHDLAPEN
60 70 80 90 100
RQLLAERDRI QAALDEWHRS NPGPVKDKAA YKSFLRELGY LVPQPERVTV
110 120 130 140 150
ETTGIDSEIT SQAGPQLVVP AMNARYALNA ANARWGSLYD ALYGSDIIPQ
160 170 180 190 200
EGAMVSGYDP QRGEQVIAWV RRFLDESLPL ENGSYQDVVA FKVVDKQLRI
210 220 230 240 250
QLKNGKETTL RTPAQFVGYR GDAAALTCIL LKNNGLHIEL QIDANGRIGK
260 270 280 290 300
DDPAHINDVI VEAAISTILD CEDSVAAVDA EDKILLYRNL LGLMQGTLQE
310 320 330 340 350
KMEKNGRQIV RKLNDDRHYT AADGSEISLH GRSLLFIRNV GHLMTIPVIW
360 370 380 390 400
DSEGNEIPEG ILDGVMTGAI ALYDLKVQKN SRTGSVYIVK PKMHGPQEVA
410 420 430 440 450
FANKLFTRIE TMLGMAPNTL KMGIMDEERR TSLNLRSCIA QARNRVAFIN
460 470 480 490 500
TGFLDRTGDE MHSVMEAGPM LRKNQMKSTP WIKAYERNNV LSGLFCGLRG
510 520 530 540 550
KAQIGKGMWA MPDLMADMYS QKGDQLRAGA NTAWVPSPTA ATLHALHYHQ
560 570 580 590 600
TNVQSVQANI AQTEFNAEFE PLLDDLLTIP VAENANWSVE EIQQELDNNV
610 620 630 640 650
QGILGYVVRW VEQGIGCSKV PDIHNVALME DRATLRISSQ HIANWLRHGI
660 670 680 690 700
LTKEQVQASL ENMAKVVDQQ NAGDPAYRPM AGNFANSCAF KAASDLIFLG
710 720
VKQPNGYTEP LLHAWRLREK ESH
Length:723
Mass (Da):80,519
Last modified:October 23, 2007 - v1
Checksum:i4B160E959727C3F2
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CP000800 Genomic DNA. Translation: ABV18902.1.
RefSeqiWP_000084091.1. NC_009801.1.
YP_001464432.1. NC_009801.1.

Genome annotation databases

EnsemblBacteriaiABV18902; ABV18902; EcE24377A_3435.
GeneIDi5590624.
KEGGiecw:EcE24377A_3435.
PATRICi18296186. VBIEscCol31211_3708.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CP000800 Genomic DNA. Translation: ABV18902.1.
RefSeqiWP_000084091.1. NC_009801.1.
YP_001464432.1. NC_009801.1.

3D structure databases

ProteinModelPortaliA7ZRK3.
SMRiA7ZRK3. Positions 1-723.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi331111.EcE24377A_3435.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaiABV18902; ABV18902; EcE24377A_3435.
GeneIDi5590624.
KEGGiecw:EcE24377A_3435.
PATRICi18296186. VBIEscCol31211_3708.

Phylogenomic databases

eggNOGiCOG2225.
HOGENOMiHOG000220740.
KOiK01638.
OMAiIQIDAND.
OrthoDBiEOG6HJ286.

Enzyme and pathway databases

UniPathwayiUPA00703; UER00720.
UPA00703; UER00720.
BioCyciECOL331111:GH7P-3416-MONOMER.

Family and domain databases

Gene3Di2.170.170.11. 2 hits.
HAMAPiMF_00641. Malate_synth_G.
InterProiIPR011076. Malate_synth-like.
IPR023310. Malate_synth_G_beta_sub_dom.
IPR001465. Malate_synthase.
IPR006253. Malate_synthG.
[Graphical view]
PfamiPF01274. Malate_synthase. 1 hit.
[Graphical view]
SUPFAMiSSF51645. SSF51645. 1 hit.
TIGRFAMsiTIGR01345. malate_syn_G. 1 hit.
ProtoNetiSearch...

Publicationsi

  1. "The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates."
    Rasko D.A., Rosovitz M.J., Myers G.S.A., Mongodin E.F., Fricke W.F., Gajer P., Crabtree J., Sebaihia M., Thomson N.R., Chaudhuri R., Henderson I.R., Sperandio V., Ravel J.
    J. Bacteriol. 190:6881-6893(2008) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: E24377A / ETECImported.

Entry informationi

Entry nameiA7ZRK3_ECO24
AccessioniPrimary (citable) accession number: A7ZRK3
Entry historyi
Integrated into UniProtKB/TrEMBL: October 23, 2007
Last sequence update: October 23, 2007
Last modified: February 4, 2015
This is version 66 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteomeImported

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.