Skip Header

Contribute Send feedback
Read comments (?) or add your own

A7ZRK3 (A7ZRK3_ECO24) Unreviewed, UniProtKB/TrEMBL

Last modified December 14, 2011. Version 43. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Malate synthase G HAMAP MF_00641

EC=2.3.3.9 HAMAP MF_00641
Gene names
Name:glcB HAMAP MF_00641
Ordered Locus Names:EcE24377A_3435
OrganismEscherichia coli O139:H28 (strain E24377A / ETEC) [Complete proteome] [HAMAP] EMBL ABV18902.1
Taxonomic identifier331111 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length723 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

Acetyl-CoA + H2O + glyoxylate = (S)-malate + CoA. HAMAP MF_00641 SAAS SAAS006253

Pathway

Carbohydrate metabolism; glyoxylate cycle; (S)-malate from isocitrate: step 2/2. HAMAP MF_00641 SAAS SAAS006253

Subunit structure

Monomer By similarity. HAMAP MF_00641 SAAS SAAS006253

Subcellular location

Cytoplasm By similarity HAMAP MF_00641.

Sequence similarities

Belongs to the malate synthase family. GlcB subfamily. HAMAP MF_00641

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Active site3381Proton acceptor By similarity HAMAP MF_00641
Active site6311Proton donor By similarity HAMAP MF_00641

Sequences

Sequence LengthMass (Da)Tools
A7ZRK3 [UniParc].

Last modified October 23, 2007. Version 1.
Checksum: 4B160E959727C3F2

FASTA72380,519
        10         20         30         40         50         60 
MSQTITQGRL RIDANFKRFV DEEVLPGTGQ DAAAFWRNFD EIVHDLAPEN RQLLAERDRI 

        70         80         90        100        110        120 
QAALDEWHRS NPGPVKDKAA YKSFLRELGY LVPQPERVTV ETTGIDSEIT SQAGPQLVVP 

       130        140        150        160        170        180 
AMNARYALNA ANARWGSLYD ALYGSDIIPQ EGAMVSGYDP QRGEQVIAWV RRFLDESLPL 

       190        200        210        220        230        240 
ENGSYQDVVA FKVVDKQLRI QLKNGKETTL RTPAQFVGYR GDAAALTCIL LKNNGLHIEL 

       250        260        270        280        290        300 
QIDANGRIGK DDPAHINDVI VEAAISTILD CEDSVAAVDA EDKILLYRNL LGLMQGTLQE 

       310        320        330        340        350        360 
KMEKNGRQIV RKLNDDRHYT AADGSEISLH GRSLLFIRNV GHLMTIPVIW DSEGNEIPEG 

       370        380        390        400        410        420 
ILDGVMTGAI ALYDLKVQKN SRTGSVYIVK PKMHGPQEVA FANKLFTRIE TMLGMAPNTL 

       430        440        450        460        470        480 
KMGIMDEERR TSLNLRSCIA QARNRVAFIN TGFLDRTGDE MHSVMEAGPM LRKNQMKSTP 

       490        500        510        520        530        540 
WIKAYERNNV LSGLFCGLRG KAQIGKGMWA MPDLMADMYS QKGDQLRAGA NTAWVPSPTA 

       550        560        570        580        590        600 
ATLHALHYHQ TNVQSVQANI AQTEFNAEFE PLLDDLLTIP VAENANWSVE EIQQELDNNV 

       610        620        630        640        650        660 
QGILGYVVRW VEQGIGCSKV PDIHNVALME DRATLRISSQ HIANWLRHGI LTKEQVQASL 

       670        680        690        700        710        720 
ENMAKVVDQQ NAGDPAYRPM AGNFANSCAF KAASDLIFLG VKQPNGYTEP LLHAWRLREK 


ESH 

« Hide

References

[1]"The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates."
Rasko D.A., Rosovitz M.J., Myers G.S.A., Mongodin E.F., Fricke W.F., Gajer P., Crabtree J., Sebaihia M., Thomson N.R., Chaudhuri R., Henderson I.R., Sperandio V., Ravel J.
J. Bacteriol. 190:6881-6893(2008) [PubMed: 18676672] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000800 Genomic DNA. Translation: ABV18902.1.
RefSeqYP_001464432.1. NC_009801.1.

3D structure databases

ProteinModelPortalA7ZRK3.
SMRA7ZRK3. Positions 1-723.
ModBaseSearch...

Protein-protein interaction databases

STRINGA7ZRK3.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaEBESCT00000020242; EBESCP00000019304; EBESCG00000019296.
GeneID5590624.
GenomeReviewsGene locus EcE24377A_3435 in contig CP000800_GR.
KEGGecw:EcE24377A_3435.
NMPDRfig|331111.3.peg.1125.
PATRIC18296186. VBIEscCol31211_3708.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGCOG2225.
GeneTreeEBGT00050000010684.
HOGENOMHBG350005.
OMAENANWSV.
ProtClustDBPRK02999.

Family and domain databases

HAMAPMF_00641. Malate_synth_G.
[Tree]
InterProIPR011076. Malate_synth-like.
IPR023310. Malate_synth_G_beta_sub_dom.
IPR001465. Malate_synthase.
IPR006253. Malate_synthG.
[Graphical view]
Gene3DG3DSA:2.170.170.11. Malate_synth_G_beta_sub_dom. 2 hits.
KOK01638.
PANTHERPTHR21631:SF1. Malate_synthase. 1 hit.
PfamPF01274. Malate_synthase. 1 hit.
[Graphical view]
SUPFAMSSF51645. Malat_synth_like. 1 hit.
TIGRFAMsTIGR01345. Malate_syn_G. 1 hit.
ProtoNetSearch...

Entry information

Entry nameA7ZRK3_ECO24
AccessionPrimary (citable) accession number: A7ZRK3
Entry history
Integrated into UniProtKB/TrEMBL: October 23, 2007
Last sequence update: October 23, 2007
Last modified: December 14, 2011
This is version 43 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)