Skip Header

Contribute Send feedback
Read comments (?) or add your own

A8A4C9 (A8A4C9_ECOHS) Unreviewed, UniProtKB/TrEMBL

Last modified December 14, 2011. Version 42. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Malate synthase G HAMAP MF_00641

EC=2.3.3.9 HAMAP MF_00641
Gene names
Name:glcB HAMAP MF_00641
Ordered Locus Names:EcHS_A3148
OrganismEscherichia coli O9:H4 (strain HS) [Complete proteome] [HAMAP] EMBL ABV07383.1
Taxonomic identifier331112 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length723 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

Acetyl-CoA + H2O + glyoxylate = (S)-malate + CoA. HAMAP MF_00641 SAAS SAAS006253

Pathway

Carbohydrate metabolism; glyoxylate cycle; (S)-malate from isocitrate: step 2/2. HAMAP MF_00641 SAAS SAAS006253

Subunit structure

Monomer By similarity. HAMAP MF_00641 SAAS SAAS006253

Subcellular location

Cytoplasm By similarity HAMAP MF_00641.

Sequence similarities

Belongs to the malate synthase family. GlcB subfamily. HAMAP MF_00641

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Active site3381Proton acceptor By similarity HAMAP MF_00641
Active site6311Proton donor By similarity HAMAP MF_00641

Sequences

Sequence LengthMass (Da)Tools
A8A4C9 [UniParc].

Last modified October 23, 2007. Version 1.
Checksum: 820F177D3FE02632

FASTA72380,489
        10         20         30         40         50         60 
MSQTITQSRL RIDANFKRFV DEEVLPGTGL DAAAFWRNFD EIVHDLAPEN RQLLAERDRI 

        70         80         90        100        110        120 
QAALDEWHRS NPGPVKDKAA YKSFLRELGY LVPQPERVTV ETTGIDSEIT SQAGPQLVVP 

       130        140        150        160        170        180 
AMNARYALNA ANARWGSLYD ALYGSDIIPQ EGAMVSGYDP QRGEQVIAWV RRFLDESLPL 

       190        200        210        220        230        240 
ENGSYQDVVA FKVVDKQLRI QLKNGKETTL RTPAQFVGYR GDAAAPTCIL LKNNGLHIEL 

       250        260        270        280        290        300 
QIDANGRIGK DDPAHINDVI VEAAISTILD CEDSVAAVDA EDKILLYRNL LGLMQGTLQE 

       310        320        330        340        350        360 
KMEKNGRQIV RKLNDDRHYT AADGSEISLH GRSLLFIRNV GHLMTIPVIW DSEGNEIPEG 

       370        380        390        400        410        420 
ILDGVMTGAI ALYDLKVQKN SRTGSVYIVK PKMHGPQEVA FANKLFTRIE TMLGMAPNTL 

       430        440        450        460        470        480 
KMGIMDEERR TSLNLRSCIA QARNRVAFIN TGFLDRTGDE MHSVMEAGPM LRKNQMKSTP 

       490        500        510        520        530        540 
WIKAYERNNV LSGLFCGLRG KAQIGKGMWA MPDLMADMYS QKGDQLRAGA NTAWVPSPTA 

       550        560        570        580        590        600 
ATLHALHYHQ TNVQSVQANI AQTEFNAEFE PLLDDLLTIP VAENANWSAQ EIQQELDNNV 

       610        620        630        640        650        660 
QGILGYVVRW VEQGIGCSKV PDIHNVALME DRATLRISSQ HIANWLRHGI LTKEQVQASL 

       670        680        690        700        710        720 
ENMAKVVDQQ NAGDPAYRPM AGNFANSCAF KAASDLIFLG VKQPNGYTEP LLHAWRLREK 


ESH 

« Hide

References

[1]"The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates."
Rasko D.A., Rosovitz M.J., Myers G.S.A., Mongodin E.F., Fricke W.F., Gajer P., Crabtree J., Sebaihia M., Thomson N.R., Chaudhuri R., Henderson I.R., Sperandio V., Ravel J.
J. Bacteriol. 190:6881-6893(2008) [PubMed: 18676672] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000802 Genomic DNA. Translation: ABV07383.1.
RefSeqYP_001459766.1. NC_009800.1.

3D structure databases

ProteinModelPortalA8A4C9.
SMRA8A4C9. Positions 1-723.
ModBaseSearch...

Protein-protein interaction databases

STRINGA8A4C9.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaEBESCT00000054697; EBESCP00000052682; EBESCG00000053745.
GeneID5593708.
GenomeReviewsGene locus EcHS_A3148 in contig CP000802_GR.
KEGGecx:EcHS_A3148.
PATRIC18316275. VBIEscCol77814_3076.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGCOG2225.
GeneTreeEBGT00050000010684.
HOGENOMHBG350005.
OMAENANWSV.
ProtClustDBPRK02999.

Family and domain databases

HAMAPMF_00641. Malate_synth_G.
[Tree]
InterProIPR011076. Malate_synth-like.
IPR023310. Malate_synth_G_beta_sub_dom.
IPR001465. Malate_synthase.
IPR006253. Malate_synthG.
[Graphical view]
Gene3DG3DSA:2.170.170.11. Malate_synth_G_beta_sub_dom. 2 hits.
KOK01638.
PANTHERPTHR21631:SF1. Malate_synthase. 1 hit.
PfamPF01274. Malate_synthase. 1 hit.
[Graphical view]
SUPFAMSSF51645. Malat_synth_like. 1 hit.
TIGRFAMsTIGR01345. Malate_syn_G. 1 hit.
ProtoNetSearch...

Entry information

Entry nameA8A4C9_ECOHS
AccessionPrimary (citable) accession number: A8A4C9
Entry history
Integrated into UniProtKB/TrEMBL: October 23, 2007
Last sequence update: October 23, 2007
Last modified: December 14, 2011
This is version 42 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)