Skip Header

Contribute Send feedback
Read comments (?) or add your own

B1LEE2 (B1LEE2_ECOSM) Unreviewed, UniProtKB/TrEMBL

Last modified December 14, 2011. Version 39. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Malate synthase G HAMAP MF_00641

EC=2.3.3.9 HAMAP MF_00641
Gene names
Name:glcB HAMAP MF_00641
Ordered Locus Names:EcSMS35_3253
OrganismEscherichia coli (strain SMS-3-5 / SECEC) [Complete proteome] [HAMAP] EMBL ACB17661.1
Taxonomic identifier439855 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length723 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

Acetyl-CoA + H2O + glyoxylate = (S)-malate + CoA. HAMAP MF_00641 SAAS SAAS006253

Pathway

Carbohydrate metabolism; glyoxylate cycle; (S)-malate from isocitrate: step 2/2. HAMAP MF_00641 SAAS SAAS006253

Subunit structure

Monomer By similarity. HAMAP MF_00641 SAAS SAAS006253

Subcellular location

Cytoplasm By similarity HAMAP MF_00641.

Sequence similarities

Belongs to the malate synthase family. GlcB subfamily. HAMAP MF_00641

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Active site3381Proton acceptor By similarity HAMAP MF_00641
Active site6311Proton donor By similarity HAMAP MF_00641

Sequences

Sequence LengthMass (Da)Tools
B1LEE2 [UniParc].

Last modified April 29, 2008. Version 1.
Checksum: 5429C999466BB247

FASTA72380,415
        10         20         30         40         50         60 
MSQTITQGRL RIDANFKRFV DEEVLPGVEL DAAAFWHNVD EIVHDLAPEN RQLLAERDRI 

        70         80         90        100        110        120 
QAVLDEWHRS NPGPVKDKAA YKSFLRELGY LVPQPERVTV ETTGIDSEIT SQAGPQLVVP 

       130        140        150        160        170        180 
AMNARYALNA ANARWGSLYD ALYGSDIIPQ EGAMVSGYDP QRGAQVIAWV RRFLDESLPL 

       190        200        210        220        230        240 
ENGSYQDVVA FKVVDKQLRI QLKNGKETTL RTPAQFVGYR GDAAAPTCIL LKNNGLHIEL 

       250        260        270        280        290        300 
QIDANGRIGK DDPAHINDVI VEAAISTILD CEDSVAAVDA EDKILLYRNL LGLMQGTLQE 

       310        320        330        340        350        360 
KMEKNGRQIV RKLNDDRHYT AADGSEISLH GRSLLFIRNV GHLMTIPVIW DSEGNEIPEG 

       370        380        390        400        410        420 
ILDGVMTGAI ALYDLKVQKN SRTGSVYIVK PKMHGPQEVA FANKLFTRIE TMLGMAPNTL 

       430        440        450        460        470        480 
KMGIMDEERR TSLNLRSCIA QARNRVAFIN TGFLDRTGDE MHSVMEAGPM LRKNQMKSTP 

       490        500        510        520        530        540 
WIKAYERNNV LSGLFCGLRG KAQIGKGMWA MPDLMADMYS QKGDQLRAGA NTAWVPSPTA 

       550        560        570        580        590        600 
ATLHALHYHQ TNVQSVQANI AQTEFNAEFE PLLDDLLTIP VAENANWSAQ EIQQELDNNV 

       610        620        630        640        650        660 
QGILGYVVRW VEQGIGCSKV PDIHNVALME DRATLRISSQ HIANWLRHGI LTKEQVQASL 

       670        680        690        700        710        720 
ENMAKVVDQQ NAGDPAYRPM AGNFANSSAF KAASDLIFLG VKQPNGYTEP LLHAWRLREK 


ESH 

« Hide

References

[1]"Insights into the environmental resistance gene pool from the genome sequence of the multidrug-resistant environmental isolate Escherichia coli SMS-3-5."
Fricke W.F., Wright M.S., Lindell A.H., Harkins D.M., Baker-Austin C., Ravel J., Stepanauskas R.
J. Bacteriol. 190:6779-6794(2008) [PubMed: 18708504] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000970 Genomic DNA. Translation: ACB17661.1.
RefSeqYP_001745233.1. NC_010498.1.

3D structure databases

ProteinModelPortalB1LEE2.
SMRB1LEE2. Positions 1-723.
ModBaseSearch...

Protein-protein interaction databases

STRINGB1LEE2.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaEBESCT00000064894; EBESCP00000062536; EBESCG00000063941.
GeneID6146304.
GenomeReviewsGene locus EcSMS35_3253 in contig CP000970_GR.
KEGGecm:EcSMS35_3253.
PATRIC18435439. VBIEscCol6161_3413.

Organism-specific databases

CMRSearch...

Phylogenomic databases

GeneTreeEBGT00050000010684.
HOGENOMHBG350005.
OMAENANWSV.
ProtClustDBPRK02999.

Family and domain databases

HAMAPMF_00641. Malate_synth_G.
[Tree]
InterProIPR011076. Malate_synth-like.
IPR023310. Malate_synth_G_beta_sub_dom.
IPR001465. Malate_synthase.
IPR006253. Malate_synthG.
[Graphical view]
Gene3DG3DSA:2.170.170.11. Malate_synth_G_beta_sub_dom. 2 hits.
KOK01638.
PANTHERPTHR21631:SF1. Malate_synthase. 1 hit.
PfamPF01274. Malate_synthase. 1 hit.
[Graphical view]
SUPFAMSSF51645. Malat_synth_like. 1 hit.
TIGRFAMsTIGR01345. Malate_syn_G. 1 hit.
ProtoNetSearch...

Entry information

Entry nameB1LEE2_ECOSM
AccessionPrimary (citable) accession number: B1LEE2
Entry history
Integrated into UniProtKB/TrEMBL: April 29, 2008
Last sequence update: April 29, 2008
Last modified: December 14, 2011
This is version 39 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)