Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q1R725 (Q1R725_ECOUT) Unreviewed, UniProtKB/TrEMBL

Last modified December 14, 2011. Version 53. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Malate synthase G HAMAP MF_00641

EC=2.3.3.9 HAMAP MF_00641
Gene names
Name:glcB HAMAP MF_00641
Ordered Locus Names:UTI89_C3392
OrganismEscherichia coli (strain UTI89 / UPEC) [Complete proteome] [HAMAP] EMBL ABE08839.1
Taxonomic identifier364106 [NCBI]
Taxonomic lineageBacteriaProteobacteriaGammaproteobacteriaEnterobacterialesEnterobacteriaceaeEscherichia

Protein attributes

Sequence length723 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

Acetyl-CoA + H2O + glyoxylate = (S)-malate + CoA. HAMAP MF_00641 SAAS SAAS006253

Pathway

Carbohydrate metabolism; glyoxylate cycle; (S)-malate from isocitrate: step 2/2. HAMAP MF_00641 SAAS SAAS006253

Subunit structure

Monomer By similarity. HAMAP MF_00641 SAAS SAAS006253

Subcellular location

Cytoplasm By similarity HAMAP MF_00641.

Sequence similarities

Belongs to the malate synthase family. GlcB subfamily. HAMAP MF_00641

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Sites

Active site3381Proton acceptor By similarity HAMAP MF_00641
Active site6311Proton donor By similarity HAMAP MF_00641

Sequences

Sequence LengthMass (Da)Tools
Q1R725 [UniParc].

Last modified May 16, 2006. Version 1.
Checksum: 237C92925F79F8FB

FASTA72380,551
        10         20         30         40         50         60 
MSQTITQGRL RIDANFKRFV DEEVLPGVEL DAAAFWHNVD EIVHDLAPEN RQLLAERDRI 

        70         80         90        100        110        120 
QAALDEWHRS NPGPVKDKAA YKSFLRELGY LVPQPDHVTV ETTGIDSEIT SQAGPQLVVP 

       130        140        150        160        170        180 
AMNARYALNA ANARWGSLYD ALYGSDIIPQ EGAMVSGYDP QRGEQVIAWV RRFLDESLPL 

       190        200        210        220        230        240 
ENGSYQDVVA FKVVDKQLRI QLKNGKETTL RTPAQFVGYR GDTAAPTCIL LKNNGLHIEL 

       250        260        270        280        290        300 
QIDANGRIGK DDPAHINDVI VEAAISTILD CEDSVAAVDA EDKILLYRNL LGLMQGTLQE 

       310        320        330        340        350        360 
KMEKNGRQIV RKLNDDRQYT AADGSEISLH GRSLLFIRNV GHLMTIPVIW DSEGNEIPEG 

       370        380        390        400        410        420 
ILDGVMTGAI ALYDLKVQKN SRTGSVYIVK PKMHGPQEVA FANKLFSRVE TMLGMAPNTL 

       430        440        450        460        470        480 
KMGIMDEERR TSLNLRSCIA QARNRVAFIN TGFLDRTGDE MHSVMEAGPM LRKNQMKSTP 

       490        500        510        520        530        540 
WIKAYERNNV LSGLFCGLRG KAQIGKGMWA MPDLMADMYS QKGDQLRAGA NTAWVPSPTA 

       550        560        570        580        590        600 
ATLHALHYHQ TNVQSVQANI AQTEFNAEFE PLLDDLLTIP VAENANWSAQ EIQQELDNNV 

       610        620        630        640        650        660 
QGILGYVVRW VEQGIGCSKV PDIHNVALME DRATLRISSQ HIANWLRHGI LTKDQVQASL 

       670        680        690        700        710        720 
ENMAKVVDQQ NAGDPAYRPM VENFANSCAF KAACDLIFLG VKQPNGYTEP LLHAWRLREK 


ENH 

« Hide

References

[1]"Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli: a comparative genomics approach."
Chen S.L., Hung C.-S., Xu J., Reigstad C.S., Magrini V., Sabo A., Blasiar D., Bieri T., Meyer R.R., Ozersky P., Armstrong J.R., Fulton R.S., Latreille J.P., Spieth J., Hooton T.M., Mardis E.R., Hultgren S.J., Gordon J.I.
Proc. Natl. Acad. Sci. U.S.A. 103:5977-5982(2006) [PubMed: 16585510] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP000243 Genomic DNA. Translation: ABE08839.1.
RefSeqYP_542370.1. NC_007946.1.

3D structure databases

ProteinModelPortalQ1R725.
SMRQ1R725. Positions 1-723.
ModBaseSearch...

Protein-protein interaction databases

STRINGQ1R725.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblBacteriaEBESCT00000066222; EBESCP00000063700; EBESCG00000065269.
GeneID3991959.
GenomeReviewsGene locus UTI89_C3392 in contig CP000243_GR.
KEGGeci:UTI89_C3392.
PATRIC18456286. VBIEscCol42261_3360.

Organism-specific databases

CMRSearch...

Phylogenomic databases

eggNOGCOG2225.
GeneTreeEBGT00050000010684.
HOGENOMHBG350005.
OMAENANWSV.
ProtClustDBPRK02999.

Family and domain databases

HAMAPMF_00641. Malate_synth_G.
[Tree]
InterProIPR011076. Malate_synth-like.
IPR023310. Malate_synth_G_beta_sub_dom.
IPR001465. Malate_synthase.
IPR006253. Malate_synthG.
[Graphical view]
Gene3DG3DSA:2.170.170.11. Malate_synth_G_beta_sub_dom. 2 hits.
KOK01638.
PANTHERPTHR21631:SF1. Malate_synthase. 1 hit.
PfamPF01274. Malate_synthase. 1 hit.
[Graphical view]
SUPFAMSSF51645. Malat_synth_like. 1 hit.
TIGRFAMsTIGR01345. Malate_syn_G. 1 hit.
ProtoNetSearch...

Entry information

Entry nameQ1R725_ECOUT
AccessionPrimary (citable) accession number: Q1R725
Entry history
Integrated into UniProtKB/TrEMBL: May 16, 2006
Last sequence update: May 16, 2006
Last modified: December 14, 2011
This is version 53 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)