Skip Header

Contribute Send feedback
Read comments (?) or add your own

P07547 (ARO1_EMENI) Reviewed, UniProtKB/Swiss-Prot

Last modified April 3, 2013. Version 131. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Pentafunctional AROM polypeptide

Including the following 5 domains:

  1. 3-dehydroquinate synthase
    Short name=DHQS
    EC=4.2.3.4
  2. 3-phosphoshikimate 1-carboxyvinyltransferase
    EC=2.5.1.19
    Alternative name(s):
    5-enolpyruvylshikimate-3-phosphate synthase
    Short name=EPSP synthase
    Short name=EPSPS
  3. Shikimate kinase
    Short name=SK
    EC=2.7.1.71
  4. 3-dehydroquinate dehydratase
    Short name=3-dehydroquinase
    EC=4.2.1.10
  5. Shikimate dehydrogenase
    EC=1.1.1.25
Gene names
Name:aromA
Synonyms:aroA, aroM
ORF Names:AN0708
OrganismEmericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) (Aspergillus nidulans) [Reference proteome]
Taxonomic identifier227321 [NCBI]
Taxonomic lineageEukaryotaFungiDikaryaAscomycotaPezizomycotinaEurotiomycetesEurotiomycetidaeEurotialesTrichocomaceaeEmericella

Protein attributes

Sequence length1583 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

The AROM polypeptide catalyzes 5 consecutive enzymatic reactions in prechorismate polyaromatic amino acid biosynthesis. HAMAP-Rule MF_03143

Catalytic activity

3-deoxy-D-arabino-hept-2-ulosonate 7-phosphate = 3-dehydroquinate + phosphate. HAMAP-Rule MF_03143

3-dehydroquinate = 3-dehydroshikimate + H2O. HAMAP-Rule MF_03143

Shikimate + NADP+ = 3-dehydroshikimate + NADPH. HAMAP-Rule MF_03143

ATP + shikimate = ADP + shikimate 3-phosphate. HAMAP-Rule MF_03143

Phosphoenolpyruvate + 3-phosphoshikimate = phosphate + 5-O-(1-carboxyvinyl)-3-phosphoshikimate. HAMAP-Rule MF_03143

Cofactor

Binds 2 zinc ions per subunit.

Pathway

Metabolic intermediate biosynthesis; chorismate biosynthesis; chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step 2/7. HAMAP-Rule MF_03143

Metabolic intermediate biosynthesis; chorismate biosynthesis; chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step 3/7.

Metabolic intermediate biosynthesis; chorismate biosynthesis; chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step 4/7.

Metabolic intermediate biosynthesis; chorismate biosynthesis; chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step 5/7.

Metabolic intermediate biosynthesis; chorismate biosynthesis; chorismate from D-erythrose 4-phosphate and phosphoenolpyruvate: step 6/7.

Subunit structure

Homodimer. Ref.7

Subcellular location

Cytoplasm HAMAP-Rule MF_03143.

Sequence similarities

In the N-terminal section; belongs to the dehydroquinate synthase family.

In the 2nd section; belongs to the EPSP synthase family.

In the 3rd section; belongs to the shikimate kinase family.

In the 4th section; belongs to the type-I 3-dehydroquinase family.

In the C-terminal section; belongs to the shikimate dehydrogenase family.

Sequence caution

The sequence CAA28836.1 differs from that shown. Reason: Erroneous gene model prediction.

The sequence CAA28836.1 differs from that shown. Reason: Frameshift at positions 534, 545, 708, 733 and 743.

The sequence EAA65484.1 differs from that shown. Reason: Erroneous gene model prediction.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 15831583Pentafunctional AROM polypeptide HAMAP-Rule MF_03143
PRO_0000140859

Regions

Nucleotide binding44 – 463NAD HAMAP-Rule MF_03143
Nucleotide binding81 – 844NAD HAMAP-Rule MF_03143
Nucleotide binding114 – 1163NAD HAMAP-Rule MF_03143
Nucleotide binding139 – 1402NAD HAMAP-Rule MF_03143
Nucleotide binding179 – 1824NAD HAMAP-Rule MF_03143
Nucleotide binding870 – 8778ATP By similarity
Region1 – 3843843-dehydroquinate synthase HAMAP-Rule MF_03143
Region194 – 1974Substrate binding 2 HAMAP-Rule MF_03143
Region264 – 2685Substrate binding 2 HAMAP-Rule MF_03143
Region397 – 842446EPSP synthase HAMAP-Rule MF_03143
Region863 – 1055193Shikimate kinase HAMAP-Rule MF_03143
Region1056 – 12762213-dehydroquinase HAMAP-Rule MF_03143
Region1289 – 1583295Shikimate dehydrogenase HAMAP-Rule MF_03143

Sites

Active site2601Proton acceptor; for 3-dehydroquinate synthase activity
Active site2751Proton acceptor; for 3-dehydroquinate synthase activity
Active site8241For EPSP synthase activity Potential
Active site11791Proton acceptor; for 3-dehydroquinate dehydratase activity By similarity
Active site12071Schiff-base intermediate with substrate; for 3-dehydroquinate dehydratase activity By similarity
Metal binding1941Zinc; catalytic
Metal binding2711Zinc; catalytic
Metal binding2871Zinc; catalytic
Binding site1191NAD
Binding site1301Substrate 1
Binding site1461Substrate 2
Binding site1521Substrate 2
Binding site1611NAD
Binding site1621Substrate 2
Binding site1901NAD
Binding site2501Substrate 2
Binding site2711Substrate 2
Binding site2871Substrate 2
Binding site3561Substrate 2

Experimental info

Sequence conflict621A → R in CAA28836. Ref.1
Sequence conflict771A → R in CAA28836. Ref.1
Sequence conflict2261R → T in CAA28836. Ref.1
Sequence conflict2351I → T in CAA28836. Ref.1
Sequence conflict4031Q → H in CAA28836. Ref.1
Sequence conflict5351A → P in CAA28836. Ref.1
Sequence conflict6461S → C in CAA28836. Ref.1
Sequence conflict8621A → G in CAA28836. Ref.1
Sequence conflict8621A → G no nucleotide entry Ref.4
Sequence conflict9841T → S in CAA28836. Ref.1
Sequence conflict9841T → S no nucleotide entry Ref.4
Sequence conflict10481K → G no nucleotide entry Ref.4
Sequence conflict10931D → N no nucleotide entry Ref.4
Sequence conflict11541E → D in CAA28836. Ref.1
Sequence conflict11541E → D no nucleotide entry Ref.4
Sequence conflict1345 – 13495SVTIP → FRNNS in CAA28836. Ref.1

Secondary structure

....................................................................... 1583
Helix Strand Turn

Details...

Sequences

Sequence LengthMass (Da)Tools
P07547 [UniParc].

Last modified May 26, 2009. Version 3.
Checksum: FBC8610B961840D8

FASTA1,583172,664
        10         20         30         40         50         60 
MSNPTKISIL GRESIIADFG LWRNYVAKDL ISDCSSTTYV LVTDTNIGSI YTPSFEEAFR 

        70         80         90        100        110        120 
KAAAEITPSP RLLIYNAPPG EVSKSRQTKA DIEDWMLSQN PPCGRDTVVI ALGGGVIGDL 

       130        140        150        160        170        180 
TGFVASTYMR GVRYVQVPTT LLAMVDSSIG GKTAIDTPLG KNLIGAIWQP TKIYIDLEFL 

       190        200        210        220        230        240 
ETLPVREFIN GMAEVIKTAA ISSEEEFTAL EENAETILKA VRREVRPGEH RFEGIEEILK 

       250        260        270        280        290        300 
ARILASARHK AYVVSADERE GGLRNLLNWG HSIGHAIEAI LTPQILHGEC VAIGMVKEAE 

       310        320        330        340        350        360 
LARHLGILKG VAVSRIVKCL AAYGLPTSLK DARIRKLTAG KHCSVDQLMF NMALDKKNDG 

       370        380        390        400        410        420 
PKKKIVLLSA IGTPYETRAS VVANEDIRVV LAPSIEVHPG VAQSSNVICA PPGSKSISNR 

       430        440        450        460        470        480 
ALVLAALGSG TCRIKNLLHS DDTEVMLNAL ERLGAATFSW EEEGEVLVVN GKGGNLQASS 

       490        500        510        520        530        540 
SPLYLGNAGT ASRFLTTVAT LANSSTVDSS VLTGNNRMKQ RPIGDLVDAL TANGASIEYV 

       550        560        570        580        590        600 
ERTGSLPLKI AASGGFAGGN INLAAKVSSQ YVSSLLMCAP YAKEPVTLRL VGGKPISQPY 

       610        620        630        640        650        660 
IDMTTAMMRS FGIDVQKSTT EEHTYHIPQG RYVNPAEYVI ESDASSATYP LAVAAVTGTT 

       670        680        690        700        710        720 
CTVPNIGSAS LQGDARFAVE VLRPMGCTVE QTETSTTVTG PSDGILRPLP NVDMEPMTDA 

       730        740        750        760        770        780 
FLGASVLAAI ARGKESNHTT RIYGIANQRV KECNRIKAMK DELAKFGVIC REHDDGLEID 

       790        800        810        820        830        840 
GIDRSNLRQP VGGVFCYDDH RVAFSFSVLS LVTPQPTLIL EKECVGKTWP GWWDTLRQLF 

       850        860        870        880        890        900 
KVKLEGKELE EEPVAASGPD RANASIYIIG MRGAGKSTAG NWVSKALNRP FVDLDTELET 

       910        920        930        940        950        960 
VEGMTIPDII KTRGWQGFRN AELEILKRTL KERSRGYVFA CGGGVVEMPE ARKLLTDYHK 

       970        980        990       1000       1010       1020 
TKGNVLLLMR DIKKIMDFLS IDKTRPAYVE DMMGVWLRRK PWFQECSNIQ YYSRDASPSG 

      1030       1040       1050       1060       1070       1080 
LARASEDFNR FLQVATGQID SLSIIKEKEH SFFASLTLPD LREAGDILEE VCVGSDAVEL 

      1090       1100       1110       1120       1130       1140 
RVDLLKDPAS NNDIPSVDYV VEQLSFLRSR VTLPIIFTIR TQSQGGRFPD NAHDAALELY 

      1150       1160       1170       1180       1190       1200 
RLAFRSGCEF VDLEIAFPED MLRAVTEMKG FSKIIASHHD PKGELSWANM SWIKFYNKAL 

      1210       1220       1230       1240       1250       1260 
EYGDIIKLVG VARNIDDNTA LRKFKNWAAE AHDVPLIAIN MGDQGQLSRI LNGFMTPVSH 

      1270       1280       1290       1300       1310       1320 
PSLPFKAAPG QLSATEIRKG LSLMGEIKPK KFAIFGSPIS QSRSPALHNT LFAQVGLPHN 

      1330       1340       1350       1360       1370       1380 
YTRLETTNAQ DVQEFIRSPD FGGASVTIPL KLDIMPLLDE VAAEAEIIGA VNTIIPVSTG 

      1390       1400       1410       1420       1430       1440 
KNTPSRLVGR NTDWQGMILS LRKAGVYGPK RKDQEQSALV VGGGGTARAA IYALHNMGYS 

      1450       1460       1470       1480       1490       1500 
PIYIVGRTPS KLENMVSTFP SSYNIRIVES PSSFESVPHV AIGTIPADQP IDPTMRETLC 

      1510       1520       1530       1540       1550       1560 
HMFERAQEAD AEAVKAIEHA PRILLEMAYK PQVTALMRLA SDSGWKTIPG LEVLVGQGWY 

      1570       1580 
QFKYWTGISP LYESARACSS PLI 

« Hide

References

« Hide 'large scale' references
[1]"The isolation and nucleotide sequence of the complex AROM locus of Aspergillus nidulans."
Charles I.G., Keyte J.W., Brammar W.J., Smith M., Hawkins A.R.
Nucleic Acids Res. 14:2201-2213(1986) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: R153.
[2]"Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae."
Galagan J.E., Calvo S.E., Cuomo C., Ma L.-J., Wortman J.R., Batzoglou S., Lee S.-I., Bastuerkmen M., Spevak C.C., Clutterbuck J., Kapitonov V., Jurka J., Scazzocchio C., Farman M.L., Butler J., Purcell S., Harris S., Braus G.H. expand/collapse author list , Draht O., Busch S., D'Enfert C., Bouchier C., Goldman G.H., Bell-Pedersen D., Griffiths-Jones S., Doonan J.H., Yu J., Vienken K., Pain A., Freitag M., Selker E.U., Archer D.B., Penalva M.A., Oakley B.R., Momany M., Tanaka T., Kumagai T., Asai K., Machida M., Nierman W.C., Denning D.W., Caddick M.X., Hynes M., Paoletti M., Fischer R., Miller B.L., Dyer P.S., Sachs M.S., Osmani S.A., Birren B.W.
Nature 438:1105-1115(2005) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139.
[3]"The 2008 update of the Aspergillus nidulans genome annotation: a community effort."
Wortman J.R., Gilsenan J.M., Joardar V., Deegan J., Clutterbuck J., Andersen M.R., Archer D., Bencina M., Braus G., Coutinho P., von Dohren H., Doonan J., Driessen A.J., Durek P., Espeso E., Fekete E., Flipphi M., Estrada C.G. expand/collapse author list , Geysens S., Goldman G., de Groot P.W., Hansen K., Harris S.D., Heinekamp T., Helmstaedt K., Henrissat B., Hofmann G., Homan T., Horio T., Horiuchi H., James S., Jones M., Karaffa L., Karanyi Z., Kato M., Keller N., Kelly D.E., Kiel J.A., Kim J.M., van der Klei I.J., Klis F.M., Kovalchuk A., Krasevec N., Kubicek C.P., Liu B., Maccabe A., Meyer V., Mirabito P., Miskei M., Mos M., Mullins J., Nelson D.R., Nielsen J., Oakley B.R., Osmani S.A., Pakula T., Paszewski A., Paulsen I., Pilsyk S., Pocsi I., Punt P.J., Ram A.F., Ren Q., Robellet X., Robson G., Seiboth B., van Solingen P., Specht T., Sun J., Taheri-Talesh N., Takeshita N., Ussery D., vanKuyk P.A., Visser H., van de Vondervoort P.J., de Vries R.P., Walton J., Xiang X., Xiong Y., Zeng A.P., Brandt B.W., Cornell M.J., van den Hondel C.A., Visser J., Oliver S.G., Turner G.
Fungal Genet. Biol. 46:S2-13(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: GENOME REANNOTATION.
Strain: FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139.
[4]"Nucleotide sequence encoding the biosynthetic dehydroquinase function of the penta-functional arom locus of Aspergillus nidulans."
Charles I.G., Keyte J.W., Brammar W.J., Hawkins A.R.
Nucleic Acids Res. 13:8119-8128(1985) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 843-1473.
Strain: R153.
[5]Hawkins A.R.
Submitted (OCT-1998) to the EMBL/GenBank/DDBJ databases
Cited for: SEQUENCE REVISION TO C-TERMINUS.
[6]"Comparative analysis of the QUTR transcription repressor protein and the three C-terminal domains of the pentafunctional AROM enzyme."
Lamb H.K., Moore J.D., Lakey J.H., Levett L.J., Wheeler K.A., Lago H., Coggins J.R., Hawkins A.R.
Biochem. J. 313:941-950(1996) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION OF INTRON.
[7]"Structure of dehydroquinate synthase reveals an active site capable of multistep catalysis."
Carpenter E.P., Hawkins A.R., Frost J.W., Brown K.A.
Nature 394:299-302(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (1.8 ANGSTROMS) OF 1-393 IN COMPLEX WITH NAD; ZINC AND SUBSTRATE ANALOG, SUBUNIT.
[8]"Ligand-induced conformational changes and a mechanism for domain closure in Aspergillus nidulans dehydroquinate synthase."
Nichols C.E., Ren J., Lamb H.K., Hawkins A.R., Stammers D.K.
J. Mol. Biol. 327:129-144(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (2.1 ANGSTROMS) OF 1-393 IN COMPLEX WITH ADP; NAD; ZINC AND SUBSTRATE ANALOG.
[9]"Structure of the 'open' form of Aspergillus nidulans 3-dehydroquinate synthase at 1.7 A resolution from crystals grown following enzyme turnover."
Nichols C.E., Hawkins A.R., Stammers D.K.
Acta Crystallogr. D 60:971-973(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: X-RAY CRYSTALLOGRAPHY (1.7 ANGSTROMS) OF 1-393.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
X05204 Genomic DNA. Translation: CAA28836.1. Sequence problems.
AACD01000010 Genomic DNA. Translation: EAA65484.1. Sequence problems.
AACD01000011 Genomic DNA. No translation available.
BN001308 Genomic DNA. Translation: CBF88944.1.
PIRBVASA1. A24962.
RefSeqXP_658312.1. XM_653220.1.

3D structure databases

PDBe
RCSB PDB
PDBj
EntryMethodResolution (Å)ChainPositionsPDBsum
1DQSX-ray1.80A/B1-393[»]
1NR5X-ray2.10A/B1-393[»]
1NRXX-ray2.90A/B1-393[»]
1NUAX-ray2.85A/B1-393[»]
1NVAX-ray2.62A/B1-393[»]
1NVBX-ray2.70A/B1-393[»]
1NVDX-ray2.51A/B1-393[»]
1NVEX-ray2.58A/B/C/D1-393[»]
1NVFX-ray2.80A/B/C1-393[»]
1SG6X-ray1.70A/B1-393[»]
ProteinModelPortalP07547.
ModBaseSearch...

Protein-protein interaction databases

STRING162425.CADANIAP00001961.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID2876485.
KEGGani:AN0708.2.

Phylogenomic databases

eggNOGCOG0337.
HOGENOMHOG000007970.
OrthoDBEOG4PRWZT.

Enzyme and pathway databases

SABIO-RKP07547.
UniPathwayUPA00053; UER00085.
UPA00053; UER00086.
UPA00053; UER00087.
UPA00053; UER00088.
UPA00053; UER00089.

Family and domain databases

Gene3D3.20.20.70. 1 hit.
3.40.50.720. 1 hit.
3.65.10.10. 2 hits.
HAMAPMF_03143. Pentafunct_AroM.
InterProIPR018508. 3-dehydroquinate_DH_AS.
IPR013785. Aldolase_TIM.
IPR016037. DHQ_synth_AroB.
IPR001381. DHquinase_I.
IPR001986. Enolpyruvate_Tfrase_dom.
IPR006264. EPSP_synthase.
IPR023193. EPSP_synthase_CS.
IPR016040. NAD(P)-bd_dom.
IPR008289. Pentafunct_AroM.
IPR013792. RNA3'P_cycl/enolpyr_Trfase_a/b.
IPR013708. Shikimate_DH-bd_N.
IPR010110. Shikimate_DH_AroM-type.
IPR000623. Shikimate_kinase.
IPR023000. Shikimate_kinase_CS.
IPR006151. Shikm_DH/Glu-tRNA_Rdtase.
[Graphical view]
PANTHERPTHR21090:SF1. PTHR21090:SF1. 1 hit.
PfamPF01487. DHquinase_I. 1 hit.
PF00275. EPSP_synthase. 1 hit.
PF01488. Shikimate_DH. 1 hit.
PF08501. Shikimate_dh_N. 1 hit.
PF01202. SKI. 1 hit.
[Graphical view]
PIRSFPIRSF000514. Pentafunct_AroM. 1 hit.
PRINTSPR01100. SHIKIMTKNASE.
SUPFAMSSF55205. RNA3'_cycl/enolpyr_transf_A/B. 1 hit.
TIGRFAMsTIGR01356. aroA. 1 hit.
TIGR01357. aroB. 1 hit.
TIGR01093. aroD. 1 hit.
TIGR01809. Shik-DH-AROM. 1 hit.
PROSITEPS01028. DEHYDROQUINASE_I. 1 hit.
PS00104. EPSP_SYNTHASE_1. 1 hit.
PS00885. EPSP_SYNTHASE_2. 1 hit.
PS01128. SHIKIMATE_KINASE. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

EvolutionaryTraceP07547.

Entry information

Entry nameARO1_EMENI
AccessionPrimary (citable) accession number: P07547
Secondary accession number(s): C8VRD2, Q5BFH2
Entry history
Integrated into UniProtKB/Swiss-Prot: April 1, 1988
Last sequence update: May 26, 2009
Last modified: April 3, 2013
This is version 131 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programFungal Protein Annotation Program

Relevant documents

PATHWAY comments

Index of metabolic and biosynthesis pathways

PDB cross-references

Index of Protein Data Bank (PDB) cross-references

SIMILARITY comments

Index of protein domains and families