Skip Header

Contribute Send feedback
Read comments (?) or add your own

O54666 (O54666_AMYMD) Unreviewed, UniProtKB/TrEMBL

Last modified December 14, 2011. Version 79. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names

Rifamycin polyketide synthase, type 1 EMBL CAA11035.1
Gene names
Name:rifA EMBL AAC01710.1
OrganismAmycolatopsis mediterranei (Nocardia mediterranei) EMBL AAC01710.1
Taxonomic identifier33910 [NCBI]
Taxonomic lineageBacteriaActinobacteriaActinobacteridaeActinomycetalesPseudonocardineaePseudonocardiaceaeAmycolatopsis

Protein attributes

Sequence length4735 AA.
Sequence statusComplete.
Protein existencePredicted

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 631631polyketide synthase loading domain EMBL AAC01710.1
PRO_5000053932
Chain632 – 219415633-AHBA CoA ligase module 1 EMBL AAC01710.1
PRO_5000053934
Chain632 – 1054423ketoacyl synthase EMBL AAC01710.1
PRO_5000053933
Chain1147 – 1456310methylmalonyl acyltransferase EMBL AAC01710.1
PRO_5000053935
Chain1467 – 1622156dehydratase EMBL AAC01710.1
PRO_5000053936
Chain1829 – 2075247ketoreductase EMBL AAC01710.1
PRO_5000053937
Chain2104 – 217875acyl carrier protein EMBL AAC01710.1
PRO_5000053938
Chain2195 – 3171977polyketide synthase module 2 EMBL AAC01710.1
PRO_5000053940
Chain2195 – 2618424ketoacyl synthase EMBL AAC01710.1
PRO_5000053939
Chain2718 – 3022305malonyl acyltransferase EMBL AAC01710.1
PRO_5000053941
Chain3081 – 315575acyl carrier protein EMBL AAC01710.1
PRO_5000053942
Chain3172 – 47351564polyketide synthase module 3 EMBL AAC01710.1
PRO_5000053944
Chain3172 – 3596425ketoacyl synthase EMBL AAC01710.1
PRO_5000053943
Chain3697 – 4011315methylmalonyl acyltransferase EMBL AAC01710.1
PRO_5000053945
Chain4307 – 4548242ketoreductase EMBL AAC01710.1
PRO_5000053946
Chain4580 – 465475acyl carrier protein EMBL AAC01710.1
PRO_5000053947

Sequences

Sequence LengthMass (Da)Tools
O54666 [UniParc].

Last modified June 1, 1998. Version 1.
Checksum: 0B717178FB68C39C

FASTA4,735495,219
        10         20         30         40         50         60 
MRTDLIKPLH VALLENATRF AGKPAFADDH RTVTYGDLEA RTRRLAGHLA GLGVRHGDRV 

        70         80         90        100        110        120 
AICLGNRVST VESYFAILRA GAVGVPLNPG SATAELEHPL TDSGATVVVT DAAQAARLRL 

       130        140        150        160        170        180 
APHVELLVTG DDVPEGAHSY DELALSEPAE PAADDLELDE PAWMFYTSGT TGRPKGVVST 

       190        200        210        220        230        240 
QRNCLWSVAS CYVPFPGLSD QDRVLWPLPL FHSLSHIACV LSATVVGASV RIADGSSADD 

       250        260        270        280        290        300 
VMRLIEAESS TFLAGVPTTY HHLVRAARQR GFSAPSLRIG LAGGAVLGAG LRSEFEETFG 

       310        320        330        340        350        360 
VPLIDAYGST ETCGAITMNP PDGARVEGSC GLAVPGVDVR VVDPDTGLDV PAGEEGEVWV 

       370        380        390        400        410        420 
SGPNVMLGYH NSPEATAAAM RDGWFRTGDL ARRDDAGYFT ICGRIKELII RGGANIHPGE 

       430        440        450        460        470        480 
VEAVLRTVDG VADAAVGGVP HDTLGEVPVA YVIPGPTGFD PAALIEKCRE QLSAYKVPDR 

       490        500        510        520        530        540 
ILEVAHIPRT ASGKIRRGLL TDEPAQLRYA ATEHEEQSRH ADESVAAALR ARLSGLDERA 

       550        560        570        580        590        600 
QCELLEDLVR TQAADVLGQP VPDGRAFRDL GFTSLAIVEL RNRLTEHTGL WLPASAVFDH 

       610        620        630        640        650        660 
PTPAALAARV RAELLGITQA VAEPVVAADP GEPIAIVGMA CRLPGGVASP EDLWRLVAER 

       670        680        690        700        710        720 
VDAVSEFPGD RGWDLDSLID PDRERAGTSY VGQGGFLHDA GEFDAGFFGI SPREAVAMDP 

       730        740        750        760        770        780 
QQRLLLETSW EALENAGVDP IALKGTDTGV FSGLMGQGYG SGAVAPELEG FVTTGVASSV 

       790        800        810        820        830        840 
ASGRVSYVLG LEGPAVTVDT ACSSSLVAMH LAAQALRQGE CSMALAGGVT VMATPGSFVE 

       850        860        870        880        890        900 
FSRQRALAPD GRCKAFAAAA DGTGWSEGVG VVVLERLSVA RERGHRILAV LRGSAVNQDG 

       910        920        930        940        950        960 
ASNGLTAPNG LSQQRVIRRA LAAAGLAPSD VDVVEAHGTG TTLGDPIEAQ ALLATYGQER 

       970        980        990       1000       1010       1020 
KQPLWLGSLK SNIGHAQAAA GVAGVIKMVQ ALRHETLPPT LHVDKPTLEV DWSAGAIELL 

      1030       1040       1050       1060       1070       1080 
TEARAWPRNG RPRRAGVSSF GVSGTNAHLI LEEAPAEEPV AAPELPVVPL VVSARSTESL 

      1090       1100       1110       1120       1130       1140 
SGQAERLASL LEGDVSLTEV AGALVSRRAV LDERAVVVAG SREEAVTGLR ALNTAGSGTP 

      1150       1160       1170       1180       1190       1200 
GKVVWVFPGQ GTQWAGMGRE LLAESPVFAE RIAECAAALA PWIDWSLVDV LRGEGDLGRV 

      1210       1220       1230       1240       1250       1260 
DVLQPACFAV MVGLAAVWES VGVRPDAVVG HSQGEIAAAC VSGALSLEDA AKVVALRSQA 

      1270       1280       1290       1300       1310       1320 
IAAELSGRGG MASVALGEDD VVSRLVDGVE VAAVNGPSSV VIAGDAHALD ATLEILSGEG 

      1330       1340       1350       1360       1370       1380 
IRVRRVAVDY ASHTRHVEDI RDTLAETLAG ISAQAPAVPF YSTVTSEWVR DAGVLDGGYW 

      1390       1400       1410       1420       1430       1440 
YRNLRNQVRF GAAATALLEQ GHTVFVEVSA HPVTVQPLSE LTGDAIGTLR REDGGLRRLL 

      1450       1460       1470       1480       1490       1500 
ASMGELFVRG IDVDWTAMVP AAGWVDLPTY AFEHRHYWLE PAEPASAGDP LLGTVVSTPG 

      1510       1520       1530       1540       1550       1560 
SDRLTAVAQW SRRAQPWAVD GLVPNAALVE AAIRLGDLAG TPVVGELVVD APVVLPRRGS 

      1570       1580       1590       1600       1610       1620 
REVQLIVGEP GEQRRRPIEV FSREADEPWT RHAHGTLAPA AAAVPEPAAA GDATDVTVAG 

      1630       1640       1650       1660       1670       1680 
LRDADRYGIH PALLDAAVRT VVGDDLLPSV WTGVSLLASG ATAVTVTPTA TGLRLTDPAG 

      1690       1700       1710       1720       1730       1740 
QPVLTVESVR GTPFVAEQGT TDALFRVDWP EIPLPTAETA DFLPYEATSA EATLSALQAW 

      1750       1760       1770       1780       1790       1800 
LADPAETRLA VVTGDCTEPG AAAIWGLVRS AQSEHPGRIV LADLDDPAVL PAVVASGEPQ 

      1810       1820       1830       1840       1850       1860 
VRVRNGVASV PRLTRVTPRQ DARPLDPEGT VLITGGTGTL GALTARHLVT AHGVRHLVLV 

      1870       1880       1890       1900       1910       1920 
SRRGEAPELQ EELTALGASV AIAACDVADR AQLEAVLRAI PAEHPLTAVI HTAGVLDDGV 

      1930       1940       1950       1960       1970       1980 
VTELTPDRLA TVRRPKVDAA RLLDELTREA DLAAFVLFSS AAGVLGNPGQ AGYAAANAEL 

      1990       2000       2010       2020       2030       2040 
DALARQRNSL DLPAVSIAWG YWATVSGMTE HLGDADLRRN QRIGMSGLPA DEGMALLDAA 

      2050       2060       2070       2080       2090       2100 
IATGGTLVAA KFDVAALRAT AKAGGPVPPL LRGLAPLPRR AAAKTASLTE RLAGLAETEQ 

      2110       2120       2130       2140       2150       2160 
AAALLDLVRR HAAEVLGHSG AESVHSGRTF KDAGFDSLTA VELRNRLAAA TGLTLSPAMI 

      2170       2180       2190       2200       2210       2220 
FDYPKPPALA DHLRAKLFGS AANRPAEIGT AAAEEPIAIV AMACRFPGGV HSPEDLWRLV 

      2230       2240       2250       2260       2270       2280 
ADGADAVTEF PADRGWDTDR LYHEDPDHEG TTYVRHGAFL DDAAGFDAAF FGISPNEALA 

      2290       2300       2310       2320       2330       2340 
MDPQQRLLLE TSWELFERAA IDPTTLAGQD IGVFAGVNSH DYSMRMHRAA GVEGFRLTGG 

      2350       2360       2370       2380       2390       2400 
SASVLSGRVA YHFGVEGPAV TVDTACSSSL VALHMAVQAL QRGECSMALA GGVMVMGTVE 

      2410       2420       2430       2440       2450       2460 
TFVEFSRQRG LAPDGRCKAF ADGADGTGWS EGVGLLLVER LSEAQRRGHQ VLAVVRGSAV 

      2470       2480       2490       2500       2510       2520 
NSDGASNGLT APNGPSQQRV IRKALAAAGL STSDVDAVEA HGTGTTLGDP IEAEALLATY 

      2530       2540       2550       2560       2570       2580 
GQNRETPLWL GSVKSNLGHT QAAAGVAGVI KMVMAMRHGV LPRTLHVDRP SSYVDWSAGA 

      2590       2600       2610       2620       2630       2640 
VELLTEARDW VSNGHPRRAG VSSFGIGGTN AHVVLEEVAA PITTPQPEPA EFLVPVLVSA 

      2650       2660       2670       2680       2690       2700 
RTAAGLRGQA GRLAAFLGDR TDVRVPDAAY ALATTRAQLD HRAVVLASDR AQLCADLAAF 

      2710       2720       2730       2740       2750       2760 
GSGVVTGTPV DGKLAVLFTG QGSQWAGMGR ELAETFPVFR DAFEAACEAV DTHLRERPLR 

      2770       2780       2790       2800       2810       2820 
EVVFDDSALL DQTMYTQGAL FAVETALFRL FESWGVRPGL LAGHSIGELA AAHVSGVLDL 

      2830       2840       2850       2860       2870       2880 
ADAGELVAAR GRLMQALPAG GAMVAVQATE DEVAPLLDGT VCVAAVNGPD SVVLSGTEAA 

      2890       2900       2910       2920       2930       2940 
VLAVADELAG RGRKTRRLAV SHAFHSPLME PMLDDFRAVA ERLTYRAGSL PVVSTLTGEL 

      2950       2960       2970       2980       2990       3000 
AALDSPDYWV GQVRNAVRFS DAVTALGAQG ASTFLELGPG GALAAMALGT LGGPEQSCVA 

      3010       3020       3030       3040       3050       3060 
TLRKNGAEVP DVLTALAELH VRGVGVDWTT VLDEPATAVG TVLPTYAFQH QRFWVDVDET 

      3070       3080       3090       3100       3110       3120 
AAVSVTPPPA EPIVDRPVQD VLELVRESAA VVLGHRDAGS FDLDRSFKDH GFDSLSAVKL 

      3130       3140       3150       3160       3170       3180 
RNRLRDFTGV ELPSTLIFDY PNPAVLADHL RAELLGERPA APAPVTRDVS DEPIAIVGMS 

      3190       3200       3210       3220       3230       3240 
TRLPGGADSP EELWKLVAEG RDAVSGFPVD RGWDLDGLYH PDPAHAGTSY TRSGGFLHDA 

      3250       3260       3270       3280       3290       3300 
AQFDAGLFGI SPREALAMDP QQRLLLETSW EALERAGVDP LSARGSDVGV FTGIVHHDYV 

      3310       3320       3330       3340       3350       3360 
TRLREVPEDV QGYTMTGTAS SVASGRVAYV FGFEGPAVTV DTACSSSLVA MHLAAQALRQ 

      3370       3380       3390       3400       3410       3420 
GECSMALAGG ATVMASPDAF LEFSRQRGLS ADGRCKAYAE GADGTGWAEG VGVVVLERLS 

      3430       3440       3450       3460       3470       3480 
VARERGHRVL AVLRGSAVNQ DGASNGLTAP NGPSQQRVIR GALASAGLAP SDVDVVEGHG 

      3490       3500       3510       3520       3530       3540 
TGTALGDPIE VQALLATYGQ EREQPLWLGS LKSNLGHTQA AAGVVGVIKM IMAMRHGVMP 

      3550       3560       3570       3580       3590       3600 
ATLHVDERTS QVDWSAGAIE VLTEAREWPR TGRPRRAGVS SFGASGTNAH LIIEEGPAEE 

      3610       3620       3630       3640       3650       3660 
AVDEEVASVV PLVVSARSAG SLAGQAGRLA AVLENESLAG VAGALVSGRA TLNERAVVIA 

      3670       3680       3690       3700       3710       3720 
GSRDEAQDGL QALARGENAP GVVTGTAGKP GKVVWVFPGQ GSQWMGMGRD LLDSSPVFAA 

      3730       3740       3750       3760       3770       3780 
RIKECAAALE QWTDWSLLDV LRGDADLLDR VDVVQPASFA MMVGLAAVWT SLGVTPDAVL 

      3790       3800       3810       3820       3830       3840 
GHSQGEIAAA CVSGALSLDD AAKVVALRSQ AIAGELAGRG GMASVALSEE DAVARLTPWA 

      3850       3860       3870       3880       3890       3900 
NRVEVAAVNS PSSVVIAGDA QALDEALEAL AGDGVRVRRV AVDYASHTRH VEAIAETLAK 

      3910       3920       3930       3940       3950       3960 
TLAGIDARVP AIPFYSTVLG TWIEQAVVDA GYWYRNLRQQ VRFGPSVADL AGLGHTVFVE 

      3970       3980       3990       4000       4010       4020 
ISAHPVLVQP LSEISDDAVV TGSLRRDDGG LRRLLASAAE LYVRGVAVDW TAAVPAAGWV 

      4030       4040       4050       4060       4070       4080 
DLPTYAFDRR HFWLHEAETA EAAEGMDGEF WTAIEQSDVD SLAELLELVP EQRGALSTVV 

      4090       4100       4110       4120       4130       4140 
PVLAQWRDRR RERSTAEKLR YQVTWQPLER EAAGVPGGRW LAVVPAGTTD ALLKELTGQG 

      4150       4160       4170       4180       4190       4200 
LDIVRLEIEE ASRAQLAEQL RNVLAEHDLT GVLSLLALDG GPADAAEITA STLALVQALG 

      4210       4220       4230       4240       4250       4260 
DTTTSAPLWC LTSGAVNIGI QDAVTAPAQA AVWGLGRAVA LERLDRWGGL VDLPAAIDAR 

      4270       4280       4290       4300       4310       4320 
TAQALLGVLN GAAGEDQLAV RRSGVYRRRL VRKPVPESAT SRWEPRGTVL VTGGAEGLGR 

      4330       4340       4350       4360       4370       4380 
HASVWLAQSG AERLIVTGTD GVDELTAELA EFGTTVEFCA DTDRDAIAQL VADSEVTAVV 

      4390       4400       4410       4420       4430       4440 
HAADIAQTSS VDDTGVADLD EVFAAKVTTA VWLDQLFEDT PLDAFVVFSS IAGIWGGGGQ 

      4450       4460       4470       4480       4490       4500 
GPAGAANAVL DALVEWRRAR GLKATSIAWG ALDQIGIGMD EAALAQLRRR GVIPMAPPLA 

      4510       4520       4530       4540       4550       4560 
VTAMVQAVAG NEKAVAVADM DWAAFIPAFT SVRPSPLFAD LPEAKAILRA AQDDGEDGDT 

      4570       4580       4590       4600       4610       4620 
ASSLADSLRA VPDAEQNRIL LKLVRGHAST VLGHSGAEGI GPRQAFQEVG FDSLAAVNLR 

      4630       4640       4650       4660       4670       4680 
NSLHAATGLR LPATLIFDYP TPEALVGYLR VELLREADDG LDGREDDLRR VLAAVPFARF 

      4690       4700       4710       4720       4730 
KEAGVLDTLL GLADTGTEPG TDAETTEAAP AADDAELIDA LDISGLVQRA LGQTS 

« Hide

References

[1]"Biosynthesis of the ansamycin antibiotic rifamycin: deductions from the molecular analysis of the rif biosynthetic gene cluster of Amycolatopsis mediterranei S699."
August P.R., Tang L., Yoon Y.J., Ning S., Muller R., Yu T.W., Taylor M., Hoffmann D., Kim C.G., Zhang X., Hutchinson C.R., Floss H.G.
Chem. Biol. 5:69-79(1998) [PubMed: 9512878] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: S699 EMBL AAC01710.1.
[2]"Mutational analysis and reconstituted expression of the biosynthetic genes involved in the formation of 3-amino-5-hydroxybenzoic acid, the starter unit of rifamycin biosynthesis in amycolatopsis Mediterranei S699."
Yu T.W., Muller R., Muller M., Zhang X., Draeger G., Kim C.G., Leistner E., Floss H.G.
J. Biol. Chem. 276:12546-12555(2001) [PubMed: 11278540] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: S699 EMBL AAC01710.1.
[3]"3-amino-5-hydroxybenzoic acid synthase, the terminal enzyme in the formation of the precursor of mC7N units in rifamycin and related antibiotics."
Kim C.G., Yu T.W., Fryhle C.B., Handa S., Floss H.G.
J. Biol. Chem. 273:6030-6040(1998) [PubMed: 9497318] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: S699 EMBL AAC01710.1.
[4]"Rifamycin insusceptibility: exploring the rif gene cluster of Amycolatopsis mediterranei S699."
Yu T.-W., Pogosova-Agadjanyan E.L., Kuan L.-Y., Bai L., Tin A.M., Adman E., Floss H.G.
Submitted (JAN-1998) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
Strain: S699 EMBL AAC01710.1.
[5]"Cloning and sequence analysis of the putative rifamycin polyketide synthase gene cluster from Amycolatopsis mediterranei."
Schupp T., Toupet C., Engel N., Goff S.
Submitted (DEC-1997) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
Strain: LBG A3136 EMBL CAA11035.1.
[6]Yu T., August P.R., Tang L., Yoon Y.J., Ning S., Mueller R., Taylor M., Kim C., Zhang X., Pogosova-Agadjanyan E.L., Tin A.M., Hutchinson C.R., Floss H.G.
Submitted (JAN-2003) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
Strain: S699 EMBL AAC01710.1.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF040570 Genomic DNA. Translation: AAC01710.1.
AJ223012 Genomic DNA. Translation: CAA11035.1.
PIRT17463.

3D structure databases

HSSPHSSP built from PDB template 1LCI based on UniProtKB P08659.
ProteinModelPortalO54666.
SMRO54666. Positions 626-1480, 3168-4035, 4562-4653.
ModBaseSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Family and domain databases

InterProIPR001227. Ac_transferase_dom.
IPR009081. Acyl_carrier_prot-like.
IPR014043. Acyl_transferase.
IPR016035. Acyl_Trfase/lysoPLipase.
IPR020845. AMP-binding_CS.
IPR000873. AMP-dep_Synth/Lig.
IPR000794. Beta-ketoacyl_synthase.
IPR018201. Ketoacyl_synth_AS.
IPR014031. Ketoacyl_synth_C.
IPR014030. Ketoacyl_synth_N.
IPR016036. Malonyl_transacylase_ACP-bd.
IPR016040. NAD(P)-bd_dom.
IPR006163. Phsphopanteth-bd.
IPR020842. PKS/FAS_KR.
IPR020801. PKS_acyl_transferase.
IPR020841. PKS_Beta-ketoAc_synthase_dom.
IPR020807. PKS_dehydratase.
IPR013968. PKS_KR.
IPR020806. PKS_PP-bd.
IPR006162. PPantetheine_attach_site.
IPR016039. Thiolase-like.
IPR016038. Thiolase-like_subgr.
[Graphical view]
Gene3DG3DSA:3.40.366.10. Ac_transferase_reg. 6 hits.
G3DSA:1.10.1200.10. ACP_like. 4 hits.
G3DSA:3.40.50.720. NAD(P)-bd. 2 hits.
G3DSA:3.40.47.10. Thiolase-like_subgr. 6 hits.
PANTHERPTHR11712. Ketoacyl_synth. 1 hit.
PfamPF00698. Acyl_transf_1. 3 hits.
PF00501. AMP-binding. 1 hit.
PF00109. ketoacyl-synt. 3 hits.
PF02801. Ketoacyl-synt_C. 3 hits.
PF08659. KR. 2 hits.
PF00550. PP-binding. 4 hits.
[Graphical view]
SMARTSM00827. PKS_AT. 3 hits.
SM00826. PKS_DH. 1 hit.
SM00822. PKS_KR. 2 hits.
SM00825. PKS_KS. 3 hits.
SM00823. PKS_PP. 4 hits.
[Graphical view]
SUPFAMSSF47336. ACP_like. 4 hits.
SSF52151. Acyl_Trfase/lysoPlipase. 3 hits.
SSF55048. Malonyl_transacylase_ACP-bd. 3 hits.
SSF53901. Thiolase-like. 3 hits.
PROSITEPS50075. ACP_DOMAIN. 4 hits.
PS00455. AMP_BINDING. 1 hit.
PS00606. B_KETOACYL_SYNTHASE. 3 hits.
PS00012. PHOSPHOPANTETHEINE. 2 hits.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameO54666_AMYMD
AccessionPrimary (citable) accession number: O54666
Entry history
Integrated into UniProtKB/TrEMBL: June 1, 1998
Last sequence update: June 1, 1998
Last modified: December 14, 2011
This is version 79 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)