Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Baruol synthase

Gene

BARS1

Organism
Arabidopsis thaliana (Mouse-ear cress)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

Converts oxidosqualene to baruol (90%) and 22 minor products.1 Publication

Catalytic activityi

(3S)-2,3-epoxy-2,3-dihydrosqualene = baruol.1 Publication

GO - Molecular functioni

  1. baruol synthase activity Source: TAIR

GO - Biological processi

  1. tetracyclic triterpenoid biosynthetic process Source: TAIR
Complete GO annotation...

Keywords - Molecular functioni

Isomerase

Enzyme and pathway databases

BioCyciMetaCyc:AT4G15370-MONOMER.
ReactomeiREACT_318780. Cholesterol biosynthesis.

Names & Taxonomyi

Protein namesi
Recommended name:
Baruol synthase (EC:5.4.99.57)
Short name:
AtBARS1
Alternative name(s):
Pentacyclic triterpene synthase 2
Short name:
AtPEN2
Gene namesi
Name:BARS1
Synonyms:PEN2
Ordered Locus Names:At4g15370
ORF Names:dl3730c, FCAALL.279
OrganismiArabidopsis thaliana (Mouse-ear cress)
Taxonomic identifieri3702 [NCBI]
Taxonomic lineageiEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis
ProteomesiUP000006548 Componenti: Chromosome 4

Organism-specific databases

TAIRiAT4G15370.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 759759Baruol synthasePRO_0000366137Add
BLAST

Proteomic databases

PaxDbiO23390.
PRIDEiO23390.

Expressioni

Gene expression databases

GenevestigatoriO23390.

Interactioni

Protein-protein interaction databases

STRINGi3702.AT4G15370.1-P.

Structurei

3D structure databases

ProteinModelPortaliO23390.
SMRiO23390. Positions 586-659.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati149 – 19042PFTB 1Add
BLAST
Repeati522 – 56443PFTB 2Add
BLAST
Repeati641 – 68242PFTB 3Add
BLAST

Sequence similaritiesi

Belongs to the terpene cyclase/mutase family.Curated
Contains 3 PFTB repeats.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiCOG1657.
HOGENOMiHOG000234317.
InParanoidiO23390.
KOiK16206.
OMAiIFTHEHR.
PhylomeDBiO23390.

Family and domain databases

Gene3Di1.50.10.20. 2 hits.
InterProiIPR001330. Prenyltrans.
IPR018333. Squalene_cyclase.
IPR002365. Terpene_synthase_CS.
IPR008930. Terpenoid_cyclase/PrenylTrfase.
[Graphical view]
PfamiPF00432. Prenyltrans. 1 hit.
[Graphical view]
SUPFAMiSSF48239. SSF48239. 2 hits.
TIGRFAMsiTIGR01787. squalene_cyclas. 1 hit.
PROSITEiPS01074. TERPENE_SYNTHASES. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

O23390-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MWRLRIGAKA KDNTHLFTTN NYVGRQIWEF DANAGSPEEL AEVEEARRNF
60 70 80 90 100
SNNRSRFKAS ADLLWRMQFL REKKFEQKIP RVIVEDAEKI TYEDAKTALR
110 120 130 140 150
RGLLYFTALQ ADDGHWPAEN AGSIFFNAPF VICLYITGHL EKIFTHEHRV
160 170 180 190 200
ELLRYMYNHQ NEDGGWGLHV ESPSNMFCSV INYICLRILG VEAGHDDKGS
210 220 230 240 250
ACARARKWIL DHGGATYSPL IGKAWLSVLG VYDWSGCKPI PPEFWFLPSF
260 270 280 290 300
FPVNGGTLWI YLRDIFMGLS YLYGKNFVAT STPLILQLRE EIYPEPYTNI
310 320 330 340 350
SWRQARNRCA KEDLYYPQSF LQDLFWKGVH VFSENILNRW PFNNLIRQRA
360 370 380 390 400
LRTTMELVHY HDEATRYITG GSVPKVIAVF HMLACWVEDP ESDYFKKHLA
410 420 430 440 450
RVPDFIWIGE DGLKIQSFGS QVWDTALSLH VFIDGFDDDV DEEIRSTLLK
460 470 480 490 500
GYDYLEKSQV TENPPGDYMK MFRHMAKGGW TFSDQDQGWP VSDCTAESLE
510 520 530 540 550
CCLFFESMSS EFIGKKMDVE KLYDAVDFLL YLQSDNGGIT AWQPADGKLV
560 570 580 590 600
EFIEDAVVEH EYVECTGSAI VALAQFNKQF PGYKKEEVER FITKGVKYIE
610 620 630 640 650
DLQMVDGSWY GNWGVCFIYG TFFAVRGLVA AGKCYNNCEA IRRAVRFILD
660 670 680 690 700
TQNTEGGWGE SYLSCPRKKY IPLIGNKTNV VNTGQALMVL IMGNQMKRDP
710 720 730 740 750
LPVHRAAKVL INSQMDNGDF PQQEIMGVFK MNVMLHFPTY RNMFTLWALT

HYTKALRGL
Length:759
Mass (Da):87,455
Last modified:March 3, 2009 - v2
Checksum:i620E96D9085213C1
GO

Sequence cautioni

The sequence CAB10316.1 differs from that shown. Reason: Erroneous gene model prediction. Curated
The sequence CAB78579.1 differs from that shown. Reason: Erroneous gene model prediction. Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Z97338 Genomic DNA. Translation: CAB10316.1. Sequence problems.
AL161541 Genomic DNA. Translation: CAB78579.1. Sequence problems.
CP002687 Genomic DNA. Translation: AEE83589.1.
PIRiB71418.
RefSeqiNP_193272.1. NM_117625.1.
UniGeneiAt.54340.

Genome annotation databases

EnsemblPlantsiAT4G15370.1; AT4G15370.1; AT4G15370.
GeneIDi827203.
KEGGiath:AT4G15370.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Z97338 Genomic DNA. Translation: CAB10316.1. Sequence problems.
AL161541 Genomic DNA. Translation: CAB78579.1. Sequence problems.
CP002687 Genomic DNA. Translation: AEE83589.1.
PIRiB71418.
RefSeqiNP_193272.1. NM_117625.1.
UniGeneiAt.54340.

3D structure databases

ProteinModelPortaliO23390.
SMRiO23390. Positions 586-659.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi3702.AT4G15370.1-P.

Proteomic databases

PaxDbiO23390.
PRIDEiO23390.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsiAT4G15370.1; AT4G15370.1; AT4G15370.
GeneIDi827203.
KEGGiath:AT4G15370.

Organism-specific databases

TAIRiAT4G15370.

Phylogenomic databases

eggNOGiCOG1657.
HOGENOMiHOG000234317.
InParanoidiO23390.
KOiK16206.
OMAiIFTHEHR.
PhylomeDBiO23390.

Enzyme and pathway databases

BioCyciMetaCyc:AT4G15370-MONOMER.
ReactomeiREACT_318780. Cholesterol biosynthesis.

Miscellaneous databases

PROiO23390.

Gene expression databases

GenevestigatoriO23390.

Family and domain databases

Gene3Di1.50.10.20. 2 hits.
InterProiIPR001330. Prenyltrans.
IPR018333. Squalene_cyclase.
IPR002365. Terpene_synthase_CS.
IPR008930. Terpenoid_cyclase/PrenylTrfase.
[Graphical view]
PfamiPF00432. Prenyltrans. 1 hit.
[Graphical view]
SUPFAMiSSF48239. SSF48239. 2 hits.
TIGRFAMsiTIGR01787. squalene_cyclas. 1 hit.
PROSITEiPS01074. TERPENE_SYNTHASES. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana."
    Bevan M., Bancroft I., Bent E., Love K., Goodman H.M., Dean C., Bergkamp R., Dirkse W., van Staveren M., Stiekema W., Drost L., Ridley P., Hudson S.-A., Patel K., Murphy G., Piffanelli P., Wedler H., Wedler E.
    , Wambutt R., Weitzenegger T., Pohl T., Terryn N., Gielen J., Villarroel R., De Clercq R., van Montagu M., Lecharny A., Aubourg S., Gy I., Kreis M., Lao N., Kavanagh T., Hempel S., Kotter P., Entian K.-D., Rieger M., Schaefer M., Funk B., Mueller-Auer S., Silvey M., James R., Monfort A., Pons A., Puigdomenech P., Douka A., Voukelatou E., Milioni D., Hatzopoulos P., Piravandi E., Obermaier B., Hilbert H., Duesterhoeft A., Moores T., Jones J.D.G., Eneva T., Palme K., Benes V., Rechmann S., Ansorge W., Cooke R., Berger C., Delseny M., Voet M., Volckaert G., Mewes H.-W., Klosterman S., Schueller C., Chalwatzis N.
    Nature 391:485-488(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: cv. Columbia.
  2. "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana."
    Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M., de Simone V., Obermaier B.
    , Mache R., Mueller M., Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B., Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A., Martienssen R., McCombie W.R.
    Nature 402:769-777(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: cv. Columbia.
  3. The Arabidopsis Information Resource (TAIR)
    Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
    Cited for: GENOME REANNOTATION.
    Strain: cv. Columbia.
  4. "Molecular cloning and expression in yeast of 2,3-oxidosqualene-triterpenoid cyclases from Arabidopsis thaliana."
    Husselstein-Muller T., Schaller H., Benveniste P.
    Plant Mol. Biol. 45:75-92(2001) [PubMed] [Europe PMC] [Abstract]
    Cited for: IDENTIFICATION, NOMENCLATURE.
  5. "An oxidosqualene cyclase makes numerous products by diverse mechanisms: a challenge to prevailing concepts of triterpene biosynthesis."
    Lodeiro S., Xiong Q., Wilson W.K., Kolesnikova M.D., Onak C.S., Matsuda S.P.T.
    J. Am. Chem. Soc. 129:11213-11222(2007) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, CATALYTIC ACTIVITY.

Entry informationi

Entry nameiBARS1_ARATH
AccessioniPrimary (citable) accession number: O23390
Entry historyi
Integrated into UniProtKB/Swiss-Prot: March 3, 2009
Last sequence update: March 3, 2009
Last modified: April 29, 2015
This is version 90 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Arabidopsis thaliana
    Arabidopsis thaliana: entries and gene names
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.