Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Baruol synthase

Gene

BARS1

Organism
Arabidopsis thaliana (Mouse-ear cress)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

Converts oxidosqualene to baruol (90%) and 22 minor products.1 Publication

Catalytic activityi

(3S)-2,3-epoxy-2,3-dihydrosqualene = baruol.1 Publication

GO - Molecular functioni

  • baruol synthase activity Source: TAIR

GO - Biological processi

  • tetracyclic triterpenoid biosynthetic process Source: TAIR
Complete GO annotation...

Keywords - Molecular functioni

Isomerase

Enzyme and pathway databases

BioCyciMetaCyc:AT4G15370-MONOMER.

Names & Taxonomyi

Protein namesi
Recommended name:
Baruol synthase (EC:5.4.99.57)
Short name:
AtBARS1
Alternative name(s):
Pentacyclic triterpene synthase 2
Short name:
AtPEN2
Gene namesi
Name:BARS1
Synonyms:PEN2
Ordered Locus Names:At4g15370
ORF Names:dl3730c, FCAALL.279
OrganismiArabidopsis thaliana (Mouse-ear cress)
Taxonomic identifieri3702 [NCBI]
Taxonomic lineageiEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis
Proteomesi
  • UP000006548 Componenti: Chromosome 4

Organism-specific databases

TAIRiAT4G15370.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 759759Baruol synthasePRO_0000366137Add
BLAST

Proteomic databases

PaxDbiO23390.
PRIDEiO23390.

Expressioni

Gene expression databases

GenevisibleiO23390. AT.

Interactioni

Protein-protein interaction databases

STRINGi3702.AT4G15370.1.

Structurei

3D structure databases

ProteinModelPortaliO23390.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Repeati149 – 19042PFTB 1Add
BLAST
Repeati522 – 56443PFTB 2Add
BLAST
Repeati641 – 68242PFTB 3Add
BLAST

Sequence similaritiesi

Belongs to the terpene cyclase/mutase family.Curated
Contains 3 PFTB repeats.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiKOG0497. Eukaryota.
COG1657. LUCA.
HOGENOMiHOG000234317.
InParanoidiO23390.
KOiK16206.
OMAiIFTHEHR.
OrthoDBiEOG093611Z0.
PhylomeDBiO23390.

Family and domain databases

Gene3Di1.50.10.20. 2 hits.
InterProiIPR032696. SQ_cyclase_C.
IPR032697. SQ_cyclase_N.
IPR018333. Squalene_cyclase.
IPR002365. Terpene_synthase_CS.
IPR008930. Terpenoid_cyclase/PrenylTrfase.
[Graphical view]
PfamiPF13243. SQHop_cyclase_C. 1 hit.
PF13249. SQHop_cyclase_N. 1 hit.
[Graphical view]
SUPFAMiSSF48239. SSF48239. 2 hits.
TIGRFAMsiTIGR01787. squalene_cyclas. 1 hit.
PROSITEiPS01074. TERPENE_SYNTHASES. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

O23390-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MWRLRIGAKA KDNTHLFTTN NYVGRQIWEF DANAGSPEEL AEVEEARRNF
60 70 80 90 100
SNNRSRFKAS ADLLWRMQFL REKKFEQKIP RVIVEDAEKI TYEDAKTALR
110 120 130 140 150
RGLLYFTALQ ADDGHWPAEN AGSIFFNAPF VICLYITGHL EKIFTHEHRV
160 170 180 190 200
ELLRYMYNHQ NEDGGWGLHV ESPSNMFCSV INYICLRILG VEAGHDDKGS
210 220 230 240 250
ACARARKWIL DHGGATYSPL IGKAWLSVLG VYDWSGCKPI PPEFWFLPSF
260 270 280 290 300
FPVNGGTLWI YLRDIFMGLS YLYGKNFVAT STPLILQLRE EIYPEPYTNI
310 320 330 340 350
SWRQARNRCA KEDLYYPQSF LQDLFWKGVH VFSENILNRW PFNNLIRQRA
360 370 380 390 400
LRTTMELVHY HDEATRYITG GSVPKVIAVF HMLACWVEDP ESDYFKKHLA
410 420 430 440 450
RVPDFIWIGE DGLKIQSFGS QVWDTALSLH VFIDGFDDDV DEEIRSTLLK
460 470 480 490 500
GYDYLEKSQV TENPPGDYMK MFRHMAKGGW TFSDQDQGWP VSDCTAESLE
510 520 530 540 550
CCLFFESMSS EFIGKKMDVE KLYDAVDFLL YLQSDNGGIT AWQPADGKLV
560 570 580 590 600
EFIEDAVVEH EYVECTGSAI VALAQFNKQF PGYKKEEVER FITKGVKYIE
610 620 630 640 650
DLQMVDGSWY GNWGVCFIYG TFFAVRGLVA AGKCYNNCEA IRRAVRFILD
660 670 680 690 700
TQNTEGGWGE SYLSCPRKKY IPLIGNKTNV VNTGQALMVL IMGNQMKRDP
710 720 730 740 750
LPVHRAAKVL INSQMDNGDF PQQEIMGVFK MNVMLHFPTY RNMFTLWALT

HYTKALRGL
Length:759
Mass (Da):87,455
Last modified:March 3, 2009 - v2
Checksum:i620E96D9085213C1
GO

Sequence cautioni

The sequence CAB10316 differs from that shown. Reason: Erroneous gene model prediction. Curated
The sequence CAB78579 differs from that shown. Reason: Erroneous gene model prediction. Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Z97338 Genomic DNA. Translation: CAB10316.1. Sequence problems.
AL161541 Genomic DNA. Translation: CAB78579.1. Sequence problems.
CP002687 Genomic DNA. Translation: AEE83589.1.
PIRiB71418.
RefSeqiNP_193272.1. NM_117625.1.
UniGeneiAt.54340.

Genome annotation databases

EnsemblPlantsiAT4G15370.1; AT4G15370.1; AT4G15370.
GeneIDi827203.
GrameneiAT4G15370.1; AT4G15370.1; AT4G15370.
KEGGiath:AT4G15370.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
Z97338 Genomic DNA. Translation: CAB10316.1. Sequence problems.
AL161541 Genomic DNA. Translation: CAB78579.1. Sequence problems.
CP002687 Genomic DNA. Translation: AEE83589.1.
PIRiB71418.
RefSeqiNP_193272.1. NM_117625.1.
UniGeneiAt.54340.

3D structure databases

ProteinModelPortaliO23390.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi3702.AT4G15370.1.

Proteomic databases

PaxDbiO23390.
PRIDEiO23390.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsiAT4G15370.1; AT4G15370.1; AT4G15370.
GeneIDi827203.
GrameneiAT4G15370.1; AT4G15370.1; AT4G15370.
KEGGiath:AT4G15370.

Organism-specific databases

TAIRiAT4G15370.

Phylogenomic databases

eggNOGiKOG0497. Eukaryota.
COG1657. LUCA.
HOGENOMiHOG000234317.
InParanoidiO23390.
KOiK16206.
OMAiIFTHEHR.
OrthoDBiEOG093611Z0.
PhylomeDBiO23390.

Enzyme and pathway databases

BioCyciMetaCyc:AT4G15370-MONOMER.

Miscellaneous databases

PROiO23390.

Gene expression databases

GenevisibleiO23390. AT.

Family and domain databases

Gene3Di1.50.10.20. 2 hits.
InterProiIPR032696. SQ_cyclase_C.
IPR032697. SQ_cyclase_N.
IPR018333. Squalene_cyclase.
IPR002365. Terpene_synthase_CS.
IPR008930. Terpenoid_cyclase/PrenylTrfase.
[Graphical view]
PfamiPF13243. SQHop_cyclase_C. 1 hit.
PF13249. SQHop_cyclase_N. 1 hit.
[Graphical view]
SUPFAMiSSF48239. SSF48239. 2 hits.
TIGRFAMsiTIGR01787. squalene_cyclas. 1 hit.
PROSITEiPS01074. TERPENE_SYNTHASES. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiBARS1_ARATH
AccessioniPrimary (citable) accession number: O23390
Entry historyi
Integrated into UniProtKB/Swiss-Prot: March 3, 2009
Last sequence update: March 3, 2009
Last modified: September 7, 2016
This is version 98 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Arabidopsis thaliana
    Arabidopsis thaliana: entries and gene names
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.