Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

O23390 (BARS1_ARATH) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 82. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Baruol synthase

Short name=AtBARS1
EC=5.4.99.57
Alternative name(s):
Pentacyclic triterpene synthase 2
Short name=AtPEN2
Gene names
Name:BARS1
Synonyms:PEN2
Ordered Locus Names:At4g15370
ORF Names:dl3730c, FCAALL.279
OrganismArabidopsis thaliana (Mouse-ear cress) [Reference proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis

Protein attributes

Sequence length759 AA.
Sequence statusComplete.
Protein existenceEvidence at protein level

General annotation (Comments)

Function

Converts oxidosqualene to baruol (90%) and 22 minor products. Ref.5

Catalytic activity

(3S)-2,3-epoxy-2,3-dihydrosqualene = baruol. Ref.5

Sequence similarities

Belongs to the terpene cyclase/mutase family.

Contains 3 PFTB repeats.

Sequence caution

The sequence CAB10316.1 differs from that shown. Reason: Erroneous gene model prediction.

The sequence CAB78579.1 differs from that shown. Reason: Erroneous gene model prediction.

Ontologies

Keywords
   DomainRepeat
   Molecular functionIsomerase
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processtetracyclic triterpenoid biosynthetic process

Inferred from direct assay Ref.5. Source: TAIR

   Molecular_functionbaruol synthase activity

Inferred from direct assay Ref.5. Source: TAIR

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 759759Baruol synthase
PRO_0000366137

Regions

Repeat149 – 19042PFTB 1
Repeat522 – 56443PFTB 2
Repeat641 – 68242PFTB 3

Sequences

Sequence LengthMass (Da)Tools
O23390 [UniParc].

Last modified March 3, 2009. Version 2.
Checksum: 620E96D9085213C1

FASTA75987,455
        10         20         30         40         50         60 
MWRLRIGAKA KDNTHLFTTN NYVGRQIWEF DANAGSPEEL AEVEEARRNF SNNRSRFKAS 

        70         80         90        100        110        120 
ADLLWRMQFL REKKFEQKIP RVIVEDAEKI TYEDAKTALR RGLLYFTALQ ADDGHWPAEN 

       130        140        150        160        170        180 
AGSIFFNAPF VICLYITGHL EKIFTHEHRV ELLRYMYNHQ NEDGGWGLHV ESPSNMFCSV 

       190        200        210        220        230        240 
INYICLRILG VEAGHDDKGS ACARARKWIL DHGGATYSPL IGKAWLSVLG VYDWSGCKPI 

       250        260        270        280        290        300 
PPEFWFLPSF FPVNGGTLWI YLRDIFMGLS YLYGKNFVAT STPLILQLRE EIYPEPYTNI 

       310        320        330        340        350        360 
SWRQARNRCA KEDLYYPQSF LQDLFWKGVH VFSENILNRW PFNNLIRQRA LRTTMELVHY 

       370        380        390        400        410        420 
HDEATRYITG GSVPKVIAVF HMLACWVEDP ESDYFKKHLA RVPDFIWIGE DGLKIQSFGS 

       430        440        450        460        470        480 
QVWDTALSLH VFIDGFDDDV DEEIRSTLLK GYDYLEKSQV TENPPGDYMK MFRHMAKGGW 

       490        500        510        520        530        540 
TFSDQDQGWP VSDCTAESLE CCLFFESMSS EFIGKKMDVE KLYDAVDFLL YLQSDNGGIT 

       550        560        570        580        590        600 
AWQPADGKLV EFIEDAVVEH EYVECTGSAI VALAQFNKQF PGYKKEEVER FITKGVKYIE 

       610        620        630        640        650        660 
DLQMVDGSWY GNWGVCFIYG TFFAVRGLVA AGKCYNNCEA IRRAVRFILD TQNTEGGWGE 

       670        680        690        700        710        720 
SYLSCPRKKY IPLIGNKTNV VNTGQALMVL IMGNQMKRDP LPVHRAAKVL INSQMDNGDF 

       730        740        750 
PQQEIMGVFK MNVMLHFPTY RNMFTLWALT HYTKALRGL 

« Hide

References

« Hide 'large scale' references
[1]"Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana."
Bevan M., Bancroft I., Bent E., Love K., Goodman H.M., Dean C., Bergkamp R., Dirkse W., van Staveren M., Stiekema W., Drost L., Ridley P., Hudson S.-A., Patel K., Murphy G., Piffanelli P., Wedler H., Wedler E. expand/collapse author list , Wambutt R., Weitzenegger T., Pohl T., Terryn N., Gielen J., Villarroel R., De Clercq R., van Montagu M., Lecharny A., Aubourg S., Gy I., Kreis M., Lao N., Kavanagh T., Hempel S., Kotter P., Entian K.-D., Rieger M., Schaefer M., Funk B., Mueller-Auer S., Silvey M., James R., Monfort A., Pons A., Puigdomenech P., Douka A., Voukelatou E., Milioni D., Hatzopoulos P., Piravandi E., Obermaier B., Hilbert H., Duesterhoeft A., Moores T., Jones J.D.G., Eneva T., Palme K., Benes V., Rechmann S., Ansorge W., Cooke R., Berger C., Delseny M., Voet M., Volckaert G., Mewes H.-W., Klosterman S., Schueller C., Chalwatzis N.
Nature 391:485-488(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[2]"Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana."
Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M., de Simone V., Obermaier B. expand/collapse author list , Mache R., Mueller M., Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B., Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A., Martienssen R., McCombie W.R.
Nature 402:769-777(1999) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[3]The Arabidopsis Information Resource (TAIR)
Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
Cited for: GENOME REANNOTATION.
Strain: cv. Columbia.
[4]"Molecular cloning and expression in yeast of 2,3-oxidosqualene-triterpenoid cyclases from Arabidopsis thaliana."
Husselstein-Muller T., Schaller H., Benveniste P.
Plant Mol. Biol. 45:75-92(2001) [PubMed] [Europe PMC] [Abstract]
Cited for: IDENTIFICATION, NOMENCLATURE.
[5]"An oxidosqualene cyclase makes numerous products by diverse mechanisms: a challenge to prevailing concepts of triterpene biosynthesis."
Lodeiro S., Xiong Q., Wilson W.K., Kolesnikova M.D., Onak C.S., Matsuda S.P.T.
J. Am. Chem. Soc. 129:11213-11222(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION, CATALYTIC ACTIVITY.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
Z97338 Genomic DNA. Translation: CAB10316.1. Sequence problems.
AL161541 Genomic DNA. Translation: CAB78579.1. Sequence problems.
CP002687 Genomic DNA. Translation: AEE83589.1.
PIRB71418.
RefSeqNP_193272.1. NM_117625.1.
UniGeneAt.54340.

3D structure databases

ProteinModelPortalO23390.
SMRO23390. Positions 95-756.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING3702.AT4G15370.1-P.

Proteomic databases

PaxDbO23390.
PRIDEO23390.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsAT4G15370.1; AT4G15370.1; AT4G15370.
GeneID827203.
KEGGath:AT4G15370.

Organism-specific databases

TAIRAT4G15370.

Phylogenomic databases

eggNOGCOG1657.
HOGENOMHOG000234317.
KOK16206.
OMAIFTHEHR.
PhylomeDBO23390.
ProtClustDBCLSN2679533.

Enzyme and pathway databases

BioCycMetaCyc:AT4G15370-MONOMER.

Gene expression databases

GenevestigatorO23390.

Family and domain databases

Gene3D1.50.10.20. 2 hits.
InterProIPR001330. Prenyltrans.
IPR018333. Squalene_cyclase.
IPR002365. Terpene_synthase_CS.
IPR008930. Terpenoid_cyclase/PrenylTrfase.
[Graphical view]
PfamPF00432. Prenyltrans. 1 hit.
[Graphical view]
SUPFAMSSF48239. SSF48239. 2 hits.
TIGRFAMsTIGR01787. squalene_cyclas. 1 hit.
PROSITEPS01074. TERPENE_SYNTHASES. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameBARS1_ARATH
AccessionPrimary (citable) accession number: O23390
Entry history
Integrated into UniProtKB/Swiss-Prot: March 3, 2009
Last sequence update: March 3, 2009
Last modified: April 16, 2014
This is version 82 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names