Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q9C8Y9 (BGL22_ARATH) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 88. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Beta-glucosidase 22

Short name=AtBGLU22
EC=3.2.1.21
Gene names
Name:BGLU22
Ordered Locus Names:At1g66280
ORF Names:T27F4.3
OrganismArabidopsis thaliana (Mouse-ear cress) [Reference proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis

Protein attributes

Sequence length524 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Catalytic activity

Hydrolysis of terminal, non-reducing beta-D-glucosyl residues with release of beta-D-glucose.

Subcellular location

Endoplasmic reticulum lumen Potential.

Sequence similarities

Belongs to the glycosyl hydrolase 1 family.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q9C8Y9-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q9C8Y9-2)

The sequence of this isoform differs from the canonical sequence as follows:
     456-456: C → W
     457-524: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2424 Potential
Chain25 – 524500Beta-glucosidase 22
PRO_0000389584

Regions

Region475 – 4762Substrate binding By similarity
Motif521 – 5244Prevents secretion from ER Potential

Sites

Active site2041Proton donor By similarity
Active site4181Nucleophile By similarity
Binding site551Substrate By similarity
Binding site1581Substrate By similarity
Binding site2031Substrate By similarity
Binding site3461Substrate By similarity
Binding site4681Substrate By similarity

Amino acid modifications

Glycosylation611N-linked (GlcNAc...) Potential
Glycosylation4941N-linked (GlcNAc...) Potential
Disulfide bond223 ↔ 230 By similarity

Natural variations

Alternative sequence4561C → W in isoform 2.
VSP_038458
Alternative sequence457 – 52468Missing in isoform 2.
VSP_038459

Experimental info

Sequence conflict2801H → R in BAH56964. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified June 1, 2001. Version 1.
Checksum: B05574AECFC56C41

FASTA52459,781
        10         20         30         40         50         60 
MALQKFPLLG LLFLITIVVS STIAVDDPVC PTTSKLSRAS FPNGFVFGTA TAAFQVEGAI 

        70         80         90        100        110        120 
NETCRGPALW DIFCKRNPER CSGHNADVAV DFFHRYKEDI QLMKNLNTDA FRLSIAWSRI 

       130        140        150        160        170        180 
FPHGRKEKGV SQAGVKFYHD LIDELLKNGI IPFVTVFHWD TPQDLEDEYG GFLSENIVKD 

       190        200        210        220        230        240 
FREYADYVFT EYGGKVKNWI TFNEPWVFAH AGYDVGKKAP GRCSRYLKGC EDRDGRSGYE 

       250        260        270        280        290        300 
AYLVSHNLLN AHAEAVEVFR QKVKGGKIGI AHSPAWFEPH DLKDSNDVPT VSRVLDFMLG 

       310        320        330        340        350        360 
WHLDPTTFGD YPQIMKDLLG HRLPKFTSSQ KAKLKDSTDF VGLNYYTSTF SNHNEKPDPS 

       370        380        390        400        410        420 
TPSWKQDSLV AWEPKNVDHS AIGSQPLTAA LPVYAKGFRS LLKYIKDKYA NPEIMIMENG 

       430        440        450        460        470        480 
YGDKLKDKDS VEVGTADYNR KYYLQRHLLA MNEAICIDKV RVTGYFVWSL LDNFEWQDGY 

       490        500        510        520 
NNRFGLYYVD FKNNLTRYEK ESAKYYKDFL GQGVRPSALK KDEL 

« Hide

Isoform 2 [UniParc].

Checksum: 88D47178B41A2E9E
Show »

FASTA45651,669

References

« Hide 'large scale' references
[1]"Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana."
Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K. expand/collapse author list , Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D., Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.
Nature 408:816-820(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[2]The Arabidopsis Information Resource (TAIR)
Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
Cited for: GENOME REANNOTATION.
Strain: cv. Columbia.
[3]"Empirical analysis of transcriptional activity in the Arabidopsis genome."
Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M., Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G., Liu S.X., Lam B., Sakano H., Wu T., Yu G. expand/collapse author list , Miranda M., Quach H.L., Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C., Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J., Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A., Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C., Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X., Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M., Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K., Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A., Ecker J.R.
Science 302:842-846(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Strain: cv. Columbia.
[4]"Analysis of multiple occurrences of alternative splicing events in Arabidopsis thaliana using novel sequenced full-length cDNAs."
Iida K., Fukami-Kobayashi K., Toyoda A., Sakaki Y., Kobayashi M., Seki M., Shinozaki K.
DNA Res. 16:155-164(2009) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Strain: cv. Columbia.
[5]"Functional genomic analysis of Arabidopsis thaliana glycoside hydrolase family 1."
Xu Z., Escamilla-Trevino L.L., Zeng L., Lalgondar M., Bevan D.R., Winkel B.S.J., Mohamed A., Cheng C.-L., Shih M.-C., Poulton J.E., Esen A.
Plant Mol. Biol. 55:343-367(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: GENE FAMILY, NOMENCLATURE.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AC020665 Genomic DNA. Translation: AAG52159.1.
CP002684 Genomic DNA. Translation: AEE34490.1.
AY074378 mRNA. Translation: AAL67074.1.
AK318849 mRNA. Translation: BAH56964.1.
PIRH96687.
RefSeqNP_176802.1. NM_105299.2.
UniGeneAt.47576.
At.75574.

3D structure databases

ProteinModelPortalQ9C8Y9.
SMRQ9C8Y9. Positions 34-503.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

IntActQ9C8Y9. 3 interactions.
STRING3702.AT1G66280.1-P.

Protein family/group databases

CAZyGH1. Glycoside Hydrolase Family 1.

Proteomic databases

PaxDbQ9C8Y9.
PRIDEQ9C8Y9.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsAT1G66280.1; AT1G66280.1; AT1G66280. [Q9C8Y9-1]
GeneID842945.
KEGGath:AT1G66280.

Organism-specific databases

TAIRAT1G66280.

Phylogenomic databases

eggNOGCOG2723.
HOGENOMHOG000088630.
InParanoidQ9C8Y9.
KOK01188.
OMADVIDECL.
PhylomeDBQ9C8Y9.
ProtClustDBCLSN2679905.

Enzyme and pathway databases

BioCycARA:AT1G66280-MONOMER.

Gene expression databases

GenevestigatorQ9C8Y9.

Family and domain databases

Gene3D3.20.20.80. 1 hit.
InterProIPR001360. Glyco_hydro_1.
IPR018120. Glyco_hydro_1_AS.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PANTHERPTHR10353. PTHR10353. 1 hit.
PfamPF00232. Glyco_hydro_1. 1 hit.
[Graphical view]
PRINTSPR00131. GLHYDRLASE1.
SUPFAMSSF51445. SSF51445. 1 hit.
PROSITEPS00014. ER_TARGET. 1 hit.
PS00653. GLYCOSYL_HYDROL_F1_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameBGL22_ARATH
AccessionPrimary (citable) accession number: Q9C8Y9
Secondary accession number(s): C0Z2N5
Entry history
Integrated into UniProtKB/Swiss-Prot: November 24, 2009
Last sequence update: June 1, 2001
Last modified: April 16, 2014
This is version 88 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names