Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q84WV2 (BGL20_ARATH) Reviewed, UniProtKB/Swiss-Prot

Last modified February 19, 2014. Version 79. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Beta-glucosidase 20

Short name=AtBGLU20
EC=3.2.1.21
Gene names
Name:BGLU20
Ordered Locus Names:At1g75940
ORF Names:T4O12.15
OrganismArabidopsis thaliana (Mouse-ear cress) [Reference proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis

Protein attributes

Sequence length535 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Catalytic activity

Hydrolysis of terminal, non-reducing beta-D-glucosyl residues with release of beta-D-glucose.

Subcellular location

Endoplasmic reticulum lumen Potential.

Sequence similarities

Belongs to the glycosyl hydrolase 1 family.

Sequence caution

The sequence AAF26759.2 differs from that shown. Reason: Erroneous gene model prediction. The predicted gene At1g75930 has been split into 2 genes: At1g75930 and At1g75940.

Ontologies

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2424 Potential
Chain25 – 535511Beta-glucosidase 20
PRO_0000389582

Regions

Region482 – 4832Substrate binding By similarity
Motif532 – 5354Prevents secretion from ER Potential

Sites

Active site2051Proton donor By similarity
Active site4241Nucleophile By similarity
Binding site561Substrate By similarity
Binding site1591Substrate By similarity
Binding site2041Substrate By similarity
Binding site3511Substrate By similarity
Binding site4751Substrate By similarity

Amino acid modifications

Glycosylation1871N-linked (GlcNAc...) Potential
Glycosylation4681N-linked (GlcNAc...) Potential
Glycosylation5011N-linked (GlcNAc...) Potential
Disulfide bond224 ↔ 235 By similarity

Experimental info

Sequence conflict3621P → T in AAC39504. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Q84WV2 [UniParc].

Last modified June 1, 2003. Version 1.
Checksum: C717D02DA37F1C81

FASTA53561,675
        10         20         30         40         50         60 
MGRFHKFPLL GLVLFLGLTG SLIAANEYAC SSTDIHFTRA NFPKGFIFGT ATAAFQVEGA 

        70         80         90        100        110        120 
VNEGCRGPSM WDVYTKKFPH KCNYHNADVA VDFYHRYKED IKLMKNLNTD GFRFSIAWPR 

       130        140        150        160        170        180 
IFPHGRMEKG ISKAGVQYYH DLIDELLANG ITPLVTVFHW DTPQDLEDEY GGFLSDRIIK 

       190        200        210        220        230        240 
DFTEYANFTF QEYGDKVKHW ITFNEPWVFS RAGYDIGNKA PGRCSKYIKE HGEMCHDGRS 

       250        260        270        280        290        300 
GHEAYIVSHN MLLAHADAVD AFRKCDKCKG GKIGIAHSPA WFEAHELSDE EHETPVTGLI 

       310        320        330        340        350        360 
DFILGWHLHP TTYGDYPQSM KDHIGHRLPK FTEAQKEKLK NSADFVGINY YTSVFALHDE 

       370        380        390        400        410        420 
EPDPSQPSWQ SDSLVDWEPR YVDKFNAFAN KPDVAKVEVY AKGLRSLLKY IKDKYGNPEI 

       430        440        450        460        470        480 
MITENGYGED LGEQDTSLVV ALSDQHRTYY IQKHLLSLHE AICDDKVNVT GYFHWSLMDN 

       490        500        510        520        530 
FEWQDGYKAR FGLYYVDYKN NLTRHEKLSA QWYSSFLHDG SKEFEIEHEF EHDEL 

« Hide

References

« Hide 'large scale' references
[1]"Identification, sequence analysis and expression studies of novel anther-specific genes of Arabidopsis thaliana."
Rubinelli P., Hu Y., Ma H.
Plant Mol. Biol. 37:607-619(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA].
[2]"Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana."
Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K. expand/collapse author list , Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D., Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.
Nature 408:816-820(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[3]The Arabidopsis Information Resource (TAIR)
Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
Cited for: GENOME REANNOTATION.
Strain: cv. Columbia.
[4]"Empirical analysis of transcriptional activity in the Arabidopsis genome."
Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M., Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G., Liu S.X., Lam B., Sakano H., Wu T., Yu G. expand/collapse author list , Miranda M., Quach H.L., Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C., Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J., Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A., Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C., Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X., Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M., Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K., Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A., Ecker J.R.
Science 302:842-846(2003) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Strain: cv. Columbia.
[5]"Functional genomic analysis of Arabidopsis thaliana glycoside hydrolase family 1."
Xu Z., Escamilla-Trevino L.L., Zeng L., Lalgondar M., Bevan D.R., Winkel B.S.J., Mohamed A., Cheng C.-L., Shih M.-C., Poulton J.E., Esen A.
Plant Mol. Biol. 55:343-367(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: GENE FAMILY, NOMENCLATURE.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF037590 mRNA. Translation: AAC39504.1.
AC007396 Genomic DNA. Translation: AAF26759.2. Sequence problems.
CP002684 Genomic DNA. Translation: AEE35779.1.
AY074517 mRNA. Translation: AAL67131.1.
BT002735 mRNA. Translation: AAO22564.1.
PIRT52048.
RefSeqNP_177722.1. NM_106244.2.
UniGeneAt.10790.

3D structure databases

ProteinModelPortalQ84WV2.
SMRQ84WV2. Positions 37-517.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING3702.AT1G75940.1-P.

Protein family/group databases

CAZyGH1. Glycoside Hydrolase Family 1.

Proteomic databases

PaxDbQ84WV2.
PRIDEQ84WV2.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsAT1G75940.1; AT1G75940.1; AT1G75940.
GeneID843927.
KEGGath:AT1G75940.

Organism-specific databases

TAIRAT1G75940.

Phylogenomic databases

eggNOGCOG2723.
HOGENOMHOG000088630.
InParanoidQ84WV2.
KOK01188.
OMAHNDISAN.
PhylomeDBQ84WV2.

Enzyme and pathway databases

BioCycARA:AT1G75940-MONOMER.

Gene expression databases

GenevestigatorQ84WV2.

Family and domain databases

Gene3D3.20.20.80. 1 hit.
InterProIPR001360. Glyco_hydro_1.
IPR018120. Glyco_hydro_1_AS.
IPR013781. Glyco_hydro_catalytic_dom.
IPR017853. Glycoside_hydrolase_SF.
[Graphical view]
PANTHERPTHR10353. PTHR10353. 1 hit.
PfamPF00232. Glyco_hydro_1. 1 hit.
[Graphical view]
PRINTSPR00131. GLHYDRLASE1.
SUPFAMSSF51445. SSF51445. 1 hit.
PROSITEPS00014. ER_TARGET. 1 hit.
PS00653. GLYCOSYL_HYDROL_F1_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameBGL20_ARATH
AccessionPrimary (citable) accession number: Q84WV2
Secondary accession number(s): O49117, Q8VXW3, Q9LQS3
Entry history
Integrated into UniProtKB/Swiss-Prot: November 24, 2009
Last sequence update: June 1, 2003
Last modified: February 19, 2014
This is version 79 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names