Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q8GY58 (GUN23_ARATH) Reviewed, UniProtKB/Swiss-Prot

Last modified June 11, 2014. Version 76. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Endoglucanase 23

EC=3.2.1.4
Alternative name(s):
Endo-1,4-beta glucanase 23
Gene names
Ordered Locus Names:At4g39000
ORF Names:F19H22.100
OrganismArabidopsis thaliana (Mouse-ear cress) [Reference proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis

Protein attributes

Sequence length493 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is further processed into a mature form.
Protein existenceEvidence at transcript level

General annotation (Comments)

Catalytic activity

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Subcellular location

Secreted By similarity.

Sequence similarities

Belongs to the glycosyl hydrolase 9 (cellulase E) family.

Caution

The conserved 'Asp-461' active site is replaced by a Gly residue.

Ontologies

Keywords
   Biological processCarbohydrate metabolism
Cell wall biogenesis/degradation
Cellulose degradation
Polysaccharide degradation
   Cellular componentSecreted
   Coding sequence diversityAlternative splicing
   DomainSignal
   Molecular functionGlycosidase
Hydrolase
   PTMGlycoprotein
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processcellulose catabolic process

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular_componentextracellular region

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular_functioncellulase activity

Inferred from electronic annotation. Source: UniProtKB-EC

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q8GY58-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q8GY58-2)

The sequence of this isoform differs from the canonical sequence as follows:
     292-310: EVLQNNVTAIAAYKDTAEK → VWSNFQNQTDVYIYDKCDR
     311-493: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Signal peptide1 – 2323 Potential
Chain24 – 493470Endoglucanase 23
PRO_0000249275

Sites

Active site4101 By similarity
Active site4701 By similarity

Amino acid modifications

Glycosylation2971N-linked (GlcNAc...) Potential
Glycosylation4651N-linked (GlcNAc...) Potential

Natural variations

Alternative sequence292 – 31019EVLQN…DTAEK → VWSNFQNQTDVYIYDKCDR in isoform 2.
VSP_020387
Alternative sequence311 – 493183Missing in isoform 2.
VSP_020388

Experimental info

Sequence conflict971Missing in BAC42491. Ref.3
Sequence conflict2151S → T in BAC42491. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified September 5, 2006. Version 2.
Checksum: B999CEC76F5FDFC3

FASTA49354,692
        10         20         30         40         50         60 
MKASIYLVTV FILLLLLLPT AIPHDYSDAL RKSILFFEGQ RSGRLPKQQR MAWRRNSALN 

        70         80         90        100        110        120 
DGKNLKTDLV GGYYDAGDNV KFHFPMAFTA TMLAWSSVDF GRYMSQHDFR HNLVAVKWAT 

       130        140        150        160        170        180 
DYLLKTVSQL PNRIFVHVGE VQPDHDCWER PEDMDTPRTA FALDAPYPAS DLAGEIAAAL 

       190        200        210        220        230        240 
AAASIAFKQA NPKYSAILLN KAVQTFQYAD SHRGSYTDNP GIKQAVCPFY CSVNGYKDEL 

       250        260        270        280        290        300 
LWGAAWLRRA TGEDSYLRYL VDNGQAFGES SNYFEFGWDN KVGGVNVLVA KEVLQNNVTA 

       310        320        330        340        350        360 
IAAYKDTAEK MMCSFLPETN GPHMSYTPGG LIYKPGSTQL QNTAALSFLL LTYADYLSTS 

       370        380        390        400        410        420 
SQQLNCGNLK FQPDSLRRIV KRQVDYVLGD NPMKLSYMIG YGERYPGLIH HRGSSIPSVT 

       430        440        450        460        470        480 
VHPAAFGCIA GWNIFSSPNP NPNILIGAVI GGPDVDDRFI GGRTNASETE PTTYINAPFV 

       490 
GVFAYFKSNP NFS 

« Hide

Isoform 2 [UniParc].

Checksum: BDC4CF88CD8C2535
Show »

FASTA31035,065

References

« Hide 'large scale' references
[1]"Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana."
Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M., de Simone V., Obermaier B. expand/collapse author list , Mache R., Mueller M., Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B., Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A., Martienssen R., McCombie W.R.
Nature 402:769-777(1999) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[2]The Arabidopsis Information Resource (TAIR)
Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
Cited for: GENOME REANNOTATION.
Strain: cv. Columbia.
[3]"Functional annotation of a full-length Arabidopsis cDNA collection."
Seki M., Narusaka M., Kamiya A., Ishida J., Satou M., Sakurai T., Nakajima M., Enju A., Akiyama K., Oono Y., Muramatsu M., Hayashizaki Y., Kawai J., Carninci P., Itoh M., Ishii Y., Arakawa T., Shibata K., Shinagawa A., Shinozaki K.
Science 296:141-145(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
Strain: cv. Columbia.
[4]"Simultaneous high-throughput recombinational cloning of open reading frames in closed and open configurations."
Underwood B.A., Vanderhaeghen R., Whitford R., Town C.D., Hilson P.
Plant Biotechnol. J. 4:317-324(2006) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
Strain: cv. Columbia.
[5]"Phylogenetic analysis of the plant endo-beta-1,4-glucanase gene family."
Libertini E., Li Y., McQueen-Mason S.J.
J. Mol. Evol. 58:506-515(2004) [PubMed] [Europe PMC] [Abstract]
Cited for: GENE FAMILY.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AL035679 Genomic DNA. Translation: CAB38820.1.
AL161594 Genomic DNA. Translation: CAB80563.1.
CP002687 Genomic DNA. Translation: AEE87006.1.
AK117850 mRNA. Translation: BAC42491.1.
DQ446906 mRNA. Translation: ABE66120.1.
PIRT06060.
RefSeqNP_195611.1. NM_120060.1. [Q8GY58-1]
UniGeneAt.31125.

3D structure databases

ProteinModelPortalQ8GY58.
SMRQ8GY58. Positions 25-484.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING3702.AT4G39000.1-P.

Protein family/group databases

CAZyGH9. Glycoside Hydrolase Family 9.

Proteomic databases

PRIDEQ8GY58.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsAT4G39000.1; AT4G39000.1; AT4G39000. [Q8GY58-1]
GeneID830055.
KEGGath:AT4G39000.

Organism-specific databases

TAIRAT4G39000.

Phylogenomic databases

eggNOGNOG322234.
InParanoidQ8GY58.
KOK01179.
OMAFQYADSH.
PhylomeDBQ8GY58.

Enzyme and pathway databases

BioCycARA:AT4G39000-MONOMER.

Gene expression databases

GenevestigatorQ8GY58.

Family and domain databases

Gene3D1.50.10.10. 1 hit.
InterProIPR008928. 6-hairpin_glycosidase-like.
IPR012341. 6hp_glycosidase.
IPR001701. Glyco_hydro_9.
[Graphical view]
PfamPF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
SUPFAMSSF48208. SSF48208. 1 hit.
ProtoNetSearch...

Entry information

Entry nameGUN23_ARATH
AccessionPrimary (citable) accession number: Q8GY58
Secondary accession number(s): Q9SVJ3
Entry history
Integrated into UniProtKB/Swiss-Prot: September 5, 2006
Last sequence update: September 5, 2006
Last modified: June 11, 2014
This is version 76 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names