ID GUN19_ARATH Reviewed; 626 AA. AC Q8L7I0; O82513; DT 05-SEP-2006, integrated into UniProtKB/Swiss-Prot. DT 01-OCT-2002, sequence version 1. DT 16-JUN-2009, entry version 46. DE RecName: Full=Endoglucanase 19; DE EC=3.2.1.4; DE AltName: Full=Endo-1,4-beta glucanase 19; DE Flags: Precursor; GN OrderedLocusNames=At4g11050; ORFNames=F2P3.1, T22B4.30; OS Arabidopsis thaliana (Mouse-ear cress). OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; OC rosids; eurosids II; Brassicales; Brassicaceae; Arabidopsis. OX NCBI_TaxID=3702; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX MEDLINE=20083488; PubMed=10617198; DOI=10.1038/47134; RA Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., RA Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., RA Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., RA Weichselgartner M., de Simone V., Obermaier B., Mache R., Mueller M., RA Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., RA Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., RA Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., RA Langham S.-A., McCullagh B., Bilham L., Robben J., RA van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., RA Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., RA Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., RA Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., RA Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., RA Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., RA De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., RA van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., RA Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., RA Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., RA Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., RA Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., RA Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., RA Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., RA Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., RA Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., RA Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., RA Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., RA Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., RA Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., RA Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., RA Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., RA Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., RA Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., RA Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., RA Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., RA Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., RA Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., RA Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., RA Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., RA Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., RA Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., RA Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., RA Chen E., Marra M.A., Martienssen R., McCombie W.R.; RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis RT thaliana."; RL Nature 402:769-777(1999). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA]. RC STRAIN=cv. Columbia; RX MEDLINE=22954850; PubMed=14593172; DOI=10.1126/science.1088305; RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., RA Southwick A.M., Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., RA Karlin-Newmann G., Liu S.X., Lam B., Sakano H., Wu T., Yu G., RA Miranda M., Quach H.L., Tripp M., Chang C.H., Lee J.M., Toriumi M.J., RA Chan M.M., Tang C.C., Onodera C.S., Deng J.M., Akiyama K., Ansari Y., RA Arakawa T., Banh J., Banno F., Bowser L., Brooks S.Y., Carninci P., RA Chao Q., Choy N., Enju A., Goldsmith A.D., Gurjal M., Hansen N.F., RA Hayashizaki Y., Johnson-Hopson C., Hsuan V.W., Iida K., Karnes M., RA Khan S., Koesema E., Ishida J., Jiang P.X., Jones T., Kawai J., RA Kamiya A., Meyers C., Nakajima M., Narusaka M., Seki M., Sakurai T., RA Satou M., Tamse R., Vaysberg M., Wallender E.K., Wong C., Yamamura Y., RA Yuan S., Shinozaki K., Davis R.W., Theologis A., Ecker J.R.; RT "Empirical analysis of transcriptional activity in the Arabidopsis RT genome."; RL Science 302:842-846(2003). RN [3] RP GENE FAMILY. RX PubMed=15170254; DOI=10.1007/s00239-003-2571-x; RA Libertini E., Li Y., McQueen-Mason S.J.; RT "Phylogenetic analysis of the plant endo-beta-1,4-glucanase gene RT family."; RL J. Mol. Evol. 58:506-515(2004). CC -!- CATALYTIC ACTIVITY: Endohydrolysis of (1->4)-beta-D-glucosidic CC linkages in cellulose, lichenin and cereal beta-D-glucans. CC -!- SUBCELLULAR LOCATION: Secreted (By similarity). CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) CC family. CC -!- SEQUENCE CAUTION: CC Sequence=AAC35539.1; Type=Erroneous gene model prediction; CC Sequence=CAB43040.1; Type=Erroneous gene model prediction; CC Sequence=CAB81206.1; Type=Erroneous gene model prediction; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AF080120; AAC35539.1; ALT_SEQ; Genomic_DNA. DR EMBL; AL049876; CAB43040.1; ALT_SEQ; Genomic_DNA. DR EMBL; AL161518; CAB81206.1; ALT_SEQ; Genomic_DNA. DR EMBL; AY133685; AAM91619.1; -; mRNA. DR IPI; IPI00523137; -. DR PIR; T01929; T01929. DR RefSeq; NP_192843.2; -. DR UniGene; At.33589; -. DR HSSP; Q9EYQ2; 1IA6. DR CAZy; CBM49; Carbohydrate-Binding Module Family 49. DR CAZy; GH9; Glycoside Hydrolase Family 9. DR GeneID; 826706; -. DR GenomeReviews; CT486007_GR; AT4G11050. DR KEGG; ath:AT4G11050; -. DR NMPDR; fig|3702.1.peg.18770; -. DR TAIR; At4g11050; -. DR OMA; Q8L7I0; YSANMAM. DR BRENDA; 3.2.1.4; 302. DR GermOnline; AT4G11050; Arabidopsis thaliana. DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell. DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro. DR GO; GO:0008810; F:cellulase activity; IEA:EC. DR GO; GO:0007047; P:cell wall organization; IEA:UniProtKB-KW. DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW. DR InterPro; IPR012341; 6hp_glycosidase. DR InterPro; IPR019028; CBM_49. DR InterPro; IPR001701; Glyco_hydro_9. DR InterPro; IPR018221; Glyco_hydro_9_AS. DR Gene3D; G3DSA:1.50.10.10; CelA/Cel48F_cat; 1. DR PANTHER; PTHR22298:SF3; Glyco_hydro_9; 1. DR Pfam; PF09478; CBM49; 1. DR Pfam; PF00759; Glyco_hydro_9; 1. DR PROSITE; PS00592; GLYCOSYL_HYDROL_F9_1; 1. DR PROSITE; PS00698; GLYCOSYL_HYDROL_F9_2; 1. PE 2: Evidence at transcript level; KW Carbohydrate metabolism; Cell wall biogenesis/degradation; KW Cellulose degradation; Complete proteome; Glycoprotein; Glycosidase; KW Hydrolase; Polysaccharide degradation; Secreted; Signal. FT SIGNAL 1 23 Potential. FT CHAIN 24 626 Endoglucanase 19. FT /FTId=PRO_0000249271. FT ACT_SITE 412 412 By similarity. FT ACT_SITE 464 464 By similarity. FT ACT_SITE 473 473 By similarity. FT CARBOHYD 560 560 N-linked (GlcNAc...) (Potential). FT CARBOHYD 622 622 N-linked (GlcNAc...) (Potential). SQ SEQUENCE 626 AA; 69345 MW; 6F7457C97EEF655F CRC64; MGSRTTISIL VVLLLGLVQL AISGHDYKQA LSKSILFFEA QRSGHLPPNQ RVSWRSHSGL YDGKSSGVDL VGGYYDAGDN VKFGLPMAFT VTTMCWSIIE YGGQLESNGE LGHAIDAVKW GTDYFIKAHP EPNVLYGEVG DGKSDHYCWQ RPEEMTTDRR AYKIDRNNPG SDLAGETAAA MAAASIVFRR SDPSYSAELL RHAHQLFEFA DKYRGKYDSS ITVAQKYYRS VSGYNDELLW AAAWLYQATN DKYYLDYLGK NGDSMGGTGW SMTEFGWDVK YAGVQTLVAK VLMQGKGGEH TAVFERYQQK AEQFMCSLLG KSTKNIKKTP GGLIFRQSWN NMQFVTSASF LATVYSDYLS YSKRDLLCSQ GNISPSQLLE FSKSQVDYIL GDNPRATSYM VGYGENYPRQ VHHRGSSIVS FNVDQKFVTC RGGYATWFSR KGSDPNVLTG ALVGGPDAYD NFADQRDNYE QTEPATYNNA PLLGVLARLI SGSTGFDQLL PGVSPTPSPV IIKPAPVPQR KPTKPPAASS PSPITISQKM TNSWKNEGKV YYRYSTILTN RSTKTLKILK ISITKLYGPI WGVTKTGNSF SFPSWMQSLP SGKSMEFVYI HSASPADVLV SNYSLE //