Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot O04478 (GUN7_ARATH)

Last modified November 3, 2009. Version 60. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (3) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Endoglucanase 7
    EC=3.2.1.4
Alternative name(s):
    Endo-1,4-beta glucanase 7
Gene names
Name: KOR2
Ordered Locus Names: At1g65610
ORF Names: F5I14.14
OrganismArabidopsis thaliana (Mouse-ear cress) [Complete proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonscore eudicotyledonsrosidseurosids IIBrassicalesBrassicaceaeArabidopsis

Protein attributes

Sequence length623 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Catalytic activity

Endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans.

Subcellular location

Membrane; Single-pass type II membrane protein Potential.

Tissue specificity

Expressed in basal region of leaf blade and proximal parts of leaf and floral organ. Ref.3

Developmental stage

Early expressed in the development of root hairs whithin the root differentiation zone. Expressed late in the development of leaf trichomes when the stalks and branches are expanded. Ref.3

Sequence similarities

Belongs to the glycosyl hydrolase 9 (cellulase E) family.

Ontologies

Keywords
   Biological processCarbohydrate metabolism
Cell wall biogenesis/degradation
Cellulose degradation
Polysaccharide degradation
   Cellular componentMembrane
   DomainSignal-anchor
Transmembrane
   Molecular functionGlycosidase
Hydrolase
   PTMGlycoprotein
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processcell wall organization

Inferred from electronic annotation. Source: UniProtKB-KW

cellulose catabolic process

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentintegral to membrane

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functioncellulase activity

Inferred from electronic annotation. Source: EC

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 623623Endoglucanase 7
PRO_0000249260

Regions

Topological domain1 – 7979Cytoplasmic Potential
Transmembrane80 – 10021Signal-anchor for type II membrane protein Potential
Topological domain101 – 623523Extracellular Potential

Sites

Active site5171 By similarity
Active site5651 By similarity
Active site5741 By similarity

Amino acid modifications

Glycosylation1161N-linked (GlcNAc...) Potential
Glycosylation2211N-linked (GlcNAc...) Potential
Glycosylation3281N-linked (GlcNAc...) Potential
Glycosylation3491N-linked (GlcNAc...) Potential
Glycosylation4121N-linked (GlcNAc...) Potential
Glycosylation4291N-linked (GlcNAc...) Potential
Glycosylation4641N-linked (GlcNAc...) Potential
Glycosylation5481N-linked (GlcNAc...) Potential
Glycosylation5711N-linked (GlcNAc...) Potential

Sequences

Sequence LengthMass (Da)Tools
O04478-1 [UniParc].

Last modified July 1, 1997. Version 1.
Checksum: 8186D6B5497D1FD4

FASTA62369,836
        10         20         30         40         50         60 
MHPGNVWGGS LDAVDSDRIA AEEEERLRNT TEWDRGAIHS QRSELDETQQ GWLLAPQDNW 

        70         80         90        100        110        120 
RKKKKKYVNL GCVSVSRTVF LWTVGSIAVL FLVVALPIII VKSLPRHKSA PPPPDNYTLA 

       130        140        150        160        170        180 
LHKALQFFDA QKSGKLPKKN KVSWRGDSGT KDGLPDVVGG LVGGYYDGGS NVKFHFPMAF 

       190        200        210        220        230        240 
SMTMLSWSLI EYSHKYKAID EYDHMRDVLK WGTDYLLLTF NNSATRLDHI YTQVGGGLRD 

       250        260        270        280        290        300 
SESPDDIYCW QKPEDMSYDR PVLSSTSAAD LGAEVSAALA AASIVFTDKP DYAKKLKKGA 

       310        320        330        340        350        360 
ETLYPFFRSK SRRKRYSDGQ PTAQAFYNST SMFDEFMWAG AWLYYATGNK TYIQFATTPS 

       370        380        390        400        410        420 
VPQTAKAFAN RPELMVPSWN NKLPGAMLLM TRYRLFLNPG FPYENMLNRY HNATGITMCA 

       430        440        450        460        470        480 
YLKQYNVFNR TSGGLMQLNL GKPRPLEYVA HASFLASLFA DYLNSTGVPG WYCGPTFVEN 

       490        500        510        520        530        540 
HVLKDFAQSQ IDYILGDNPL KMSYVVGFGK KFPRRVHHRG ATIPNDKKRR SCREGLKYRD 

       550        560        570        580        590        600 
TKNPNPNNIT GAMVGGPNKF DEFHDLRNNY NASEPTLSGN AGLVAALVSL TSSGGQQIDK 

       610        620 
NTMFNSVPPL YSPTPPPPKA WKP 

« Hide

References

« Hide 'large scale' references
[1]"Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana."
Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K. expand/collapse author list , Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D., Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.
Nature 408:816-820(2000) [PubMed: 11130712] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[2]"Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs."
Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A., Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y., Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K. expand/collapse author list , Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y., Shinozaki K.
Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Strain: cv. Columbia.
[3]"Two Arabidopsis thaliana genes, KOR2 and KOR3, which encode membrane-anchored endo-1,4-beta-D-glucanases, are differentially expressed in developing leaf trichomes and their support cells."
Moelhoej M., Joergensen B., Ulvskov P., Borkhardt B.
Plant Mol. Biol. 46:263-275(2001) [PubMed: 11488474] [Abstract]
Cited for: TISSUE SPECIFICITY, DEVELOPMENTAL STAGE.
[4]"Phylogenetic analysis of the plant endo-beta-1,4-glucanase gene family."
Libertini E., Li Y., McQueen-Mason S.J.
J. Mol. Evol. 58:506-515(2004) [PubMed: 15170254] [Abstract]
Cited for: GENE FAMILY.
+Additional computationally mapped references.

Cross-references

Sequence databases

AC001229 Genomic DNA. Translation: AAB60922.1.
AK229402 mRNA. Translation: BAF01264.1.
IPIIPI00531295.
PIRB96681.
RefSeqNP_176738.1.
UniGeneAt.35864

3D structure databases

HSSPHSSP built from PDB template 1IA6 based on UniProtKB Q9EYQ2.
ModBaseSearch...

Protein family/group databases

CAZyGH9. Glycoside Hydrolase Family 9.

Proteomic databases

PRIDEO04478.

Genome annotation databases

GeneID842872.
GenomeReviewsGene locus AT1G65610 in contig CT485782_GR.
KEGGath:AT1G65610.
NMPDRfig|3702.1.peg.5984.

Organism-specific databases

TAIRAt1g65610.

Phylogenomic databases

OMANNITGAM.

Enzyme and pathway databases

BRENDA3.2.1.4. 302.

Gene expression databases

GenevestigatorO04478.
GermOnlineAT1G65610. Arabidopsis thaliana.

Family and domain databases

InterProIPR012341. 6hp_glycosidase.
IPR001701. Glyco_hydro_9.
IPR018221. Glyco_hydro_9_AS.
[Graphical view]
Gene3DG3DSA:1.50.10.10. CelA/Cel48F_cat. 1 hit.
PANTHERPTHR22298:SF3. Glyco_hydro_9. 1 hit.
PfamPF00759. Glyco_hydro_9. 1 hit.
[Graphical view]
PROSITEPS00592. GLYCOSYL_HYDROL_F9_1. 1 hit.
PS00698. GLYCOSYL_HYDROL_F9_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameGUN7_ARATH
AccessionPrimary (citable) accession number: O04478
Entry history
Integrated into UniProtKB/Swiss-Prot: September 19, 2006
Last sequence update: July 1, 1997
Last modified: November 3, 2009
This is version 60 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectPPAP (Plant Proteome Annotation Project)

Relevant documents

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names

Glycosyl hydrolases

Classification of glycosyl hydrolase families and list of entries

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents