Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q9ZUM9 (ASHR2_ARATH) Reviewed, UniProtKB/Swiss-Prot

Last modified September 21, 2011. Version 66. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Histone-lysine N-methyltransferase ASHR2

EC=2.1.1.43
Alternative name(s):
ASH1-related protein 2
Protein SET DOMAIN GROUP 39
Gene names
Name:ASHR2
Synonyms:SDG39, SET39
Ordered Locus Names:At2g19640
ORF Names:F3P11.24
OrganismArabidopsis thaliana (Mouse-ear cress)
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonscore eudicotyledonsrosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis

Protein attributes

Sequence length398 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

General annotation (Comments)

Function

Histone methyltransferase By similarity.

Catalytic activity

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].

Subcellular location

Nucleus By similarity. Chromosome By similarity.

Sequence similarities

Belongs to the histone-lysine methyltransferase family. SET2 subfamily.

Contains 1 SET domain.

Ontologies

Keywords
   Cellular componentChromosome
Nucleus
   Coding sequence diversityAlternative splicing
   LigandS-adenosyl-L-methionine
   Molecular functionChromatin regulator
Methyltransferase
Transferase
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Cellular componentchromosome

Inferred from electronic annotation. Source: UniProtKB-SubCell

nucleus

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionhistone-lysine N-methyltransferase activity

Inferred from electronic annotation. Source: EC

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q9ZUM9-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q9ZUM9-2)

The sequence of this isoform differs from the canonical sequence as follows:
     330-341: GENEEEVCGNGV → DICVRRRIVLAL
     342-398: Missing.
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 398398Histone-lysine N-methyltransferase ASHR2
PRO_0000233375

Regions

Domain147 – 274128SET
Compositional bias53 – 184132Ser-rich
Compositional bias299 – 33537Glu-rich

Natural variations

Alternative sequence330 – 34112GENEE…CGNGV → DICVRRRIVLAL in isoform 2.
VSP_018134
Alternative sequence342 – 39857Missing in isoform 2.
VSP_018135

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified May 2, 2006. Version 3.
Checksum: 32CFC10C8B546F64

FASTA39843,761
        10         20         30         40         50         60 
MINDGGAKPE TLLRVAEIGG RGRSLVAAQS LRAGQVILRE SPLLLYSAFP FLSSSVSPYC 

        70         80         90        100        110        120 
DHCFRLLASS AHQKCQSCSL VSFCSPNCFA SHTPWLCESL RRLHQSSSSA FSDQPSDRQV 

       130        140        150        160        170        180 
QARFLLSAYN LAAASPSDFQ ILLSLQGSGS SNGDPSCSAG DSAAAGFLHS LLSSVCPSLP 

       190        200        210        220        230        240 
VSISPDLTAA LLSKDKVNAF GLMEPCSVSN EKRSVRAYGI YPKTSFFNHD CLPNACRFDY 

       250        260        270        280        290        300 
VDSASDGNTD IIIRMIHDVP EGREVCLSYF PVNMNYSSRQ KRLLEDYGFK CDCDRCKVEF 

       310        320        330        340        350        360 
SWSEGEEDEN EIMEEMEDQD EQEEMEDSVG ENEEEVCGNG VDDESNFPHA YFFVRYMCEK 

       370        380        390 
ENCFGTLAPL PPKTHDASRV LECNVCGSVK EDEVGVNQ 

« Hide

Isoform 2 [UniParc].

Checksum: A206767A5E2E3FD2
Show »

FASTA34137,532

References

« Hide 'large scale' references
[1]"Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana."
Lin X., Kaul S., Rounsley S.D., Shea T.P., Benito M.-I., Town C.D., Fujii C.Y., Mason T.M., Bowman C.L., Barnstead M.E., Feldblyum T.V., Buell C.R., Ketchum K.A., Lee J.J., Ronning C.M., Koo H.L., Moffat K.S., Cronin L.A. expand/collapse author list , Shen M., Pai G., Van Aken S., Umayam L., Tallon L.J., Gill J.E., Adams M.D., Carrera A.J., Creasy T.H., Goodman H.M., Somerville C.R., Copenhaver G.P., Preuss D., Nierman W.C., White O., Eisen J.A., Salzberg S.L., Fraser C.M., Venter J.C.
Nature 402:761-768(1999) [PubMed: 10617197] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[2]The Arabidopsis Information Resource (TAIR)
Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
Cited for: GENOME REANNOTATION.
Strain: cv. Columbia.
[3]"Empirical analysis of transcriptional activity in the Arabidopsis genome."
Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M., Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G., Liu S.X., Lam B., Sakano H., Wu T., Yu G. expand/collapse author list , Miranda M., Quach H.L., Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C., Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J., Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A., Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C., Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X., Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M., Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K., Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A., Ecker J.R.
Science 302:842-846(2003) [PubMed: 14593172] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
Strain: cv. Columbia.
[4]"The Arabidopsis thaliana genome contains at least 29 active genes encoding SET domain proteins that can be assigned to four evolutionarily conserved classes."
Baumbusch L.O., Thorstensen T., Krauss V., Fischer A., Naumann K., Assalkhou R., Schulz I., Reuter G., Aalen R.B.
Nucleic Acids Res. 29:4319-4333(2001) [PubMed: 11691919] [Abstract]
Cited for: NOMENCLATURE.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AC005917 Genomic DNA. Translation: AAD10162.2.
CP002685 Genomic DNA. Translation: AEC06906.1.
CP002685 Genomic DNA. Translation: AEC06907.1.
AY034953 mRNA. Translation: AAK59459.1.
AY070027 mRNA. Translation: AAL47498.1.
BT004010 mRNA. Translation: AAO42047.1.
IPIIPI00516577.
IPI00545100.
PIRC84579.
RefSeqNP_565457.1. NM_127522.2.
NP_849991.1. NM_179660.1.
UniGeneAt.23821.

3D structure databases

ProteinModelPortalQ9ZUM9.
SMRQ9ZUM9. Positions 223-271.
ModBaseSearch...

Proteomic databases

PRIDEQ9ZUM9.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsAT2G19640.2; AT2G19640.2; AT2G19640.
GeneID816483.
GenomeReviewsGene locus AT2G19640 in contig CT485783_GR.
KEGGath:AT2G19640.
NMPDRfig|3702.1.peg.8958.

Organism-specific databases

TAIRAt2g19640.

Phylogenomic databases

eggNOGKOG2084.
GeneTreeEPGT00070000030233.
HOGENOMHBG601688.
InParanoidQ9ZUM9.
OMADCLPNAC.
PhylomeDBQ9ZUM9.
ProtClustDBCLSN2688255.

Gene expression databases

GenevestigatorQ9ZUM9.
GermOnlineAT2G19640. Arabidopsis thaliana.

Family and domain databases

InterProIPR001214. SET_dom.
[Graphical view]
PfamPF00856. SET. 1 hit.
[Graphical view]
SMARTSM00317. SET. 1 hit.
[Graphical view]
PROSITEPS50280. SET. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameASHR2_ARATH
AccessionPrimary (citable) accession number: Q9ZUM9
Secondary accession number(s): Q84WB9, Q94CD2
Entry history
Integrated into UniProtKB/Swiss-Prot: May 2, 2006
Last sequence update: May 2, 2006
Last modified: September 21, 2011
This is version 66 of the entry and version 3 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Relevant documents

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names

SIMILARITY comments

Index of protein domains and families