Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot Q2LAE1 (ASHH2_ARATH)

Last modified June 16, 2009. Version 33. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Histone-lysine N-methyltransferase ASHH2
    EC=2.1.1.43
Alternative name(s):
    H3-K4-HMTase
    Histone H3-K36 methyltransferase 8
    H3-K36-HMTase
    ASH1 homolog 2
    Protein EARLY FLOWERING IN SHORT DAYS
    Protein SET DOMAIN GROUP 8
Gene names
Name: ASHH2
Synonyms: EFS, SDG8, SET8
Ordered Locus Names: At1g77300
ORF Names: T14N5.15
OrganismArabidopsis thaliana (Mouse-ear cress) [Complete proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonscore eudicotyledonsrosidseurosids IIBrassicalesBrassicaceaeArabidopsis

Protein attributes

Sequence length1759 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Function

Histone methyltransferase. Methylates 'Lys-4' and 'Lys-36' of histone H3. H3 'Lys-4' and 'Lys-36' methylations represent specific tags for epigenetic transcriptional activation. Regulates positively FLC transcription to prevent early flowering transition. Required for flowering transition in response to vernalization. Seems also to modulate several traits including floral organ size, root size and dormancy. Promotes apical dominance. Ref.1 Ref.5 Ref.6

Catalytic activity

S-adenosyl-L-methionine + histone L-lysine = S-adenosyl-L-homocysteine + histone N(6)-methyl-L-lysine.

Subcellular location

Nucleus Probable. Centromere Probable. Note: Associates with centromeric constitutive heterochromatin Probable.

Tissue specificity

Ubiquitous, with higher levels in young tissues, including shoot and root apex. Ref.1 Ref.6

Sequence similarities

Belongs to the histone-lysine methyltransferase family. SET2 subfamily.

Contains 1 AWS domain.

Contains 1 CW-type zinc finger.

Contains 1 post-SET domain.

Contains 1 SET domain.

Sequence caution

The sequence AAC34358.1 differs from that shown. Reason: Erroneous gene model prediction.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: Q2LAE1-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: Q2LAE1-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1447-1447: G → GLQMLHNIMKQYRGDFKRIPIIRKLLKVLEYLATRKILALEHIIRRP
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 17591759Histone-lysine N-methyltransferase ASHH2
PRO_0000233371

Regions

Domain974 – 102451AWS
Domain1025 – 1147123SET
Domain1151 – 116717Post-SET
Zinc finger859 – 91254CW-type

Natural variations

Alternative sequence14471G → GLQMLHNIMKQYRGDFKRIP IIRKLLKVLEYLATRKILAL EHIIRRP in isoform 2.
VSP_018133

Experimental info

Sequence conflict14261I → T in BAD94318. Ref.3

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified February 21, 2006. Version 1.
Checksum: 5616BE25ECEF2155

FASTA1,759193,228
        10         20         30         40         50         60 
MDCKENGVGD ASGCNIDANS LASNLAMNTN EDFYEKLSSR GQNLDSVSSL EIPQTASSVN 

        70         80         90        100        110        120 
HTIEGQRKCF TEIEQMGYGN SNSQEDAGNT DDDLYVCYNA DDTQEQGVVS GELEQSQELI 

       130        140        150        160        170        180 
CDTDLLVNCN KLDDGKESQD TNVSLVSIFS GSMQEKEAPQ AKEDEGYGGT TLPIGGSGID 

       190        200        210        220        230        240 
TESTFVNDAP EQFESLETTK HIKPDEVESD GISYRFDDGG KEGRNGPSSD LDTGSSDDIS 

       250        260        270        280        290        300 
LSQSFSFPDS LLDSSVFGCS ATESYLEDAI DIEGNGTIVV SPSLAITEML NNDDGGLCSH 

       310        320        330        340        350        360 
DLNKITVTET INPDLKLVRE DRLDTDLSVM NEKMLKNHVG DSSSESAVAA LSMNNGMAAD 

       370        380        390        400        410        420 
LRAENFSQSS PIDEKTLDME ANSPITDSSL IWNFPLNFGS GGIEVCNPEN AVEPLRIVDD 

       430        440        450        460        470        480 
NGRIGGEVAS ASGSDFCEAG MSSSRRKARD GKQCKVVQTK TSARHLRKSS RKKQSERDIE 

       490        500        510        520        530        540 
SIFKCSKQKR SSLLKTSRSS EWGLPSKTTE IFLQSNNIPY DGPPHHEPQR SQGNLNNGEH 

       550        560        570        580        590        600 
NRSSHNGNVE GSNRNIQASS GSCLRLKVKF GKSGGQNPLN ITVSKVSGNS LPGNGIVKAG 

       610        620        630        640        650        660 
TCLELPGSAH FGEDKMQTVE TKEDLVEKSN PVEKVSYLQS SDSMRDKKYN QDAGGLCRKV 

       670        680        690        700        710        720 
GGDVLDDDPH LSSIRMVEEC ERATGTQSLD AETSPDSEVI NSVPDSIVNI EHKEGLHHGF 

       730        740        750        760        770        780 
FSTPEDVVKK NRVLEKEDEL RASKSPSENG SHLIPNAKKA KHPKSKSNGT KKGKSKFSES 

       790        800        810        820        830        840 
AKDGRKNESH EGVEQRKSLN TSMGRDDSDY PEVGRIESHK TTGALLDADI GKTSATYGTI 

       850        860        870        880        890        900 
SSDVTHGEMV VDVTIEDSYS TESAWVRCDD CFKWRRIPAS VVGSIDESSR WICMNNSDKR 

       910        920        930        940        950        960 
FADCSKSQEM SNEEINEELG IGQDEADAYD CDAAKRGKEK EQKSKRLTGK QKACFKAIKT 

       970        980        990       1000       1010       1020 
NQFLHRNRKS QTIDEIMVCH CKPSPDGRLG CGEECLNRML NIECLQGTCP AGDLCSNQQF 

      1030       1040       1050       1060       1070       1080 
QKRKYVKFER FQSGKKGYGL RLLEDVREGQ FLIEYVGEVL DMQSYETRQK EYAFKGQKHF 

      1090       1100       1110       1120       1130       1140 
YFMTLNGNEV IDAGAKGNLG RFINHSCEPN CRTEKWMVNG EICVGIFSMQ DLKKGQELTF 

      1150       1160       1170       1180       1190       1200 
DYNYVRVFGA AAKKCYCGSS HCRGYIGGDP LNGDVIIQSD SDEEYPELVI LDDDESGEGI 

      1210       1220       1230       1240       1250       1260 
LGATSRTFTD DADEQMPQSF EKVNGYKDLA PDNTQTQSSV SVKLPEREIP PPLLQPTEVL 

      1270       1280       1290       1300       1310       1320 
KELSSGISIT AVQQEVPAEK KTKSTSPTSS SLSRMSPGGT NSDKTTKHGS GEDKKILPRP 

      1330       1340       1350       1360       1370       1380 
RPRMKTSRSS ESSKRDKGGI YPGVNKAQVI PVNKLQQQPI KSKGSEKVSP SIETFEGKLN 

      1390       1400       1410       1420       1430       1440 
ELLDAVGGIS KRRDSAKGYL KLLLLTAASR GTDEEGIYSN RDLSMILDAL LKTKSKSVLV 

      1450       1460       1470       1480       1490       1500 
DIINKNGPFA GMESFKDSVL SFTEHDDYTV HNIARSFRDR WIPKHFRKPW RINREERSES 

      1510       1520       1530       1540       1550       1560 
MRSPINRRFR ASQEPRYDHQ SPRPAEPAAS VTSSKAATPE TASVSEGYSE PNSGLPETNG 

      1570       1580       1590       1600       1610       1620 
RKRKSRWDQP SKTKEQRIMT ILSQQTDETN GNQDVQDDLP PGFSSPCTDV PDAITAQPQQ 

      1630       1640       1650       1660       1670       1680 
KFLSRLPVSY GIPLSIVHQF GSPGKEDPTT WSVAPGMPFY PFPPLPPVSH GEFFAKRNVR 

      1690       1700       1710       1720       1730       1740 
ACSSSMGNLT YSNEILPATP VTDSTAPTRK RELFSSDIGT TYFRQQKQSV PPWLRNNGGE 

      1750 
KTANSPIPGN LTLEKKLNS 

« Hide

Isoform 2.

Checksum: 79D63F4C931B15D0
Show »

FASTA1,805198,851

References

« Hide 'large scale' references
[1]"Prevention of early flowering by expression of FLOWERING LOCUS C requires methylation of histone H3 K36."
Zhao Z., Yu Y., Meyer D., Wu C., Shen W.-H.
Nat. Cell Biol. 7:1256-1260(2005) [PubMed: 16299497] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), FUNCTION, TISSUE SPECIFICITY.
[2]"Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana."
Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K. expand/collapse author list , Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D., Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.
Nature 408:816-820(2000) [PubMed: 11130712] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[3]"Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs."
Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A., Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y., Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K. expand/collapse author list , Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y., Shinozaki K.
Submitted (MAR-2005) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1299-1759 (ISOFORM 2).
Strain: cv. Columbia.
[4]"The Arabidopsis thaliana genome contains at least 29 active genes encoding SET domain proteins that can be assigned to four evolutionarily conserved classes."
Baumbusch L.O., Thorstensen T., Krauss V., Fischer A., Naumann K., Assalkhou R., Schulz I., Reuter G., Aalen R.B.
Nucleic Acids Res. 29:4319-4333(2001) [PubMed: 11691919] [Abstract]
Cited for: NOMENCLATURE.
[5]"The early-flowering mutant efs is involved in the autonomous promotion pathway of Arabidopsis thaliana."
Soppe W.J.J., Bentsink L., Koornneef M.
Development 126:4763-4770(1999) [PubMed: 10518493] [Abstract]
Cited for: FUNCTION.
[6]"Establishment of the vernalization-responsive, winter-annual habit in Arabidopsis requires a putative histone H3 methyl transferase."
Kim S.Y., He Y., Jacob Y., Noh Y.-S., Michaels S., Amasino R.
Plant Cell 17:3301-3310(2005) [PubMed: 16258034] [Abstract]
Cited for: FUNCTION, TISSUE SPECIFICITY.

Cross-references

Sequence databases

DQ340869 mRNA. Translation: ABC69038.1.
AC004260 Genomic DNA. Translation: AAC34358.1. Sequence problems.
AK221916 mRNA. Translation: BAD94318.1. Different initiation.
IPIIPI00542920.
IPI00759360.
PIRT00458.

3D structure databases

HSSPHSSP built from PDB template 1PEG based on UniProtKB Q8X225.
ModBaseSearch...

Proteomic databases

PRIDEQ2LAE1.

Genome annotation databases

GenomeReviewsGene locus AT1G77300 in contig CT485782_GR.

Organism-specific databases

TAIRAt1g77300.

Enzyme and pathway databases

BRENDA2.1.1.43. 302.

Gene expression databases

GermOnlineAT1G77300. Arabidopsis thaliana.

Family and domain databases

InterProIPR006560. AWS.
IPR003616. Post-SET_Zn_bd.
IPR001214. SET.
IPR011124. Znf_CW.
[Graphical view]
PfamPF00856. SET. 1 hit.
PF07496. zf-CW. 1 hit.
[Graphical view]
SMARTSM00570. AWS. 1 hit.
SM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEPS51215. AWS. 1 hit.
PS50868. POST_SET. 1 hit.
PS50280. SET. 1 hit.
PS51050. ZF_CW. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameASHH2_ARATH
AccessionPrimary (citable) accession number: Q2LAE1
Secondary accession number(s): O80663, Q56WW4
Entry history
Integrated into UniProtKB/Swiss-Prot: May 2, 2006
Last sequence update: February 21, 2006
Last modified: June 16, 2009
This is version 33 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectPPAP (Plant Proteome Annotation Project)

Relevant documents

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents