Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

O49139 (CMT1_ARATH) Reviewed, UniProtKB/Swiss-Prot

Last modified June 11, 2014. Version 104. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Putative DNA (cytosine-5)-methyltransferase CMT1

EC=2.1.1.37
Alternative name(s):
Chromomethylase 1
Protein CHROMOMETHYLASE 1
Gene names
Name:CMT1
Ordered Locus Names:At1g80740
ORF Names:F23A5.9
OrganismArabidopsis thaliana (Mouse-ear cress) [Reference proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis

Protein attributes

Sequence length791 AA.
Sequence statusComplete.
Protein existenceUncertain

General annotation (Comments)

Function

May be involved in the CpXpG methylation and in gene silencing By similarity.

Catalytic activity

S-adenosyl-L-methionine + DNA = S-adenosyl-L-homocysteine + DNA containing 5-methylcytosine.

Subcellular location

Nucleus By similarity.

Tissue specificity

Expressed in flowers. Not detected in leaves, roots, seedlings and plants prior formation of flower buds. Ref.1

Sequence similarities

Belongs to the class I-like SAM-binding methyltransferase superfamily. C5-methyltransferase family.

Contains 1 BAH domain.

Contains 1 chromo domain.

Contains 1 SAM-dependent MTase C5-type domain.

Caution

Could be the product of a pseudogene. The protein is severely truncated in several ecotypes and the gene even harbors a complete retrotransposon in 3 ecotypes.

Sequence caution

The sequence AAA98912.1 differs from that shown. Reason: Erroneous gene model prediction.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform 1 (identifier: O49139-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: O49139-2)

The sequence of this isoform differs from the canonical sequence as follows:
     568-570: VTN → CCR
     571-791: Missing.
Note: Alternative splice site used 50% of the time in cv. Columbia.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 791791Putative DNA (cytosine-5)-methyltransferase CMT1
PRO_0000246691

Regions

Domain79 – 199121BAH
Domain225 – 768544SAM-dependent MTase C5-type
Domain339 – 40466Chromo
Coiled coil308 – 33326 Potential

Sites

Active site4171 By similarity

Natural variations

Alternative sequence568 – 5703VTN → CCR in isoform 2.
VSP_019857
Alternative sequence571 – 791221Missing in isoform 2.
VSP_019858
Natural variant1481D → G in strain: cv. Metz-0.
Natural variant1961T → I in strain: cv. Kl-0.
Natural variant1981A → T in strain: cv. No-0.
Natural variant249 – 791543Missing in strain: cv. Metz-0.
Natural variant5601D → V in strain: cv. Landsberg erecta, cv. No-0 and cv. RLD.
Natural variant561 – 791231Missing in strain: cv. Landsberg erecta, cv. No-0 and cv. RLD.
Natural variant6041F → C in strain: cv. Nd-1, Nd-0 and cv. Kl-0.

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified July 25, 2006. Version 2.
Checksum: A5ECFDBC274B215C

FASTA79189,219
        10         20         30         40         50         60 
MAARNKQKKR AEPESDLCFA GKPMSVVEST IRWPHRYQSK KTKLQAPTKK PANKGGKKED 

        70         80         90        100        110        120 
EEIIKQAKCH FDKALVDGVL INLNDDVYVT GLPGKLKFIA KVIELFEADD GVPYCRFRWY 

       130        140        150        160        170        180 
YRPEDTLIER FSHLVQPKRV FLSNDENDNP LTCIWSKVNI AKVPLPKITS RIEQRVIPPC 

       190        200        210        220        230        240 
DYYYDMKYEV PYLNFTSADD GSDASSSLSS DSALNCFENL HKDEKFLLDL YSGCGAMSTG 

       250        260        270        280        290        300 
FCMGASISGV KLITKWSVDI NKFACDSLKL NHPETEVRNE AAEDFLALLK EWKRLCEKFS 

       310        320        330        340        350        360 
LVSSTEPVES ISELEDEEVE ENDDIDEAST GAELEPGEFE VEKFLGIMFG DPQGTGEKTL 

       370        380        390        400        410        420 
QLMVRWKGYN SSYDTWEPYS GLGNCKEKLK EYVIDGFKSH LLPLPGTVYT VCGGPPCQGI 

       430        440        450        460        470        480 
SGYNRYRNNE APLEDQKNQQ LLVFLDIIDF LKPNYVLMEN VVDLLRFSKG FLARHAVASF 

       490        500        510        520        530        540 
VAMNYQTRLG MMAAGSYGLP QLRNRVFLWA AQPSEKLPPY PLPTHEVAKK FNTPKEFKDL 

       550        560        570        580        590        600 
QVGRIQMEFL KLDNALTLAD AISDLPPVTN YVANDVMDYN DAAPKTEFEN FISLKRSETL 

       610        620        630        640        650        660 
LPAFGGDPTR RLFDHQPLVL GDDDLERVSY IPKQKGANYR DMPGVLVHNN KAEINPRFRA 

       670        680        690        700        710        720 
KLKSGKNVVP AYAISFIKGK SKKPFGRLWG DEIVNTVVTR AEPHNQCVIH PMQNRVLSVR 

       730        740        750        760        770        780 
ENARLQGFPD CYKLCGTIKE KYIQVGNAVA VPVGVALGYA FGMASQGLTD DEPVIKLPFK 

       790 
YPECMQAKDQ I 

« Hide

Isoform 2 [UniParc].

Checksum: D08B184B4F3C6ADA
Show »

FASTA57064,557

References

« Hide 'large scale' references
[1]"A DNA methyltransferase homolog with a chromodomain exists in multiple polymorphic forms in Arabidopsis."
Henikoff S., Comai L.
Genetics 149:307-318(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA] (ISOFORMS 1 AND 2), TISSUE SPECIFICITY.
Strain: cv. Columbia, cv. Kl-0, cv. Landsberg erecta, cv. Metz-0, cv. Nd-0, cv. Nd-1 and cv. RLD.
[2]"A 37.5 Kb sequence from Arabidopsis thaliana chromosome I."
Goodman H.M., Gallant P., Keifer-Higgins S., Rubenfield M., Church G.M.
Submitted (APR-1996) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: cv. Columbia.
[3]"Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana."
Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K. expand/collapse author list , Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D., Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.
Nature 408:816-820(2000) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[4]The Arabidopsis Information Resource (TAIR)
Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
Cited for: GENOME REANNOTATION.
Strain: cv. Columbia.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF039364 mRNA. Translation: AAB95485.1.
AF039366 Genomic DNA. Translation: AAC02659.1.
AF039367 Genomic DNA. Translation: AAC02660.1.
AF039368 Genomic DNA. Translation: AAC02661.1.
AF039369 Genomic DNA. Translation: AAC02662.1.
AF039370 Genomic DNA. Translation: AAC02663.1.
AF039371 Genomic DNA. Translation: AAC02665.1.
AF039372 Genomic DNA. Translation: AAC02667.1.
AF039373 Genomic DNA. Translation: AAC02668.1.
U53501 Genomic DNA. Translation: AAA98912.1. Sequence problems.
AC011713 Genomic DNA. Translation: AAF14662.1.
CP002684 Genomic DNA. Translation: AEE36442.1.
PIRH96839.
RefSeqNP_565245.1. NM_106722.2. [O49139-1]
UniGeneAt.5460.

3D structure databases

ProteinModelPortalO49139.
SMRO49139. Positions 12-778.
ModBaseSearch...
MobiDBSearch...

Protein family/group databases

REBASE3261. M.AthCMT1P.

Proteomic databases

PaxDbO49139.
PRIDEO49139.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsAT1G80740.1; AT1G80740.1; AT1G80740. [O49139-1]
GeneID844413.
KEGGath:AT1G80740.

Organism-specific databases

TAIRAT1G80740.

Phylogenomic databases

eggNOGCOG0270.
InParanoidO49139.
KOK00558.
OMAFGMASQG.
PhylomeDBO49139.

Enzyme and pathway databases

BioCycARA:AT1G80740-MONOMER.

Gene expression databases

GenevestigatorO49139.

Family and domain databases

Gene3D3.40.50.150. 2 hits.
InterProIPR001025. BAH_dom.
IPR001525. C5_MeTfrase.
IPR025821. C5_MeTfrase_pln.
IPR017984. Chromo_dom_subgr.
IPR023780. Chromo_domain.
IPR000953. Chromo_domain/shadow.
IPR016197. Chromodomain-like.
IPR023779. Chromodomain_CS.
IPR029063. SAM-dependent_MTases-like.
[Graphical view]
PANTHERPTHR10629. PTHR10629. 1 hit.
PfamPF01426. BAH. 1 hit.
PF00385. Chromo. 1 hit.
PF00145. DNA_methylase. 1 hit.
[Graphical view]
PRINTSPR00105. C5METTRFRASE.
PR00504. CHROMODOMAIN.
SMARTSM00439. BAH. 1 hit.
SM00298. CHROMO. 1 hit.
[Graphical view]
SUPFAMSSF53335. SSF53335. 4 hits.
SSF54160. SSF54160. 1 hit.
PROSITEPS51038. BAH. 1 hit.
PS00598. CHROMO_1. 1 hit.
PS50013. CHROMO_2. 1 hit.
PS51679. SAM_MT_C5. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameCMT1_ARATH
AccessionPrimary (citable) accession number: O49139
Secondary accession number(s): O49137 expand/collapse secondary AC list , O49138, O49141, O50057, O50073, Q38940, Q7G196
Entry history
Integrated into UniProtKB/Swiss-Prot: July 25, 2006
Last sequence update: July 25, 2006
Last modified: June 11, 2014
This is version 104 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Relevant documents

SIMILARITY comments

Index of protein domains and families

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names