Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot O49139 (CMT1_ARATH)

Last modified November 3, 2009. Version 59. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Putative DNA (cytosine-5)-methyltransferase CMT1
    EC=2.1.1.37
Alternative name(s):
    Chromomethylase 1
    Protein CHROMOMETHYLASE 1
Gene names
Name: CMT1
Ordered Locus Names: At1g80740
ORF Names: F23A5.9
OrganismArabidopsis thaliana (Mouse-ear cress) [Complete proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonscore eudicotyledonsrosidseurosids IIBrassicalesBrassicaceaeArabidopsis

Protein attributes

Sequence length791 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceUncertain.

General annotation (Comments)

Function

May be involved in the CpXpG methylation and in gene silencing By similarity.

Catalytic activity

S-adenosyl-L-methionine + DNA = S-adenosyl-L-homocysteine + DNA containing 5-methylcytosine.

Subcellular location

Nucleus By similarity.

Tissue specificity

Expressed in flowers. Not detected in leaves, roots, seedlings and plants prior formation of flower buds. Ref.1

Sequence similarities

Belongs to the C5-methyltransferase family.

Contains 1 BAH domain.

Contains 1 chromo domain.

Caution

Could be the product of a pseudogene. The protein is severely truncated in several ecotypes and the gene even harbors a complete retrotransposon in 3 ecotypes.

Sequence caution

The sequence AAA98912.1 differs from that shown. Reason: Erroneous gene model prediction.

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]

Note: Additional isoforms seem to exist.
Isoform 1 (identifier: O49139-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform 2 (identifier: O49139-2)

The sequence of this isoform differs from the canonical sequence as follows:
     568-570: VTN → CCR
     571-791: Missing.
Note: Alternative splice site used 50% of the time in cv. Columbia.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 791791Putative DNA (cytosine-5)-methyltransferase CMT1
PRO_0000246691

Regions

Domain79 – 199121BAH
Domain339 – 40466Chromo
Coiled coil308 – 33326 Potential

Natural variations

Alternative sequence568 – 5703VTN → CCR in isoform 2.
VSP_019857
Alternative sequence571 – 791221Missing in isoform 2.
VSP_019858
Natural variant1481D → G in strain: cv. Metz-0.
Natural variant1961T → I in strain: cv. Kl-0.
Natural variant1981A → T in strain: cv. No-0.
Natural variant249 – 791543Missing in strain: cv. Metz-0.
Natural variant5601D → V in strain: cv. Landsberg erecta, cv. No-0 and cv. RLD.
Natural variant561 – 791231Missing in strain: cv. Landsberg erecta, cv. No-0 and cv. RLD.
Natural variant6041F → C in strain: cv. Nd-1, Nd-0 and cv. Kl-0.

Sequences

Sequence LengthMass (Da)Tools
Isoform 1 [UniParc].

Last modified July 25, 2006. Version 2.
Checksum: A5ECFDBC274B215C

FASTA79189,219
        10         20         30         40         50         60 
MAARNKQKKR AEPESDLCFA GKPMSVVEST IRWPHRYQSK KTKLQAPTKK PANKGGKKED 

        70         80         90        100        110        120 
EEIIKQAKCH FDKALVDGVL INLNDDVYVT GLPGKLKFIA KVIELFEADD GVPYCRFRWY 

       130        140        150        160        170        180 
YRPEDTLIER FSHLVQPKRV FLSNDENDNP LTCIWSKVNI AKVPLPKITS RIEQRVIPPC 

       190        200        210        220        230        240 
DYYYDMKYEV PYLNFTSADD GSDASSSLSS DSALNCFENL HKDEKFLLDL YSGCGAMSTG 

       250        260        270        280        290        300 
FCMGASISGV KLITKWSVDI NKFACDSLKL NHPETEVRNE AAEDFLALLK EWKRLCEKFS 

       310        320        330        340        350        360 
LVSSTEPVES ISELEDEEVE ENDDIDEAST GAELEPGEFE VEKFLGIMFG DPQGTGEKTL 

       370        380        390        400        410        420 
QLMVRWKGYN SSYDTWEPYS GLGNCKEKLK EYVIDGFKSH LLPLPGTVYT VCGGPPCQGI 

       430        440        450        460        470        480 
SGYNRYRNNE APLEDQKNQQ LLVFLDIIDF LKPNYVLMEN VVDLLRFSKG FLARHAVASF 

       490        500        510        520        530        540 
VAMNYQTRLG MMAAGSYGLP QLRNRVFLWA AQPSEKLPPY PLPTHEVAKK FNTPKEFKDL 

       550        560        570        580        590        600 
QVGRIQMEFL KLDNALTLAD AISDLPPVTN YVANDVMDYN DAAPKTEFEN FISLKRSETL 

       610        620        630        640        650        660 
LPAFGGDPTR RLFDHQPLVL GDDDLERVSY IPKQKGANYR DMPGVLVHNN KAEINPRFRA 

       670        680        690        700        710        720 
KLKSGKNVVP AYAISFIKGK SKKPFGRLWG DEIVNTVVTR AEPHNQCVIH PMQNRVLSVR 

       730        740        750        760        770        780 
ENARLQGFPD CYKLCGTIKE KYIQVGNAVA VPVGVALGYA FGMASQGLTD DEPVIKLPFK 

       790 
YPECMQAKDQ I 

« Hide

Isoform 2.

Checksum: D08B184B4F3C6ADA
Show »

FASTA57064,557

References

« Hide 'large scale' references
[1]"A DNA methyltransferase homolog with a chromodomain exists in multiple polymorphic forms in Arabidopsis."
Henikoff S., Comai L.
Genetics 149:307-318(1998) [PubMed: 9584105] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA] (ISOFORMS 1 AND 2), TISSUE SPECIFICITY.
Strain: cv. Columbia, cv. Kl-0, cv. Landsberg erecta, cv. Metz-0, cv. Nd-0, cv. Nd-1 and cv. RLD.
[2]"A 37.5 Kb sequence from Arabidopsis thaliana chromosome I."
Goodman H.M., Gallant P., Keifer-Higgins S., Rubenfield M., Church G.M.
Submitted (APR-1996) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: cv. Columbia.
[3]"Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana."
Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K. expand/collapse author list , Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D., Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.
Nature 408:816-820(2000) [PubMed: 11130712] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
+Additional computationally mapped references.

Cross-references

Sequence databases

AF039364 mRNA. Translation: AAB95485.1.
AF039366 Genomic DNA. Translation: AAC02659.1.
AF039367 Genomic DNA. Translation: AAC02660.1.
AF039368 Genomic DNA. Translation: AAC02661.1.
AF039369 Genomic DNA. Translation: AAC02662.1.
AF039370 Genomic DNA. Translation: AAC02663.1.
AF039371 Genomic DNA. Translation: AAC02665.1.
AF039372 Genomic DNA. Translation: AAC02667.1.
AF039373 Genomic DNA. Translation: AAC02668.1.
U53501 Genomic DNA. Translation: AAA98912.1. Sequence problems.
AC011713 Genomic DNA. Translation: AAF14662.1.
IPIIPI00531370.
IPI00782822.
PIRH96839.
RefSeqNP_565245.1.
UniGeneAt.5460

3D structure databases

ModBaseSearch...

Proteomic databases

PRIDEO49139.

Genome annotation databases

GeneID844413.
GenomeReviewsGene locus AT1G80740 in contig CT485782_GR.
KEGGath:AT1G80740.
NMPDRfig|3702.1.peg.7665.

Organism-specific databases

TAIRAt1g80740.

Phylogenomic databases

OMAETEVRNE.

Enzyme and pathway databases

BRENDA2.1.1.37. 302.

Gene expression databases

GenevestigatorO49139.
GermOnlineAT1G80740. Arabidopsis thaliana.

Family and domain databases

InterProIPR001025. BAH.
IPR001525. C5_DNA_meth.
IPR017984. Chromo_dom_subgr.
IPR000953. Chromodomain.
[Graphical view]
PANTHERPTHR10629. C5_DNA_meth. 1 hit.
PfamPF01426. BAH. 1 hit.
PF00385. Chromo. 1 hit.
PF00145. DNA_methylase. 2 hits.
[Graphical view]
PRINTSPR00105. C5METTRFRASE.
PR00504. CHROMODOMAIN.
SMARTSM00439. BAH. 1 hit.
SM00298. CHROMO. 1 hit.
[Graphical view]
PROSITEPS51038. BAH. 1 hit.
PS00598. CHROMO_1. 1 hit.
PS50013. CHROMO_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameCMT1_ARATH
AccessionPrimary (citable) accession number: O49139
Secondary accession number(s): O49137 expand/collapse secondary AC list , O49138, O49141, O50057, O50073, Q38940, Q7G196
Entry history
Integrated into UniProtKB/Swiss-Prot: July 25, 2006
Last sequence update: July 25, 2006
Last modified: November 3, 2009
This is version 59 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectPPAP (Plant Proteome Annotation Project)

Relevant documents

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Alternative products · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents