Skip Header

Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot Q94F87 (CMT2_ARATH)

Last modified February 9, 2010. Version 57. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    DNA (cytosine-5)-methyltransferase CMT2
    EC=2.1.1.37
Alternative name(s):
    Chromomethylase 2
    Protein CHROMOMETHYLASE 2
Gene names
Name: CMT2
Ordered Locus Names: At4g19020
ORF Names: F13C5.190
OrganismArabidopsis thaliana (Mouse-ear cress) [Complete proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonscore eudicotyledonsrosidsmalvidsBrassicalesBrassicaceaeArabidopsis

Protein attributes

Sequence length1244 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Function

May be involved in the CpXpG methylation and in gene silencing By similarity.

Catalytic activity

S-adenosyl-L-methionine + DNA = S-adenosyl-L-homocysteine + DNA containing 5-methylcytosine.

Subcellular location

Nucleus By similarity.

Sequence similarities

Belongs to the C5-methyltransferase family.

Contains 1 BAH domain.

Contains 1 chromo domain.

Sequence caution

The sequence CAA16759.1 differs from that shown. Reason: Erroneous gene model prediction.

The sequence CAB78904.1 differs from that shown. Reason: Erroneous gene model prediction.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 12441244DNA (cytosine-5)-methyltransferase CMT2
PRO_0000246692

Regions

Domain499 – 638140BAH
Domain786 – 85166Chromo
Compositional bias771 – 7777Poly-Ser

Experimental info

Sequence conflict54 – 552DN → ND in AAK69757. Ref.1
Sequence conflict110 – 1112AL → PV in AAK69757. Ref.1
Sequence conflict1251K → E in AAK69757. Ref.1
Sequence conflict132 – 1332KF → NS in AAK69757. Ref.1
Sequence conflict1561F → S in AAK69757. Ref.1
Sequence conflict2291R → K in AAK69757. Ref.1
Sequence conflict3401K → N in AAK69757. Ref.1
Sequence conflict5011K → N in AAK69757. Ref.1
Sequence conflict5481A → V in AAK69757. Ref.1
Sequence conflict6101V → A in AAK69757. Ref.1
Sequence conflict6541C → W in AAK69757. Ref.1
Sequence conflict7721G → E in AAK69757. Ref.1
Sequence conflict7991H → P in AAK69757. Ref.1
Sequence conflict10811K → N in AAK69757. Ref.1
Sequence conflict10931L → I in AAK69757. Ref.1

Sequences

Sequence LengthMass (Da)Tools
Q94F87-1 [UniParc].

Last modified July 25, 2006. Version 2.
Checksum: CBE10F5E47809340

FASTA1,244139,188
        10         20         30         40         50         60 
MLSPAKCESE EAQAPLDLHS SSRSEPECLS LVLWCPNPEE AAPSSTRELI KLPDNGEMSL 

        70         80         90        100        110        120 
RRSTTLNCNS PEENGGEGRV SQRKSSRGKS QPLLMLTNGC QLRRSPRFRA LHANFDNVCS 

       130        140        150        160        170        180 
VPVTKGGVSQ RKFSRGKSQP LLTLTNGCQL RRSPRFRAVD GNFDSVCSVP VTGKFGSRKR 

       190        200        210        220        230        240 
KSNSALDKKE SSDSEGLTFK DIAVIAKSLE MEIISECQYK NNVAEGRSRL QDPAKRKVDS 

       250        260        270        280        290        300 
DTLLYSSINS SKQSLGSNKR MRRSQRFMKG TENEGEENLG KSKGKGMSLA SCSFRRSTRL 

       310        320        330        340        350        360 
SGTVETGNTE TLNRRKDCGP ALCGAEQVRG TERLVQISKK DHCCEAMKKC EGDGLVSSKQ 

       370        380        390        400        410        420 
ELLVFPSGCI KKTVNGCRDR TLGKPRSSGL NTDDIHTSSL KISKNDTSNG LTMTTALVEQ 

       430        440        450        460        470        480 
DAMESLLQGK TSACGAADKG KTREMHVNST VIYLSDSDEP SSIEYLNGDN LTQVESGSAL 

       490        500        510        520        530        540 
SSGGNEGIVS LDLNNPTKST KRKGKRVTRT AVQEQNKRSI CFFIGEPLSC EEAQERWRWR 

       550        560        570        580        590        600 
YELKMEKATS EFSGFTEQRI QCVYKFFSFI MERQATNHDK RRLFYSTVMN DNPVDCLISK 

       610        620        630        640        650        660 
VTVLQVSPRV GLKPNSIKSD YYFDMEYCVE YSTFQTLRNR KCSSKTSENK LECCADVVPT 

       670        680        690        700        710        720 
ESTESILKKK SFSGELPVLD LYSGCGGMST GLSLGAKISG VDVVTKWAVD QNTAACKSLK 

       730        740        750        760        770        780 
LNHPNTQVRN DAAGDFLQLL KEWDKLCKRY VFNNDQRTDT LRSVNSTKET SGSSSSSDDD 

       790        800        810        820        830        840 
SDSEEYEVEK LVDICFGDHD KTGKNGLKFK VHWKGYRSDE DTWELAEELS NCQDAIREFV 

       850        860        870        880        890        900 
TSGFKSKILP LPGRVGVICG GPPCQGISGY NRHRNVDSPL NDERNQQIIV FMDIVEYLKP 

       910        920        930        940        950        960 
SYVLMENVVD ILRMDKGSLG RYALSRLVNM RYQARLGIMT AGCYGLSQFR SRVFMWGAVP 

       970        980        990       1000       1010       1020 
NKNLPPFPLP THDVIVRYGL PLEFERNVVA YAEGQPRKLE KALVLKDAIS DLPHVSNDED 

      1030       1040       1050       1060       1070       1080 
REKLPYESLP KTDFQRYIRS TKRDLTGSAI DNCNKRTMLL HDHRPFHINE DDYARVCQIP 

      1090       1100       1110       1120       1130       1140 
KRKGANFRDL PGLIVRNNTV CRDPSMEPVI LPSGKPLVPG YVFTFQQGKS KRPFARLWWD 

      1150       1160       1170       1180       1190       1200 
ETVPTVLTVP TCHSQALLHP EQDRVLTIRE SARLQGFPDY FQFCGTIKER YCQIGNAVAV 

      1210       1220       1230       1240 
SVSRALGYSL GMAFRGLARD EHLIKLPQNF SHSTYPQLQE TIPH 

« Hide

References

« Hide 'large scale' references
[1]"Arabidopsis cmt3 chromomethylase mutations block non-CG methylation and silencing of an endogenous gene."
Bartee L., Malagnac F., Bender J.
Genes Dev. 15:1753-1758(2001) [PubMed: 11459824] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
Strain: cv. Wassilewskija.
[2]"Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana."
Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M., de Simone V., Obermaier B. expand/collapse author list , Mache R., Mueller M., Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B., Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A., Martienssen R., McCombie W.R.
Nature 402:769-777(1999) [PubMed: 10617198] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF383171 Genomic DNA. Translation: AAK69757.1.
AL021711 Genomic DNA. Translation: CAA16759.1. Sequence problems.
AL161549 Genomic DNA. Translation: CAB78904.1. Sequence problems.
IPIIPI00539325.
PIRT05039.

3D structure databases

ModBaseSearch...

Proteomic databases

PRIDEQ94F87.

Genome annotation databases

GenomeReviewsGene locus AT4G19020 in contig CT486007_GR.

Organism-specific databases

TAIRAt4g19020.

Phylogenomic databases

eggNOGeuNOG04911.
InParanoidQ94F87.
PhylomeDBQ94F87.

Enzyme and pathway databases

BRENDA2.1.1.37. 302.

Gene expression databases

GenevestigatorQ94F87.
GermOnlineAT4G19020. Arabidopsis thaliana.

Family and domain databases

InterProIPR001025. BAH_dom.
IPR001525. C5_DNA_meth.
IPR000953. Chromodomain.
IPR016197. Chromodomain-like.
[Graphical view]
PANTHERPTHR10629. C5_DNA_meth. 1 hit.
PfamPF00385. Chromo. 1 hit.
PF00145. DNA_methylase. 1 hit.
[Graphical view]
PRINTSPR00105. C5METTRFRASE.
SMARTSM00439. BAH. 1 hit.
SM00298. CHROMO. 1 hit.
[Graphical view]
PROSITEPS51038. BAH. 1 hit.
PS00598. CHROMO_1. False negative.
PS50013. CHROMO_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameCMT2_ARATH
AccessionPrimary (citable) accession number: Q94F87
Secondary accession number(s): O49415
Entry history
Integrated into UniProtKB/Swiss-Prot: July 25, 2006
Last sequence update: July 25, 2006
Last modified: February 9, 2010
This is version 57 of the entry and version 2 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectPPAP (Plant Proteome Annotation Project)

Relevant documents

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents