Skip Header

Contribute Send feedback
Read comments (?) or add your own

O23273 (O23273_ARATH) Unreviewed, UniProtKB/TrEMBL

Last modified May 1, 2013. Version 93. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Cytosine-specific methyltransferase RuleBase RU000417

EC=2.1.1.37 RuleBase RU000417
Gene names
Name:dl3110w EMBL CAB10193.1
Synonyms:AT4g14140 EMBL CAB78456.1, DMT2 EMBL AEE83379.1
Ordered Locus Names:At4g14140 TAIR At4g14140
ORF Names:AT4G14140 EMBL AEE83379.1
OrganismArabidopsis thaliana (Mouse-ear cress) [Reference proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonscore eudicotyledonsrosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis

Protein attributes

Sequence length1519 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

S-adenosyl-L-methionine + DNA = S-adenosyl-L-homocysteine + DNA containing 5-methylcytosine. RuleBase RU000417

Sequence similarities

Belongs to the C5-methyltransferase family. RuleBase RU000416

Sequences

Sequence LengthMass (Da)Tools
O23273 [UniParc].

Last modified January 1, 1998. Version 1.
Checksum: 8BD760A15FA90DA4

FASTA1,519171,586
        10         20         30         40         50         60 
MEMETKAGKQ KKRSVDSDDD VSKERRPKRA AACTNFKEKS LRISDKSETV EAKKEQILAE 

        70         80         90        100        110        120 
EIVAIQLTSS LESNDDPRPN RRLTDFVLHD SEGVPQPVEM LELGDIFIEG VVLPLGDEKK 

       130        140        150        160        170        180 
EEKGVRFQSF GRVENWNISG YEDGSPVIWI STALADYDCR KPSKKYKKLY DYFFEKACAC 

       190        200        210        220        230        240 
VEVFKSLSKN PDTSLDELLA AVSRSMSGSK IFSSGGAIQE FVISQGEFIY NQLAGLDETA 

       250        260        270        280        290        300 
KNHETCFVEN RVLVSLRDHE SNKIHKALSN VALRIDESKV VTSDHLVDGA EDEDVKYAKL 

       310        320        330        340        350        360 
IQEEEYRKSM ERSRNKRSST TSGGSSRFYI KISEDEIADD YPLPSYYKNT KEETDELVLF 

       370        380        390        400        410        420 
EAGYEVDTRD LPCRTLHNWT LYNSDSRMIS LEVLPMRPCA EIDVTVFGSG VVAEDDGSGF 

       430        440        450        460        470        480 
CLDDSESSTS TQSNDHDGMN IFLSQIKEWM IEFGAEMIFV TLRTDMAWYR LGKPSKQYAP 

       490        500        510        520        530        540 
WFGTVMKTVR VGISIFNMLM RESRVAKLSY ANVIKRLCGL EENDKAYISS KLLDVERYVV 

       550        560        570        580        590        600 
VHGQIILQLF EEYPDKDIKR CPFVTSLASK MQDIHHTKWI IKKKKKILQK GKNLNPRAGI 

       610        620        630        640        650        660 
APVVSRMKAM QATTTRLVNR IWGEFYSIYS PEVPSEAINA ENVEEEELEE VEEEDENEED 

       670        680        690        700        710        720 
DPEENELEAV EIQNSPTPKK IKGISEDMEI KWDGEILGKT SAGEPLYGRA FVGGDVVVVG 

       730        740        750        760        770        780 
SAVILEVDDQ DDTQLICFVE FMFESSNHSK MLHGKLLQRG SETVLGMAAN ERELFLTNEC 

       790        800        810        820        830        840 
LTVQLKDIKG TVSLEIRSRL WGHQYRKENI DVDKLDRARA EERKTNGLPT DYYCKSLYSP 

       850        860        870        880        890        900 
ERGGFFSLPR NDMGLGSGFC SSCKIRENEE ERSKTKLNDS KTGFLSNGIE YHNGDFVYVL 

       910        920        930        940        950        960 
PNYITKDGLK KGSRRTTLKC GRNVGLKAFV VCQLLDVIVL EESRKASKAS FQVKLTRFYR 

       970        980        990       1000       1010       1020 
PEDISEEKAY ASDIQELYYS QDTYILPPEA IQGKCEVRKK SDMPLCREYP ILDHIFFCEV 

      1030       1040       1050       1060       1070       1080 
FYDSSTGYLK QFPANMKLKF STIKDETLLR EKKGKGVETG TSSGMLMKPD EVPKEKPLAT 

      1090       1100       1110       1120       1130       1140 
LDIFAGCGGL SHGLENAGVS TTKWAIEYEE PAGHAFKQNH PEATVFVDNC NVILRAIMEK 

      1150       1160       1170       1180       1190       1200 
CGDVDDCVST VEAAELAAKL DENQKSTLPL PGQVDFINGG PPCQGFSGMN RFSHGSWSKV 

      1210       1220       1230       1240       1250       1260 
QCEMILAFLS FADYFRPKYF LLENVKKFVT YNKGRTFQLT MASLLEMGYQ VRFGILEAGT 

      1270       1280       1290       1300       1310       1320 
YGVSQPRKRV IIWAASPEEV LPEWPEPMHV FDNPGSKISL PRGLRYDAGC NTKFGAPFRS 

      1330       1340       1350       1360       1370       1380 
ITVRDTIGDL PPVENGESKI NKEYGTTPAS WFQKKIRGNM SVLTDHICKG LNELNLIRCK 

      1390       1400       1410       1420       1430       1440 
KIPKRPGADW RDLPDENVTL SNGLVEKLRP LALSKTAKNH NEWKGLYGRL DWQGNLPISI 

      1450       1460       1470       1480       1490       1500 
TDPQPMGKVG MCFHPEQDRI ITVRECARSQ GFPDSYEFSG TTKHKHRQIG NAVPPPLAFA 

      1510 
LGRKLKEALY LKSSLQHQS 

« Hide

References

« Hide 'large scale' references
[1]Bevan M., Stiekema W., Murphy G., Wambutt R., Pohl T., Terryn N., Kreis M., Kavanagh T., Entian K.D., Rieger M., James R., Puigdomenech P., Hatzopoulos P., Obermaier B., Duesterhoft A., Jones J., Palme K., Ansorge W. expand/collapse author list , Delseny M., Bancroft I., Mewes H.W., Schueller C., Chalwatzis N.
Submitted (JUL-1997) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
[2]"Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana."
Bevan M., Bancroft I., Bent E., Love K., Goodman H.M., Dean C., Bergkamp R., Dirkse W., van Staveren M., Stiekema W., Drost L., Ridley P., Hudson S.-A., Patel K., Murphy G., Piffanelli P., Wedler H., Wedler E. expand/collapse author list , Wambutt R., Weitzenegger T., Pohl T., Terryn N., Gielen J., Villarroel R., De Clercq R., van Montagu M., Lecharny A., Aubourg S., Gy I., Kreis M., Lao N., Kavanagh T., Hempel S., Kotter P., Entian K.-D., Rieger M., Schaefer M., Funk B., Mueller-Auer S., Silvey M., James R., Monfort A., Pons A., Puigdomenech P., Douka A., Voukelatou E., Milioni D., Hatzopoulos P., Piravandi E., Obermaier B., Hilbert H., Duesterhoeft A., Moores T., Jones J.D.G., Eneva T., Palme K., Benes V., Rechmann S., Ansorge W., Cooke R., Berger C., Delseny M., Voet M., Volckaert G., Mewes H.-W., Klosterman S., Schueller C., Chalwatzis N.
Nature 391:485-488(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[3]"Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana."
EU, CSHL and WU Arabidopsis Sequencing Project
Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M., de Simone V., Obermaier B. expand/collapse author list , Mache R., Mueller M., Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B., Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A., Martienssen R., McCombie W.R.
Nature 402:769-777(1999) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[4]EU Arabidopsis sequencing project
Submitted (JUN-1999) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
[5]EU Arabidopsis sequencing project
Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.
[6]The Arabidopsis Information Resource (TAIR)
Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
Cited for: GENOME REANNOTATION.
Strain: cv. Columbia.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
CP002687 Genomic DNA. Translation: AEE83379.1.
Z97335 Genomic DNA. Translation: CAB10193.1.
AL161538 Genomic DNA. Translation: CAB78456.1.
IPIIPI00527201.
PIRG71402.
RefSeqNP_193150.1. NM_117491.1.
UniGeneAt.51020.

3D structure databases

HSSPHSSP built from PDB template 1DCT based on UniProtKB P20589.
SMRO23273. Positions 67-245, 886-1516.
ModBaseSearch...

Protein-protein interaction databases

STRING3702.AT4G14140.1-P.

Protein family/group databases

REBASE11752. M.AthMET1.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsAT4G14140.1; AT4G14140.1; AT4G14140.
GeneID827052.
KEGGath:AT4G14140.

Organism-specific databases

TAIRAt4g14140.

Phylogenomic databases

eggNOGCOG0270.
HOGENOMHOG000083447.
InParanoidO23273.
KOK00558.
ProtClustDBCLSN2685944.

Gene expression databases

GenevestigatorO23273.

Family and domain databases

InterProIPR001025. BAH_dom.
IPR018117. C5_DNA_meth_AS.
IPR001525. C5_MeTfrase.
IPR022702. Cytosine_MeTrfase1_RFD.
IPR017198. DNA_C5-MeTrfase_1_euk.
[Graphical view]
PANTHERPTHR10629. PTHR10629. 1 hit.
PfamPF01426. BAH. 2 hits.
PF00145. DNA_methylase. 2 hits.
PF12047. DNMT1-RFD. 2 hits.
[Graphical view]
PIRSFPIRSF037404. DNMT1. 1 hit.
PRINTSPR00105. C5METTRFRASE.
SMARTSM00439. BAH. 2 hits.
[Graphical view]
TIGRFAMsTIGR00675. dcm. 1 hit.
PROSITEPS51038. BAH. 2 hits.
PS00094. C5_MTASE_1. 1 hit.
PS00095. C5_MTASE_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameO23273_ARATH
AccessionPrimary (citable) accession number: O23273
Entry history
Integrated into UniProtKB/TrEMBL: January 1, 1998
Last sequence update: January 1, 1998
Last modified: May 1, 2013
This is version 93 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)