Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot P0CB22 (ATX2_ARATH)

Last modified November 24, 2009. Version 4. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Histone-lysine N-methyltransferase ATX2
    EC=2.1.1.43
Alternative name(s):
    Trithorax-homolog protein 2
      Short name=TRX-homolog protein 2
    Protein SET DOMAIN GROUP 30
Gene names
Name: ATX2
Synonyms: SDG30, SET30
Ordered Locus Names: At1g05830
ORF Names: T20M3.10
OrganismArabidopsis thaliana (Mouse-ear cress) [Complete proteome]
Taxonomic identifier3702 [NCBI]
Taxonomic lineageEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonscore eudicotyledonsrosidseurosids IIBrassicalesBrassicaceaeArabidopsis

Protein attributes

Sequence length1083 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

General annotation (Comments)

Function

Histone methyltransferase By similarity.

Catalytic activity

S-adenosyl-L-methionine + histone L-lysine = S-adenosyl-L-homocysteine + histone N(6)-methyl-L-lysine.

Subcellular location

Nucleus By similarity.

Tissue specificity

Expressed in roots and flowers and, to a lower extent, in young seedlings. Ref.3

Sequence similarities

Belongs to the histone-lysine methyltransferase family. TRX/MLL subfamily.

Contains 2 PHD-type zinc fingers.

Contains 1 post-SET domain.

Contains 1 PWWP domain.

Contains 1 SET domain.

Sequence caution

The sequence AAF29390.1 differs from that shown. Reason: Erroneous gene model prediction. The predicted gene has been split into 2 genes: At1g05830 and At1g05835.

The sequence AK226560 differs from that shown. Reason: Frameshift at position 276.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 10831083Histone-lysine N-methyltransferase ATX2
PRO_0000233355

Regions

Domain315 – 37965PWWP
Domain918 – 1041124SET
Domain1043 – 105917Post-SET
Zinc finger626 – 67752PHD-type 1
Zinc finger740 – 76728PHD-type 2; degenerate
Compositional bias41 – 488Poly-Ser
Compositional bias78 – 9720Arg-rich

Sequences

Sequence LengthMass (Da)Tools
P0CB22-1 [UniParc].

Last modified September 1, 2009. Version 1.
Checksum: 2F773B9406BE6102

FASTA1,083123,287
        10         20         30         40         50         60 
MISMSCVPKE EEGEDTQIKT ELHDHAADNP VRYASLESVY SVSSSSSSLC CKTAAGSHKK 

        70         80         90        100        110        120 
VNALKLPMSD SFELQPHRRP EIVHVYCRRK RRRRRRRESF LELAILQNEG VERDDRIVKI 

       130        140        150        160        170        180 
ESAELDDEKE EENKKKKQKK RRIGNGELMK LGVDSTTLSV SATPPLRGCR IKAVCSGNKQ 

       190        200        210        220        230        240 
DGSSRSKRNT VKNQEKVVTA SATAKKWVRL SYDGVDPKHF IGLQCKVFWP LDAVWYPGSI 

       250        260        270        280        290        300 
VGYNVETKHH IVKYGDGDGE ELALRREKIK FLISRDDMEL LNMKFGTNDV VVDGQDYDEL 

       310        320        330        340        350        360 
VILAASFEEC QDFEPRDIIW AKLTGHAMWP AIIVDESVIV KRKGLNNKIS GGRSVLVQFF 

       370        380        390        400        410        420 
GTHDFARIQV KQAVSFLKGL LSRSPLKCKQ PRFEEAMEEA KMYLKEYKLP GRMDQLQKVA 

       430        440        450        460        470        480 
DTDCSERINS GEEDSSNSGD DYTKDGEVWL RPTELGDCLH RIGDLQIINL GRIVTDSEFF 

       490        500        510        520        530        540 
KDSKHTWPEG YTATRKFISL KDPNASAMYK MEVLRDAESK TRPVFRVTTN SGEQFKGDTP 

       550        560        570        580        590        600 
SACWNKIYNR IKKIQIASDN PDVLGEGLHE SGTDMFGFSN PEVDKLIQGL LQSRPPSKVS 

       610        620        630        640        650        660 
QRKYSSGKYQ DHPTGYRPVR VEWKDLDKCN VCHMDEEYEN NLFLQCDKCR MMVHTRCYGQ 

       670        680        690        700        710        720 
LEPHNGILWL CNLCRPVALD IPPRCCLCPV VGGAMKPTTD GRWAHLACAI WIPETCLLDV 

       730        740        750        760        770        780 
KKMEPIDGVK KVSKDRWKLL CSICGVSYGA CIQCSNNTCR VAYHPLCARA AGLCVELADE 

       790        800        810        820        830        840 
DRLFLLSMDD DEADQCIRLL SFCKRHRQTS NYHLETEYMI KPAHNIAEYL PPPNPSGCAR 

       850        860        870        880        890        900 
TEPYNYLGRR GRKEPEALAG ASSKRLFVEN QPYIVGGYSR HEFSTYERIY GSKMSQITTP 

       910        920        930        940        950        960 
SNILSMAEKY TFMKETYRKR LAFGKSGIHG FGIFAKLPHR AGDMVIEYTG ELVRPPIADK 

       970        980        990       1000       1010       1020 
REHLIYNSMV GAGTYMFRID NERVIDATRT GSIAHLINHS CEPNCYSRVI SVNGDEHIII 

      1030       1040       1050       1060       1070       1080 
FAKRDVAKWE ELTYDYRFFS IDERLACYCG FPRCRGVVND TEAEERQANI HASRCELKEW 


TES 

« Hide

References

« Hide 'large scale' references
[1]"Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana."
Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K. expand/collapse author list , Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P., Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D., Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.
Nature 408:816-820(2000) [PubMed: 11130712] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: cv. Columbia.
[2]"Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs."
Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A., Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y., Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K. expand/collapse author list , Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J., Hayashizaki Y., Shinozaki K.
Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
Strain: cv. Columbia.
[3]"Two Arabidopsis homologs of the animal trithorax genes: a new structural domain is a signature feature of the trithorax gene family."
Alvarez-Venegas R., Avramova Z.
Gene 271:215-221(2001) [PubMed: 11418242] [Abstract]
Cited for: TISSUE SPECIFICITY.
[4]"The Arabidopsis thaliana genome contains at least 29 active genes encoding SET domain proteins that can be assigned to four evolutionarily conserved classes."
Baumbusch L.O., Thorstensen T., Krauss V., Fischer A., Naumann K., Assalkhou R., Schulz I., Reuter G., Aalen R.B.
Nucleic Acids Res. 29:4319-4333(2001) [PubMed: 11691919] [Abstract]
Cited for: NOMENCLATURE.

Web resources

PlantsUBQ

A functional genomics database for the ubiquitin/26S proteasome proteolytic pathway in plants

Cross-references

Sequence databases

AC009999 Genomic DNA. Translation: AAF29390.1. Sequence problems.
AK226560 mRNA. No translation available.
IPIIPI00519225.
PIRA86193.
RefSeqNP_001077464.4.
NP_172074.6.

3D structure databases

ModBaseSearch...

Proteomic databases

PRIDEP0CB22.

Genome annotation databases

GeneID837093.

Organism-specific databases

TAIRAt1g05830.

Enzyme and pathway databases

BRENDA2.1.1.43. 302.

Gene expression databases

GenevestigatorP0CB22.
GermOnlineAT1G05830. Arabidopsis thaliana.

Family and domain databases

InterProIPR003889. FYrich_C.
IPR018516. FYrich_C_sg.
IPR003888. FYrich_N.
IPR018518. FYrich_N_sg.
IPR003616. Post-SET_Zn_bd.
IPR000313. PWWP.
IPR001214. SET.
IPR019786. Zinc_finger_PHD-type_CS.
IPR011011. Znf_FYVE_PHD.
IPR001965. Znf_PHD.
IPR019787. Znf_PHD-finger.
[Graphical view]
PfamPF05965. FYRC. 1 hit.
PF05964. FYRN. 1 hit.
PF00628. PHD. 2 hits.
PF00855. PWWP. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
SMARTSM00542. FYRC. 1 hit.
SM00541. FYRN. 1 hit.
SM00249. PHD. 2 hits.
SM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEPS50868. POST_SET. 1 hit.
PS50812. PWWP. 1 hit.
PS50280. SET. 1 hit.
PS01359. ZF_PHD_1. 1 hit.
PS50016. ZF_PHD_2. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameATX2_ARATH
AccessionPrimary (citable) accession number: P0CB22
Secondary accession number(s): Q8RXL4, Q9MA43
Entry history
Integrated into UniProtKB/Swiss-Prot: September 1, 2009
Last sequence update: September 1, 2009
Last modified: November 24, 2009
This is version 4 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectPPAP (Plant Proteome Annotation Project)

Relevant documents

Arabidopsis thaliana

Arabidopsis thaliana: entries and gene names

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Web resources · Cross-references · Entry information · Relevant documents