Reviewed,
UniProtKB/Swiss-Prot O96028 (NSD2_HUMAN)
Last modified
November 3, 2009.
Version 79.
History...
Clusters with 100%,
90%,
50% identity |
Documents (3) |
Third-party data |
Customize display | text xml rdf/xml gff fasta |
Names and origin
| Protein names | Recommended name: Probable histone-lysine N-methyltransferase NSD2 EC=2.1.1.43 Alternative name(s): Nuclear SET domain-containing protein 2 Wolf-Hirschhorn syndrome candidate 1 protein Multiple myeloma SET domain-containing protein Protein trithorax-5 | ||||
| Gene names |
| ||||
| Organism | Homo sapiens (Human) [Complete proteome] | ||||
| Taxonomic identifier | 9606 [NCBI] | ||||
| Taxonomic lineage | Eukaryota › Metazoa › Chordata › Craniata › Vertebrata › Euteleostomi › Mammalia › Eutheria › Euarchontoglires › Primates › Haplorrhini › Catarrhini › Hominidae › Homo |
Protein attributes
| Sequence length | 1365 AA. |
| Sequence status | Complete. |
| Sequence processing | The displayed sequence is not processed. |
| Protein existence | Evidence at protein level. |
General annotation (Comments)
| Function | Probable histone methyltransferase By similarity. May act as a transcription regulator that binds DNA and suppresses IL5 transcription. |
| Catalytic activity | S-adenosyl-L-methionine + histone L-lysine = S-adenosyl-L-homocysteine + histone N(6)-methyl-L-lysine. |
| Subcellular location | |
| Tissue specificity | Widely expressed. Ref.2 |
| Post-translational modification | Phosphorylated upon DNA damage, probably by ATM or ATR. Ref.16 Ref.17 Ref.18 Ref.19 |
| Involvement in disease | A chromosomal aberration involving WHSC1 is a cause of multiple myeloma tumors. Translocation t(4;14)(p16.3;q32.3) with IgH. Ref.15 A chromosomal aberration involving WHSC1 is a cause of Wolf-Hirschhorn syndrome (WHS) [MIM:194190]. WHS is caused by sub-telomeric deletions in the short arm of chromosome 4. WHS is characterized by profound mental retardation, heart defects, and facial clefting. WHSC1 is deleted in every case, however deletion of linked genes contributes to both the severity of the core characteristics and the presence of the additional syndromic problems. |
| Sequence similarities | Belongs to the histone-lysine methyltransferase family. SET2 subfamily. Contains 1 AWS domain. Contains 1 HMG box DNA-binding domain. Contains 4 PHD-type zinc fingers. Contains 1 post-SET domain. Contains 2 PWWP domains. Contains 1 SET domain. |
Ontologies
Alternative products
| This entry describes 7 isoforms produced by alternative splicing. [Align] [Select] | ||||||
| Isoform 1 (identifier: O96028-1) This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry. | ||||||
| Isoform 2 (identifier: O96028-2) The sequence of this isoform differs from the canonical sequence as follows: 1-652: Missing. 653-712: VSSKKSERGV...GRFTCSECAS → MAGSFCWRML...SLHPFFNFLL | ||||||
| Note: No experimental confirmation available. | ||||||
| Isoform 3 (identifier: O96028-3) The sequence of this isoform differs from the canonical sequence as follows: 628-647: VSDSPGDEPSESPYESADET → LLWEPTPVKLDLNPAALYCT 648-1365: Missing. | ||||||
| Isoform 4 (identifier: O96028-4) Also known as: REII-BP; IL-5 promoter REII-region-binding protein; The sequence of this isoform differs from the canonical sequence as follows: 1-781: Missing. | ||||||
| Isoform 5 (identifier: O96028-5) The sequence of this isoform differs from the canonical sequence as follows: 629-629: S → K 630-1365: Missing. | ||||||
| Isoform 6 (identifier: O96028-6) The sequence of this isoform differs from the canonical sequence as follows: 472-484: VAEHPDASGEEIE → STKLCFMLASFRI 485-1365: Missing. | ||||||
| Note: No experimental confirmation available. | ||||||
| Isoform 7 (identifier: O96028-7) The sequence of this isoform differs from the canonical sequence as follows: 255-273: QKKSARQYHVQFFGDAPER → IFKSKKFEHLKTSQIVLKD 274-1365: Missing. |
Sequence annotation (Features)
| Feature key | Position(s) | Length | Description | Graphical view | Feature identifier | ||||
Molecule processing | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Chain | 1 – 1365 | 1365 | Probable histone-lysine N-methyltransferase NSD2 | PRO_0000259519 | |||||
Regions | |||||||||
| Domain | 222 – 286 | 65 | PWWP 1 | ||||||
| Domain | 880 – 942 | 63 | PWWP 2 | ||||||
| Domain | 1011 – 1061 | 51 | AWS | ||||||
| Domain | 1062 – 1184 | 123 | SET | ||||||
| Domain | 1187 – 1203 | 17 | Post-SET | ||||||
| DNA binding | 453 – 521 | 69 | HMG box Ref.4 | ||||||
| Zinc finger | 667 – 713 | 47 | PHD-type 1 | ||||||
| Zinc finger | 714 – 770 | 57 | PHD-type 2 | ||||||
| Zinc finger | 831 – 875 | 45 | PHD-type 3 | ||||||
| Zinc finger | 1239 – 1286 | 48 | PHD-type 4; atypical | ||||||
Amino acid modifications | |||||||||
| Modified residue | 102 | 1 | Phosphoserine Ref.17 | ||||||
| Modified residue | 110 | 1 | Phosphothreonine Ref.19 | ||||||
| Modified residue | 115 | 1 | Phosphothreonine Ref.19 | ||||||
| Modified residue | 121 | 1 | Phosphoserine Ref.19 | ||||||
| Modified residue | 437 | 1 | Phosphoserine Ref.18 | ||||||
| Modified residue | 544 | 1 | Phosphothreonine Ref.16 Ref.18 Ref.19 | ||||||
| Modified residue | 779 | 1 | N6-acetyllysine Ref.20 | ||||||
| Modified residue | 781 | 1 | N6-acetyllysine Ref.20 | ||||||
Natural variations | |||||||||
| Alternative sequence | 1 – 781 | 781 | Missing in isoform 4. | VSP_021413 | |||||
| Alternative sequence | 1 – 652 | 652 | Missing in isoform 2. | VSP_021414 | |||||
| Alternative sequence | 255 – 273 | 19 | QKKSA…DAPER → IFKSKKFEHLKTSQIVLKD in isoform 7. | VSP_021415 | |||||
| Alternative sequence | 274 – 1365 | 1092 | Missing in isoform 7. | VSP_021416 | |||||
| Alternative sequence | 472 – 484 | 13 | VAEHP…GEEIE → STKLCFMLASFRI in isoform 6. | VSP_021417 | |||||
| Alternative sequence | 485 – 1365 | 881 | Missing in isoform 6. | VSP_021418 | |||||
| Alternative sequence | 628 – 647 | 20 | VSDSP…SADET → LLWEPTPVKLDLNPAALYCT in isoform 3. | VSP_021419 | |||||
| Alternative sequence | 629 | 1 | S → K in isoform 5. | VSP_021420 | |||||
| Alternative sequence | 630 – 1365 | 736 | Missing in isoform 5. | VSP_021421 | |||||
| Alternative sequence | 648 – 1365 | 718 | Missing in isoform 3. | VSP_021422 | |||||
| Alternative sequence | 653 – 712 | 60 | VSSKK…SECAS → MAGSFCWRMLGLVSKVGNRA RCFSSVAASEEELLDFSGSE LQFNSCSLHLSLHPFFNFLL in isoform 2. | VSP_021423 | |||||
Experimental info | |||||||||
| Sequence conflict | 210 | 1 | T → A in AAU09264. Ref.5 | ||||||
Sequences
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||
References
| « Hide 'large scale' references | |
| [1] | "The t(4;14) translocation in myeloma dysregulates both FGFR3 and a novel gene, MMSET, resulting in IgH/MMSET hybrid transcripts." Chesi M., Nardini E., Lim R.S.C., Smith K.D., Kuehl W.M., Bergsagel P.L. Blood 92:3025-3034(1998) [PubMed: 9787135] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORMS 1 AND 3), CHROMOSOMAL TRANSLOCATION WITH IGH. Tissue: Myeloma. |
| [2] | "WHSC1, a 90 kb SET domain-containing gene, expressed in early development and homologous to a Drosophila dysmorphy gene maps in the Wolf-Hirschhorn syndrome critical region and is fused to IgH in t(4;14) multiple myeloma." Stec I., Wright T.J., van Ommen G.-J.B., de Boer P.A., van Haeringen A., Moorman A.F.M., Altherr M.R., den Dunnen J.T. Hum. Mol. Genet. 7:1071-1082(1998) [PubMed: 9618163] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA] (ISOFORMS 1; 3 AND 5), TISSUE SPECIFICITY, CHROMOSOMAL TRANSLOCATION WITH IGH. |
| [3] | Erratum Stec I., Wright T.J., van Ommen G.-J.B., de Boer P.A., van Haeringen A., Moorman A.F.M., Altherr M.R., den Dunnen J.T. Hum. Mol. Genet. 7:1527-1527(1998) |
| [4] | "A unique mRNA initiated within a middle intron of WHSC1/MMSET encodes a DNA binding protein that suppresses human IL-5 transcription." Garlisi C.G., Uss A.S., Xiao H., Tian F., Sheridan K.E., Wang L., Motasim Billah M., Egan R.W., Stranick K.S., Umland S.P. Am. J. Respir. Cell Mol. Biol. 24:90-98(2001) [PubMed: 11152655] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 4), FUNCTION, DNA-BINDING. |
| [5] | "Overexpression of transcripts originating from the MMSET locus characterizes all t(4;14)(p16;q32)-positive multiple myeloma patients." Keats J.J., Maxwell C.A., Taylor B.J., Hendzel M.J., Chesi M., Bergsagel P.L., Larratt L.M., Mant M.J., Reiman T., Belch A.R., Pilarski L.M. Blood 105:4060-4069(2005) [PubMed: 15677557] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 7), SUBCELLULAR LOCATION. |
| [6] | "Mammalian trithorax- and ASH1-like proteins: putative chromatin regulators which contain PHD fingers and SET domains." Angrand P.-O., Valvatne H., Jeanmougin F., Adamson A., van der Hoeven F., Olsen L., Tekotte H., Huang N., Poch O., Lamerdin J., Chambon P., Losson R., Stewart A., Aasland R. Submitted (JUN-1998) to the EMBL/GenBank/DDBJ databases Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1). |
| [7] | "Prediction of the coding sequences of unidentified human genes. XIV. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro." Kikuno R., Nagase T., Ishikawa K., Hirosawa M., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O. DNA Res. 6:197-205(1999) [PubMed: 10470851] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). Tissue: Brain. |
| [8] | "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)." The MGC Project Team Genome Res. 14:2121-2127(2004) [PubMed: 15489334] [Abstract] Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2; 4 AND 6), NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 496-1365 (ISOFORM 3). Tissue: Ovary and Spleen. |
| [9] | "Detection of t(4;14)(p16.3;q32) chromosomal translocation in multiple myeloma by reverse transcription-polymerase chain reaction analysis of IGH-MMSET fusion transcripts." Malgeri U., Baldini L., Perfetti V., Fabris S., Vignarelli M.C., Colombo G., Lotti V., Compasso S., Bogni S., Lombardi L., Maiolo A.T., Neri A. Cancer Res. 60:4058-4061(2000) [PubMed: 10945609] [Abstract] Cited for: CHROMOSOMAL TRANSLOCATION WITH IGH. |
| [10] | "Translocation T(4;14)(p16.3;q32) is a recurrent genetic lesion in primary amyloidosis." Perfetti V., Coluccia A.M., Intini D., Malgeri U., Vignarelli M.C., Casarini S., Merlini G., Neri A. Am. J. Pathol. 158:1599-1603(2001) [PubMed: 11337357] [Abstract] Cited for: CHROMOSOMAL TRANSLOCATION WITH IGH. |
| [11] | "A subset of multiple myeloma harboring the t(4;14)(p16;q32) translocation lacks FGFR3 expression but maintains an IGH/MMSET fusion transcript." Santra M., Zhan F., Tian E., Barlogie B., Shaughnessy J. Jr. Blood 101:2374-2376(2003) [PubMed: 12433679] [Abstract] Cited for: CHROMOSOMAL TRANSLOCATION WITH IGH. |
| [12] | "Identification of a novel IGH-MMSET fusion transcript in a human myeloma cell line with the t(4;14)(p16.3;q32) chromosomal translocation." Intini D., Fabris S., Storlazzi T., Otsuki T., Ciceri G., Verdelli D., Lombardi L., Rocchi M., Neri A. Br. J. Haematol. 126:437-439(2004) [PubMed: 15257719] [Abstract] Cited for: CHROMOSOMAL TRANSLOCATION WITH IGH. |
| [13] | "Identification of ID-1 as a potential target gene of MMSET in multiple myeloma." Hudlebusch H.R., Theilgaard-Moench K., Lodahl M., Johnsen H.E., Rasmussen T. Br. J. Haematol. 130:700-708(2005) [PubMed: 16115125] [Abstract] Cited for: FUNCTION. |
| [14] | "Transcription repression activity is associated with the type I isoform of the MMSET gene involved in t(4;14) in multiple myeloma." Todoerti K., Ronchetti D., Agnelli L., Castellani S., Marelli S., Deliliers G.L., Zanella A., Lombardi L., Neri A. Br. J. Haematol. 131:214-218(2005) [PubMed: 16197452] [Abstract] Cited for: SUBCELLULAR LOCATION. |
| [15] | "The etiology of Wolf-Hirschhorn syndrome." Bergemann A.D., Cole F., Hirschhorn K. Trends Genet. 21:188-195(2005) [PubMed: 15734578] [Abstract] Cited for: INVOLVEMENT IN WHS. |
| [16] | "Global, in vivo, and site-specific phosphorylation dynamics in signaling networks." Olsen J.V., Blagoev B., Gnad F., Macek B., Kumar C., Mortensen P., Mann M. Cell 127:635-648(2006) [PubMed: 17081983] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-544, MASS SPECTROMETRY. Tissue: Epithelium. |
| [17] | "ATM and ATR substrate analysis reveals extensive protein networks responsive to DNA damage." Matsuoka S., Ballif B.A., Smogorzewska A., McDonald E.R. III, Hurov K.E., Luo J., Bakalarski C.E., Zhao Z., Solimini N., Lerenthal Y., Shiloh Y., Gygi S.P., Elledge S.J. Science 316:1160-1166(2007) [PubMed: 17525332] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-102, MASS SPECTROMETRY. |
| [18] | "Evaluation of the low-specificity protease elastase for large-scale phosphoproteome analysis." Wang B., Malik R., Nigg E.A., Korner R. Anal. Chem. 80:9526-9533(2008) [PubMed: 19007248] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-437 AND THR-544, MASS SPECTROMETRY. |
| [19] | "A quantitative atlas of mitotic phosphorylation." Dephoure N., Zhou C., Villen J., Beausoleil S.A., Bakalarski C.E., Elledge S.J., Gygi S.P. Proc. Natl. Acad. Sci. U.S.A. 105:10762-10767(2008) [PubMed: 18669648] [Abstract] Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-110; THR-115; SER-121 AND THR-544, MASS SPECTROMETRY. |
| [20] | "Lysine acetylation targets protein complexes and co-regulates major cellular functions." Choudhary C., Kumar C., Gnad F., Nielsen M.L., Rehman M., Walther T., Olsen J.V., Mann M. Science 325:834-840(2009) [PubMed: 19608861] [Abstract] Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT LYS-779 AND LYS-781, MASS SPECTROMETRY. |
| + | Additional computationally mapped references. |
Cross-references
Sequence databases | |
|---|---|
| AF071593 mRNA. Translation: AAC24150.1. AF071594 mRNA. Translation: AAC24151.1. AF083386 mRNA. Translation: AAD19343.1. AF083387 mRNA. Translation: AAD21770.1. AF083388 mRNA. Translation: AAD21771.1. AF083389 mRNA. Translation: AAD19344.1. AF083390 mRNA. Translation: AAD19345.1. AF083391 mRNA. Translation: AAD19346.1. AF178206 AF178200 Genomic DNA. Translation: AAF23369.1. AF178219 AF178201 Genomic DNA. Translation: AAF23370.1. AF330040 mRNA. Translation: AAK00344.1. AY694128 mRNA. Translation: AAU09264.1. AJ007042 mRNA. Translation: CAB45386.1. AB029013 mRNA. Translation: BAA83042.2. Different initiation. BC052254 mRNA. Translation: AAH52254.1. BC070176 mRNA. Translation: AAH70176.1. BC094825 mRNA. Translation: AAH94825.2. BC152412 mRNA. Translation: AAI52413.1. | |
| IPI | IPI00107486. IPI00107487. IPI00218240. IPI00334604. IPI00470433. IPI00790144. IPI00792674. |
| RefSeq | NP_001035889.1. NP_015627.1. NP_579877.1. NP_579878.1. NP_579889.1. NP_579890.1. |
| UniGene | Hs.113876 |
3D structure databases | |
| HSSP | HSSP built from PDB template 1F62 based on UniProtKB Q9UIG0. |
| ModBase | Search... |
Protein-protein interaction databases | |
| STRING | O96028. |
PTM databases | |
| PhosphoSite | O96028. |
Proteomic databases | |
| PRIDE | O96028. |
Genome annotation databases | |
| Ensembl | ENST00000312087; ENSP00000308780; ENSG00000109685; Homo sapiens. [Genome view] ENST00000350915; ENSP00000264753; ENSG00000109685; Homo sapiens. [Genome view] ENST00000353275; ENSP00000329167; ENSG00000109685; Homo sapiens. [Genome view] ENST00000382888; ENSP00000372344; ENSG00000109685; Homo sapiens. [Genome view] ENST00000382889; ENSP00000372345; ENSG00000109685; Homo sapiens. [Genome view] ENST00000382891; ENSP00000372347; ENSG00000109685; Homo sapiens. [Genome view] ENST00000382892; ENSP00000372348; ENSG00000109685; Homo sapiens. [Genome view] ENST00000382895; ENSP00000372351; ENSG00000109685; Homo sapiens. [Genome view] ENST00000398261; ENSP00000381311; ENSG00000109685; Homo sapiens. [Genome view] ENST00000420906; ENSP00000399251; ENSG00000109685; Homo sapiens. [Genome view] ENST00000436793; ENSP00000416725; ENSG00000109685; Homo sapiens. [Genome view] |
| GeneID | 7468. |
| KEGG | hsa:7468. |
| UCSC | uc003gdx.2. human. uc003gdz.2. human. uc003geg.1. human. uc003geh.1. human. uc003gei.2. human. |
Organism-specific databases | |
| CTD | 7468. |
| GeneCards | GC04P001840. |
| HGNC | HGNC:12766. WHSC1. |
| HPA | HPA015315. HPA015801. |
| MIM | 194190. phenotype. 602952. gene. |
| Orphanet | 280. Wolf-Hirschhorn syndrome. |
| PharmGKB | PA37369. |
| HUGE | Search... |
| GenAtlas | Search... |
Phylogenomic databases | |
| HOVERGEN | O96028. |
| OMA | CWRRVTE. |
Enzyme and pathway databases | |
| BRENDA | 2.1.1.43. 247. |
Gene expression databases | |
| ArrayExpress | O96028. |
| Bgee | O96028. |
| Genevestigator | O96028. |
| GermOnline | ENSG00000109685. Homo sapiens. |
Family and domain databases | |
| InterPro | IPR006560. AWS. IPR000910. HMG_HMG1/HMG2. IPR003616. Post-SET_Zn_bd. IPR000313. PWWP. IPR001214. SET. IPR019786. Zinc_finger_PHD-type_CS. IPR001965. Znf_PHD. IPR019787. Znf_PHD-finger. IPR001841. Znf_RING. [Graphical view] |
| Gene3D | G3DSA:1.10.30.10. HMG-box. 1 hit. |
| Pfam | PF00505. HMG_box. 1 hit. PF00628. PHD. 3 hits. PF00855. PWWP. 2 hits. PF00856. SET. 1 hit. [Graphical view] |
| SMART | SM00570. AWS. 1 hit. SM00398. HMG. 1 hit. SM00249. PHD. 4 hits. SM00508. PostSET. 1 hit. SM00293. PWWP. 2 hits. SM00184. RING. 2 hits. SM00317. SET. 1 hit. [Graphical view] |
| PROSITE | PS51215. AWS. 1 hit. PS50118. HMG_BOX_2. 1 hit. PS50868. POST_SET. 1 hit. PS50812. PWWP. 2 hits. PS50280. SET. 1 hit. PS01359. ZF_PHD_1. 2 hits. PS50016. ZF_PHD_2. 2 hits. [Graphical view] |
| ProtoNet | Search... |
Other Resources | |
| NextBio | 29246. |
| SOURCE | Search... |
Entry information
| Entry name | NSD2_HUMAN | ||||||||
| Accession | Primary (citable) accession number: O96028 Secondary accession number(s): A7MCZ1 Q9UPR2 | ||||||||
| Entry history |
| ||||||||
| Entry status | Reviewed (UniProtKB/Swiss-Prot) | ||||||||
| Annotation project | HPI (Human Proteome Initiative) | ||||||||
| Disclaimer | Any medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care. | ||||||||
Relevant documents
| Human chromosome 4 Human chromosome 4: entries, gene names and cross-references to MIM |
| MIM cross-references Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot |
| SIMILARITY comments Index of protein domains and families |

Clusters with


