Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Histone-lysine N-methyltransferase

Gene

Nsd1

Organism
Mus musculus (Mouse)
Status
Unreviewed-Annotation score: Annotation score: 2 out of 5-Protein predictedi

Functioni

Catalytic activityi

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].SAAS annotation

GO - Molecular functioni

  1. histone-lysine N-methyltransferase activity Source: UniProtKB-EC
  2. transcription cofactor activity Source: Ensembl
  3. zinc ion binding Source: InterPro

GO - Biological processi

  1. positive regulation of transcription, DNA-templated Source: Ensembl
Complete GO annotation...

Keywords - Molecular functioni

MethyltransferaseUniRule annotationSAAS annotation, Transferase

Keywords - Ligandi

Metal-binding, S-adenosyl-L-methionineSAAS annotation, Zinc

Enzyme and pathway databases

ReactomeiREACT_278269. PKMTs methylate histone lysines.

Names & Taxonomyi

Protein namesi
Recommended name:
Histone-lysine N-methyltransferaseSAAS annotation (EC:2.1.1.43SAAS annotation)
Gene namesi
Name:Nsd1Imported
OrganismiMus musculus (Mouse)Imported
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
ProteomesiUP000000589 Componenti: Chromosome 13

Organism-specific databases

MGIiMGI:1276545. Nsd1.

Subcellular locationi

GO - Cellular componenti

  1. nucleus Source: UniProtKB-KW
Complete GO annotation...

Keywords - Cellular componenti

NucleusSAAS annotation

PTM / Processingi

Proteomic databases

MaxQBiE9QAE4.
PRIDEiE9QAE4.

Expressioni

Gene expression databases

BgeeiE9QAE4.
ExpressionAtlasiE9QAE4. baseline and differential.

Structurei

3D structure databases

ProteinModelPortaliE9QAE4.
SMRiE9QAE4. Positions 1754-1851, 1853-2083, 2119-2211.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Contains 1 post-SET domain.UniRule annotation

Keywords - Domaini

Zinc-fingerUniRule annotationSAAS annotation

Phylogenomic databases

GeneTreeiENSGT00780000121845.
KOiK15588.
OMAiECRTGIH.
OrthoDBiEOG7Z69BG.
TreeFamiTF329088.

Family and domain databases

Gene3Di3.30.40.10. 3 hits.
InterProiIPR006560. AWS_dom.
IPR003616. Post-SET_dom.
IPR000313. PWWP_dom.
IPR001214. SET_dom.
IPR019786. Zinc_finger_PHD-type_CS.
IPR011011. Znf_FYVE_PHD.
IPR001965. Znf_PHD.
IPR019787. Znf_PHD-finger.
IPR001841. Znf_RING.
IPR013083. Znf_RING/FYVE/PHD.
[Graphical view]
PfamiPF00628. PHD. 1 hit.
PF00855. PWWP. 2 hits.
PF00856. SET. 1 hit.
[Graphical view]
SMARTiSM00570. AWS. 1 hit.
SM00249. PHD. 5 hits.
SM00508. PostSET. 1 hit.
SM00293. PWWP. 2 hits.
SM00184. RING. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
SUPFAMiSSF57903. SSF57903. 3 hits.
PROSITEiPS51215. AWS. 1 hit.
PS50868. POST_SET. 1 hit.
PS50812. PWWP. 2 hits.
PS50280. SET. 1 hit.
PS01359. ZF_PHD_1. 2 hits.
PS50016. ZF_PHD_2. 2 hits.
PS50089. ZF_RING_2. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

E9QAE4-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MDRTCELSRR NCLLSFSNPV NLDASEDKDS PFGNGQSNFS EPLNGCTMQL
60 70 80 90 100
PTAASGTSQN AYGQDSPSCY IPLRRLQDLA SMINVEYLSG SADGSESFQD
110 120 130 140 150
PAKSDSRAQS PIVCTSLSPG GPTALAMKQE PTCNNSPELQ LRVTKTTKNG
160 170 180 190 200
FLHFENFTGV DDADVDSEMD PEQPVTEDES IEEIFEETQT NATCNYEPKS
210 220 230 240 250
ENGVEVAMGS EQDSMPESRH GAVERPFLPL APQTEKQKNK QRSEVDGSNE
260 270 280 290 300
KTALLPAPTS LGDTNVTVEE QFNSINLSFQ DDPDSSPSPL GNMLEIPGTS
310 320 330 340 350
SPSTSQELPF CQPKKKSTPL KYEVGDLIWA KFKRRPWWPC RICSDPLINT
360 370 380 390 400
HSKMKVANRR PYREYYVEAF GDPSEKAWVA GKAIVMFEGR HQFEELPVLR
410 420 430 440 450
KRGKQKEKGY RHKVPQKILS KWEASVGLAE QYDVPKGSKN QKCVSSSVKL
460 470 480 490 500
DSEEDMPFED CTNDPDSEHL LLNGCLKSLA FDSEHSADEK EKPCAKSRVR
510 520 530 540 550
KSSDNIKRTS VKKDLVPFES RKEERRGKIP DNLGLDFISG GVSDKQASNE
560 570 580 590 600
LSRIANSLTG SSTAPGSFLF SSSVQNTAKT DFETPDCDSL SGLSESALIS
610 620 630 640 650
KHSGEKKKLQ PGQVCSSKVQ LCYVGAGDEE KRSNSVSVST TSDDGCSDLD
660 670 680 690 700
PTEHNSGFQN SVLGITDAFD KTENALSVHK NETQYSRYPV TNRIKEKQKS
710 720 730 740 750
LITNSHADHL MGSTKTMEPE TAELSQVNLS DLKISSPIPK PQPEFRNDGL
760 770 780 790 800
TTKFSAPPGI RNENPLTKGG LANQTLLPLK CRQPKFRSIK CKHKESPAVA
810 820 830 840 850
ETSATSEDLS LKCCSSDTNG SPLANISKSG KGEGLKLLNN MHEKTRDSSD
860 870 880 890 900
IETAVVKHVL SELKELSYRS LSEDVSDSGT AKASKPLLFS SASSQNHIPI
910 920 930 940 950
EPDYKFSTLL MMLKDMHDSK TKEQRLMTAQ NLASYRTPDR GDCSSGSPVG
960 970 980 990 1000
TSKVLVLGSS TPNSEKPGDS TQDSVHQSPG GGDSALSGEL SSSLSSLASD
1010 1020 1030 1040 1050
KRELPACGKI RSNCIPRRNC GRAKPSSKLR ETISAQMVKP SVNPKALKTE
1060 1070 1080 1090 1100
RKRKFSRLPA VTLAANRLGN KESGSVNGPS RGGAEDPGKE EPLQQMDLLR
1110 1120 1130 1140 1150
NEDTHFSDVH FDSKAKQSDP DKNLEKEPSF ENRKGPELGS EMNTENDELH
1160 1170 1180 1190 1200
GVNQVVPKKR WQRLNQRRPK PGKRANRFRE KENSEGAFGV LLPADAVQKA
1210 1220 1230 1240 1250
REDYLEQRAP PTSKPEDSAA DPNHGSHSES VAPRLNVCEK SSVGMGDVEK
1260 1270 1280 1290 1300
ETGIPSLMPQ TKLPEPAIRS EKKRLRKPSK WLLEYTEEYD QIFAPKKKQK
1310 1320 1330 1340 1350
KVQEQVHKVS SRCEDESLLA RCQPSAQNKQ VDENSLISTK EEPPVLEREA
1360 1370 1380 1390 1400
PFLEGPLAQS DLGVTHAELP QLTLSVPVAP EASPRPALES EELLVKTPGN
1410 1420 1430 1440 1450
YESKRQRKPT KKLLESNDLD PGFMPKKGDL GLSRKCFEAS RSGNGIVESR
1460 1470 1480 1490 1500
ATSHLKEFSG GTTKIFDKPR KRKRQRLVTA RVHYKKVKKE DLTKDTPSSE
1510 1520 1530 1540 1550
GELLIHRTAA SPKEILEEGV EHDPGMSASK KLQVERGGGA ALKENVCQNC
1560 1570 1580 1590 1600
EKLGELLLCE AQCCGAFHLE CLGLPEMPRG KFICNECHTG IHTCFVCKQS
1610 1620 1630 1640 1650
GEDVKRCLLP LCGKFYHEEC VQKYPPTVTQ NKGFRCPLHI CITCHAANPA
1660 1670 1680 1690 1700
NVSASKGRLM RCVRCPVAYH ANDFCLAAGS KILASNSIIC PNHFTPRRGC
1710 1720 1730 1740 1750
RNHEHVNVSW CFVCSEGGSL LCCDSCPAAF HRECLNIDIP EGNWYCNDCK
1760 1770 1780 1790 1800
AGKKPHYREI VWVKVGRYRW WPAEICHPRA VPSNIDKMRH DVGEFPVLFF
1810 1820 1830 1840 1850
GSNDYLWTHQ ARVFPYMEGD VSSKDKMGKG VDGTYKKALQ EAAARFEELK
1860 1870 1880 1890 1900
AQKELRQLQE DRKNDKKPPP YKHIKVNRPI GRVQIFTADL SEIPRCNCKA
1910 1920 1930 1940 1950
TDENPCGIDS ECINRMLLYE CHPTVCPAGV RCQNQCFSKR QYPDVEIFRT
1960 1970 1980 1990 2000
LQRGWGLRTK TDIKKGEFVN EYVGELIDEE ECRARIRYAQ EHDITNFYML
2010 2020 2030 2040 2050
TLDKDRIIDA GPKGNYARFM NHCCQPNCET QKWSVNGDTR VGLFALSDIK
2060 2070 2080 2090 2100
AGTELTFNYN LECLGNGKTV CKCGAPNCSG FLGVRPKNQP IVTEEKSRKF
2110 2120 2130 2140 2150
KRKPHGKRRS QGEVTKERED ECFSCGDAGQ LVSCKKPGCP KVYHADCLNL
2160 2170 2180 2190 2200
TKRPAGKWEC PWHQCDVCGK EAASFCEMCP SSFCKQHREG MLFISKLDGR
2210 2220 2230 2240 2250
LSCTEHDPCG PNPLEPGEIR EYVPPTATSP PSPGTQPKEQ SSEMATQGPK
2260 2270 2280 2290 2300
KSDQPPTDAT QLLPLSKKAL TGSCQRPLLP ERPPERTDSS SHLLDRIRDL
2310 2320 2330 2340 2350
AGSGTKSQSL VSSQRPQDRP PAKEGPRPQP PDRASPMTRP SSSPSVSSLP
2360 2370 2380 2390 2400
LERPLRMTDS RLDKSIGAAS PKSQAVEKTP ASTGLRLSSP DRLLTTNSPK
2410 2420 2430 2440 2450
PQISDRPPEK SHASLTQRLP PPEKVLSAVV QSLVAKEKAL RPVDQNTQSK
2460 2470 2480 2490 2500
HRPAVVMDLI DLTPRQKERA ASPQEVTPQA DEKTAMLESS SWPSSKGLGH
2510 2520 2530 2540 2550
IPRATEKISV SESLQPSGKV AAPSEHPWQA VKSLTHARFL SPPSAKAFLY
2560 2570 2580 2590 2600
ESATQASGRT PVGAEQTPGP PSPAPGLVKQ VKQLSRGLTA KSGQSFRSLG
2610 2620 2630 2640 2650
KISASLPNEE KKLTTTEQSP WGLGKASPGA GLWPIVAGQT LAQACWSAGG
2660 2670 2680 2690
TQTLAQTCWS LGRGQDPKPE NAIQALNQAP SSRKCADSEK K
Length:2,691
Mass (Da):296,364
Last modified:April 5, 2011 - v1
Checksum:i9757638B75D6E773
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC160958 Genomic DNA. No translation available.
RefSeqiNP_032765.3. NM_008739.3.
XP_006517197.1. XM_006517134.2.
XP_006517198.1. XM_006517135.2.
XP_006517199.1. XM_006517136.1.
XP_006517200.1. XM_006517137.1.
XP_006517201.1. XM_006517138.2.
XP_006517203.1. XM_006517140.2.
XP_006517204.1. XM_006517141.1.
XP_006517205.1. XM_006517142.2.
UniGeneiMm.12964.
Mm.168965.

Genome annotation databases

EnsembliENSMUST00000099490; ENSMUSP00000097089; ENSMUSG00000021488.
GeneIDi18193.
KEGGimmu:18193.
UCSCiuc007qqd.1. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC160958 Genomic DNA. No translation available.
RefSeqiNP_032765.3. NM_008739.3.
XP_006517197.1. XM_006517134.2.
XP_006517198.1. XM_006517135.2.
XP_006517199.1. XM_006517136.1.
XP_006517200.1. XM_006517137.1.
XP_006517201.1. XM_006517138.2.
XP_006517203.1. XM_006517140.2.
XP_006517204.1. XM_006517141.1.
XP_006517205.1. XM_006517142.2.
UniGeneiMm.12964.
Mm.168965.

3D structure databases

ProteinModelPortaliE9QAE4.
SMRiE9QAE4. Positions 1754-1851, 1853-2083, 2119-2211.
ModBaseiSearch...
MobiDBiSearch...

Proteomic databases

MaxQBiE9QAE4.
PRIDEiE9QAE4.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000099490; ENSMUSP00000097089; ENSMUSG00000021488.
GeneIDi18193.
KEGGimmu:18193.
UCSCiuc007qqd.1. mouse.

Organism-specific databases

CTDi64324.
MGIiMGI:1276545. Nsd1.

Phylogenomic databases

GeneTreeiENSGT00780000121845.
KOiK15588.
OMAiECRTGIH.
OrthoDBiEOG7Z69BG.
TreeFamiTF329088.

Enzyme and pathway databases

ReactomeiREACT_278269. PKMTs methylate histone lysines.

Miscellaneous databases

ChiTaRSiNsd1. mouse.
NextBioi293540.
SOURCEiSearch...

Gene expression databases

BgeeiE9QAE4.
ExpressionAtlasiE9QAE4. baseline and differential.

Family and domain databases

Gene3Di3.30.40.10. 3 hits.
InterProiIPR006560. AWS_dom.
IPR003616. Post-SET_dom.
IPR000313. PWWP_dom.
IPR001214. SET_dom.
IPR019786. Zinc_finger_PHD-type_CS.
IPR011011. Znf_FYVE_PHD.
IPR001965. Znf_PHD.
IPR019787. Znf_PHD-finger.
IPR001841. Znf_RING.
IPR013083. Znf_RING/FYVE/PHD.
[Graphical view]
PfamiPF00628. PHD. 1 hit.
PF00855. PWWP. 2 hits.
PF00856. SET. 1 hit.
[Graphical view]
SMARTiSM00570. AWS. 1 hit.
SM00249. PHD. 5 hits.
SM00508. PostSET. 1 hit.
SM00293. PWWP. 2 hits.
SM00184. RING. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
SUPFAMiSSF57903. SSF57903. 3 hits.
PROSITEiPS51215. AWS. 1 hit.
PS50868. POST_SET. 1 hit.
PS50812. PWWP. 2 hits.
PS50280. SET. 1 hit.
PS01359. ZF_PHD_1. 2 hits.
PS50016. ZF_PHD_2. 2 hits.
PS50089. ZF_RING_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: C57BL/6JImported.
  2. Ensembl
    Submitted (MAY-2011) to UniProtKB
    Cited for: IDENTIFICATION.
    Strain: C57BL/6JImported.

Entry informationi

Entry nameiE9QAE4_MOUSE
AccessioniPrimary (citable) accession number: E9QAE4
Entry historyi
Integrated into UniProtKB/TrEMBL: April 5, 2011
Last sequence update: April 5, 2011
Last modified: April 1, 2015
This is version 39 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Caution

The sequence shown here is derived from an Ensembl automatic analysis pipeline and should be considered as preliminary data.Imported

Keywords - Technical termi

Complete proteome, Reference proteomeImported

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.