ID F1QA79_DANRE Unreviewed; 2055 AA. AC F1QA79; A0A8N7UUM6; DT 03-MAY-2011, integrated into UniProtKB/TrEMBL. DT 03-MAY-2011, sequence version 1. DT 27-MAR-2024, entry version 97. DE SubName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20 specific {ECO:0000313|RefSeq:XP_683890.4}; DE SubName: Full=Nuclear receptor-binding SET domain protein 1a {ECO:0000313|Ensembl:ENSDARP00000078549}; GN Name=nsd1a {ECO:0000313|Ensembl:ENSDARP00000078549, GN ECO:0000313|RefSeq:XP_683890.4, ECO:0000313|ZFIN:ZDB-GENE-080519-3}; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Danionidae; Danioninae; Danio. OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000078549}; RN [1] {ECO:0000313|Ensembl:ENSDARP00000078549} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000078549}; RG Ensembl; RL Submitted (JUL-2011) to UniProtKB. RN [2] {ECO:0000313|Ensembl:ENSDARP00000078549, ECO:0000313|Proteomes:UP000000437} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000078549}; RX PubMed=23594743; DOI=10.1038/nature12111; RG Genome Reference Consortium Zebrafish; RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., Muffato M., RA Collins J.E., Humphray S., McLaren K., Matthews L., McLaren S., Sealy I., RA Caccamo M., Churcher C., Scott C., Barrett J.C., Koch R., Rauch G.J., RA White S., Chow W., Kilian B., Quintais L.T., Guerra-Assuncao J.A., Zhou Y., RA Gu Y., Yen J., Vogel J.H., Eyre T., Redmond S., Banerjee R., Chi J., Fu B., RA Langley E., Maguire S.F., Laird G.K., Lloyd D., Kenyon E., Donaldson S., RA Sehra H., Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M., RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J., RA Clee C., Oliver K., Clark R., Riddle C., Elliot D., Eliott D., RA Threadgold G., Harden G., Ware D., Begum S., Mortimore B., Mortimer B., RA Kerry G., Heath P., Phillimore B., Tracey A., Corby N., Dunn M., RA Johnson C., Wood J., Clark S., Pelan S., Griffiths G., Smith M., RA Glithero R., Howden P., Barker N., Lloyd C., Stevens C., Harley J., RA Holt K., Panagiotidis G., Lovell J., Beasley H., Henderson C., Gordon D., RA Auger K., Wright D., Collins J., Raisen C., Dyer L., Leung K., RA Robertson L., Ambridge K., Leongamornlert D., McGuire S., Gilderthorp R., RA Griffiths C., Manthravadi D., Nichol S., Barker G., Whitehead S., Kay M., RA Brown J., Murnane C., Gray E., Humphries M., Sycamore N., Barker D., RA Saunders D., Wallis J., Babbage A., Hammond S., Mashreghi-Mohammadi M., RA Barr L., Martin S., Wray P., Ellington A., Matthews N., Ellwood M., RA Woodmansey R., Clark G., Cooper J., Cooper J., Tromans A., Grafham D., RA Skuce C., Pandian R., Andrews R., Harrison E., Kimberley A., Garnett J., RA Fosker N., Hall R., Garner P., Kelly D., Bird C., Palmer S., Gehring I., RA Berger A., Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M., RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M., RA Rudolph-Geiger S., Teucke M., Lanz C., Raddatz G., Osoegawa K., Zhu B., RA Rapp A., Widaa S., Langford C., Yang F., Schuster S.C., Carter N.P., RA Harrow J., Ning Z., Herrero J., Searle S.M., Enright A., Geisler R., RA Plasterk R.H., Lee C., Westerfield M., de Jong P.J., Zon L.I., RA Postlethwait J.H., Nusslein-Volhard C., Hubbard T.J., Roest Crollius H., RA Rogers J., Stemple D.L.; RT "The zebrafish reference genome sequence and its relationship to the human RT genome."; RL Nature 496:498-503(2013). RN [3] {ECO:0000313|RefSeq:XP_683890.4} RP IDENTIFICATION. RC STRAIN=Tuebingen {ECO:0000313|RefSeq:XP_683890.4}; RG RefSeq; RL Submitted (NOV-2023) to UniProtKB. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; CU633762; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CU655965; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; CU659412; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR RefSeq; XP_683890.4; XM_678798.8. DR PaxDb; 7955-ENSDARP00000078549; -. DR Ensembl; ENSDART00000084114; ENSDARP00000078549; ENSDARG00000060016. DR Ensembl; ENSDART00000084114.6; ENSDARP00000078549.4; ENSDARG00000060016.7. DR GeneID; 556086; -. DR KEGG; dre:556086; -. DR AGR; ZFIN:ZDB-GENE-080519-3; -. DR CTD; 556086; -. DR ZFIN; ZDB-GENE-080519-3; nsd1a. DR eggNOG; KOG1081; Eukaryota. DR HOGENOM; CLU_001380_0_0_1; -. DR OMA; MAYSPTQ; -. DR OrthoDB; 950362at2759; -. DR TreeFam; TF329088; -. DR Proteomes; UP000000437; Chromosome 14. DR Bgee; ENSDARG00000060016; Expressed in somite and 23 other cell types or tissues. DR GO; GO:0000785; C:chromatin; IBA:GO_Central. DR GO; GO:0005634; C:nucleus; IBA:GO_Central. DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IBA:GO_Central. DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW. DR GO; GO:0031507; P:heterochromatin formation; IGI:ZFIN. DR GO; GO:0006355; P:regulation of DNA-templated transcription; IBA:GO_Central. DR CDD; cd15648; PHD1_NSD1_2; 1. DR CDD; cd15653; PHD3_NSD1; 1. DR CDD; cd15656; PHD4_NSD1; 1. DR CDD; cd15659; PHD5_NSD1; 1. DR CDD; cd20161; PWWP_NSD1_rpt1; 1. DR Gene3D; 2.30.30.140; -; 2. DR Gene3D; 2.170.270.10; SET domain; 1. DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4. DR InterPro; IPR006560; AWS_dom. DR InterPro; IPR041306; C5HCH. DR InterPro; IPR047426; PHD1_NSD1_2. DR InterPro; IPR047429; PHD3_NSD1. DR InterPro; IPR047430; PHD4_NSD1. DR InterPro; IPR047432; PHD5_NSD1. DR InterPro; IPR003616; Post-SET_dom. DR InterPro; IPR000313; PWWP_dom. DR InterPro; IPR001214; SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR InterPro; IPR019786; Zinc_finger_PHD-type_CS. DR InterPro; IPR011011; Znf_FYVE_PHD. DR InterPro; IPR001965; Znf_PHD. DR InterPro; IPR019787; Znf_PHD-finger. DR InterPro; IPR013083; Znf_RING/FYVE/PHD. DR PANTHER; PTHR22884:SF511; HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-36 AND H4 LYSINE-20 SPECIFIC; 1. DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1. DR Pfam; PF17907; AWS; 1. DR Pfam; PF17982; C5HCH; 1. DR Pfam; PF00628; PHD; 1. DR Pfam; PF00855; PWWP; 2. DR Pfam; PF00856; SET; 1. DR SMART; SM00570; AWS; 1. DR SMART; SM00249; PHD; 4. DR SMART; SM00508; PostSET; 1. DR SMART; SM00293; PWWP; 2. DR SMART; SM00317; SET; 1. DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3. DR SUPFAM; SSF82199; SET domain; 1. DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2. DR PROSITE; PS51215; AWS; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50812; PWWP; 2. DR PROSITE; PS50280; SET; 1. DR PROSITE; PS01359; ZF_PHD_1; 1. DR PROSITE; PS50016; ZF_PHD_2; 2. PE 1: Evidence at protein level; KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853}; KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}; KW Proteomics identification {ECO:0007829|PeptideAtlas:F1QA79}; KW Reference proteome {ECO:0000313|Proteomes:UP000000437}; KW Repeat {ECO:0000256|ARBA:ARBA00022737}; KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691}; KW Transferase {ECO:0000256|ARBA:ARBA00022679}; KW Zinc {ECO:0000256|ARBA:ARBA00022833}; KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE- KW ProRule:PRU00146}. FT DOMAIN 288..354 FT /note="PWWP" FT /evidence="ECO:0000259|PROSITE:PS50812" FT DOMAIN 1226..1272 FT /note="PHD-type" FT /evidence="ECO:0000259|PROSITE:PS50016" FT DOMAIN 1390..1434 FT /note="PHD-type" FT /evidence="ECO:0000259|PROSITE:PS50016" FT DOMAIN 1439..1501 FT /note="PWWP" FT /evidence="ECO:0000259|PROSITE:PS50812" FT DOMAIN 1573..1623 FT /note="AWS" FT /evidence="ECO:0000259|PROSITE:PS51215" FT DOMAIN 1625..1742 FT /note="SET" FT /evidence="ECO:0000259|PROSITE:PS50280" FT DOMAIN 1749..1765 FT /note="Post-SET" FT /evidence="ECO:0000259|PROSITE:PS50868" FT REGION 1..22 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 181..227 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 260..280 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 423..505 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 533..601 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 621..731 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 776..851 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 918..977 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 989..1178 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1200..1219 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1768..1793 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1979..2000 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2032..2055 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 181..211 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 429..455 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 465..498 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 549..582 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 583..597 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 634..676 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 688..707 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 708..726 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 821..838 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 918..932 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 955..977 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1001..1029 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1071..1127 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1135..1178 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1200..1217 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2032..2049 FT /note="Acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 2055 AA; 227772 MW; FC5206F7F0DBDA1A CRC64; MNQSYKVAGR DGSECRGDPQ DSRALDVRIG SCGQQSTKAD QSSAMHFLNP GYKLSAPFAY NQRESTSNYS PLRRLQDLTT MVNHADGGLQ DKDFHGRNKL LMHSPIGGNE FGIGLYNTTN SQGNTAIQDL DKNGFSPVSS DSLEHFSPIS NGFLHFDSTL FDSGDSKDND EDVKGEVATF KDTSKKCQKR NVAKTETQSS HRLDANKHSS KNTAKSDPPA SPFHVPFTMM EDFDSMDDYS DSDCDLPSTE NCASIFNLSK RAHSPSDSHS NPSSSPKKKQ LLSVKYSIGD VVWAKFNRRP WWPCHVTTDP QIDVHTKMKD PSRRPCRLYF VRMFGEIVDQ AWVPAKATFP FEGGDQFEKL PVLRRRGRQK EKDYKYTVPK RLIPSWKASV LEAEASLTKQ LNSEPPLTLI QDVCVPITED KAHKKNPACP KSTTLSTPSS SLLSNGLNAS VKNRLKPGTA SKKKATKKMI QSQSIKLSET PSLFKSKSNP SSDKPSIATD PLDSPYSDID SVPRILCSKR IDESLQVELE KELSTKTTKV QACKKTSKNS VDKPGKKTDV KCLKNSKLKK STPKDKTLKA TSSRLKESSS PGSSDGSFAM HSHYLPASSG LIVRALAAGE ETKEKDLSHE PNSHSPSSEH SSQTNHELSK QNWEVKNNSE SSEESVDASD SPPNPTSIKR IKKRPQKNGI HKDPPPKSHE ESESKVNNES MFSDTSSSSI PSPSISPMDA FQDIKELSFR SLVKEECSSG ESPLRADSNY KFSTFLMLLK DLHDSREKEG KPLTLPPTSP SSLIKEEPSL IPVGEEPPNE VLCEKEFAEQ SSLPGKTSTP NNSKQSKAKL KSTVKKESQK PDVVIDSVSP LKKPISPVGL DVLDKSLPLL GDLPKSSDVS AAVTDVCAKG TISKVAPKKR WKTFESELGK SIKPKSDQVD STLDEVKSVS TEPNGVFKDG LQESAHSDVP NKKDASENKR LRKPSKRLIE CGEEYEQIFP SKKKTKKSTE SCKTGSCISN TTPEQPASDQ ITLDAVSSSS AEKPLDHVVA EEQKPPADVL VPPSAPKSPS CATSLETEPC EKKSAPLERK RPRKSVHKVL DCAIEGESVK IPKKGESRQH KTNPEESGVR DLEKQEEAGP PPSSPCAVSA QSEGELRTSS PIQPDVPSVS QIPVEGEQSS PSKPQINNED SLISEGFAAG LNDSAFSSRD SLSGDLSGLS TNKRSVERGG GASLKENVCQ VCEKTGELLL CEGQCCGAFH LQCIGLTETP KGRFICQECK MGVHTCFVCK KPDKEVRRCM IPVCGKFYHM DCILKYSPTV AQNRGFRCSI HVCLSCYITN PNNPGISKGR LTRCVRCPVA YHANDYCMAA GSVPLANNSF LCPNHFTPRK GCKNHEHINV SWCFVCSEGG SLLCCESCPA AFHRECLNIE MPQGSWFCND CRAGKKPHYK DILWVKVGRY RWWPAEVTQP KSVPENISRM KHEVGEFPVH FFGSKDYVWT YQARCFPYME GDANNKEKMG KGADAVYKKA LNEAADRFRE LLKEKEMRQL QEDRKNDKKP PPYKHIKVNK QIGKVLIITA DLSEIPRCNC KATDENPCGI DSECINRMLL YECHSQVCPA GERCQNQSFT KRQYTEVEIF RTLSRGWGLR SISDIKKGAF VNEYVGEVID EEECRSRIKN AQDNDICNFY MLTLDKDRII DAGPKGNESR FMNHSCQPNC ETQKWTVNGD TRVGLFALED IPKGVELTFN YNLECLGNGK TVCKCGAPNC SGFLGVRPKN QPPSDDKTRK LRRKVSGKRK SQSEVTKERE DECFYCGDGG QIVSCKKPGC PKVYHADCLN LSKRPAGRWE CPWHQCNECG REAASYCEMC PNSYCEQHRE GMLFISKLDG KLSCSEHDPC GPDPLEPGEI REYVPSTSIV GQAPNMSASG RITQPALPPA APLFIPAQNR PAFQSQRDPY GDEVVDNILP SSSPKDIKEE DMSDGEVVFE EGEEEEEDLE LVDDEDDEEE IDCRGMGLED EEEEEEFEDY EDDFVNVDFV GTEDGDEVEG DEQETSWDEL VEAEK //