ID H2R328_PANTR Unreviewed; 2665 AA. AC H2R328; DT 21-MAR-2012, integrated into UniProtKB/TrEMBL. DT 28-FEB-2018, sequence version 2. DT 27-MAR-2024, entry version 85. DE SubName: Full=Nuclear receptor binding SET domain protein 1 {ECO:0000313|Ensembl:ENSPTRP00000044805.5}; GN Name=NSD1 {ECO:0000313|Ensembl:ENSPTRP00000044805.5, GN ECO:0000313|VGNC:VGNC:6953}; OS Pan troglodytes (Chimpanzee). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Pan. OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000044805.5, ECO:0000313|Proteomes:UP000002277}; RN [1] {ECO:0000313|Ensembl:ENSPTRP00000044805.5, ECO:0000313|Proteomes:UP000002277} RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=16136131; DOI=10.1038/nature04072; RG Chimpanzee sequencing and analysis consortium; RT "Initial sequence of the chimpanzee genome and comparison with the human RT genome."; RL Nature 437:69-87(2005). RN [2] {ECO:0000313|Ensembl:ENSPTRP00000044805.5} RP IDENTIFICATION. RG Ensembl; RL Submitted (NOV-2023) to UniProtKB. CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}. CC --------------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution (CC BY 4.0) License CC --------------------------------------------------------------------------- DR EMBL; AACZ04060243; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR EMBL; AACZ04060244; -; NOT_ANNOTATED_CDS; Genomic_DNA. DR Ensembl; ENSPTRT00000042784.5; ENSPTRP00000044805.5; ENSPTRG00000017575.6. DR VGNC; VGNC:6953; NSD1. DR eggNOG; KOG1081; Eukaryota. DR GeneTree; ENSGT00940000155027; -. DR HOGENOM; CLU_000756_0_0_1; -. DR TreeFam; TF329088; -. DR Proteomes; UP000002277; Chromosome 5. DR Bgee; ENSPTRG00000017575; Expressed in lymph node and 21 other cell types or tissues. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IEA:UniProt. DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW. DR CDD; cd15648; PHD1_NSD1_2; 1. DR CDD; cd15650; PHD2_NSD1; 1. DR CDD; cd15653; PHD3_NSD1; 1. DR CDD; cd15656; PHD4_NSD1; 1. DR CDD; cd15659; PHD5_NSD1; 1. DR CDD; cd20164; PWWP_NSD1_rpt2; 1. DR CDD; cd19210; SET_NSD1; 1. DR Gene3D; 2.30.30.140; -; 1. DR Gene3D; 2.170.270.10; SET domain; 1. DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4. DR InterPro; IPR006560; AWS_dom. DR InterPro; IPR041306; C5HCH. DR InterPro; IPR047426; PHD1_NSD1_2. DR InterPro; IPR047428; PHD2_NSD1. DR InterPro; IPR047429; PHD3_NSD1. DR InterPro; IPR047430; PHD4_NSD1. DR InterPro; IPR047432; PHD5_NSD1. DR InterPro; IPR003616; Post-SET_dom. DR InterPro; IPR000313; PWWP_dom. DR InterPro; IPR047423; PWWP_NSD1_rpt2. DR InterPro; IPR001214; SET_dom. DR InterPro; IPR046341; SET_dom_sf. DR InterPro; IPR047433; SET_NSD1. DR InterPro; IPR019786; Zinc_finger_PHD-type_CS. DR InterPro; IPR011011; Znf_FYVE_PHD. DR InterPro; IPR001965; Znf_PHD. DR InterPro; IPR019787; Znf_PHD-finger. DR InterPro; IPR013083; Znf_RING/FYVE/PHD. DR PANTHER; PTHR22884:SF312; HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-36 SPECIFIC; 1. DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1. DR Pfam; PF17907; AWS; 1. DR Pfam; PF17982; C5HCH; 1. DR Pfam; PF00628; PHD; 1. DR Pfam; PF00855; PWWP; 1. DR Pfam; PF00856; SET; 1. DR SMART; SM00570; AWS; 1. DR SMART; SM00249; PHD; 5. DR SMART; SM00508; PostSET; 1. DR SMART; SM00293; PWWP; 1. DR SMART; SM00317; SET; 1. DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3. DR SUPFAM; SSF82199; SET domain; 1. DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1. DR PROSITE; PS51215; AWS; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50812; PWWP; 1. DR PROSITE; PS50280; SET; 1. DR PROSITE; PS01359; ZF_PHD_1; 1. DR PROSITE; PS50016; ZF_PHD_2; 2. PE 4: Predicted; KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853}; KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}; KW Nucleus {ECO:0000256|ARBA:ARBA00023242}; KW Reference proteome {ECO:0000313|Proteomes:UP000002277}; KW Repeat {ECO:0000256|ARBA:ARBA00022737}; KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691}; KW Transferase {ECO:0000256|ARBA:ARBA00022679}; KW Zinc {ECO:0000256|ARBA:ARBA00022833}; KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE- KW ProRule:PRU00146}. FT DOMAIN 1512..1558 FT /note="PHD-type" FT /evidence="ECO:0000259|PROSITE:PS50016" FT DOMAIN 1676..1720 FT /note="PHD-type" FT /evidence="ECO:0000259|PROSITE:PS50016" FT DOMAIN 1725..1787 FT /note="PWWP" FT /evidence="ECO:0000259|PROSITE:PS50812" FT DOMAIN 1859..1909 FT /note="AWS" FT /evidence="ECO:0000259|PROSITE:PS51215" FT DOMAIN 1911..2028 FT /note="SET" FT /evidence="ECO:0000259|PROSITE:PS50280" FT DOMAIN 2035..2051 FT /note="Post-SET" FT /evidence="ECO:0000259|PROSITE:PS50868" FT REGION 207..250 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 282..301 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 455..482 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 840..859 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 904..1000 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1037..1057 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1080..1102 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1212..1241 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1264..1313 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1351..1397 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 1433..1504 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2060..2080 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2182..2391 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2433..2468 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2565..2585 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT REGION 2634..2665 FT /note="Disordered" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 842..859 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 904..956 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1264..1287 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1369..1383 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1435..1450 FT /note="Basic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1451..1469 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 1479..1497 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2186..2201 FT /note="Pro residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2222..2237 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2249..2263 FT /note="Basic and acidic residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2300..2318 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2364..2378 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" FT COMPBIAS 2644..2659 FT /note="Polar residues" FT /evidence="ECO:0000256|SAM:MobiDB-lite" SQ SEQUENCE 2665 AA; 292965 MW; 1B8224112F651802 CRC64; MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STVSGTSQNA YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQTP IVCTSLSPGG PTALAMKQEP SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI EEIFEETQTN ATCNYETKSE NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ RNEVDGSNEK AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS SSTSQELPFV SSFWETCNKF VFVSNRRPYR QYYVEAFGDP SERAWVAGKA IVMFEGRHQF EELPVLRRRG KQKEKGYRHK VPQKILSKWE ASVGLAEQYD VPKGSKNRKC IPGSIKLDSE EDMPFEDCTN DPESEHDLLL NGCLKSLAFD SEHSADEKEK PCAKSRARKS SDNPKRTSVK KGHIQFEAHK DERRGKIPEN LGLNFISGDI SDTQASNELS RIANSLTGSN TAPGSFLFSS CGKNTAKKEF ETSNGDSLLG LPEGALISKC SREKNKPQRS LVCGSKVKLC YIGAGDEEKR SDSISICTTS DDGSSDLDPI EHSSESDNSV LEIPDAFDRT ENMLSMQKNE KIKYSRFAAT NTRVKAKQKP LISNSHTDHL MGCTKSAEPG TETSQVNLSD LKASTLVHKP QSDFTNDALS PKFNMSSSIS SENSLIKGGA ANQALLHSKS KQPKFRSIKC KHKENPVMVE PPVINEECSL KCCSSDTKGS PLASISKSGK VDGLKLLNNM HEKTRDSSDI ETAVVKHVLS ELKELSYRSL GEDVSDSGTS KPSKPLLFSS ASSQNHIPIE PDYKFSTLLM MLKDMHDSKT KEQRLMTAQN LVSYRSPGRG DCSTNSPVGV SKVLVSGGST HNSEKKGDGT QNSANPSPSG GDSALSGELS ASLPGLVSDK RDLPASGKSR SDCVTRRNCG RSKPSSKLRD AFSAQMVKNT VNRKALKTER KRKLNQLPSV TLDAVLQGDR EHGGSLRGGA EDPSKEDPLQ IMGHLTSEDG DHFSDVHFDS KVKQSDPGKI SEKGLSFENG KGPELDSVMN SENDELNGVN QVVPKKRWQR LNQRRTKPRK RMNRFKEKEN SECAFRVLLP SDPVQEGRDE FPEHRTPPSA SILEEPLTEQ NHADCLDSVG PRLNVCDKSS ASIGDMEKEP GIPSLTPQAE LPEPAVRSEK KRLRKPSKWL LEYTEEYDQI FAPKKKQKKV QEQVHKVSSR CEEESLLARG RSSAQNKQVD ENSLISTKEE PPVLEREAPF LEGPLAQSEL GGGHAELPQL TLSVPVAPEV SPRPALESEE LLVKTPGNYE SKRQRKPTKK LLESNDLDPG FMPKKGDLGL SKKCYEAGHL ENGITESCAT SYSKDFGGGT TKIFDKPRKR KRQRHAAAKM QCKKVKNDDS SKEIPGSEGE LMPHRTATSP KETVEEGVEH DPGMPASKKM QGERGGGAAL KENVCQNCEK LGELLLCEAQ CCGAFHLECL GLTEMPRGKF ICNECRTGIH TCFVCKQSGE DVKRCLLPLC GKFYHEECVQ KYPPTVMQNK GFRCSLHICI TCHAANPANV SASKGRLMRC VRCPVAYHAN DFCLAAGSKI LASNSIICPN HFTPRRGCRN HEHVNVSWCF VCSEGGSLLC CDSCPAAFHR ECLNIDIPEG NWYCNDCKAG KKPHYREIVW VKVGRYRWWP AEICHPRAVP SNIDKMRHDV GEFPVLFFGS NDYLWTHQAR VFPYMEGDVS SKDKMGKGVD GTYKKALQEA AARFEELKAQ KELRQLQEDR KNDKKPPPYK HIKVNRPIGR VQIFTADLSE IPRCNCKATD ENPCGIDSEC INRMLLYECH PTVCPAGGRC QNQCFSKRQY PEVEIFRTLQ RGWGLRTKTD IKKGEFVNEY VGELIDEEEC RARIRYAQEH DITNFYMLTL DKDRIIDAGP KGNYARFMNH CCQPNCETQK WSVNGDTRVG LFALSDIKAG TELTFNYNLE CLGNGKTVCK CGAPNCSGFL GVRPKNQPIA TEEKSKKFKK KQQGKRRTQG EITKEREDEC FSCGDAGQLV SCKKPGCPKV YHADCLNLTK RPAGKWECPW HQCDICGKEA ASFCEMCPSS FCKQHREGML FISKLDGRLS CTEHDPCGPN PLEPGEIREY VPPPVPLPPG PSTHLAEQST GMAAQAPKMS DKPPADTNQT LSLSKKALAG TCQRPLLPER PLERTDSRPQ PLDKVRDLAG SGTKSQSLVS SQRPLDRPPA VAGPRPQLSD KPSPVTSPSS SPSVRSQPLE RPLGTADPRL DKSIGAASPR PQSLEKTPVP TGLRLPPPDR LLITSSPKPQ TSDRPTDKPH ASLSQRLPPP EKVLSAVVQT LVAKEKALRP VDQNTQSKNR AALVMDLIDL TPRQKERAAS PHEVTPQADE KMPVLESSSW PASKGLGHMP RAVEKGCVSD PLQTSGKAAA PSEDPWQAVK SLTQARLLSQ PPAKAFLYEP TTQASGRASA GAEQTPGPLS QSLGLVKQAK QMVGGQQLPA LAAKSGQSFR SLGKAPASLP TEEKKLVTTE QSPWALGKAS SRAGLWPIVA GQTLAQSCWS AGSTQTLAQT CWSLGRGQDP KPEQNTLPAL NQAPSSHKCA ESEQK //