Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Nuclear receptor binding SET domain protein 1

Gene

NSD1

Organism
Pan troglodytes (Chimpanzee)
Status
Unreviewed-Annotation score: -Protein predictedi

Functioni

Catalytic activityi

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N6-methyl-L-lysine-[histone].SAAS annotation

GO - Molecular functioni

Keywordsi

Molecular functionMethyltransferaseSAAS annotation, Transferase
LigandMetal-binding, S-adenosyl-L-methionineSAAS annotation, Zinc

Names & Taxonomyi

Protein namesi
Submitted name:
Nuclear receptor binding SET domain protein 1Imported
Gene namesi
Name:NSD1Imported
OrganismiPan troglodytes (Chimpanzee)Imported
Taxonomic identifieri9598 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaePan
Proteomesi
  • UP000002277 Componenti: Chromosome 5

Organism-specific databases

VGNCiVGNC:6953 NSD1

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

NucleusSAAS annotation

Expressioni

Gene expression databases

BgeeiENSPTRG00000017575

Interactioni

Protein-protein interaction databases

STRINGi9598.ENSPTRP00000044805

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini1512 – 1558PHD-typeInterPro annotationAdd BLAST47
Domaini1676 – 1720PHD-typeInterPro annotationAdd BLAST45
Domaini1725 – 1787PWWPInterPro annotationAdd BLAST63
Domaini1859 – 1909AWSInterPro annotationAdd BLAST51
Domaini1911 – 2028SETInterPro annotationAdd BLAST118
Domaini2035 – 2051Post-SETInterPro annotationAdd BLAST17

Coiled coil

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Coiled coili1804 – 1827Sequence analysisAdd BLAST24

Keywords - Domaini

Coiled coilSequence analysis, Zinc-fingerSAAS annotation

Phylogenomic databases

eggNOGiKOG1081 Eukaryota
COG2940 LUCA
GeneTreeiENSGT00780000121845
KOiK15588
OMAiVQKYPPT
OrthoDBiEOG091G00XD
TreeFamiTF329088

Family and domain databases

Gene3Di3.30.40.10, 4 hits
InterProiView protein in InterPro
IPR006560 AWS_dom
IPR003616 Post-SET_dom
IPR000313 PWWP_dom
IPR001214 SET_dom
IPR019786 Zinc_finger_PHD-type_CS
IPR011011 Znf_FYVE_PHD
IPR001965 Znf_PHD
IPR019787 Znf_PHD-finger
IPR013083 Znf_RING/FYVE/PHD
PfamiView protein in Pfam
PF00855 PWWP, 1 hit
PF00856 SET, 1 hit
SMARTiView protein in SMART
SM00570 AWS, 1 hit
SM00249 PHD, 5 hits
SM00508 PostSET, 1 hit
SM00293 PWWP, 1 hit
SM00317 SET, 1 hit
SUPFAMiSSF57903 SSF57903, 3 hits
PROSITEiView protein in PROSITE
PS51215 AWS, 1 hit
PS50868 POST_SET, 1 hit
PS50812 PWWP, 1 hit
PS50280 SET, 1 hit
PS01359 ZF_PHD_1, 1 hit
PS50016 ZF_PHD_2, 2 hits

Sequencei

Sequence statusi: Complete.

H2R328-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL
60 70 80 90 100
STVSGTSQNA YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP
110 120 130 140 150
EKSDSRAQTP IVCTSLSPGG PTALAMKQEP SCNNSPELQV KVTKTIKNGF
160 170 180 190 200
LHFENFTCVD DADVDSEMDP EQPVTEDESI EEIFEETQTN ATCNYETKSE
210 220 230 240 250
NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ RNEVDGSNEK
260 270 280 290 300
AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS
310 320 330 340 350
SSTSQELPFV SSFWETCNKF VFVSNRRPYR QYYVEAFGDP SERAWVAGKA
360 370 380 390 400
IVMFEGRHQF EELPVLRRRG KQKEKGYRHK VPQKILSKWE ASVGLAEQYD
410 420 430 440 450
VPKGSKNRKC IPGSIKLDSE EDMPFEDCTN DPESEHDLLL NGCLKSLAFD
460 470 480 490 500
SEHSADEKEK PCAKSRARKS SDNPKRTSVK KGHIQFEAHK DERRGKIPEN
510 520 530 540 550
LGLNFISGDI SDTQASNELS RIANSLTGSN TAPGSFLFSS CGKNTAKKEF
560 570 580 590 600
ETSNGDSLLG LPEGALISKC SREKNKPQRS LVCGSKVKLC YIGAGDEEKR
610 620 630 640 650
SDSISICTTS DDGSSDLDPI EHSSESDNSV LEIPDAFDRT ENMLSMQKNE
660 670 680 690 700
KIKYSRFAAT NTRVKAKQKP LISNSHTDHL MGCTKSAEPG TETSQVNLSD
710 720 730 740 750
LKASTLVHKP QSDFTNDALS PKFNMSSSIS SENSLIKGGA ANQALLHSKS
760 770 780 790 800
KQPKFRSIKC KHKENPVMVE PPVINEECSL KCCSSDTKGS PLASISKSGK
810 820 830 840 850
VDGLKLLNNM HEKTRDSSDI ETAVVKHVLS ELKELSYRSL GEDVSDSGTS
860 870 880 890 900
KPSKPLLFSS ASSQNHIPIE PDYKFSTLLM MLKDMHDSKT KEQRLMTAQN
910 920 930 940 950
LVSYRSPGRG DCSTNSPVGV SKVLVSGGST HNSEKKGDGT QNSANPSPSG
960 970 980 990 1000
GDSALSGELS ASLPGLVSDK RDLPASGKSR SDCVTRRNCG RSKPSSKLRD
1010 1020 1030 1040 1050
AFSAQMVKNT VNRKALKTER KRKLNQLPSV TLDAVLQGDR EHGGSLRGGA
1060 1070 1080 1090 1100
EDPSKEDPLQ IMGHLTSEDG DHFSDVHFDS KVKQSDPGKI SEKGLSFENG
1110 1120 1130 1140 1150
KGPELDSVMN SENDELNGVN QVVPKKRWQR LNQRRTKPRK RMNRFKEKEN
1160 1170 1180 1190 1200
SECAFRVLLP SDPVQEGRDE FPEHRTPPSA SILEEPLTEQ NHADCLDSVG
1210 1220 1230 1240 1250
PRLNVCDKSS ASIGDMEKEP GIPSLTPQAE LPEPAVRSEK KRLRKPSKWL
1260 1270 1280 1290 1300
LEYTEEYDQI FAPKKKQKKV QEQVHKVSSR CEEESLLARG RSSAQNKQVD
1310 1320 1330 1340 1350
ENSLISTKEE PPVLEREAPF LEGPLAQSEL GGGHAELPQL TLSVPVAPEV
1360 1370 1380 1390 1400
SPRPALESEE LLVKTPGNYE SKRQRKPTKK LLESNDLDPG FMPKKGDLGL
1410 1420 1430 1440 1450
SKKCYEAGHL ENGITESCAT SYSKDFGGGT TKIFDKPRKR KRQRHAAAKM
1460 1470 1480 1490 1500
QCKKVKNDDS SKEIPGSEGE LMPHRTATSP KETVEEGVEH DPGMPASKKM
1510 1520 1530 1540 1550
QGERGGGAAL KENVCQNCEK LGELLLCEAQ CCGAFHLECL GLTEMPRGKF
1560 1570 1580 1590 1600
ICNECRTGIH TCFVCKQSGE DVKRCLLPLC GKFYHEECVQ KYPPTVMQNK
1610 1620 1630 1640 1650
GFRCSLHICI TCHAANPANV SASKGRLMRC VRCPVAYHAN DFCLAAGSKI
1660 1670 1680 1690 1700
LASNSIICPN HFTPRRGCRN HEHVNVSWCF VCSEGGSLLC CDSCPAAFHR
1710 1720 1730 1740 1750
ECLNIDIPEG NWYCNDCKAG KKPHYREIVW VKVGRYRWWP AEICHPRAVP
1760 1770 1780 1790 1800
SNIDKMRHDV GEFPVLFFGS NDYLWTHQAR VFPYMEGDVS SKDKMGKGVD
1810 1820 1830 1840 1850
GTYKKALQEA AARFEELKAQ KELRQLQEDR KNDKKPPPYK HIKVNRPIGR
1860 1870 1880 1890 1900
VQIFTADLSE IPRCNCKATD ENPCGIDSEC INRMLLYECH PTVCPAGGRC
1910 1920 1930 1940 1950
QNQCFSKRQY PEVEIFRTLQ RGWGLRTKTD IKKGEFVNEY VGELIDEEEC
1960 1970 1980 1990 2000
RARIRYAQEH DITNFYMLTL DKDRIIDAGP KGNYARFMNH CCQPNCETQK
2010 2020 2030 2040 2050
WSVNGDTRVG LFALSDIKAG TELTFNYNLE CLGNGKTVCK CGAPNCSGFL
2060 2070 2080 2090 2100
GVRPKNQPIA TEEKSKKFKK KQQGKRRTQG EITKEREDEC FSCGDAGQLV
2110 2120 2130 2140 2150
SCKKPGCPKV YHADCLNLTK RPAGKWECPW HQCDICGKEA ASFCEMCPSS
2160 2170 2180 2190 2200
FCKQHREGML FISKLDGRLS CTEHDPCGPN PLEPGEIREY VPPPVPLPPG
2210 2220 2230 2240 2250
PSTHLAEQST GMAAQAPKMS DKPPADTNQT LSLSKKALAG TCQRPLLPER
2260 2270 2280 2290 2300
PLERTDSRPQ PLDKVRDLAG SGTKSQSLVS SQRPLDRPPA VAGPRPQLSD
2310 2320 2330 2340 2350
KPSPVTSPSS SPSVRSQPLE RPLGTADPRL DKSIGAASPR PQSLEKTPVP
2360 2370 2380 2390 2400
TGLRLPPPDR LLITSSPKPQ TSDRPTDKPH ASLSQRLPPP EKVLSAVVQT
2410 2420 2430 2440 2450
LVAKEKALRP VDQNTQSKNR AALVMDLIDL TPRQKERAAS PHEVTPQADE
2460 2470 2480 2490 2500
KMPVLESSSW PASKGLGHMP RAVEKGCVSD PLQTSGKAAA PSEDPWQAVK
2510 2520 2530 2540 2550
SLTQARLLSQ PPAKAFLYEP TTQASGRASA GAEQTPGPLS QSLGLVKQAK
2560 2570 2580 2590 2600
QMVGGQQLPA LAAKSGQSFR SLGKAPASLP TEEKKLVTTE QSPWALGKAS
2610 2620 2630 2640 2650
SRAGLWPIVA GQTLAQSCWS AGSTQTLAQT CWSLGRGQDP KPEQNTLPAL
2660
NQAPSSHKCA ESEQK
Length:2,665
Mass (Da):292,965
Last modified:February 28, 2018 - v2
Checksum:i1B8224112F651802
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AACZ04060243 Genomic DNA No translation available.
AACZ04060244 Genomic DNA No translation available.

Genome annotation databases

EnsembliENSPTRT00000042784; ENSPTRP00000044805; ENSPTRG00000017575
KEGGiptr:471754

Similar proteinsi

Entry informationi

Entry nameiH2R328_PANTR
AccessioniPrimary (citable) accession number: H2R328
Entry historyiIntegrated into UniProtKB/TrEMBL: March 21, 2012
Last sequence update: February 28, 2018
Last modified: April 25, 2018
This is version 51 of the entry and version 2 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteomeImported
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health