Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Histone-lysine N-methyltransferase SETD1B-A

Gene

setd1ba

Organism
Danio rerio (Zebrafish) (Brachydanio rerio)
Status
Reviewed-Annotation score: -Experimental evidence at protein leveli

Functioni

Histone methyltransferase that specifically methylates 'Lys-4' of histone H3, when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation (By similarity).By similarity

Catalytic activityi

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N6-methyl-L-lysine-[histone].

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionActivator, Chromatin regulator, Methyltransferase, RNA-binding, Transferase
Biological processTranscription, Transcription regulation
LigandS-adenosyl-L-methionine

Names & Taxonomyi

Protein namesi
Recommended name:
Histone-lysine N-methyltransferase SETD1B-A (EC:2.1.1.43)
Alternative name(s):
SET domain-containing protein 1B-A
Gene namesi
Name:setd1ba
Synonyms:setd1b
ORF Names:si:dkey-237o15.4
OrganismiDanio rerio (Zebrafish) (Brachydanio rerio)
Taxonomic identifieri7955 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiActinopterygiiNeopterygiiTeleosteiOstariophysiCypriniformesCyprinidaeDanio
Proteomesi
  • UP000000437 Componenti: Unplaced

Organism-specific databases

ZFINiZDB-GENE-050309-289 setd1ba

Subcellular locationi

Extracellular region or secreted Cytosol Plasma membrane Cytoskeleton Lysosome Endosome Peroxisome ER Golgi apparatus Nucleus Mitochondrion Manual annotation Automatic computational assertionGraphics by Christian Stolte; Source: COMPARTMENTS

Keywords - Cellular componenti

Chromosome, Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00003169961 – 1844Histone-lysine N-methyltransferase SETD1B-AAdd BLAST1844

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei1138Phosphoserine1 Publication1

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiQ1LY77
PRIDEiQ1LY77

PTM databases

iPTMnetiQ1LY77

Interactioni

Protein-protein interaction databases

STRINGi7955.ENSDARP00000080600

Structurei

3D structure databases

ProteinModelPortaliQ1LY77
SMRiQ1LY77
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini128 – 216RRMPROSITE-ProRule annotationAdd BLAST89
Domaini1705 – 1822SETPROSITE-ProRule annotationAdd BLAST118
Domaini1828 – 1844Post-SETPROSITE-ProRule annotationAdd BLAST17

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi375 – 786Pro-richAdd BLAST412
Compositional biasi952 – 1116Glu-richAdd BLAST165
Compositional biasi1000 – 1080Ser-richAdd BLAST81
Compositional biasi1172 – 1435Pro-richAdd BLAST264

Sequence similaritiesi

Belongs to the class V-like SAM-binding methyltransferase superfamily.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG1080 Eukaryota
COG2940 LUCA
HOGENOMiHOG000168216
HOVERGENiHBG055596
InParanoidiQ1LY77
KOiK11422
PhylomeDBiQ1LY77

Family and domain databases

CDDicd12549 RRM_Set1B, 1 hit
Gene3Di3.30.70.330, 1 hit
InterProiView protein in InterPro
IPR024657 COMPASS_Set1_N-SET
IPR012677 Nucleotide-bd_a/b_plait_sf
IPR003616 Post-SET_dom
IPR035979 RBD_domain_sf
IPR000504 RRM_dom
IPR034468 Set1B_RRM
IPR001214 SET_dom
IPR037842 SETD1B
PANTHERiPTHR22884:SF475 PTHR22884:SF475, 1 hit
PfamiView protein in Pfam
PF11764 N-SET, 1 hit
PF00076 RRM_1, 1 hit
PF00856 SET, 1 hit
SMARTiView protein in SMART
SM01291 N-SET, 1 hit
SM00508 PostSET, 1 hit
SM00360 RRM, 1 hit
SM00317 SET, 1 hit
SUPFAMiSSF54928 SSF54928, 1 hit
PROSITEiView protein in PROSITE
PS50868 POST_SET, 1 hit
PS50102 RRM, 1 hit
PS50280 SET, 1 hit

Sequencei

Sequence statusi: Complete.

Q1LY77-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MCWKVEIVVY CKRQKPQTRG TQYVPGERNK LNEDHGRRQS SSLANGMDNS
60 70 80 90 100
HPICSSGEKR SHHWRSYKLI IDPALKKGSH KVYRYDGHQF STPSFGMSPV
110 120 130 140 150
DIVRDPRIGR LWTKYKETDL PVPKFKIDEC YVGRVPPKEV TFAKLNDNVR
160 170 180 190 200
EGFLTDMCKK FGDIEEVEIL YNPKNKKHLG IAKVVFETVK AAKDAVQNLH
210 220 230 240 250
NTSVMGNIIH VELDPKGENR QRYFQRLING SYTPLTLPVG GEEACDVSPR
260 270 280 290 300
SLAEALMACE PSRRLFEGGS SVVAGTTPSG TNTPMSLDTA YSSLRQDTPQ
310 320 330 340 350
SQGTPHTPRP SGTPFSQDSS YSSRQGTPAF QANRAESSGG YKSRRHETKF
360 370 380 390 400
QDAYNRRPER RYVHGPTQRG NTEQPPSFKQ HQPPEPPSPA FTHTPPPPTS
410 420 430 440 450
ANFKTAYSQY QPPIPQEYTV ASYHQPVQRE LDYRRPPQAP PPPSTDFLPV
460 470 480 490 500
RDRPTTPPIP EPPPAPETQP TTPPSSTPEP CPSPTQESER NSLDSRIEML
510 520 530 540 550
LKPFLNERGD SDAEVRMDGS PISSSSSQLS PIPPQRPSRP SSTGLEDISP
560 570 580 590 600
TPLPDSEDDE PIRGTASLLA NSRGMSPTNM HSKSCVGEPR TAIDKMDTGH
610 620 630 640 650
QSSGEDMEIS DDEMPGTPIA SGDCDKNIVV NSALSLIQTI PMPPPGFPPL
660 670 680 690 700
PHAAGFPLPP HHLPHHSTVS HLPSHHPMLH PLHSYGMMHF LPVDLLSSLP
710 720 730 740 750
QLLQMPFQMQ TQMLSRMAQS QHPYAYPYPA PSANPAAMPF GGPYPPLSVV
760 770 780 790 800
SAPADTLHGQ PWPLPSMPQF NPAVPPPGYE PQKEDPHKAT IDGVLMAIVK
810 820 830 840 850
ELKAIMKKDL NRKMVEVVAF RKFDEWWDKQ ELSAKATLTP VKTGEGKDEE
860 870 880 890 900
KERAKPKETM SSHLPWNKGE GLGFEGMGLG IGLRGIRLPS FKVKRKQPPE
910 920 930 940 950
PTSTSDNKRV RPSTPVDDEL EDEESERMGR TDGSRVDPAG SSSKRRPARP
960 970 980 990 1000
LELDSEGEEE EETSGKEESS LSDHEEEPVD DASERLSSGK DLEEEDEKKS
1010 1020 1030 1040 1050
ESHSSESESS DSSDDEASSS SSSKSGSDSS GSESSSDYES SSEEEEEEEE
1060 1070 1080 1090 1100
EEERIVGMDD EEDVDARTST SSSTTSTSSS DEEEVVEVKA PSTPTGPPPE
1110 1120 1130 1140 1150
EEPNELGRLE AVDEAEIDHK PSMVSLIKTK VEEVRPPSPK GLPADELDVD
1160 1170 1180 1190 1200
LEVKIPVPKT EASLEEVGNL RPPTPTGSFA DSDQDTRPKI PTEDFPRTPG
1210 1220 1230 1240 1250
HEGPVPLESE TTVPRSLPTP SMHLPLPPSH VPDPQSLLPP PETLPDMPVR
1260 1270 1280 1290 1300
GRLPTEEDIP RTPGRDLMDR ARGLGKLQST DTVPVTPGSD TPLTGNSLSS
1310 1320 1330 1340 1350
PHILGSPFSY PAQSPVLSAG IPRTPGRDLT FAPAFPDSAG LSAGLPIHRK
1360 1370 1380 1390 1400
ASSEILEEKP LFKEPLLSAS PQASLPNNAA SSPFPGPPLP TASLPEPALP
1410 1420 1430 1440 1450
PQGSPPASIE NSFPASPKEL PVPMIDVPVP LDDTPSKKKL VRSKNKKGIQ
1460 1470 1480 1490 1500
DSEEPQVTLI EASSLPELPV NNQYPDLPSE SIKEEDGEPA FSEKEESQVP
1510 1520 1530 1540 1550
TIIPKVEETS FYVEEPIQKT RRQRRGWQEL LLSMHSPVAS PRRPSFMPRS
1560 1570 1580 1590 1600
DFEEMTILYD IWNDGIDEED IRYLKITYDK MLQQDNAHDW LNDTLWVHHP
1610 1620 1630 1640 1650
PTNMGSATGV KKKRKEDGIR DHVTGCARSE GYYKIDKKDK MKYLNSSRLQ
1660 1670 1680 1690 1700
SEEPDVDTQG KSIPAQPQVS TRAGSERRSE QRRLLSSFSC DSDLLKFNQL
1710 1720 1730 1740 1750
KFRKKKIRFC RSHIHDWGLF AMEPIAADEM VIEYVGQNIR QVIADMREKR
1760 1770 1780 1790 1800
YEDEGIGSSY MFRVDHDTII DATKCGNFAR FINHSCNPNC YAKVITVESQ
1810 1820 1830 1840
KKIVIYSRQP INVNEEITYD YKFPIEDEKI PCLCGAENCR GTLN
Length:1,844
Mass (Da):204,141
Last modified:February 20, 2007 - v2
Checksum:i020BC92CCB797E27
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
BX088560 Genomic DNA Translation: CAK10781.2
DQ851809 mRNA Translation: ABI34481.1
RefSeqiNP_001038599.2, NM_001045134.2
UniGeneiDr.80156

Genome annotation databases

GeneIDi567970
KEGGidre:567970

Similar proteinsi

Entry informationi

Entry nameiSE1BA_DANRE
AccessioniPrimary (citable) accession number: Q1LY77
Secondary accession number(s): A5XCC0
Entry historyiIntegrated into UniProtKB/Swiss-Prot: February 5, 2008
Last sequence update: February 20, 2007
Last modified: March 28, 2018
This is version 85 of the entry and version 2 of the sequence. See complete history.
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome
UniProt is an ELIXIR core data resource
Main funding by: National Institutes of Health