ID SE1BA_DANRE Reviewed; 1844 AA. AC Q1LY77; A5XCC0; DT 05-FEB-2008, integrated into UniProtKB/Swiss-Prot. DT 20-FEB-2007, sequence version 2. DT 16-JUN-2009, entry version 27. DE RecName: Full=Histone-lysine N-methyltransferase SETD1B-A; DE EC=2.1.1.43; DE AltName: Full=SET domain-containing protein 1B-A; GN Name=setd1ba; Synonyms=setd1b; ORFNames=si:dkey-237o15.4; OS Danio rerio (Zebrafish) (Brachydanio rerio). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; OC Cyprinidae; Danio. OX NCBI_TaxID=7955; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RC STRAIN=Tuebingen; RG The Danio rerio sequencing project at the Sanger Institute; RL Submitted (JUN-2007) to the EMBL/GenBank/DDBJ databases. RN [2] RP NUCLEOTIDE SEQUENCE [MRNA] OF 1585-1740. RX PubMed=18231586; DOI=10.1371/journal.pone.0001499; RA Sun X.-J., Xu P.-F., Zhou T., Hu M., Fu C.-T., Zhang Y., Jin Y., RA Chen Y., Chen S.-J., Huang Q.-H., Liu T.X., Chen Z.; RT "Genome-wide survey and developmental expression mapping of zebrafish RT SET domain-containing genes."; RL PLoS ONE 3:E1499-E1499(2008). RN [3] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1138, AND MASS RP SPECTROMETRY. RC TISSUE=Embryo; RX PubMed=18307296; DOI=10.1021/pr700667w; RA Lemeer S., Pinkse M.W.H., Mohammed S., van Breukelen B., RA den Hertog J., Slijper M., Heck A.J.R.; RT "Online automated in vivo zebrafish phosphoproteomics: from large- RT scale analysis down to a single embryo."; RL J. Proteome Res. 7:1555-1564(2008). CC -!- FUNCTION: Histone methyltransferase that specifically methylates CC 'Lys-4' of histone H3, when part of the SET1 histone CC methyltransferase (HMT) complex, but not if the neighboring 'Lys- CC 9' residue is already methylated. H3 'Lys-4' methylation CC represents a specific tag for epigenetic transcriptional CC activation (By similarity). CC -!- CATALYTIC ACTIVITY: S-adenosyl-L-methionine + histone L-lysine = CC S-adenosyl-L-homocysteine + histone N(6)-methyl-L-lysine. CC -!- SUBCELLULAR LOCATION: Nucleus speckle (By similarity). CC -!- SIMILARITY: Contains 1 post-SET domain. CC -!- SIMILARITY: Contains 1 RRM (RNA recognition motif) domain. CC -!- SIMILARITY: Contains 1 SET domain. CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; BX088560; CAK10781.2; -; Genomic_DNA. DR EMBL; DQ851809; ABI34481.1; -; mRNA. DR IPI; IPI00608851; -. DR RefSeq; NP_001038599.2; -. DR UniGene; Dr.149131; -. DR UniGene; Dr.80156; -. DR Ensembl; ENSDARG00000060847; Danio rerio. DR Ensembl; ENSDARG00000078930; Danio rerio. DR GeneID; 567970; -. DR KEGG; dre:567970; -. DR ZFIN; ZDB-GENE-050309-289; setd1ba. DR HOVERGEN; Q1LY77; -. DR BRENDA; 2.1.1.43; 96826. DR Bgee; Q1LY77; -. DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell. DR GO; GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:EC. DR GO; GO:0000166; F:nucleotide binding; IEA:InterPro. DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW. DR GO; GO:0016568; P:chromatin modification; IEA:UniProtKB-KW. DR GO; GO:0006355; P:regulation of transcription, DNA-dependent; IEA:UniProtKB-KW. DR GO; GO:0006350; P:transcription; IEA:UniProtKB-KW. DR InterPro; IPR012677; a_b_plait_nuc_bd. DR InterPro; IPR015722; MLL. DR InterPro; IPR003616; Post-SET_Zn_bd. DR InterPro; IPR000504; RRM_RNP1. DR InterPro; IPR001214; SET. DR Gene3D; G3DSA:3.30.70.330; a_b_plait_nuc_bd; 1. DR PANTHER; PTHR22884:SF10; MLL; 1. DR Pfam; PF00076; RRM_1; 1. DR Pfam; PF00856; SET; 1. DR SMART; SM00508; PostSET; 1. DR SMART; SM00360; RRM; 1. DR SMART; SM00317; SET; 1. DR PROSITE; PS50868; POST_SET; 1. DR PROSITE; PS50102; RRM; 1. DR PROSITE; PS50280; SET; 1. PE 1: Evidence at protein level; KW Activator; Chromatin regulator; Methyltransferase; Nucleus; KW Phosphoprotein; RNA-binding; S-adenosyl-L-methionine; Transcription; KW Transcription regulation; Transferase. FT CHAIN 1 1844 Histone-lysine N-methyltransferase FT SETD1B-A. FT /FTId=PRO_0000316996. FT DOMAIN 128 216 RRM. FT DOMAIN 1704 1826 SET. FT DOMAIN 1828 1844 Post-SET. FT COMPBIAS 375 786 Pro-rich. FT COMPBIAS 952 1116 Glu-rich. FT COMPBIAS 1000 1080 Ser-rich. FT COMPBIAS 1172 1435 Pro-rich. FT MOD_RES 1138 1138 Phosphoserine. SQ SEQUENCE 1844 AA; 204141 MW; 020BC92CCB797E27 CRC64; MCWKVEIVVY CKRQKPQTRG TQYVPGERNK LNEDHGRRQS SSLANGMDNS HPICSSGEKR SHHWRSYKLI IDPALKKGSH KVYRYDGHQF STPSFGMSPV DIVRDPRIGR LWTKYKETDL PVPKFKIDEC YVGRVPPKEV TFAKLNDNVR EGFLTDMCKK FGDIEEVEIL YNPKNKKHLG IAKVVFETVK AAKDAVQNLH NTSVMGNIIH VELDPKGENR QRYFQRLING SYTPLTLPVG GEEACDVSPR SLAEALMACE PSRRLFEGGS SVVAGTTPSG TNTPMSLDTA YSSLRQDTPQ SQGTPHTPRP SGTPFSQDSS YSSRQGTPAF QANRAESSGG YKSRRHETKF QDAYNRRPER RYVHGPTQRG NTEQPPSFKQ HQPPEPPSPA FTHTPPPPTS ANFKTAYSQY QPPIPQEYTV ASYHQPVQRE LDYRRPPQAP PPPSTDFLPV RDRPTTPPIP EPPPAPETQP TTPPSSTPEP CPSPTQESER NSLDSRIEML LKPFLNERGD SDAEVRMDGS PISSSSSQLS PIPPQRPSRP SSTGLEDISP TPLPDSEDDE PIRGTASLLA NSRGMSPTNM HSKSCVGEPR TAIDKMDTGH QSSGEDMEIS DDEMPGTPIA SGDCDKNIVV NSALSLIQTI PMPPPGFPPL PHAAGFPLPP HHLPHHSTVS HLPSHHPMLH PLHSYGMMHF LPVDLLSSLP QLLQMPFQMQ TQMLSRMAQS QHPYAYPYPA PSANPAAMPF GGPYPPLSVV SAPADTLHGQ PWPLPSMPQF NPAVPPPGYE PQKEDPHKAT IDGVLMAIVK ELKAIMKKDL NRKMVEVVAF RKFDEWWDKQ ELSAKATLTP VKTGEGKDEE KERAKPKETM SSHLPWNKGE GLGFEGMGLG IGLRGIRLPS FKVKRKQPPE PTSTSDNKRV RPSTPVDDEL EDEESERMGR TDGSRVDPAG SSSKRRPARP LELDSEGEEE EETSGKEESS LSDHEEEPVD DASERLSSGK DLEEEDEKKS ESHSSESESS DSSDDEASSS SSSKSGSDSS GSESSSDYES SSEEEEEEEE EEERIVGMDD EEDVDARTST SSSTTSTSSS DEEEVVEVKA PSTPTGPPPE EEPNELGRLE AVDEAEIDHK PSMVSLIKTK VEEVRPPSPK GLPADELDVD LEVKIPVPKT EASLEEVGNL RPPTPTGSFA DSDQDTRPKI PTEDFPRTPG HEGPVPLESE TTVPRSLPTP SMHLPLPPSH VPDPQSLLPP PETLPDMPVR GRLPTEEDIP RTPGRDLMDR ARGLGKLQST DTVPVTPGSD TPLTGNSLSS PHILGSPFSY PAQSPVLSAG IPRTPGRDLT FAPAFPDSAG LSAGLPIHRK ASSEILEEKP LFKEPLLSAS PQASLPNNAA SSPFPGPPLP TASLPEPALP PQGSPPASIE NSFPASPKEL PVPMIDVPVP LDDTPSKKKL VRSKNKKGIQ DSEEPQVTLI EASSLPELPV NNQYPDLPSE SIKEEDGEPA FSEKEESQVP TIIPKVEETS FYVEEPIQKT RRQRRGWQEL LLSMHSPVAS PRRPSFMPRS DFEEMTILYD IWNDGIDEED IRYLKITYDK MLQQDNAHDW LNDTLWVHHP PTNMGSATGV KKKRKEDGIR DHVTGCARSE GYYKIDKKDK MKYLNSSRLQ SEEPDVDTQG KSIPAQPQVS TRAGSERRSE QRRLLSSFSC DSDLLKFNQL KFRKKKIRFC RSHIHDWGLF AMEPIAADEM VIEYVGQNIR QVIADMREKR YEDEGIGSSY MFRVDHDTII DATKCGNFAR FINHSCNPNC YAKVITVESQ KKIVIYSRQP INVNEEITYD YKFPIEDEKI PCLCGAENCR GTLN //