SubmitCancel

Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Q8CFT2

- SET1B_MOUSE

UniProt

Q8CFT2 - SET1B_MOUSE

(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Protein
Histone-lysine N-methyltransferase SETD1B
Gene
Setd1b, Kiaa1076, Set1b
Organism
Mus musculus (Mouse)
Status
Reviewed - Annotation score: 4 out of 5 - Experimental evidence at protein leveli

Functioni

Histone methyltransferase that specifically methylates 'Lys-4' of histone H3, when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation. The non-overalpping localization with SETD1B suggests that SETD1A and SETD1B make non-redundant contributions to the epigenetic control of chromatin structure and gene expression By similarity.

Catalytic activityi

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].

GO - Molecular functioni

  1. RNA binding Source: UniProtKB-KW
  2. histone methyltransferase activity (H3-K4 specific) Source: Ensembl
  3. nucleotide binding Source: InterPro

GO - Biological processi

  1. regulation of transcription, DNA-templated Source: UniProtKB-KW
  2. transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Activator, Chromatin regulator, Methyltransferase, Transferase

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

RNA-binding, S-adenosyl-L-methionine

Names & Taxonomyi

Protein namesi
Recommended name:
Histone-lysine N-methyltransferase SETD1B (EC:2.1.1.43)
Alternative name(s):
SET domain-containing protein 1B
Gene namesi
Name:Setd1b
Synonyms:Kiaa1076, Set1b
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
ProteomesiUP000000589: Chromosome 5

Organism-specific databases

MGIiMGI:2652820. Setd1b.

Subcellular locationi

Nucleus speckle By similarity. Chromosome By similarity
Note: Localizes to a largely non-overlapping set of euchromatic nuclear speckles with SETD1A, suggesting that SETD1A and SETD1B each bind to a unique set of target genes By similarity.

GO - Cellular componenti

  1. Set1C/COMPASS complex Source: UniProtKB
  2. chromosome Source: UniProtKB-SubCell
  3. nuclear speck Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

Chromosome, Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 19851985Histone-lysine N-methyltransferase SETD1B
PRO_0000316994Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei1678 – 16781Phosphoserine By similarity
Modified residuei1682 – 16821Phosphoserine By similarity

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiQ8CFT2.
PRIDEiQ8CFT2.

PTM databases

PhosphoSiteiQ8CFT2.

Expressioni

Tissue specificityi

Widely expressed.1 Publication

Gene expression databases

ArrayExpressiQ8CFT2.
BgeeiQ8CFT2.
CleanExiMM_SETD1B.
GenevestigatoriQ8CFT2.

Interactioni

Subunit structurei

Component of the SET1 complex, at least composed of the catalytic subunit (SETD1A or SETD1B), WDR5, WDR82, RBBP5, ASH2L/ASH2, CXXC1/CFP1, HCFC1 and DPY30. Interacts with HCFC1. and ASH2L/ASH2. Interacts (via the RRM domain) with WDR82. Interacts (via the RRM domain) with hyperphosphorylated C-terminal domain (CTD) of RNA polymerase II large subunit (POLR2A) only in the presence of WDR82. Binds specifically to CTD heptad repeats phosphorylated on 'Ser-5' of each heptad. Interacts with RBM15 By similarity.

Structurei

3D structure databases

ProteinModelPortaliQ8CFT2.
SMRiQ8CFT2. Positions 97-202.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini92 – 18089RRM
Add
BLAST
Domaini1846 – 1963118SET
Add
BLAST
Domaini1969 – 198517Post-SET
Add
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi373 – 16901318Pro-rich
Add
BLAST
Compositional biasi1031 – 1179149Ser-rich
Add
BLAST
Compositional biasi1059 – 1331273Glu-rich
Add
BLAST

Sequence similaritiesi

Contains 1 post-SET domain.
Contains 1 SET domain.

Phylogenomic databases

eggNOGiCOG2940.
GeneTreeiENSGT00740000115089.
HOVERGENiHBG055596.
InParanoidiQ8CFT2.
KOiK11422.
OMAiQHWPPLP.
OrthoDBiEOG7GQXTT.
TreeFamiTF106436.

Family and domain databases

Gene3Di3.30.70.330. 2 hits.
InterProiIPR024657. COMPASS_Set1_N-SET.
IPR012677. Nucleotide-bd_a/b_plait.
IPR003616. Post-SET_dom.
IPR000504. RRM_dom.
IPR001214. SET_dom.
[Graphical view]
PfamiPF11764. N-SET. 1 hit.
PF00076. RRM_1. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
SMARTiSM00508. PostSET. 1 hit.
SM00360. RRM. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEiPS50868. POST_SET. 1 hit.
PS50102. RRM. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. Align

Isoform 1 (identifier: Q8CFT2-1) [UniParc]FASTAAdd to Basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

MENSHPHHHH QQPPPQPGPS GERRNHHWRS YKLMIDPALK KGHHKLYRYD     50
GQHFSLAMSS NRPVEIVEDP RVVGIWTKNK ELELSVPKFK IDEFYVGPVP 100
PKQVTFAKLN DNVRENFLRD MCKKYGEVEE VEILYNPKTK KHLGIAKVVF 150
ATVRGAKEAV QHLHSTSVMG NIIHVELDTK GETRMRFYEL LVTGRYTPQT 200
LPVGELDAIS PIVSETLQLS DALKRLKDGS LSAGCGSGSS SVTPNSGGTP 250
FSQDTAYSSC RLDTPNSYGQ GTPITPRLGT PFSQDSSYSS RQPTPSYLFS 300
QDPTATFKAR RHESKFTDAY NRRHEHHYVH NSAVAGATAP FRGSSDLSFG 350
TVGSSGTPFK AQSQDATTFA HTPPPAQTAT ASGFKSAFSP YQTPAPPFPP 400
PPEEPTATAA FGSRDSGEFR RAPAPPPLPP AEPPAKEKPG TPPGPPPPDS 450
NSMELGGRPT FGWSPEPCDS PGTPTLESSP AGPEKPHDSL DSRIEMLLKE 500
QRTKLPFLRE QDSDTEIQME GSPISSSSSQ LSPLSHFGTN SQPGFRGPSP 550
PSSRPSSTGL EDISPTPLPD SDEDEDLGLG LGPRPPPEPG PPDPMGLLGQ 600
TAEVDLDLAG DRTPTSERMD EGQQSSGEDM EISDDEMPSA PITSADCPKP 650
MVVTPGAGAV AAPNVLAPNL PLPPPPGFPP LPPPPPPPPP QPGFPMPPPL 700
PPPPPPPPPA HPAVTVPPPP LPAPPGVPPP PILPPLPPFP PGLFPVMQVD 750
MSHVLGGQWG GMPMSFQMQT QMLSRLMTGQ GACPYPPFMA AAAAAASAGL 800
QFVNLPPYRS PFSLSNSGPG RGQHWPPLPK FDPSVPPPGY IPRQEDPHKA 850
TVDGVLLVVL KELKAIMKRD LNRKMVEVVA FRAFDEWWDK KERMAKASLT 900
PVKSGEHKDE DRPKPKDRIA SCLLESWGKG EGLGYEGLGL GIGLRGAIRL 950
PSFKVKRKEP PDTASSGDQK RLRPSTSVDE EDEESERERD RDIADAPCEL 1000
TKRDPKSVGV RRRPGRPLEL DSGGEEDEKE SLSASSSSSA SSSSGSSTTS 1050
PSSSASDKEE EDRESTEEEE EEEEEEAEEE EEEGPRSRIS SPSSSSSSDK 1100
DDEDDNEADS DGQIDSDIDD QGAPLSEASE KDNGDSEEEE TESITTSKAP 1150
AESSSSSSES SGSSEFESSS ESESSSSSSE DEEEMTVPGV EEEEEEEEEE 1200
EKETAMAAAT VVAMAEESMP PAGGQDFEQD RAEVPLGPRG PMRESLGTEE 1250
EVDIEAEDEV PEMQAPELEE PPLPMGARKL EGSPEPPEEP GPNTQGDMLL 1300
SPELPARETE EAQLPSPPEH GPESDLDMEP EPPPMLSLPL QPPLPPPRLL 1350
RPPSPPPEPE TPEPPKPPVP LEPPPEDHPP RTPGLCGSLA KSQSTETVPA 1400
TPGGEPPLSG SSSGLSLSSP QVPGSPFSYP SPSPGLSSGG LPRTPGRDFS 1450
FTPTFPEPSG PLLLPVCPLP TGRRDERTGP LASPVLLETG LPLPLPLPLP 1500
LPLALPVPVL RAQPRPPPQL PPLLPATLAP CPTPIKRKPG RPRRSPPSML 1550
SLDGPLVRPP PGPALGRDLL LLPGQPPAPI FPSAHDPRAV TLDFRNTGIP 1600
APPPPLPPQP PPPPPPPPVE STKLPFKELD NQWPSEAIPP GPRRDEVTEE 1650
YVDLAKVRGP WRRPPKKRHE DLVAPSASPE PSPPQPLFRP RSEFEEMTIL 1700
YDIWNGGIDE EDIRFLCVTY ERLLQQDNGM DWLNDTLWVY HPSTSLSSAK 1750
KKKREDGIRE HVTGCARSEG FYTIDKKDKL RYLNSSRAST DEPPMDTQGM 1800
SIPAQPHAST RAGSERRSEQ RRLLSSFTGS CDSDLLKFNQ LKFRKKKLKF 1850
CKSHIHDWGL FAMEPIAADE MVIEYVGQNI RQVIADMREK RYEDEGIGSS 1900
YMFRVDHDTI IDATKCGNFA RFINHSCNPN CYAKVITVES QKKIVIYSKQ 1950
HINVNEEITY DYKFPIEDVK IPCLCGSENC RGTLN 1985
Length:1,985
Mass (Da):215,352
Last modified:February 5, 2008 - v2
Checksum:i760EA261769292EC
GO
Isoform 2 (identifier: Q8CFT2-2) [UniParc]FASTAAdd to Basket

The sequence of this isoform differs from the canonical sequence as follows:
     1139-1179: Missing.

Note: No experimental confirmation available.

Show »
Length:1,944
Mass (Da):211,286
Checksum:iE2BD1B70C876448C
GO

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1139 – 117941Missing in isoform 2.
VSP_030851Add
BLAST

Sequence conflict

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti1222 – 12221A → V in BAC65717. 1 Publication
Sequence conflicti1411 – 14111S → G in BAC65717. 1 Publication

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
AC158114 Genomic DNA. No translation available.
BC038367 mRNA. Translation: AAH38367.2.
BC040775 mRNA. Translation: AAH40775.1.
BC041681 mRNA. Translation: AAH41681.1.
AK122435 mRNA. Translation: BAC65717.1.
CCDSiCCDS59684.1. [Q8CFT2-1]
RefSeqiNP_001035488.2. NM_001040398.2. [Q8CFT2-1]
UniGeneiMm.250391.

Genome annotation databases

EnsembliENSMUST00000056053; ENSMUSP00000134686; ENSMUSG00000038384. [Q8CFT2-1]
ENSMUST00000163030; ENSMUSP00000133933; ENSMUSG00000038384. [Q8CFT2-1]
ENSMUST00000174836; ENSMUSP00000134461; ENSMUSG00000038384. [Q8CFT2-2]
GeneIDi208043.
KEGGimmu:208043.

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
AC158114 Genomic DNA. No translation available.
BC038367 mRNA. Translation: AAH38367.2 .
BC040775 mRNA. Translation: AAH40775.1 .
BC041681 mRNA. Translation: AAH41681.1 .
AK122435 mRNA. Translation: BAC65717.1 .
CCDSi CCDS59684.1. [Q8CFT2-1 ]
RefSeqi NP_001035488.2. NM_001040398.2. [Q8CFT2-1 ]
UniGenei Mm.250391.

3D structure databases

ProteinModelPortali Q8CFT2.
SMRi Q8CFT2. Positions 97-202.
ModBasei Search...
MobiDBi Search...

PTM databases

PhosphoSitei Q8CFT2.

Proteomic databases

PaxDbi Q8CFT2.
PRIDEi Q8CFT2.

Protocols and materials databases

Structural Biology Knowledgebase Search...

Genome annotation databases

Ensembli ENSMUST00000056053 ; ENSMUSP00000134686 ; ENSMUSG00000038384 . [Q8CFT2-1 ]
ENSMUST00000163030 ; ENSMUSP00000133933 ; ENSMUSG00000038384 . [Q8CFT2-1 ]
ENSMUST00000174836 ; ENSMUSP00000134461 ; ENSMUSG00000038384 . [Q8CFT2-2 ]
GeneIDi 208043.
KEGGi mmu:208043.

Organism-specific databases

CTDi 23067.
MGIi MGI:2652820. Setd1b.
Rougei Search...

Phylogenomic databases

eggNOGi COG2940.
GeneTreei ENSGT00740000115089.
HOVERGENi HBG055596.
InParanoidi Q8CFT2.
KOi K11422.
OMAi QHWPPLP.
OrthoDBi EOG7GQXTT.
TreeFami TF106436.

Miscellaneous databases

PROi Q8CFT2.
SOURCEi Search...

Gene expression databases

ArrayExpressi Q8CFT2.
Bgeei Q8CFT2.
CleanExi MM_SETD1B.
Genevestigatori Q8CFT2.

Family and domain databases

Gene3Di 3.30.70.330. 2 hits.
InterProi IPR024657. COMPASS_Set1_N-SET.
IPR012677. Nucleotide-bd_a/b_plait.
IPR003616. Post-SET_dom.
IPR000504. RRM_dom.
IPR001214. SET_dom.
[Graphical view ]
Pfami PF11764. N-SET. 1 hit.
PF00076. RRM_1. 1 hit.
PF00856. SET. 1 hit.
[Graphical view ]
SMARTi SM00508. PostSET. 1 hit.
SM00360. RRM. 1 hit.
SM00317. SET. 1 hit.
[Graphical view ]
PROSITEi PS50868. POST_SET. 1 hit.
PS50102. RRM. 1 hit.
PS50280. SET. 1 hit.
[Graphical view ]
ProtoNeti Search...

Publicationsi

« Hide 'large scale' publications
  1. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: C57BL/6J.
  2. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 883-1985 (ISOFORM 1).
    Strain: FVB/N.
    Tissue: Eye and Mammary tumor.
  3. "Prediction of the coding sequences of mouse homologues of KIAA gene: II. The complete nucleotide sequences of 400 mouse KIAA-homologous cDNAs identified by screening of terminal sequences of cDNA clones randomly sampled from size-fractionated libraries."
    Okazaki N., Kikuno R., Ohara R., Inamoto S., Aizawa H., Yuasa S., Nakajima D., Nagase T., Ohara O., Koga H.
    DNA Res. 10:35-48(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1090-1985 (ISOFORM 2).
  4. "Identification and characterization of the human Set1B histone H3-Lys4 methyltransferase complex."
    Lee J.-H., Tate C.M., You J.-S., Skalnik D.G.
    J. Biol. Chem. 282:13419-13428(2007) [PubMed] [Europe PMC] [Abstract]
    Cited for: TISSUE SPECIFICITY.
  5. Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Tissue: Liver.

Entry informationi

Entry nameiSET1B_MOUSE
AccessioniPrimary (citable) accession number: Q8CFT2
Secondary accession number(s): Q80TK9, Q8CFQ8, Q8CGD1
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 5, 2008
Last sequence update: February 5, 2008
Last modified: July 9, 2014
This is version 99 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi