Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Probable histone-lysine N-methyltransferase set-23

Gene

set-23

Organism
Caenorhabditis elegans
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Protein inferred from homologyi

Functioni

Probable histone methyltransferase (By similarity). Required for embryonic development.By similarity1 Publication

Catalytic activityi

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].

Sites

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Metal bindingi27Zinc 1By similarity1
Metal bindingi27Zinc 2By similarity1
Metal bindingi29Zinc 1By similarity1
Metal bindingi33Zinc 1By similarity1
Metal bindingi33Zinc 3By similarity1
Metal bindingi39Zinc 1By similarity1
Metal bindingi41Zinc 2By similarity1
Metal bindingi65Zinc 2By similarity1
Metal bindingi65Zinc 3By similarity1
Metal bindingi69Zinc 2By similarity1
Metal bindingi71Zinc 3By similarity1
Metal bindingi78Zinc 3By similarity1
Binding sitei141S-adenosyl-L-methioninePROSITE-ProRule annotation1
Binding sitei143S-adenosyl-L-methioninePROSITE-ProRule annotation1
Binding sitei170S-adenosyl-L-methioninePROSITE-ProRule annotation1
Metal bindingi176Zinc 4By similarity1
Metal bindingi225Zinc 4By similarity1
Metal bindingi227Zinc 4By similarity1
Metal bindingi232Zinc 4By similarity1

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Chromatin regulator, Developmental protein, Methyltransferase, Transferase

Keywords - Ligandi

Metal-binding, S-adenosyl-L-methionine, Zinc

Names & Taxonomyi

Protein namesi
Recommended name:
Probable histone-lysine N-methyltransferase set-23 (EC:2.1.1.43)
Alternative name(s):
SET-domain containing protein 23
Gene namesi
Name:set-23
ORF Names:Y41D4B.12
OrganismiCaenorhabditis elegans
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
Proteomesi
  • UP000001940 Componenti: Chromosome IV

Organism-specific databases

WormBaseiY41D4B.12a; CE27623; WBGene00021515; set-23.
Y41D4B.12b; CE31647; WBGene00021515; set-23.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Chromosome, Nucleus

Pathology & Biotechi

Disruption phenotypei

Embryonic lethal.1 Publication

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00002875651 – 244Probable histone-lysine N-methyltransferase set-23Add BLAST244

Proteomic databases

PaxDbiQ95Y12.

Expressioni

Gene expression databases

BgeeiWBGene00021515.

Interactioni

Protein-protein interaction databases

IntActiQ95Y12. 1 interactor.
STRINGi6239.Y41D4B.12a.

Structurei

3D structure databases

ProteinModelPortaliQ95Y12.
SMRiQ95Y12.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini25 – 86Pre-SETPROSITE-ProRule annotationAdd BLAST62
Domaini89 – 213SETPROSITE-ProRule annotationAdd BLAST125
Domaini221 – 237Post-SETPROSITE-ProRule annotationAdd BLAST17

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni101 – 103S-adenosyl-L-methionine bindingBy similarity3
Regioni173 – 174S-adenosyl-L-methionine bindingBy similarity2

Domaini

In the pre-SET domain, Cys residues bind 3 zinc ions that are arranged in a triangular cluster; some of these Cys residues contribute to the binding of two zinc ions within the cluster.By similarity

Sequence similaritiesi

Belongs to the class V-like SAM-binding methyltransferase superfamily. Histone-lysine methyltransferase family. Suvar3-9 subfamily.PROSITE-ProRule annotation
Contains 1 post-SET domain.PROSITE-ProRule annotation
Contains 1 pre-SET domain.PROSITE-ProRule annotation
Contains 1 SET domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG1082. Eukaryota.
COG2940. LUCA.
GeneTreeiENSGT00780000121845.
InParanoidiQ95Y12.
KOiK11433.
OMAiCEEREFC.
OrthoDBiEOG091G0Y4N.
PhylomeDBiQ95Y12.

Family and domain databases

InterProiIPR003616. Post-SET_dom.
IPR007728. Pre-SET_dom.
IPR001214. SET_dom.
[Graphical view]
PfamiPF05033. Pre-SET. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
SMARTiSM00317. SET. 1 hit.
[Graphical view]
PROSITEiPS50868. POST_SET. 1 hit.
PS50867. PRE_SET. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform a (identifier: Q95Y12-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MNYEKIDSTI PGPGISETDW NDVFEGCNCE AECSSAAGCS CLINKIDNYT
60 70 80 90 100
VDGKINKSSE LLIECSDQCA CILLPTSCRN RVVQCGPQKK LEIFSTCEMA
110 120 130 140 150
KGFGVRAGEQ IAAGEFVCEY AGECIGEQEV ERRCREFRGD DNYTLTLKEF
160 170 180 190 200
FGGKPVKTFV DPRLRGNIGR FLNHSCEPNC EIILARLGRM IPAAGIFAKR
210 220 230 240
DIVRGEELCY DYGHSAIEGE NRKLCLCKSE KCRKYLPMSA TPIE
Length:244
Mass (Da):27,108
Last modified:December 1, 2001 - v1
Checksum:iAB291915D129C2A3
GO
Isoform b (identifier: Q95Y12-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-85: Missing.
     86-95: GPQKKLEIFS → MCLHTAPNFI

Note: No experimental confirmation available.
Show »
Length:159
Mass (Da):17,895
Checksum:iAF09BB931CC5CA44
GO

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0255641 – 85Missing in isoform b. CuratedAdd BLAST85
Alternative sequenceiVSP_02556586 – 95GPQKKLEIFS → MCLHTAPNFI in isoform b. Curated10

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
FO080782 Genomic DNA. Translation: CCD66712.1.
FO080782 Genomic DNA. Translation: CCD66713.1.
RefSeqiNP_741320.1. NM_171270.1. [Q95Y12-1]
NP_741321.1. NM_171271.3. [Q95Y12-2]
UniGeneiCel.32662.

Genome annotation databases

EnsemblMetazoaiY41D4B.12a; Y41D4B.12a; WBGene00021515. [Q95Y12-1]
GeneIDi176969.
KEGGicel:CELE_Y41D4B.12.

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
FO080782 Genomic DNA. Translation: CCD66712.1.
FO080782 Genomic DNA. Translation: CCD66713.1.
RefSeqiNP_741320.1. NM_171270.1. [Q95Y12-1]
NP_741321.1. NM_171271.3. [Q95Y12-2]
UniGeneiCel.32662.

3D structure databases

ProteinModelPortaliQ95Y12.
SMRiQ95Y12.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiQ95Y12. 1 interactor.
STRINGi6239.Y41D4B.12a.

Proteomic databases

PaxDbiQ95Y12.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaiY41D4B.12a; Y41D4B.12a; WBGene00021515. [Q95Y12-1]
GeneIDi176969.
KEGGicel:CELE_Y41D4B.12.

Organism-specific databases

CTDi176969.
WormBaseiY41D4B.12a; CE27623; WBGene00021515; set-23.
Y41D4B.12b; CE31647; WBGene00021515; set-23.

Phylogenomic databases

eggNOGiKOG1082. Eukaryota.
COG2940. LUCA.
GeneTreeiENSGT00780000121845.
InParanoidiQ95Y12.
KOiK11433.
OMAiCEEREFC.
OrthoDBiEOG091G0Y4N.
PhylomeDBiQ95Y12.

Miscellaneous databases

PROiQ95Y12.

Gene expression databases

BgeeiWBGene00021515.

Family and domain databases

InterProiIPR003616. Post-SET_dom.
IPR007728. Pre-SET_dom.
IPR001214. SET_dom.
[Graphical view]
PfamiPF05033. Pre-SET. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
SMARTiSM00317. SET. 1 hit.
[Graphical view]
PROSITEiPS50868. POST_SET. 1 hit.
PS50867. PRE_SET. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiSET23_CAEEL
AccessioniPrimary (citable) accession number: Q95Y12
Secondary accession number(s): Q8MXT0
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 15, 2007
Last sequence update: December 1, 2001
Last modified: November 2, 2016
This is version 97 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Caenorhabditis elegans
    Caenorhabditis elegans: entries, gene names and cross-references to WormBase
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.