Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

Q95Y12 (SET23_CAEEL) Reviewed, UniProtKB/Swiss-Prot

Last modified April 16, 2014. Version 82. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (2) | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Alt products·Sequence annotation·Sequences·References·Cross-refs·Entry info·DocumentsCustomize order

Names and origin

Protein namesRecommended name:
Probable histone-lysine N-methyltransferase set-23

EC=2.1.1.43
Alternative name(s):
SET-domain containing protein 23
Gene names
Name:set-23
ORF Names:Y41D4B.12
OrganismCaenorhabditis elegans [Reference proteome]
Taxonomic identifier6239 [NCBI]
Taxonomic lineageEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis

Protein attributes

Sequence length244 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Function

Probable histone methyltransferase By similarity. Required for embryonic development. Ref.2

Catalytic activity

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].

Subcellular location

Nucleus By similarity. Chromosome By similarity.

Disruption phenotype

Embryonic lethal. Ref.2

Sequence similarities

Belongs to the class V-like SAM-binding methyltransferase superfamily. Histone-lysine methyltransferase family. Suvar3-9 subfamily.

Contains 1 post-SET domain.

Contains 1 pre-SET domain.

Contains 1 SET domain.

Ontologies

Keywords
   Cellular componentChromosome
Nucleus
   Coding sequence diversityAlternative splicing
   LigandS-adenosyl-L-methionine
   Molecular functionChromatin regulator
Developmental protein
Methyltransferase
Transferase
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological_processmulticellular organismal development

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular_componentchromosome

Inferred from electronic annotation. Source: UniProtKB-SubCell

nucleus

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular_functionhistone-lysine N-methyltransferase activity

Inferred from electronic annotation. Source: UniProtKB-EC

zinc ion binding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Alternative products

This entry describes 2 isoforms produced by alternative splicing. [Align] [Select]
Isoform a (identifier: Q95Y12-1)

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
Isoform b (identifier: Q95Y12-2)

The sequence of this isoform differs from the canonical sequence as follows:
     1-85: Missing.
     86-95: GPQKKLEIFS → MCLHTAPNFI
Note: No experimental confirmation available.

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 244244Probable histone-lysine N-methyltransferase set-23
PRO_0000287565

Regions

Domain25 – 8662Pre-SET
Domain89 – 213125SET
Domain221 – 23717Post-SET

Natural variations

Alternative sequence1 – 8585Missing in isoform b.
VSP_025564
Alternative sequence86 – 9510GPQKKLEIFS → MCLHTAPNFI in isoform b.
VSP_025565

Sequences

Sequence LengthMass (Da)Tools
Isoform a [UniParc].

Last modified December 1, 2001. Version 1.
Checksum: AB291915D129C2A3

FASTA24427,108
        10         20         30         40         50         60 
MNYEKIDSTI PGPGISETDW NDVFEGCNCE AECSSAAGCS CLINKIDNYT VDGKINKSSE 

        70         80         90        100        110        120 
LLIECSDQCA CILLPTSCRN RVVQCGPQKK LEIFSTCEMA KGFGVRAGEQ IAAGEFVCEY 

       130        140        150        160        170        180 
AGECIGEQEV ERRCREFRGD DNYTLTLKEF FGGKPVKTFV DPRLRGNIGR FLNHSCEPNC 

       190        200        210        220        230        240 
EIILARLGRM IPAAGIFAKR DIVRGEELCY DYGHSAIEGE NRKLCLCKSE KCRKYLPMSA 


TPIE 

« Hide

Isoform b [UniParc].

Checksum: AF09BB931CC5CA44
Show »

FASTA15917,895

References

« Hide 'large scale' references
[1]"Genome sequence of the nematode C. elegans: a platform for investigating biology."
The C. elegans sequencing consortium
Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], ALTERNATIVE SPLICING.
Strain: Bristol N2.
[2]"Two C. elegans histone methyltransferases repress lin-3 EGF transcription to inhibit vulval development."
Andersen E.C., Horvitz H.R.
Development 134:2991-2999(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: FUNCTION, DISRUPTION PHENOTYPE.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
FO080782 Genomic DNA. Translation: CCD66712.1.
FO080782 Genomic DNA. Translation: CCD66713.1.
RefSeqNP_741320.1. NM_171270.1.
NP_741321.1. NM_171271.3.
UniGeneCel.32662.

3D structure databases

ProteinModelPortalQ95Y12.
SMRQ95Y12. Positions 10-241.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

IntActQ95Y12. 1 interaction.

Proteomic databases

PRIDEQ95Y12.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaY41D4B.12a; Y41D4B.12a; Y41D4B.12. [Q95Y12-1]
GeneID176969.
KEGGcel:CELE_Y41D4B.12.

Organism-specific databases

CTD176969.
WormBaseY41D4B.12a; CE27623; WBGene00021515; set-23.
Y41D4B.12b; CE31647; WBGene00021515; set-23.

Phylogenomic databases

eggNOGCOG2940.
InParanoidQ95Y12.
KOK11433.
OMAQEVERRC.
OrthoDBEOG744T8D.
PhylomeDBQ95Y12.

Family and domain databases

InterProIPR003616. Post-SET_dom.
IPR007728. Pre-SET_dom.
IPR001214. SET_dom.
[Graphical view]
PfamPF05033. Pre-SET. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
SMARTSM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEPS50868. POST_SET. 1 hit.
PS50867. PRE_SET. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]
ProtoNetSearch...

Other

NextBio894788.
PROQ95Y12.

Entry information

Entry nameSET23_CAEEL
AccessionPrimary (citable) accession number: Q95Y12
Secondary accession number(s): Q8MXT0
Entry history
Integrated into UniProtKB/Swiss-Prot: May 15, 2007
Last sequence update: December 1, 2001
Last modified: April 16, 2014
This is version 82 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Relevant documents

SIMILARITY comments

Index of protein domains and families

Caenorhabditis elegans

Caenorhabditis elegans: entries, gene names and cross-references to WormBase