SubmitCancel

Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Q18221

- SET2_CAEEL

UniProt

Q18221 - SET2_CAEEL

(max 400 entries)x

Your basket is currently empty.

Select item(s) and click on "Add to basket" to create your own collection here
(400 entries max)

Protein
Probable histone-lysine N-methyltransferase set-2
Gene
set-2, C26E6.9
Organism
Caenorhabditis elegans
Status
Reviewed - Annotation score: 5 out of 5 - Experimental evidence at transcript leveli

Functioni

Probable histone methyltransferase involved in chromatin modification and/or remodeling in meiotic germ cells. May act redundantly with mes-3 and mes-4 proteins. Required for RNAi. Functions as an antagonist of hpl-1 and hpl-2 activity in growth and somatic gonad development.4 Publications

Catalytic activityi

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].

GO - Molecular functioni

  1. RNA binding Source: UniProtKB-KW
  2. histone methyltransferase activity (H3-K4 specific) Source: WormBase
  3. nucleotide binding Source: InterPro
Complete GO annotation...

GO - Biological processi

  1. determination of adult lifespan Source: WormBase
  2. histone H3-K4 methylation Source: WormBase
  3. regulation of transcription, DNA-templated Source: UniProtKB-KW
  4. transcription, DNA-templated Source: UniProtKB-KW
Complete GO annotation...

Keywords - Molecular functioni

Chromatin regulator, Developmental protein, Methyltransferase, Transferase

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

RNA-binding, S-adenosyl-L-methionine

Names & Taxonomyi

Protein namesi
Recommended name:
Probable histone-lysine N-methyltransferase set-2 (EC:2.1.1.43)
Alternative name(s):
SET domain-containing protein 2
Gene namesi
Name:set-2
ORF Names:C26E6.9
OrganismiCaenorhabditis elegans
Taxonomic identifieri6239 [NCBI]
Taxonomic lineageiEukaryotaMetazoaEcdysozoaNematodaChromadoreaRhabditidaRhabditoideaRhabditidaePeloderinaeCaenorhabditis
ProteomesiUP000001940: Chromosome III

Organism-specific databases

WormBaseiC26E6.9a; CE27735; WBGene00004782; set-2.
C26E6.9b; CE01158; WBGene00004782; set-2.
C26E6.9c; CE27736; WBGene00004782; set-2.

Subcellular locationi

Nucleus
Note: Localized in mitotic and mid-late-stage meiotic nuclei but is undetectable in early pachytene nuclei.1 Publication

GO - Cellular componenti

  1. nucleus Source: WormBase
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 15071507Probable histone-lysine N-methyltransferase set-2
PRO_0000097695Add
BLAST

Proteomic databases

PaxDbiQ18221.
PRIDEiQ18221.

Expressioni

Tissue specificityi

Expressed in all cells of embryo. In L1 larva, it is predominantly expressed in Z2 and Z3 primordial germ cells. In adults, it is predominantly expressed in the germline.1 Publication

Developmental stagei

Expressed throughout embryogenesis.

Interactioni

Protein-protein interaction databases

BioGridi40896. 2 interactions.

Structurei

3D structure databases

ProteinModelPortaliQ18221.
SMRiQ18221. Positions 124-222, 1340-1507.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini128 – 19972RRM
Add
BLAST
Domaini1368 – 1485118SET
Add
BLAST
Domaini1491 – 150717Post-SET
Add
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi296 – 35459Pro-rich
Add
BLAST
Compositional biasi554 – 664111Pro-rich
Add
BLAST
Compositional biasi870 – 1011142Ser-rich
Add
BLAST

Sequence similaritiesi

Contains 1 post-SET domain.
Contains 1 SET domain.

Phylogenomic databases

eggNOGiCOG2940.
GeneTreeiENSGT00740000115089.
HOGENOMiHOG000021414.
InParanoidiQ18221.
KOiK11422.
OMAiYCTIPPK.

Family and domain databases

Gene3Di3.30.70.330. 1 hit.
InterProiIPR012677. Nucleotide-bd_a/b_plait.
IPR003616. Post-SET_dom.
IPR000504. RRM_dom.
IPR001214. SET_dom.
[Graphical view]
PfamiPF00076. RRM_1. 1 hit.
PF00856. SET. 1 hit.
[Graphical view]
SMARTiSM00508. PostSET. 1 hit.
SM00360. RRM. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEiPS50868. POST_SET. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. Align

Isoform a (identifier: Q18221-1) [UniParc]FASTAAdd to Basket

Also known as: L

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

MSTHDMNHHP PRKSHSKRDK PSSSNSGPKI ENHKCKWAWQ KVFETGKSFL     50
RRDGFPQDCK SKEDFERIKR TGVRKTSENM LEDPRKNFES LQQSSVYQTN 100
SFRNPRYLCR AHLRVDSYYC TIPPKREVSL FNMDDNCTEV LLRDFAKDCG 150
KVEKAYVCIH PETKRHMKMA YVKFATVKEA HNFYSMYHAQ NLLATKCTPR 200
IDPFLSILNE EYEVATNGQV LPILPDDLAS IDPSVLRDLR ANFLRDQNEK 250
YELAMRNTYE DEGGMLSGVI MDTSDHYERD YTMDHDVGPS SMKMSPIPPP 300
PIKEESPPPP PPPPVASVSN LAPVPSVQLP YYNNIQPSSS TMHMPEFRPT 350
EPPPSYSRED PYRSTSRSSL SRHRNRSRSP SDGMDRSGRS SSRRTHRRPE 400
SRNGSKNANG DVVKYETYKM EKRKIKYEGG NKKYEQVHIK ERTAVIRGKN 450
QLENVSSESA SGSSSVDTYP DFSDEERKKK KRPKSPNRSK KDSRAFGWDS 500
TDESDEDTRR RRSGRSQNRS SERKFQTTSS SSTRRELSST HTNSVPNLKS 550
HETPPPPPPK GHPSVHLQTP YQHVQPQMIP ATYYNLPPQH MAPPPITTSL 600
PPFCDFSQPP PGFTPTFKPI TNAPLPTPYQ ASNIPQPGLV QIAALSAAPE 650
PFSSIPGPPP GPAPIQEDVG RAESPEKPSL SERFSGIFGP TQREEPAQVE 700
VEYDYPLKHS ESHDDRHSLE DMDVEVSSDG ETVSNVEKIE CMEEKKRQDL 750
ERIAIARTPI VKKCKKRMMD ELSRKVAEDI RQQIMRQCFA ALDEKLHLKA 800
IADEEKRKKE REEKARQEAE KPSNHLIADM MTLYNNQSFA SSSRGFYRKQ 850
KPIPKSHPKH QEHHHHAKAS VSTPVHSSST SRNSSVAPTP QRTVSTSSSS 900
SSAATSARVS EDESDSDSTP GEVQRRKTSV LSNDKRRRRA SFSSTSIQSS 950
PERQRDVSSS SRTSSSSSTS SMKQEETADE KSRKRKLIMS SDESSTTGST 1000
ATSVVSSRQS SLEPQQEKTD GEPPKKKSQT DFISERVSKI EGEERPLPEP 1050
VETSGPIIGD SSYLPYKIVH WEKAGIIEMN LPANSIRAHE YHPFTTEHCY 1100
FGIDDPRQPK IQIFDHSPCK SEPGSEPLKI TPAPWGPIDN VAETGPLIYM 1150
DVVTAPKTVQ KKQKPRKQVF EKDPYEYYEP PPTKRPAPPP RFKKTFKPRS 1200
EEEKKKIIGD CEDLPDLEDQ WYLRAALNEM QSEVKSADEL PWKKMLTFKE 1250
MLRSEDPLLR LNPIRSKKGL PDAFYEDEEL DGVIPVAAGC SRARPYEKMT 1300
MKQKRSLVRR PDNESHPTAI FSERDETAIR HQHLASKDMR LLQRRLLTSL 1350
GDANNDFFKI NQLKFRKKMI KFARSRIHGW GLYAMESIAP DEMIVEYIGQ 1400
TIRSLVAEER EKAYERRGIG SSYLFRIDLH HVIDATKRGN FARFINHSCQ 1450
PNCYAKVLTI EGEKRIVIYS RTIIKKGEEI TYDYKFPIED DKIDCLCGAK 1500
TCRGYLN 1507
Length:1,507
Mass (Da):171,683
Last modified:April 23, 2003 - v2
Checksum:iE7D9689DA720C34A
GO
Isoform b (identifier: Q18221-2) [UniParc]FASTAAdd to Basket

Also known as: S

The sequence of this isoform differs from the canonical sequence as follows:
     1-768: Missing.
     769-831: MDELSRKVAE...PSNHLIADMM → MYNNSAPYLN...PQRVYRSINS

Show »
Length:739
Mass (Da):83,999
Checksum:i6588F8D83C72BABD
GO
Isoform c (identifier: Q18221-3) [UniParc]FASTAAdd to Basket

The sequence of this isoform differs from the canonical sequence as follows:
     831-831: M → MPSQ

Note: No experimental confirmation available.

Show »
Length:1,510
Mass (Da):171,995
Checksum:iF301028054840D2E
GO

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1 – 768768Missing in isoform b.
VSP_007217Add
BLAST
Alternative sequencei769 – 83163MDELS…IADMM → MYNNSAPYLNHSSLNTVRKK VVTVRRVLPSLPPPPPPPPS LYPPCSVFKVPYIPQRVYRS INS in isoform b.
VSP_007218Add
BLAST
Alternative sequencei831 – 8311M → MPSQ in isoform c.
VSP_038347

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
FO080680 Genomic DNA. Translation: CCD65735.1.
FO080680 Genomic DNA. Translation: CCD65734.1.
FO080680 Genomic DNA. Translation: CCD65736.1.
PIRiA88445.
RefSeqiNP_498039.1. NM_065638.3. [Q18221-3]
NP_498040.1. NM_065639.4. [Q18221-1]
NP_498041.1. NM_065640.3. [Q18221-2]
UniGeneiCel.8145.

Genome annotation databases

EnsemblMetazoaiC26E6.9a; C26E6.9a; WBGene00004782. [Q18221-1]
GeneIDi175662.
KEGGicel:CELE_C26E6.9.
UCSCiC26E6.9a. c. elegans. [Q18221-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBL
GenBank
DDBJ
Links Updated
FO080680 Genomic DNA. Translation: CCD65735.1 .
FO080680 Genomic DNA. Translation: CCD65734.1 .
FO080680 Genomic DNA. Translation: CCD65736.1 .
PIRi A88445.
RefSeqi NP_498039.1. NM_065638.3. [Q18221-3 ]
NP_498040.1. NM_065639.4. [Q18221-1 ]
NP_498041.1. NM_065640.3. [Q18221-2 ]
UniGenei Cel.8145.

3D structure databases

ProteinModelPortali Q18221.
SMRi Q18221. Positions 124-222, 1340-1507.
ModBasei Search...

Protein-protein interaction databases

BioGridi 40896. 2 interactions.

Proteomic databases

PaxDbi Q18221.
PRIDEi Q18221.

Protocols and materials databases

Structural Biology Knowledgebase Search...

Genome annotation databases

EnsemblMetazoai C26E6.9a ; C26E6.9a ; WBGene00004782 . [Q18221-1 ]
GeneIDi 175662.
KEGGi cel:CELE_C26E6.9.
UCSCi C26E6.9a. c. elegans. [Q18221-1 ]

Organism-specific databases

CTDi 175662.
WormBasei C26E6.9a ; CE27735 ; WBGene00004782 ; set-2.
C26E6.9b ; CE01158 ; WBGene00004782 ; set-2.
C26E6.9c ; CE27736 ; WBGene00004782 ; set-2.

Phylogenomic databases

eggNOGi COG2940.
GeneTreei ENSGT00740000115089.
HOGENOMi HOG000021414.
InParanoidi Q18221.
KOi K11422.
OMAi YCTIPPK.

Miscellaneous databases

NextBioi 889112.
PROi Q18221.

Family and domain databases

Gene3Di 3.30.70.330. 1 hit.
InterProi IPR012677. Nucleotide-bd_a/b_plait.
IPR003616. Post-SET_dom.
IPR000504. RRM_dom.
IPR001214. SET_dom.
[Graphical view ]
Pfami PF00076. RRM_1. 1 hit.
PF00856. SET. 1 hit.
[Graphical view ]
SMARTi SM00508. PostSET. 1 hit.
SM00360. RRM. 1 hit.
SM00317. SET. 1 hit.
[Graphical view ]
PROSITEi PS50868. POST_SET. 1 hit.
PS50280. SET. 1 hit.
[Graphical view ]
ProtoNeti Search...

Publicationsi

« Hide 'large scale' publications
  1. "Genome sequence of the nematode C. elegans: a platform for investigating biology."
    The C. elegans sequencing consortium
    Science 282:2012-2018(1998) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA], ALTERNATIVE SPLICING.
    Strain: Bristol N2.
  2. "Depletion of a novel SET-domain protein enhances the sterility of mes-3 and mes-4 mutants of Caenorhabditis elegans."
    Xu L., Strome S.
    Genetics 159:1019-1029(2001) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, SUBCELLULAR LOCATION, TISSUE SPECIFICITY, ALTERNATIVE SPLICING.
  3. "A targeted RNAi screen for genes involved in chromosome morphogenesis and nuclear organization in the Caenorhabditis elegans germline."
    Colaiacovo M.P., Stanfield G.M., Reddy K.C., Reinke V., Kim S.K., Villeneuve A.M.
    Genetics 162:113-128(2002) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION.
  4. "Telomeric position effect variegation in Saccharomyces cerevisiae by Caenorhabditis elegans linker histones suggests a mechanistic connection between germ line and telomeric silencing."
    Jedrusik M.A., Schulze E.
    Mol. Cell. Biol. 23:3681-3691(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION.
  5. "Antagonistic functions of SET-2/SET1 and HPL/HP1 proteins in C. elegans development."
    Simonet T., Dulermo R., Schott S., Palladino F.
    Dev. Biol. 312:367-383(2007) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION.

Entry informationi

Entry nameiSET2_CAEEL
AccessioniPrimary (citable) accession number: Q18221
Secondary accession number(s): Q95QU6, Q95QU7
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 23, 2003
Last sequence update: April 23, 2003
Last modified: June 11, 2014
This is version 111 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programCaenorhabditis annotation project

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Caenorhabditis elegans
    Caenorhabditis elegans: entries, gene names and cross-references to WormBase
  2. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi