Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Histone-lysine N-methyltransferase, H3 lysine-4 specific

Gene

SET1

Organism
Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Catalytic component of the COMPASS (Set1C) complex that specifically mono-, di- and trimethylates histone H3 to form H3K4me1/2/3, which subsequently plays a role in telomere length maintenance and transcription elongation regulation.12 Publications

Catalytic activityi

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].1 Publication

GO - Molecular functioni

  • histone-lysine N-methyltransferase activity Source: UniProtKB-EC
  • histone methyltransferase activity Source: SGD
  • protein-lysine N-methyltransferase activity Source: SGD
  • RNA binding Source: SGD

GO - Biological processi

  • ascospore formation Source: SGD
  • chromatin silencing at silent mating-type cassette Source: SGD
  • chromatin silencing at telomere Source: SGD
  • histone H3-K4 dimethylation Source: CACAO
  • histone H3-K4 methylation Source: SGD
  • histone H3-K4 trimethylation Source: CACAO
  • peptidyl-lysine dimethylation Source: SGD
  • positive regulation of histone acetylation Source: SGD
  • regulation of meiotic DNA double-strand break formation Source: SGD
  • regulation of transcription from RNA polymerase II promoter in response to stress Source: SGD
  • telomere maintenance Source: SGD
Complete GO annotation...

Keywords - Molecular functioni

Chromatin regulator, Methyltransferase, Transferase

Keywords - Ligandi

S-adenosyl-L-methionine

Enzyme and pathway databases

BioCyciYEAST:G3O-31161-MONOMER.
ReactomeiR-SCE-3214841. PKMTs methylate histone lysines.

Names & Taxonomyi

Protein namesi
Recommended name:
Histone-lysine N-methyltransferase, H3 lysine-4 specific (EC:2.1.1.43)
Alternative name(s):
COMPASS component SET1
Lysine N-methyltransferase 2
SET domain-containing protein 1
Gene namesi
Name:SET1
Synonyms:KMT2, YTX1
Ordered Locus Names:YHR119W
OrganismiSaccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast)
Taxonomic identifieri559292 [NCBI]
Taxonomic lineageiEukaryotaFungiDikaryaAscomycotaSaccharomycotinaSaccharomycetesSaccharomycetalesSaccharomycetaceaeSaccharomyces
Proteomesi
  • UP000002311 Componenti: Chromosome VIII

Organism-specific databases

EuPathDBiFungiDB:YHR119W.
SGDiS000001161. SET1.

Subcellular locationi

GO - Cellular componenti

  • chromosome, telomeric region Source: GOC
  • Set1C/COMPASS complex Source: SGD
Complete GO annotation...

Keywords - Cellular componenti

Chromosome, Nucleus

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 10801080Histone-lysine N-methyltransferase, H3 lysine-4 specificPRO_0000186086Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei625 – 6251PhosphoserineCombined sources
Modified residuei875 – 8751PhosphothreonineCombined sources

Keywords - PTMi

Phosphoprotein

Proteomic databases

MaxQBiP38827.
PRIDEiP38827.

PTM databases

iPTMnetiP38827.

Interactioni

Subunit structurei

Component of the COMPASS (Set1C) complex which consists of SET12, BRE22, SPP12, SDC11, SHG11, SWD11, SWD21, and SWD31. Interacts with MEC3.4 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
BRE2P431326EBI-16977,EBI-27115
MEC3Q025743EBI-16977,EBI-10658
SHG1P383375EBI-16977,EBI-21106
SWD2P361046EBI-16977,EBI-26608

Protein-protein interaction databases

BioGridi36552. 138 interactions.
DIPiDIP-4616N.
IntActiP38827. 30 interactions.
MINTiMINT-552558.

Structurei

Secondary structure

1
1080
Legend: HelixTurnBeta strand
Show more details
Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Beta strandi249 – 25810Combined sources
Helixi265 – 2739Combined sources
Beta strandi279 – 2857Combined sources
Turni287 – 2893Combined sources
Beta strandi292 – 2998Combined sources
Helixi310 – 32011Combined sources
Turni321 – 3244Combined sources
Beta strandi326 – 3283Combined sources
Beta strandi331 – 3377Combined sources
Helixi342 – 36423Combined sources

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
2J8AX-ray3.00A247-375[»]
ProteinModelPortaliP38827.
SMRiP38827. Positions 247-366.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiP38827.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini938 – 1055118SETPROSITE-ProRule annotationAdd
BLAST
Domaini1064 – 108017Post-SETPROSITE-ProRule annotationAdd
BLAST

Sequence similaritiesi

Belongs to the class V-like SAM-binding methyltransferase superfamily.PROSITE-ProRule annotation
Contains 1 post-SET domain.PROSITE-ProRule annotation
Contains 1 SET domain.PROSITE-ProRule annotation

Phylogenomic databases

GeneTreeiENSGT00760000119228.
HOGENOMiHOG000066111.
InParanoidiP38827.
KOiK11422.
OMAiCEPSCTA.
OrthoDBiEOG092C3T9B.

Family and domain databases

InterProiIPR024657. COMPASS_Set1_N-SET.
IPR017111. Hist_H3-K4_MeTrfase_1_fun.
IPR003616. Post-SET_dom.
IPR024636. SET_assoc.
IPR001214. SET_dom.
[Graphical view]
PfamiPF11764. N-SET. 1 hit.
PF00856. SET. 1 hit.
PF11767. SET_assoc. 1 hit.
[Graphical view]
PIRSFiPIRSF037104. Histone_H3-K4_mtfrase_Set1_fun. 1 hit.
SMARTiSM01291. N-SET. 1 hit.
SM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEiPS50868. POST_SET. 1 hit.
PS51572. SAM_MT43_1. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

P38827-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSNYYRRAHA SSGSYRQPQE QPQYSRSGHY QYSNGHSHQQ YSSQYNQRRR
60 70 80 90 100
YNHNDGTRRR YNDDRPHSSN NASTRQYYAT NNSQSGPYVN KKSDISSRRG
110 120 130 140 150
MSQSRYSNSN VHNTLASSSG SLPTESALLL QQRPPSVLRY NTDNLKSKFH
160 170 180 190 200
YFDPIKGEFF NKDKMLSWKA TDKEFSETGY YVVKELQDGQ FKFKIKHRHP
210 220 230 240 250
EIKASDPRNE NGIMTSGKVA THRKCRNSLI LLPRISYDRY SLGPPPSCEI
260 270 280 290 300
VVYPAQDSTT TNIQDISIKN YFKKYGEISH FEAFNDPNSA LPLHVYLIKY
310 320 330 340 350
ASSDGKINDA AKAAFSAVRK HESSGCFIMG FKFEVILNKH SILNNIISKF
360 370 380 390 400
VEINVKKLQK LQENLKKAKE KEAENEKAKE LQGKDITLPK EPKVDTLSHS
410 420 430 440 450
SGSEKRIPYD LLGVVNNRPV LHVSKIFVAK HRFCVEDFKY KLRGYRCAKF
460 470 480 490 500
IDHPTGIYII FNDIAHAQTC SNAESGNLTI MSRSRRIPIL IKFHLILPRF
510 520 530 540 550
QNRTRFNKSS SSSNSTNVPI KYESKEEFIE ATAKQILKDL EKTLHVDIKK
560 570 580 590 600
RLIGPTVFDA LDHANFPELL AKRELKEKEK RQQIASKIAE DELKRKEEAK
610 620 630 640 650
RDFDLFGLYG GYAKSNKRNL KRHNSLALDH TSLKRKKLSN GIKPMAHLLN
660 670 680 690 700
EETDSKETTP LNDEGITRVS KEHDEEDENM TSSSSEEEEE EAPDKKFKSE
710 720 730 740 750
SEPTTPESDH LHGIKPLVPD QNGSSDVLDA SSMYKPTATE IPEPVYPPEE
760 770 780 790 800
YDLKYSQTLS SMDLQNAIKD EEDMLILKQL LSTYTPTVTP ETSAALEYKI
810 820 830 840 850
WQSRRKVLEE EKASDWQIEL NGTLFDSELQ PGSSFKAEGF RKIADKLKIN
860 870 880 890 900
YLPHRRRVHQ PLNTVNIHNE RNEYTPELCQ REESSNKEPS DSVPQEVSSS
910 920 930 940 950
RDNRASNRRF QQDIEAQKAA IGTESELLSL NQLNKRKKPV MFARSAIHNW
960 970 980 990 1000
GLYALDSIAA KEMIIEYVGE RIRQPVAEMR EKRYLKNGIG SSYLFRVDEN
1010 1020 1030 1040 1050
TVIDATKKGG IARFINHCCD PNCTAKIIKV GGRRRIVIYA LRDIAASEEL
1060 1070 1080
TYDYKFEREK DDEERLPCLC GAPNCKGFLN
Length:1,080
Mass (Da):123,912
Last modified:February 1, 1995 - v1
Checksum:iB7FA5D60F71063FD
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00059 Genomic DNA. Translation: AAB68867.1.
BK006934 Genomic DNA. Translation: DAA06813.1.
PIRiS48961.
RefSeqiNP_011987.1. NM_001179249.1.

Genome annotation databases

EnsemblFungiiYHR119W; YHR119W; YHR119W.
GeneIDi856519.
KEGGisce:YHR119W.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U00059 Genomic DNA. Translation: AAB68867.1.
BK006934 Genomic DNA. Translation: DAA06813.1.
PIRiS48961.
RefSeqiNP_011987.1. NM_001179249.1.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
2J8AX-ray3.00A247-375[»]
ProteinModelPortaliP38827.
SMRiP38827. Positions 247-366.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi36552. 138 interactions.
DIPiDIP-4616N.
IntActiP38827. 30 interactions.
MINTiMINT-552558.

PTM databases

iPTMnetiP38827.

Proteomic databases

MaxQBiP38827.
PRIDEiP38827.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblFungiiYHR119W; YHR119W; YHR119W.
GeneIDi856519.
KEGGisce:YHR119W.

Organism-specific databases

EuPathDBiFungiDB:YHR119W.
SGDiS000001161. SET1.

Phylogenomic databases

GeneTreeiENSGT00760000119228.
HOGENOMiHOG000066111.
InParanoidiP38827.
KOiK11422.
OMAiCEPSCTA.
OrthoDBiEOG092C3T9B.

Enzyme and pathway databases

BioCyciYEAST:G3O-31161-MONOMER.
ReactomeiR-SCE-3214841. PKMTs methylate histone lysines.

Miscellaneous databases

EvolutionaryTraceiP38827.
PROiP38827.

Family and domain databases

InterProiIPR024657. COMPASS_Set1_N-SET.
IPR017111. Hist_H3-K4_MeTrfase_1_fun.
IPR003616. Post-SET_dom.
IPR024636. SET_assoc.
IPR001214. SET_dom.
[Graphical view]
PfamiPF11764. N-SET. 1 hit.
PF00856. SET. 1 hit.
PF11767. SET_assoc. 1 hit.
[Graphical view]
PIRSFiPIRSF037104. Histone_H3-K4_mtfrase_Set1_fun. 1 hit.
SMARTiSM01291. N-SET. 1 hit.
SM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEiPS50868. POST_SET. 1 hit.
PS51572. SAM_MT43_1. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiSET1_YEAST
AccessioniPrimary (citable) accession number: P38827
Secondary accession number(s): D3DL69
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 1, 1995
Last sequence update: February 1, 1995
Last modified: September 7, 2016
This is version 152 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programFungal Protein Annotation Program

Miscellaneousi

Miscellaneous

Present with 172 molecules/cell in log phase SD medium.1 Publication

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  2. SIMILARITY comments
    Index of protein domains and families
  3. Yeast
    Yeast (Saccharomyces cerevisiae): entries, gene names and cross-references to SGD
  4. Yeast chromosome VIII
    Yeast (Saccharomyces cerevisiae) chromosome VIII: entries and gene names

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.