Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Histone-lysine N-methyltransferase

Gene

PGUG_01518

Organism
Meyerozyma guilliermondii (strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC 10279 / NRRL Y-324) (Yeast) (Candida guilliermondii)
Status
Unreviewed-Annotation score: Annotation score: 3 out of 5-Protein inferred from homologyi

Functioni

Catalytic activityi

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].PROSITE-ProRule annotationSAAS annotation

GO - Molecular functioni

GO - Biological processi

Keywordsi

Molecular functionMethyltransferasePROSITE-ProRule annotationSAAS annotation, Transferase
LigandS-adenosyl-L-methioninePROSITE-ProRule annotationSAAS annotation

Names & Taxonomyi

Protein namesi
Recommended name:
Histone-lysine N-methyltransferasePROSITE-ProRule annotation (EC:2.1.1.43PROSITE-ProRule annotation)
Gene namesi
ORF Names:PGUG_01518Imported
OrganismiMeyerozyma guilliermondii (strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC 10279 / NRRL Y-324) (Yeast) (Candida guilliermondii)Imported
Taxonomic identifieri294746 [NCBI]
Taxonomic lineageiEukaryotaFungiDikaryaAscomycotaSaccharomycotinaSaccharomycetesSaccharomycetalesDebaryomycetaceaeMeyerozyma
Proteomesi
  • UP000001997 Componenti: Unassembled WGS sequence

Subcellular locationi

  • Nucleus SAAS annotation

GO - Cellular componenti

Keywords - Cellular componenti

NucleusSAAS annotation

Interactioni

Protein-protein interaction databases

STRINGi294746.XP_001485847.1.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini50 – 106AWSInterPro annotationAdd BLAST57
Domaini108 – 225SETInterPro annotationAdd BLAST118
Domaini232 – 248Post-SETInterPro annotationAdd BLAST17
Domaini464 – 498WWInterPro annotationAdd BLAST35

Coiled coil

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Coiled coili527 – 552Sequence analysisAdd BLAST26

Sequence similaritiesi

Belongs to the class V-like SAM-binding methyltransferase superfamily. Histone-lysine methyltransferase family. SET2 subfamily.PROSITE-ProRule annotation

Keywords - Domaini

Coiled coilSequence analysis

Phylogenomic databases

eggNOGiKOG4442. Eukaryota.
COG2940. LUCA.
InParanoidiA5DE17.
KOiK11423.
OrthoDBiEOG092C3T9B.

Family and domain databases

Gene3Di1.20.930.10. 1 hit.
InterProiView protein in InterPro
IPR006560. AWS_dom.
IPR025788. Hist-Lys_N-MeTrfase_SET2_fun.
IPR003616. Post-SET_dom.
IPR001214. SET_dom.
IPR013257. SRI.
IPR017923. TFIIS_N.
IPR001202. WW_dom.
PfamiView protein in Pfam
PF08711. Med26. 1 hit.
PF00856. SET. 1 hit.
PF08236. SRI. 1 hit.
PF00397. WW. 1 hit.
SMARTiView protein in SMART
SM00570. AWS. 1 hit.
SM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
SM00456. WW. 1 hit.
SUPFAMiSSF47676. SSF47676. 1 hit.
SSF51045. SSF51045. 1 hit.
PROSITEiView protein in PROSITE
PS51215. AWS. 1 hit.
PS50868. POST_SET. 1 hit.
PS51568. SAM_MT43_SET2_1. 1 hit.
PS50280. SET. 1 hit.
PS01159. WW_DOMAIN_1. 1 hit.
PS50020. WW_DOMAIN_2. 1 hit.

Sequencei

Sequence statusi: Complete.

A5DE17-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSGSASPVKK SRTPPLFLDS QDKTDEARTS FQEISECIYA NKQLGHSGQN
60 70 80 90 100
EQMTCDCHEK WDQATQRNLA CGEHSECINR ATSVECVNKS CGCGDDCQNQ
110 120 130 140 150
RFQKKEYANV SVIQTELKGY GLRANEEINE SGFIYEYVGE VINEESFRKR
160 170 180 190 200
MVEYDEKKFP HFYFMMLKKD SFIDATIKGS LARFCNHSCS PNAYVDKWVV
210 220 230 240 250
GDKLRMGIFA KRLIQAGEEI TFDYNVDRYG AQSQPCYCGE PNCIKVMGGK
260 270 280 290 300
TQTDAALLLP DGISEALGVT HQMERQWLKE NKHLRSKQQK DDSIINEAFV
310 320 330 340 350
KSIEVTPIED SEVSKVMGAL MKEQDVNITR KLVERIYLTE EPTIHSSIVR
360 370 380 390 400
MHGYKTLSQV LQTCDNDTST IIQTLTILSK WPKVTRNKIS SSQIEDVVRQ
410 420 430 440 450
LHKQSKNKKI TQLSSDLLKE WGNLQMAYRI PKNVNGERGS SSPSVYGRGA
460 470 480 490 500
RSKSPEPTEP APEEPLPEGW NMTIDPNTQK PYYYHMQLGI SRWDRPVSEP
510 520 530 540 550
PKGPSFPKGP KNEPQRKRTG NFAEDILARQ EEERLKQERE NQFKEIQQKE
560 570 580 590 600
RMLQDVILQS QRQAEEKRQA DEKARLEKLA KSRERHQKHQ KHKKNKSDNN
610 620 630 640 650
VSIEQQWAKL FAKHIPNMIK KHEKTIGHDN VKGCAKDLVK ILASKEIKKH
660 670 680 690 700
PDTAPPTELD RAKLKKIKEF SSMFMDKFLQ KYEAKRDGKR KQDNGEDEAE

AKKHKKE
Length:707
Mass (Da):81,329
Last modified:July 22, 2008 - v2
Checksum:iBBE0963B4FA7346B
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CH408156 Genomic DNA. Translation: EDK37420.2.
RefSeqiXP_001485847.1. XM_001485797.1.

Genome annotation databases

EnsemblFungiiEDK37420; EDK37420; PGUG_01518.
GeneIDi5128463.
KEGGipgu:PGUG_01518.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
CH408156 Genomic DNA. Translation: EDK37420.2.
RefSeqiXP_001485847.1. XM_001485797.1.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi294746.XP_001485847.1.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblFungiiEDK37420; EDK37420; PGUG_01518.
GeneIDi5128463.
KEGGipgu:PGUG_01518.

Phylogenomic databases

eggNOGiKOG4442. Eukaryota.
COG2940. LUCA.
InParanoidiA5DE17.
KOiK11423.
OrthoDBiEOG092C3T9B.

Family and domain databases

Gene3Di1.20.930.10. 1 hit.
InterProiView protein in InterPro
IPR006560. AWS_dom.
IPR025788. Hist-Lys_N-MeTrfase_SET2_fun.
IPR003616. Post-SET_dom.
IPR001214. SET_dom.
IPR013257. SRI.
IPR017923. TFIIS_N.
IPR001202. WW_dom.
PfamiView protein in Pfam
PF08711. Med26. 1 hit.
PF00856. SET. 1 hit.
PF08236. SRI. 1 hit.
PF00397. WW. 1 hit.
SMARTiView protein in SMART
SM00570. AWS. 1 hit.
SM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
SM00456. WW. 1 hit.
SUPFAMiSSF47676. SSF47676. 1 hit.
SSF51045. SSF51045. 1 hit.
PROSITEiView protein in PROSITE
PS51215. AWS. 1 hit.
PS50868. POST_SET. 1 hit.
PS51568. SAM_MT43_SET2_1. 1 hit.
PS50280. SET. 1 hit.
PS01159. WW_DOMAIN_1. 1 hit.
PS50020. WW_DOMAIN_2. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiA5DE17_PICGU
AccessioniPrimary (citable) accession number: A5DE17
Entry historyiIntegrated into UniProtKB/TrEMBL: June 12, 2007
Last sequence update: July 22, 2008
Last modified: April 12, 2017
This is version 75 of the entry and version 2 of the sequence. See complete history.
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteomeImported

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.