Skip Header

 
Contribute Send feedback
Read comments (0) or add your own

Reviewed, UniProtKB/Swiss-Prot Q4PB36 (SET1_USTMA)

Last modified June 16, 2009. Version 35. Feed History...

Clusters with 100%, 90%, 50% identity | Documents (1) | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents

Names and origin

Protein namesRecommended name:
    Histone-lysine N-methyltransferase, H3 lysine-4 specific
    EC=2.1.1.43
Alternative name(s):
    COMPASS component SET1
    SET domain-containing protein 1
Gene names
Name: SET1
ORF Names: UM02677
OrganismUstilago maydis (Smut fungus) [Complete proteome]
Taxonomic identifier5270 [NCBI]
Taxonomic lineageEukaryotaFungiDikaryaBasidiomycotaUstilaginomycotinaUstilaginomycetesUstilaginalesUstilaginaceaeUstilago

Protein attributes

Sequence length1468 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceInferred from homology.

General annotation (Comments)

Function

Catalytic component of the COMPASS (Set1C) complex that specifically mono-, di- and trimethylates histone H3 to form H3K4me1/2/3, which subsequently plays a role in telomere length maintenance and transcription elongation regulation By similarity.

Catalytic activity

S-adenosyl-L-methionine + histone L-lysine = S-adenosyl-L-homocysteine + histone N(6)-methyl-L-lysine.

Subunit structure

Component of the COMPASS (Set1C) complex By similarity.

Subcellular location

Nucleus Probable.

Sequence similarities

Contains 1 post-SET domain.

Contains 1 SET domain.

Ontologies

Keywords
   Cellular componentChromosomal protein
Nucleus
   LigandRNA-binding
S-adenosyl-L-methionine
   Molecular functionChromatin regulator
Methyltransferase
Transferase
   Technical termComplete proteome
Gene Ontology (GO)
   Biological processchromatin modification

Inferred from electronic annotation. Source: UniProtKB-KW

   Cellular componentchromosome

Inferred from electronic annotation. Source: UniProtKB-KW

nucleus

Inferred from electronic annotation. Source: UniProtKB-SubCell

   Molecular functionRNA binding

Inferred from electronic annotation. Source: UniProtKB-KW

histone-lysine N-methyltransferase activity

Inferred from electronic annotation. Source: EC

Complete GO annotation...

Sequence annotation (Features)

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifier

Molecule processing

Chain1 – 14681468Histone-lysine N-methyltransferase, H3 lysine-4 specific
PRO_0000269777

Regions

Domain1326 – 1448123SET
Domain1453 – 146816Post-SET
Compositional bias27 – 275249Arg-rich
Compositional bias641 – 68545Pro-rich

Sequences

Sequence LengthMass (Da)Tools
Q4PB36-1 [UniParc].

Last modified July 19, 2005. Version 1.
Checksum: 7CC6632973149DF9

FASTA1,468164,530
        10         20         30         40         50         60 
MPYSSQQNGY TSASTSRLSE QTSSHSRSSR EDRHLTEKGR RPPSPEARHR SDRDYDRRRS 

        70         80         90        100        110        120 
TEYVRDDDYR RSSRSSHDSR YADAYDHWRS ARSAYSPTPR DDRRDEARND LSSTKRHRSP 

       130        140        150        160        170        180 
EHSTSRLRHR SPESAHRRQN GTANRLDSKP DRGGDRKTGE ALDSGRSRWS QRAYEYDDWR 

       190        200        210        220        230        240 
NERPSARYER YRHDREPHRS RREDEYETKR SRDDSNGNSI YAPTRRSRSR SRSRSRSRDR 

       250        260        270        280        290        300 
YRSRDHSRER RRERSRDRSN GTYSSRDDRR PKADRSAHTI KRDEHSTRLN GTSEDSKDLR 

       310        320        330        340        350        360 
HESQRRVSAS VQSASEGPAS TPVARAVYIK HAEVDQEAPA PPTTRDYHSC PQRWPDQADS 

       370        380        390        400        410        420 
AVRASSAPNG SATAPSRSDR PPANGSSGRH SPRSLPTREK AEEARTSSTR RPSSQTNDNV 

       430        440        450        460        470        480 
NNSRDPLTQR KATSERSFGH VLLPHELPVE CRGKNYMATA TYKEGVKSIY KSAADKHLVD 

       490        500        510        520        530        540 
VDTRDPRRLG KKSSRYRESL HSASFRWDSN SRGKKPLPPP RNLVLTNLSG LLQPHQILLH 

       550        560        570        580        590        600 
ILPHGRIESS KLEIDPKIGQ SLGIFRVTFA HDFDEHGKPL ESMPAGQNPQ HGAKVAKAAC 

       610        620        630        640        650        660 
LALNGRMIGQ TRAQAFLDRD GEVIAERIKA KLAENEHKLR PTIVPPAPPA AASSSPATPS 

       670        680        690        700        710        720 
TTKQSMPPPQ VPRGPKVFMP AAPSPSYASS PASARANTDR YEYSATSHSR YRSSYEESRK 

       730        740        750        760        770        780 
LASSETYHRR RGTEEYDTYN RSKPYADAQV PAGSRSETRK DIKRPDEEIL NELRDKKRPY 

       790        800        810        820        830        840 
VHIPRPKNCD IDVTSVEAQL RSTAPIWVRE GQKGFYAAFH TSKEANQCKV VNETLTIGGY 

       850        860        870        880        890        900 
TLQVDVRSAP SQHAPSQQIR TPSGKHASVP LSMPAPPKQE RKAIDTGLRP PTADEKLKVD 

       910        920        930        940        950        960 
WSAAELQDAV FRMLQKELAD TFVRDVKSRV VGPYLTAYLK PDGEGGKMLA KATMKKPVIP 

       970        980        990       1000       1010       1020 
TSINDHGTTL FEATGEARLP SFRKLAGAHP KKKASDADTT TSQAKRDQTD AKKKRGHTHR 

      1030       1040       1050       1060       1070       1080 
SKVHRDRDVS SSENESDDME RGMVVAARRN SYTRSKSSTK RRGAAAWLLE ASDAEAGTDD 

      1090       1100       1110       1120       1130       1140 
VDSTETDALS RSVSASVEPT GEEQIEVDVG AKAKKIPKVK AATVSKKKGT TAARKKLDVA 

      1150       1160       1170       1180       1190       1200 
PPEAVVEADQ GSETATPETD VPIKTAAAKA KVKPAKTSAK AKSALVDPFE AGLVEDSEDC 

      1210       1220       1230       1240       1250       1260 
HYLRLALEHL SRTGELASEH TLPDEIELEV EAEEQAMAAG GIPKHSTGSA RTEGYYRIPP 

      1270       1280       1290       1300       1310       1320 
EQKAMHLPDR NKATEDVDTS SNAQILQSAR NNRADSRRLV LGIEQHKRET ATDTDIFKFN 

      1330       1340       1350       1360       1370       1380 
QLRTRKKQLK FAKSPIHDWG LYAMELIPAG DMVIEYVGEV VRQQVADERE KQYERQGNFS 

      1390       1400       1410       1420       1430       1440 
TYLFRVDDDL VVDATHKGNI ARLMNHCCTP NCNAKILTLN GEKRIVLFAK TAIRAGEELT 

      1450       1460 
YDYKFQSSAD DEDAIPCLCG SPGCRRFL 

« Hide

Cross-references

Sequence databases

AACP01000088 Genomic DNA. Translation: EAK83847.1.
RefSeqXP_758824.1.

3D structure databases

ModBaseSearch...

Genome annotation databases

GeneID3630740.
KEGGuma:UM02677.1.

Enzyme and pathway databases

BRENDA2.1.1.43. 2320.

Family and domain databases

InterProIPR015722. MLL.
IPR003616. Post-SET_Zn_bd.
IPR001214. SET.
[Graphical view]
PANTHERPTHR22884:SF10. MLL. 1 hit.
PfamPF00856. SET. 1 hit.
[Graphical view]
SMARTSM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEPS50868. POST_SET. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameSET1_USTMA
AccessionPrimary (citable) accession number: Q4PB36
Entry history
Integrated into UniProtKB/Swiss-Prot: January 9, 2007
Last sequence update: July 19, 2005
Last modified: June 16, 2009
This is version 35 of the entry and version 1 of the sequence. [Complete history]
Entry statusReviewed (UniProtKB/Swiss-Prot)
Annotation projectFPAP (Fungal Proteome Annotation Project)

Relevant documents

SIMILARITY comments

Index of protein domains and families

Names and origin · Protein attributes · General annotation (Comments) · Ontologies · Sequence annotation (Features) · Sequences · References · Cross-references · Entry information · Relevant documents