Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

A7TGI1 (A7TGI1_VANPO) Unreviewed, UniProtKB/TrEMBL

Last modified April 16, 2014. Version 42. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Histone-lysine N-methyltransferase, H3 lysine-4 specific PIRNR PIRNR037104

EC=2.1.1.43 PIRNR PIRNR037104
Gene names
ORF Names:Kpol_1048p18 EMBL EDO18588.1
OrganismVanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294) (Kluyveromyces polysporus) [Complete proteome]
Taxonomic identifier436907 [NCBI]
Taxonomic lineageEukaryotaFungiDikaryaAscomycotaSaccharomycotinaSaccharomycetesSaccharomycetalesSaccharomycetaceaeVanderwaltozyma

Protein attributes

Sequence length1074 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Function

Catalytic component of the COMPASS (Set1C) complex that specifically mono-, di- and trimethylates histone H3 to form H3K4me1/2/3, which subsequently plays a role in telomere length maintenance and transcription elongation regulation By similarity. PIRNR PIRNR037104

Catalytic activity

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone]. PIRNR PIRNR037104 SAAS SAAS001214

Subunit structure

Component of the COMPASS (Set1C) complex By similarity. PIRNR PIRNR037104

Subcellular location

Nucleus By similarity PIRNR PIRNR037104.

Sequence similarities

Contains 1 SET domain. RuleBase RU004538

Contains SET domain. SAAS SAAS001214

Contains post-SET domain. SAAS SAAS001214

Sequences

Sequence LengthMass (Da)Tools
A7TGI1 [UniParc].

Last modified October 2, 2007. Version 1.
Checksum: 5869A9997B385594

FASTA1,074122,913
        10         20         30         40         50         60 
MSGSYRRPYQ SSGPSHRQYQ YNQQQRPYAN QYNGRRNNYD FYNDHRQGHY GGHSYNSSYG 

        70         80         90        100        110        120 
GNGYPYRHGM SNSPMHSNNS SNSSNSNNNN NNNNNINNIH NTDNGGAMDS NDYYNSIGDI 

       130        140        150        160        170        180 
RSYSGVSRNN ESKEFLKLVR PQPVIRYDTN FYKSKYHYFD PVTKMLIHQQ EMSNWRNNEE 

       190        200        210        220        230        240 
MPTNGFVLVH ENHNGAIRSI MRGIDYNGCS KDPRTSSNFT SNTSRSKVKN TLKTLPSIPY 

       250        260        270        280        290        300 
DKFSVGPPPP TEIVIYPAST INMISELSVK NYFKEFGEIS HFEVFNDPNS ALSLNVYLVR 

       310        320        330        340        350        360 
YNSPDGKLKN AIKAAKYAVQ KHEEKGCSIL GCHFNVVLNK DNVIKSIISK FVQENLKKAK 

       370        380        390        400        410        420 
DINIDERNKN DKERKIVDIA PKNNFQDRRI PLDIKDIVNN RPVLFVPKSF TSIHSFRVED 

       430        440        450        460        470        480 
FKLKLRKYRW ARILDHQAGI YIVFNDLVHA RSCMNAESGI MTLISRSRNI PIEVRLKLLA 

       490        500        510        520        530        540 
PEPIPTPVSK PTKSIVGELK FPTKPAAKVY SSKKDIVATA MKMITRDLEN ALSIDIRRKI 

       550        560        570        580        590        600 
IGPVVFDGLN SSNYPELVKK KELNELEKLN KKKDLETKKT EAKEERSKFD LFDLYGGYVK 

       610        620        630        640        650        660 
SNKKRRASDN LEKANRKRRG SIDNKPTAHM LNEDTISKES TPLEYSASLS RFSGDEALLS 

       670        680        690        700        710        720 
EESSIESEDE VTESTKSEEE TIKEEETKFD EEKEAGIGKE LNVEEKEGEE EEFSHAKVEN 

       730        740        750        760        770        780 
ELSNQIYAPI ASVYPEPVYS YELFELESES AVISVEDLQS VVKDEEDMKY LRKVCETDSI 

       790        800        810        820        830        840 
PTVSAQIAAS FEYELWKLRR FNKMKAKISE KHLELNEVPY DSSLDNGDKA FKAVGFKKVP 

       850        860        870        880        890        900 
DRIKSCYLPH RRKLHQPLNT VNIHNDTLEK QRSNQADDMD KADSSEVSAE VSSSRVNRAI 

       910        920        930        940        950        960 
NRRFQQDIEA QKAAIGTESE LLSLNQLNKR KKPVTFARSA IHNWGLYALE PIAAKEMIIE 

       970        980        990       1000       1010       1020 
YVGERIRQPV AEMRERRYIK NGIGSSYLFR VDENTVIDAT KRGGIARFIN HCCDPSCTAK 

      1030       1040       1050       1060       1070 
IIKVGGMKRI VIYALRDIAS NEELTYDYKF EREMDDKERL PCLCGAATCK GFLN 

« Hide

References

[1]"Independent sorting-out of thousands of duplicated gene pairs in two yeast species descended from a whole-genome duplication."
Scannell D.R., Frank A.C., Conant G.C., Byrne K.P., Woolfit M., Wolfe K.H.
Proc. Natl. Acad. Sci. U.S.A. 104:8397-8402(2007) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: ATCC 22028 / DSM 70294.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
DS480387 Genomic DNA. Translation: EDO18588.1.
RefSeqXP_001646446.1. XM_001646396.1.

3D structure databases

ProteinModelPortalA7TGI1.
SMRA7TGI1. Positions 251-363.
ModBaseSearch...
MobiDBSearch...

Protein-protein interaction databases

STRING436907.A7TGI1.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

GeneID5546888.
KEGGvpo:Kpol_1048p18.

Phylogenomic databases

eggNOGCOG2940.
KOK11422.
OrthoDBEOG7P8PH3.

Family and domain databases

InterProIPR024657. COMPASS_Set1_N-SET.
IPR017111. Hist_H3-K4_MeTrfase_1_fun.
IPR015722. Histone-lysine_MeTfrase.
IPR003616. Post-SET_dom.
IPR024636. SET_assoc.
IPR001214. SET_dom.
[Graphical view]
PANTHERPTHR22884:SF10. PTHR22884:SF10. 1 hit.
PfamPF11764. N-SET. 1 hit.
PF00856. SET. 1 hit.
PF11767. SET_assoc. 1 hit.
[Graphical view]
PIRSFPIRSF037104. Histone_H3-K4_mtfrase_Set1_fun. 1 hit.
SMARTSM00508. PostSET. 1 hit.
SM00317. SET. 1 hit.
[Graphical view]
PROSITEPS50868. POST_SET. 1 hit.
PS51572. SAM_MT43_1. 1 hit.
PS50280. SET. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameA7TGI1_VANPO
AccessionPrimary (citable) accession number: A7TGI1
Entry history
Integrated into UniProtKB/TrEMBL: October 2, 2007
Last sequence update: October 2, 2007
Last modified: April 16, 2014
This is version 42 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)