Skip Header

You are using a version of Internet Explorer that may not display all features of this website. Please upgrade to a modern browser.
Contribute Send feedback
Read comments (?) or add your own

F5HIR5 (F5HIR5_ANOGA) Unreviewed, UniProtKB/TrEMBL

Last modified July 9, 2014. Version 19. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·General annotation·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein namesRecommended name:
Histone-lysine N-methyltransferase, H3 lysine-79 specific RuleBase RU271113

EC=2.1.1.43 RuleBase RU271113
Alternative name(s):
Histone H3-K79 methyltransferase RuleBase RU271113
Gene names
ORF Names:AgaP_AGAP003282 EMBL EGK96176.1
OrganismAnopheles gambiae (African malaria mosquito) [Reference proteome]
Taxonomic identifier7165 [NCBI]
Taxonomic lineageEukaryotaMetazoaEcdysozoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraNematoceraCulicoideaCulicidaeAnophelinaeAnopheles

Protein attributes

Sequence length2545 AA.
Sequence statusComplete.
Protein existenceInferred from homology

General annotation (Comments)

Catalytic activity

S-adenosyl-L-methionine + L-lysine-[histone] = S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone]. RuleBase RU271113

Subcellular location

Nucleus By similarity RuleBase RU271113.

Miscellaneous

In contrast to other lysine histone methyltransferases, it does not contain a SET domain, suggesting the existence of another mechanism for methylation of lysine residues of histones By similarity. RuleBase RU271113

Sequence similarities

Belongs to the class I-like SAM-binding methyltransferase superfamily. DOT1 family. RuleBase RU271113

Contains 1 DOT1 domain. RuleBase RU271113

Caution

The sequence shown here is derived from an EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is preliminary data. EMBL EGK96176.1

Ontologies

Sequences

Sequence LengthMass (Da)Tools
F5HIR5 [UniParc].

Last modified July 27, 2011. Version 1.
Checksum: F8BF3D8C7076E487

FASTA2,545268,957
        10         20         30         40         50         60 
MATPNYKELK LQSPAGAEPF LYNFPFSTVM GTGHDSGAEL IENVRWVCED MPEIKSAIEE 

        70         80         90        100        110        120 
IDLNNLDTNN YDAMKNLCDR FNRAIDSVAA LEKGTSLSNQ RFTYPSRGLL KHILQQVYNQ 

       130        140        150        160        170        180 
AVVEPEKLNQ YEPFSPEVYG ETSFDLICQM IDQVKITADD VFVDLGSGVG QVVLQMAAST 

       190        200        210        220        230        240 
PVKVCFGIEK ADVPSKYAEG MNTTFKLWMR WFGKKYGDYE LIKGDFLADE YREKITSATI 

       250        260        270        280        290        300 
VFVNNFAFGP NVDHQLKERF ADLRDGARIV SSKSFCPLNF RITDRNLSDI GTIMHVSEMS 

       310        320        330        340        350        360 
PLRGSVSWTG KPVSYYLHII DRTKLERYFQ RLKTKGTENH TDGTSGGGSG SHSTRSSRSR 

       370        380        390        400        410        420 
KDNNTNHHHH HPKVITNDDS TSESDTDVVG PTTRKAWSDW NSGKEGKTSP SEEENNNSPV 

       430        440        450        460        470        480 
LRNGRIPVAT KKRRKITRTK AAGKKAELAA ASAAAAAAAA AAAAAANRDM GVGTSAASAA 

       490        500        510        520        530        540 
AAAAAAMAGG KKRGRVKGKG RQRRPLNIAG LDLLHNETLL STSEQMIGKR LPPAPGCVDQ 

       550        560        570        580        590        600 
QLTSLAGDMQ HNELDIPEAP SETPYALQIL LDLYKTQFMK TIEAMRKPSY KDNVQQQFDR 

       610        620        630        640        650        660 
EKERNQRLMN RAGQLEKQIK VLIDDSVALL KARMNELGIS TTSQNDLLCK AKEIVGRHKE 

       670        680        690        700        710        720 
LQVMAAKLQN QVNVIEQEQK RLVMQHLTKL TVEQQQQHQQ QQQHQQLHPH QPYIKQEDAE 

       730        740        750        760        770        780 
LTSSSSNELV LKAIASTLSQ RKKLYAQVSN LETELNLIEK LTEERKSMVV GLSAAAPNSN 

       790        800        810        820        830        840 
HNSASSTAAA AAAAATSTII SVARATEVRD REQYGHAGQL HPATIGGGRE REHIHPDGTT 

       850        860        870        880        890        900 
TSKHTSDPVR TLPAGAAAGP VGVAGSAAPL PPPPMAGVGS ASVTVPSTPT KQGSGGGSSR 

       910        920        930        940        950        960 
SAQRKSRENR TRSQEWPEIP DIGKIEENNP EILAQKILET GRQIEAGKLF AAGKHASKER 

       970        980        990       1000       1010       1020 
SSEGKLAPVV TAGHGSSAAA GTAAPQHIAH PPTVGAPAPQ QQPYPHHHQQ PHSQPSQPPV 

      1030       1040       1050       1060       1070       1080 
HPHLHHPDTA LMPAPASTIN KAHHHHRSNS GGPSGAVPPP VPTGGGSLPK CFVPGTASNE 

      1090       1100       1110       1120       1130       1140 
HHSGGGRNAP EGASGTGRGG GSGGGGGGGK LQDSHKVVNF EDRLKSIITS VLQGSPKTGN 

      1150       1160       1170       1180       1190       1200 
TASSASAPVS LTVTGPPPAG PSPGAGHRDA HHRSGSGGHQ PLTMEPAGSP LKATSASGAG 

      1210       1220       1230       1240       1250       1260 
YGSAAGPGKT TVYLQSSPGA GSHHPHQMVV AQDMSARSST GGRGPSPSAH NPAHQQQIHP 

      1270       1280       1290       1300       1310       1320 
HQVSQSQYQQ QQQLLHHPHH PHAQQSLHHQ QQQHQHYQMQ QQQQHQHQQQ QQGMLNVITS 

      1330       1340       1350       1360       1370       1380 
GAHHLNASTS ISTSPVPTSP YKMHPTGPPA SSIAGSASTK ISPSSKYPYP KGGPVAGMSH 

      1390       1400       1410       1420       1430       1440 
HSPGSSALST HQQIIQQQER ERAMLYAAAA HGGGLPIDHP HHPLHQHHHR GMPPSLVDGK 

      1450       1460       1470       1480       1490       1500 
MLEFKAPENF RYDPRASGPS GAAGNGPPML DTSAVSLQSH SRSSSTNSLD SIPAADYGPA 

      1510       1520       1530       1540       1550       1560 
SAGASAVGGG GARYGGQQQP IPLVTHSPGV GSQGSGQQHG GNNSRPGSTS SQPDYTQVSP 

      1570       1580       1590       1600       1610       1620 
AKMALRRHLS QEKLTHPSAG GPAAGGPVSG GTGAGGSGGG LGTVKTIGDL VNGEIERTLE 

      1630       1640       1650       1660       1670       1680 
ISNQSIINAA INMSSHQQQQ QSSAASGNSS APSASGNNTV INTHVQRPER VSIRLLEEAG 

      1690       1700       1710       1720       1730       1740 
HAAAGGPPPP GSSGGTYSPI SRPGSVGDSG SKSPVHHLHG QSNLASLVQV SAYNSKNHKG 

      1750       1760       1770       1780       1790       1800 
AGTTAVPSTI VSPRGGGSSQ QQQHQQYSTA QGSGAAGGPG AVYQQSTSRG HDRHHSGEPM 

      1810       1820       1830       1840       1850       1860 
PYMALPRADM KPYLESYFTD EHNKRQQQLH QQQQQHQQHP QHQQQHQQQH LQHPVQSPSP 

      1870       1880       1890       1900       1910       1920 
SAMSASAMGL HHHQQQQMHH QHAPLAMHRA SHPVDLHRGS VVMNEPGMLS RSRGEMIPMD 

      1930       1940       1950       1960       1970       1980 
DNRMDRLNGG PPLEGLAASL QARVIATLKI KEEDEERHRR DLSIHHSTSG TINSSSNSIN 

      1990       2000       2010       2020       2030       2040 
STNSNIHGGS LQIVQTAHIK SEKYTSSSSS SSTSSTSSTS SHHHHALKRT SPIVEHPAGT 

      2050       2060       2070       2080       2090       2100 
RPPKMLYTTS ASAGGMDCGP DADMLHVPRG TVPGSSVVGH SAVRGGLLVA PPLVMSPEIN 

      2110       2120       2130       2140       2150       2160 
SLSSVVDDGR HHHQLHVRHN HHSRNDVDDD VVIGDDESSW HDRVSSGFDR LVAFASTELD 

      2170       2180       2190       2200       2210       2220 
KTRRSNEDAP ASSASCTTSP DSGINQSDHS RTFLSSSSSS SQLDVPPGSA SVGSSSSSSI 

      2230       2240       2250       2260       2270       2280 
SSSSSSNSST SSIGSVTGGH GGGSGGSHGV GGHAGVPGGL MMKHMVPIIK SSPAEPVDSP 

      2290       2300       2310       2320       2330       2340 
PLSDVGLPRT PSPTSSPPLL FGHPTASSTA IAGGATTSAV STVMQPSLLH PPVVGGHAPV 

      2350       2360       2370       2380       2390       2400 
AGSGNNNGVV PPAAGSAAPV NNSNSGNSSS SSSSLGIPLK YQRQSKSSSS SSEKHYKKKF 

      2410       2420       2430       2440       2450       2460 
RERNWEEYEE SLSGGRNSAI SGSGDVPMDQ QDYHHTVASL NAPHSGSMDV GNASAPSSST 

      2470       2480       2490       2500       2510       2520 
SMSAEPIGSA VGDKRNDDHA AQHHHQQQQQ QQQQQQQQHH HKHKSAKFRP KGKDWNWDDE 

      2530       2540 
HLNASSGSAT STRGAGRSGT NTAST 

« Hide

References

[1]"The genome sequence of the malaria mosquito Anopheles gambiae."
Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R., Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R., Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z., Kraft C.L., Abril J.F. expand/collapse author list , Anthouard V., Arensburger P., Atkinson P.W., Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C., Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K., Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V., Dana A., Delcher A., Dew I., Evans C.A., Flanigan M., Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R., Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J., Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I., Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A., McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D., O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H., Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J., Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B., Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I., Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J., Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M., Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C., Collins F.H., Hoffman S.L.
Science 298:129-149(2002) [PubMed] [Europe PMC] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: PEST EMBL EGK96176.1.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AAAB01008794 Genomic DNA. Translation: EGK96176.1.
RefSeqXP_003436434.1. XM_003436386.1.

3D structure databases

ModBaseSearch...
MobiDBSearch...

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaAGAP003282-RB; AGAP003282-PB; AGAP003282.
GeneID1269184.
KEGGaga:AgaP_AGAP003282.

Phylogenomic databases

OrthoDBEOG7W6WJV.
PhylomeDBF5HIR5.

Family and domain databases

Gene3D3.40.50.150. 1 hit.
InterProIPR013110. DOT1.
IPR025789. Histone_H3-K79_MeTrfase.
IPR029063. SAM-dependent_MTases-like.
[Graphical view]
PfamPF08123. DOT1. 1 hit.
[Graphical view]
SUPFAMSSF53335. SSF53335. 1 hit.
PROSITEPS51569. DOT1. 1 hit.
[Graphical view]
ProtoNetSearch...

Entry information

Entry nameF5HIR5_ANOGA
AccessionPrimary (citable) accession number: F5HIR5
Entry history
Integrated into UniProtKB/TrEMBL: July 27, 2011
Last sequence update: July 27, 2011
Last modified: July 9, 2014
This is version 19 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)