Skip Header

 
Contribute Send feedback

Unreviewed, UniProtKB/TrEMBL Q9U999 (Q9U999_DROME)

Last modified October 14, 2008. Version 37. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data | Customize display text xml rdf/xml gff fasta
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information

Names and origin

Protein namesSubmitted name:
    Huntingtin homolog EMBL AAD51369.1
Gene names
Name: htt FlyBase FBgn0027655
Synonyms: huntingtin FlyBase FBgn0027655
ORF Names: CG9995 FlyBase FBgn0027655
OrganismDrosophila melanogaster (Fruit fly) [Complete proteome] EMBL AAD51369.1
Taxonomic identifier7227 [NCBI]
Taxonomic lineageEukaryotaMetazoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora

Protein attributes

Sequence length3584 AA.
Sequence statusComplete.
Sequence processingThe displayed sequence is not processed.
Protein existenceEvidence at transcript level.

Ontologies

Keywords

   Technical termComplete proteome

Gene Ontology (GO)

   Cellular componentcytoplasm

Inferred from electronic annotation. Source: InterPro

nucleus

Inferred from electronic annotation. Source: InterPro

   Molecular functionbinding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
Q9U999-1 [UniParc].

Last modified May 1, 2000. Version 1.
Checksum: DB7C7FB7764BCA6E

FASTA3,584395,858
        10         20         30         40         50         60 
MDKSRSSAYD KFVGFVEQLR NTECSQKQKI TCFQQIAECI MSPSLAGHIN YAAHCGTATN 

        70         80         90        100        110        120 
VLLLFCEDVD SVVRMSAEEN LNKILRSLEK TRVSRILMDL YGEIKRNGNQ RSLRICLNLF 

       130        140        150        160        170        180 
SYYAPQIKEK HIKWYAVRLL QCMTTISQRK ETLLQETLCD FVKHFSRHIQ QGLSDSESCK 

       190        200        210        220        230        240 
LFETFLDQIS SDCAVKRRCS AQNCMSLIEN ARNRSLMARH GVNKVMELLL TDQQANSVLG 

       250        260        270        280        290        300 
ALGLLRLLLP QLIRGYPGDS HEDSESLAGK KQQQQQTTTS DCRQIIEIYD YCLHLLSTQH 

       310        320        330        340        350        360 
TANHAIINAT LEVINGILQA VDAASDGQCS QSLGQSLRQL LCNQQLQHNE YLRRRKSLKN 

       370        380        390        400        410        420 
QIFQLKNYEV ATSQHQLEDE DENEDVDELV VGATAMQMKK NSNAKLQQAK CREQQQHQHQ 

       430        440        450        460        470        480 
QQLEVDNSSL GINAGEDAPT EAPSSVADEG GEPESTKLRC HIRNAARSIS ECVASDEDKQ 

       490        500        510        520        530        540 
GQGHRQQRDE DGVVVAEDDD DDDDDDDDDD DMELLSAECD DFTTLSQLNE QQQALSAALK 

       550        560        570        580        590        600 
LPTTTAASSG GAATSQDDKL IDVDADVGGL PKPQHQSSLQ NLLAGSDDKS QHLSDIDNES 

       610        620        630        640        650        660 
FNSIDFDAEI TIAGSKEQQQ QHPPADDSVE SGDATAIGTF FNNLLSHSNA ASESVSKLFR 

       670        680        690        700        710        720 
QSSGSKSTPS KSASTPAPAD KSDAISAASL TLSLTSLASS NLEPPERQPL IAETPTPVED 

       730        740        750        760        770        780 
SCSITASHTA STALMMDAPA VEVAASKPET PQLRGTPNAN PFLVENSPLR QTVVGRALIT 

       790        800        810        820        830        840 
VKIGSILEQS LVYYTARLVA ARFLLSGQAA GLQPDSISRV SIKSLSLAVI AQCVRLAPKI 

       850        860        870        880        890        900 
LQLSLEISEQ ELQLLEEATS QIGSGDSTQV SSPQSSDNSQ VGGEKPPLDS SLVPTSLEEN 

       910        920        930        940        950        960 
LLLLDIKDDH FGPSTCPAYL QSATPTLSRS ADASVLLLEV GTTSSRSAKK SEEMLSKSEI 

       970        980        990       1000       1010       1020 
IESSYRPTVA VEDVPPLSMP PRPPKRTKST RSRVGVLGTS STTESSSPQS RQKLSDILLF 

      1030       1040       1050       1060       1070       1080 
HDHCDPILRG GVQQVVGNFL QSSGAGLFLD LQRGLGLQHL LAILLKGFED EIHTVVIQAL 

      1090       1100       1110       1120       1130       1140 
NAFDKIFPNV VSKYLTEPPC HYHAHQQQQQ QQKEQQQQEQ DNQKLEQDLQ RHSSGQQKRS 

      1150       1160       1170       1180       1190       1200 
GQAQTFGQQT FAKDQDNALS SQRQQQRRPN DAGTCANSSA TDNDELLAAL LNDFQLQSTG 

      1210       1220       1230       1240       1250       1260 
MRQQPKNNST DIGQSGNEPD LEPNPNAAVE PFCVFAISPK LLLSKLRLCH HNKYWLVQNK 

      1270       1280       1290       1300       1310       1320 
YAEVISNLNY VLLRSYYANF RCAIDNKNSG ARKQDSKWPP MDASSVCHSV RDAEGEDIVC 

      1330       1340       1350       1360       1370       1380 
TYEAQFLAEL LHLLGDDDAR VREHAACCLC RFIMQTARQD PSQDQGAGGG GGDDIEGNGN 

      1390       1400       1410       1420       1430       1440 
VNVETQQTNF NLLWDFFDYR IFGSMSVTLR NLFRASSTIV PPLAELDALA TSNSAPSYPD 

      1450       1460       1470       1480       1490       1500 
TGSTSGSSTS TSASSGGSAA AVSAASAYFE ASYGIGIAEG HVFALASASQ RQIAQEEKVL 

      1510       1520       1530       1540       1550       1560 
AKVLYRLTNK LMTLNDKNVQ FGIIYALRLL LRHFNFVDYQ QAWLEFNFVE ICISYAYYNN 

      1570       1580       1590       1600       1610       1620 
ATAADLGCQN DLIDVMGKLM AGAMLSSGEP NTAHLDFLLR HSVKMLNIYY HLVTNQRPPT 

      1630       1640       1650       1660       1670       1680 
AGSQSGSSSS KQPKSELFAR EQPAATLQAL GYFAGDYVYM KLYNILRGAN DSYKITINQE 

      1690       1700       1710       1720       1730       1740 
AGSLLICLLK TCLHAVSLCL EGMASASPPE LKLIEEILHY LTRLINYAPA ECVACLRQLL 

      1750       1760       1770       1780       1790       1800 
KYLFAQNYAS QVRLQPSAGI GGNGTGIGHH AAFMRPYFAA KGRGHGASST LLPTINSKPA 

      1810       1820       1830       1840       1850       1860 
VAVGSQRGAP TDARQPIDAG PLQDMGMLFV HGLQPPTPPA GDCVRLIKLF EPMVIYCLTL 

      1870       1880       1890       1900       1910       1920 
FMKSNALVQA PILRLLSQLL DLNVTYSILD SKNVIFDQVL SNMDLIEGGI DRNAFIMVPP 

      1930       1940       1950       1960       1970       1980 
MLRFLVQLTH KSDRQLITIP KIISITNNLL ANGSVRVVAL LALKTLSYEL FFMHSQLEEA 

      1990       2000       2010       2020       2030       2040 
LDTEGHNSGR DACQSPLSAA PTPETREALL AQRRELDTQR EVVLGMLEKF IEARPSQQVL 

      2050       2060       2070       2080       2090       2100 
ALLLLFERSV QQLDTPPYRS AQDADAVYGT LCRGLCSRQW RLHNAGDLRL LESCFRNNGN 

      2110       2120       2130       2140       2150       2160 
HVLADSKRFL QLLQLFIEQG VGNFGDLALA MVMLSNVILK TEEIYLVNHI KLYLKNNPTA 

      2170       2180       2190       2200       2210       2220 
ERRLQALMPS SPSAAPHWQD EAPSTSSAAA AARAAAASFS AGRSSISEIN YFAKVLCEKL 

      2230       2240       2250       2260       2270       2280 
LACLEVLLGL EPSSSSHAYC QLTGRFMDAL LNVCCRSRHK DALQSVFRLV LAESEFLCKY 

      2290       2300       2310       2320       2330       2340 
YSLLLMSAAG LVGSHSLDAV LLACLRLVLA MRLEEPVALL EQAAQLPLKT NLQRALLREV 

      2350       2360       2370       2380       2390       2400 
CRASAGCDWS AQQVRRLFEG RYLNFLIADH LEFICELCQE RPECGSLLQV ALFRNAHRLS 

      2410       2420       2430       2440       2450       2460 
RQSVRIVLRL LGRLCEPAER EASGDVGSDA DPTLQAMQLL SRLHQLYEGE RSLQLPIERM 

      2470       2480       2490       2500       2510       2520 
ARRLSAQSGQ PARNVIYERL VEGDLAGGED QDALRTLLLK DLECRQDNET ATPSRIIDES 

      2530       2540       2550       2560       2570       2580 
WLFAQLIKFA TQHADAPQQQ KQLMLLLLEI QSEPKLQRLL RSLGTEHEAK LLRHAIAGSL 

      2590       2600       2610       2620       2630       2640 
AAMMSAFRQK CIQHAPHINY MQPTPLARVS CALLMSRVAS TEATKCRNPP TGEQLDVARA 

      2650       2660       2670       2680       2690       2700 
VGALMACIRN AEQTALIYID ARLMEKFVVE HLLRREHLPQ LLAYLGWLAG AAKQILAMPT 

      2710       2720       2730       2740       2750       2760 
RQESEQDALG VLLATVNTLL QQPRVWRELN ASSDPSLRCE LLDLLDSVAR CILQDTIFYR 

      2770       2780       2790       2800       2810       2820 
RHRRDRNKAK GPAPQAIFLA KLIETQIEIE SLASGRVLAG VEEARLQFAG QDLARFQVAL 

      2830       2840       2850       2860       2870       2880 
SLVTSIGISL LRTHQFYAYA VTPHELIQQP GDQQQEQQAD GKLPSIPVDS LSDVDVLRQF 

      2890       2900       2910       2920       2930       2940 
VKRLSIFGFT TRQQFEEYFM TCLLLINKLY DEHMVDQQEQ FQIKQVCLQA ILELLMTYKT 

      2950       2960       2970       2980       2990       3000 
FPIVGMANGQ FHHTTRWQRI TCDSISLKKL HKVQLLVDAC NVFYQPNLER QLAYDNVIGT 

      3010       3020       3030       3040       3050       3060 
RTFAPNQYDL NFSWAQMEDQ AAAGVGVGVG VGLSGGEQAN TADIKQSCDA DVPDMAMRNY 

      3070       3080       3090       3100       3110       3120 
RHFTQLSGID FRSSTQLVFD VLQQMIELNH ILVLPNLVKF CEICESRDHI KWIKERCLKL 

      3130       3140       3150       3160       3170       3180 
QEQVAMDDTI SHQHIIYLLC RSQALLIPSL GELQVLCSLI GNVYLKSTHS FIRIATLQGL 

      3190       3200       3210       3220       3230       3240 
LCLLECCSKT NTTMGRLSEE LALLRSLIVG YINRHGIIDE SPLPFSVEHT KLVWTLNYSL 

      3250       3260       3270       3280       3290       3300 
IEWTSKFVPQ CHLLSNTIIA ANNFLKTTAD EELYLCVLHG LERMVVNSGV PPPGIQPTGK 

      3310       3320       3330       3340       3350       3360 
DAAAGEPGAE GSKAGVGVVV TPQMRHKIEK LALELLKMEN EKFSIPALKL LLSCMYVGSA 

      3370       3380       3390       3400       3410       3420 
AQLENTELSN GIVQDDPEII AQQNDKVDIL LHCIKSSTRD AAWIYGQVLC QIIRDLVPPN 

      3430       3440       3450       3460       3470       3480 
EILTKVIKEF LAINHPHCDV IAMIVYQVFR SAIDSSYLQM LQDWLICTLP TFLDQPEQQG 

      3490       3500       3510       3520       3530       3540 
VWGLSVIFLS ASINLHLIKL FPLVLGIGAS NSAAAATTAT TATTEAEAAA PAMARKLGQH 

      3550       3560       3570       3580 
EIALFVTAAQ DFHAKLSGEQ RQRFREAFGS FKRSQVYGRM LQCL 

« Hide

References

[1]"Drosophila HD homolog."
Takano H., Bernards A., Gusella J.F.
Submitted (AUG-1999) to the EMBL/GenBank/DDBJ databases
Cited for: NUCLEOTIDE SEQUENCE.

Cross-references

Sequence databases

AF177386 mRNA. Translation: AAD51369.1.

3D structure databases

ModBaseSearch...

Organism-specific databases

FlyBaseFBgn0027655. htt.

Phylogenomic databases

HOGENOMQ9U999.

Gene expression databases

ArrayExpressQ9U999.

Family and domain databases

InterProIPR000357. HEAT.
IPR000091. Huntingtin.
[Graphical view]
PANTHERPTHR10170. Huntingtin. 1 hit.
PfamPF02985. HEAT. 2 hits.
[Graphical view]

Entry information

Entry nameQ9U999_DROME
AccessionPrimary (citable) accession number: Q9U999
Entry history
Integrated into UniProtKB/TrEMBL: May 1, 2000
Last sequence update: May 1, 2000
Last modified: October 14, 2008
This is version 37 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)
Names and origin · Protein attributes · Ontologies · Sequences · References · Cross-references · Entry information