Skip Header

Contribute Send feedback
Read comments (?) or add your own

Q9V3N4 (Q9V3N4_DROME) Unreviewed, UniProtKB/TrEMBL

Last modified January 25, 2012. Version 76. Feed History...

Clusters with 100%, 90%, 50% identity | Third-party data text xml rdf/xml gff fasta
to top of pageNames·Attributes·Ontologies·Sequences·References·Cross-refs·Entry infoCustomize order

Names and origin

Protein names

Huntington disease protein homolog EMBL AAF03255.1
Gene names
Name:htt FlyBase FBgn0027655
Synonyms:Hsap\HD EMBL AAF03255.1, huntingtin FlyBase FBgn0027655
ORF Names:CG9995 FlyBase FBgn0027655, Dmel_CG9995 EMBL AAF56808.1
OrganismDrosophila melanogaster (Fruit fly)
Taxonomic identifier7227 [NCBI]
Taxonomic lineageEukaryotaMetazoaArthropodaHexapodaInsectaPterygotaNeopteraEndopterygotaDipteraBrachyceraMuscomorphaEphydroideaDrosophilidaeDrosophilaSophophora

Protein attributes

Sequence length3583 AA.
Sequence statusComplete.
Protein existenceEvidence at transcript level

Ontologies

Keywords
   Technical termComplete proteome
Reference proteome
Gene Ontology (GO)
   Biological processestablishment of mitotic spindle orientation

Inferred from mutant phenotype. Source: FlyBase

   Cellular componentcytoplasm

Inferred from direct assay. Source: FlyBase

nucleus

Inferred from electronic annotation. Source: InterPro

   Molecular functionbinding

Inferred from electronic annotation. Source: InterPro

Complete GO annotation...

Sequences

Sequence LengthMass (Da)Tools
Q9V3N4 [UniParc].

Last modified May 1, 2000. Version 1.
Checksum: 98C83978C4F0A888

FASTA3,583395,867
        10         20         30         40         50         60 
MDKSRSSAYD KFVGFVEQLR NTECSQKQKI TCFQQIAECI MSPSLAGHIN YAAHCGTATN 

        70         80         90        100        110        120 
VLLLFCEDVD SVVRMSAEEN LNKILRSLEK TRVSRILMDL YGEIKRNGNQ RSLRICLNLF 

       130        140        150        160        170        180 
SYYAPQIKEK HIKWYAVRLL QCMTTISQRK ETLLQETLCD FVKHFSRHIQ QGLSDSESCK 

       190        200        210        220        230        240 
LFETFLDQIS SDCAVKRRCS AQNCMSLIEN ARNRSLMARH GVNKVMELLL TDQQANSVLG 

       250        260        270        280        290        300 
ALGLLRLLLP QLIRGYPGDS HEDSESLAGK KQQQQQTTTS DCRQIIEIYD YCLHLLSTQH 

       310        320        330        340        350        360 
TANHAIINAT LEVINGILQA VDAASDGQCS QSLGQSLRQL LCNQQLQHNE YLRRRKSLKN 

       370        380        390        400        410        420 
QIFQLKNYEV ATSQHQLEDE DENEDVDELV VGATAMQMKK NSNAKLQQAK CREQQQHQHQ 

       430        440        450        460        470        480 
QQLEVDNSSL GINAGEDAPT EAPSSVADEG GEPESTKLRC HIRNAARSIS ECVASDEDKQ 

       490        500        510        520        530        540 
GQGHRQQRDE DGVVVAEDDD DDDDDDDDDD DMELLSAECD DFTTLSQLNE QQQALSAALK 

       550        560        570        580        590        600 
LPTTTAASSG GAATSQDDKL IDVDADVGGL PKPQHQSSLQ NLLAGSDDKS QHLSDIDNES 

       610        620        630        640        650        660 
FNSIDFDAEI TIAGSKEQQQ QHPPADDSVE SGDATAIGTF FNNLLSHSNA ASESVSKLFR 

       670        680        690        700        710        720 
QSSGSKSTPS KSASTPAPAD KSDAISAASL TLSLTSLASS NLEPPERQPL IAETPTPVED 

       730        740        750        760        770        780 
SCSITASHTA STALMMDAPA VEVAASKPET PQLRGTPNAN PFLVENSPLR QTVVGRALIT 

       790        800        810        820        830        840 
VKIGSILEQS LVYYTARLVA ARFLLSGQAA GLQPDSISRV SIKSLSLAVI AQCVRLAPKI 

       850        860        870        880        890        900 
LQLSLEISEQ ELQLLEEATS QIGSGDSTQV SSPQSSDNSQ VGGEKPPLDS SLVPTSLEEN 

       910        920        930        940        950        960 
LLLLDIKDDH FGPSTCPAYL QSATPTLSRS ADASVLLLEG GTTSSRSAKK SEEMLSKSEI 

       970        980        990       1000       1010       1020 
IESSYRPTVA VEDVPPLSMP PRPPKRTKST RSRVGVLGTS STTESSSPQS RQKLSDILLF 

      1030       1040       1050       1060       1070       1080 
HDHCDPILRG GVQQVVGNFL QSSGAGLFLD LQRGLGLQHL LAILLKGFED EIHTVVIQAL 

      1090       1100       1110       1120       1130       1140 
NAFDKIFPNV VSKYLTEPPC HYHAHQQQQQ QQKEQQQQEQ DNQKLEQDLQ RHSSGQQKRS 

      1150       1160       1170       1180       1190       1200 
GQAQTFGQQT FAKDQDNALS SQRQQQRRPN DAGTCANSSA TDNDELLAAL LNDFQLQSTG 

      1210       1220       1230       1240       1250       1260 
MRQQPKNNST DTGQSGNEPD LEPNPNAAVE PFCVFAISPK LLLSKLRLCH HNKYWLVQNK 

      1270       1280       1290       1300       1310       1320 
YAEVISNLNY VLLRSYYANF RCAIDNKNSG ARKQDSKWPP MDASSVCHSV RDAEGEDIVC 

      1330       1340       1350       1360       1370       1380 
TYEAQFLAEL LHLLGDDDAR VREHAACCLC RFIMQTARQD PSQDQGAGGG GGDDIEGNGN 

      1390       1400       1410       1420       1430       1440 
VNVETQQTNF NLLWDFFDYR IFGSMSVTLR NLFRASSTIV PPLAELDALA TSNSAPSYPD 

      1450       1460       1470       1480       1490       1500 
TGSTSGSSTS TSASSGGSAA AVSAASAYFE ASYGIGIAEG HVFALASASQ RQIAQEEKVL 

      1510       1520       1530       1540       1550       1560 
AKVLYRLTNK LMTLNDKNVQ FGIIYALRLL LRHFNFVDYQ QVWLEFNFVE ICISYAYYNN 

      1570       1580       1590       1600       1610       1620 
ATAADLGCQN DLIDVMGKLM AGAMLSSGEP NTAHLDFLLR HSVKMLNIYY HLVTNQRPPT 

      1630       1640       1650       1660       1670       1680 
AGSQSGSSSS KQPKSELFAR EQPAATLQAL GYFAGDYVYM KLYNILRGAN DSYKITINQE 

      1690       1700       1710       1720       1730       1740 
AGSLLICLLK TCLHAVSLCL EGMASASPPE LKLIEEILHY LTRLINYAPA ECVACLRQLL 

      1750       1760       1770       1780       1790       1800 
KYLFAQNYAS QVRLQPSAIG GNGSEIGHHA AFMRPYFAAK GRGHGASSTL LPTINSKPAV 

      1810       1820       1830       1840       1850       1860 
AVGSQRGAPT DARQPIDAGP LQDMGMLFVH GLQPPTPPAG DCVRLIKLFE PMVIYCLTLF 

      1870       1880       1890       1900       1910       1920 
MKSNALVQAP ILRLLSQLLD LNVTYSILDS KNVIFDQVLS NMDLIEGGID RNAFIMVPPM 

      1930       1940       1950       1960       1970       1980 
LRFLVQLTHK SDRQLITIPK IISITNNLLA NGSVRVVALL ALKTLSYELF FMHSQLEEAL 

      1990       2000       2010       2020       2030       2040 
DTEGHNSGRD ACQSPLSAAP TPETREALLA QRRELDTQRE VVLGMLEKFI EARPSQQVLA 

      2050       2060       2070       2080       2090       2100 
LLLLFERSVQ QLDTPPYRSA QDADAVYGTL CRGLCSRQWR LHNAGDLRLL ESCFRNNGNH 

      2110       2120       2130       2140       2150       2160 
VLADSKRFLQ LLQLFIEQGV GNFGDLALAM VMLSNVILKT EEIYLVNHIK LYLKNNPTAE 

      2170       2180       2190       2200       2210       2220 
RRLQALMPSS PSAAPHWQDE APSTSSAAAA ARAAAASFSA GRSSISEINY FAKVLCEKLL 

      2230       2240       2250       2260       2270       2280 
ACLEVLLGLE PSSSSHAYCQ LTGRFMDALL NVCCRSRHKD ALQSVFRLVL AESEFLCKYY 

      2290       2300       2310       2320       2330       2340 
SLLLMSAAGL VGSYLLDAVL LACLRLVLAM RLEEPVALLE QAAQLPLKTN LQRALLREVC 

      2350       2360       2370       2380       2390       2400 
RASAGCDWSA QQVRRLFEGR YLNFLIADHL EFICELCQER PECGSLLQVA LFRNAHRLSR 

      2410       2420       2430       2440       2450       2460 
QSVRIVLRLL GRLCEPAERE ASGDVGSDAD PTLQAMQLLS RLHQLYEGER SLQLPIERMA 

      2470       2480       2490       2500       2510       2520 
RRLSAQSGQP ARNVIYERLV EGDLAGGEDQ DALRTLLLKD LECRQDNETA TPSRIIDESW 

      2530       2540       2550       2560       2570       2580 
LFAQLIKFAT QHADAPQQQK QLMLLLLEIQ SEPKLQRLLR SLGTEHEAKL LRHAIAGSLA 

      2590       2600       2610       2620       2630       2640 
AMMSAFRQKC IQHAPHINYM QPTPLARVSC ALLMSRVAST EATKCRNPPT GEQLDVARAV 

      2650       2660       2670       2680       2690       2700 
GALMACIRNA EQTALIYIDA RLMEKFVVEH LLRREHLPQL LAYLGWLAGA AKQILAMPTR 

      2710       2720       2730       2740       2750       2760 
QESEQDALGV LLATVNTLLQ QPRVWRELNA SSDPSLRCEL LDLLDSVARC ILQDTIFYRR 

      2770       2780       2790       2800       2810       2820 
HRRDRNKAKG PAPQAIFLAK LIETQIEIES LASGRVLAGV EEARLQFAGQ DLARFQVALS 

      2830       2840       2850       2860       2870       2880 
LVTSIGISLL RTHQFYAYAV TPHELIQQPG DQQQEQQADG KLPSIPVDSL SDVDVLRQFV 

      2890       2900       2910       2920       2930       2940 
KRLSIFGFTT RQQFEEYFMT CLLLINKLYD EHMVDQQEQF QIKQVCLQAI LELLMTYKTF 

      2950       2960       2970       2980       2990       3000 
PIVGLANGQF HHTTRWQRIT CDSISLKKLH KVQLLVDACN VFYQPNLERQ LAYDNVIGTR 

      3010       3020       3030       3040       3050       3060 
TFAPNQYDLN FSWAQMEDQA AAGVGVGVGV GLSGGEQANT ADIKQSCDAD VPDMAMRNYR 

      3070       3080       3090       3100       3110       3120 
HFTQLSGIDF RSSTQLVFDV LQQMIELNHI LVLPNLVKFC EICESRDHIK WIKERCLKLQ 

      3130       3140       3150       3160       3170       3180 
EQVAMDDTIS HQHIIYLLCR SQALLIPSLG ELQVLCSLIG NVYLKSTHSF IRIATLQGLL 

      3190       3200       3210       3220       3230       3240 
CLLECCSKTN TTMGRLSEEL ALLRSLIVGY INRHGIIDES PLPFSVEHTK LVWTLNYSLI 

      3250       3260       3270       3280       3290       3300 
EWTSKFVPQC HLLSNTIIAA NNFLKTTADE ELYLCVLHGL ERMVVNSGVP PPGIQPTGKD 

      3310       3320       3330       3340       3350       3360 
AAAGEPGAEG SKAGVGVVVT PQMRHKIEKL ALELLKMENE KFSIPALKLL LSCMYVGSAA 

      3370       3380       3390       3400       3410       3420 
QLENTELSNG IVQDDPEIIA QQNDKVDILL HCIKSSTRDA AWIYGQVLCQ IIRDLVPPNE 

      3430       3440       3450       3460       3470       3480 
ILTKVIKEFL AINHPHCDVI AMIVYQVFRS AIDSSYLQML QDWLICTLPT FLDQPEQQGV 

      3490       3500       3510       3520       3530       3540 
WGLSVIFLSA SINLHLIKLF PLVLGIGASN SAAAATTATT ATTEAEAAAP AMARKLGQHE 

      3550       3560       3570       3580 
IALFVTAAQD FHAKLSGEQR QRFREAFGSF KRSQVYGRML QCL 

« Hide

References

« Hide 'large scale' references
[1]"The genome sequence of Drosophila melanogaster."
Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N., Sutton G.G., Wortman J.R., Yandell M.D. expand/collapse author list , Zhang Q., Chen L.X., Brandon R.C., Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C., Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A., An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B., Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S., Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M., Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D., Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D., Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A., Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C., McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C., Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L., Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H., Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C., Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A., Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G., Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.
Science 287:2185-2195(2000) [PubMed: 10731132] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
Strain: Berkeley.
[2]"A putative Drosophila homolog of the Huntington's disease gene."
Li Z., Karlovich C.A., Fish M.P., Scott M.P., Myers R.M.
Hum. Mol. Genet. 8:1807-1815(1999) [PubMed: 10441347] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
[3]"Annotation of the Drosophila melanogaster euchromatic genome: a systematic review."
Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S., Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., Bettencourt B.R., Celniker S.E., de Grey A.D.N.J. expand/collapse author list , Drysdale R.A., Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M., Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.
Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002) [PubMed: 12537572] [Abstract]
Cited for: NUCLEOTIDE SEQUENCE.
Strain: Berkeley.
+Additional computationally mapped references.

Cross-references

Sequence databases

EMBL
GenBank
DDBJ
AF146362 mRNA. Translation: AAF03255.1.
AF147779 Genomic DNA. Translation: AAF03256.1.
AE014297 Genomic DNA. Translation: AAF56808.1.
RefSeqNP_651629.1. NM_143372.1.
UniGeneDm.1357.

3D structure databases

ModBaseSearch...

Protein-protein interaction databases

STRINGQ9V3N4.

Proteomic databases

PRIDEQ9V3N4.

Protocols and materials databases

StructuralBiologyKnowledgebaseSearch...

Genome annotation databases

EnsemblMetazoaFBtr0085310; FBpp0084679; FBgn0027655.
GeneID43392.
KEGGdme:Dmel_CG9995.
NMPDRfig|7227.3.peg.15114.
UCSCCG9995-RA. d. melanogaster.

Organism-specific databases

CTD3064.
FlyBaseFBgn0027655. htt.

Phylogenomic databases

GeneTreeEMGT00050000010324.
InParanoidQ9V3N4.
OMAKFCEICE.
PhylomeDBQ9V3N4.

Gene expression databases

ArrayExpressQ9V3N4.
BgeeQ9V3N4.

Family and domain databases

InterProIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR000357. HEAT.
IPR000091. Huntingtin.
IPR024613. Huntingtin_middle-repeat.
[Graphical view]
Gene3DG3DSA:1.25.10.10. ARM-like. 5 hits.
KOK04533.
PANTHERPTHR10170. Huntingtin. 1 hit.
PfamPF12372. DUF3652. 2 hits.
PF02985. HEAT. 1 hit.
[Graphical view]
SUPFAMSSF48371. ARM-type_fold. 1 hit.
ProtoNetSearch...

Other

NextBio833691.

Entry information

Entry nameQ9V3N4_DROME
AccessionPrimary (citable) accession number: Q9V3N4
Entry history
Integrated into UniProtKB/TrEMBL: May 1, 2000
Last sequence update: May 1, 2000
Last modified: January 25, 2012
This is version 76 of the entry and version 1 of the sequence. [Complete history]
Entry statusUnreviewed (UniProtKB/TrEMBL)