Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein
Submitted name:

Huntingtin

Gene

Htt

Organism
Mus musculus (Mouse)
Status
Unreviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at protein leveli

Functioni

GO - Biological processi

Complete GO annotation...

Names & Taxonomyi

Protein namesi
Submitted name:
HuntingtinImported
Submitted name:
Huntington disease gene homolog, isoform CRA_aImported
Gene namesi
Name:HttImported
Synonyms:HdhImported
ORF Names:mCG_2547Imported
OrganismiMus musculus (Mouse)Imported
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 5

Organism-specific databases

MGIiMGI:96067. Htt.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Expressioni

Gene expression databases

BgeeiENSMUSG00000029104.

Interactioni

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000078945.

Family & Domainsi

Phylogenomic databases

eggNOGiENOG410IDZV. Eukaryota.
ENOG410XSEC. LUCA.
GeneTreeiENSGT00390000015863.
KOiK04533.
OMAiPIRRKGK.
OrthoDBiEOG091G004G.
TreeFamiTF323608.

Family and domain databases

Gene3Di1.25.10.10. 8 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR000091. Huntingtin.
IPR028426. Huntingtin_fam.
IPR024613. Huntingtin_middle-repeat.
[Graphical view]
PANTHERiPTHR10170. PTHR10170. 1 hit.
PfamiPF12372. DUF3652. 1 hit.
[Graphical view]
PRINTSiPR00375. HUNTINGTIN.
SUPFAMiSSF48371. SSF48371. 8 hits.

Sequencei

Sequence statusi: Complete.

G3X9H5-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MATLEKLMKA FESLKSFQQQ QQQQPPPQAP PPPPPPPPPQ PPQPPPQGQP
60 70 80 90 100
PPPPPPLPGP AEEPLHRPKK ELSATKKDRV NHCLTICENI VAQSLRNSPE
110 120 130 140 150
FQKLLGIAME LFLLCSDDAE SDVRMVADEC LNKVIKALMD SNLPRLQLEL
160 170 180 190 200
YKEIKKNGAP RSLRAALWRF AELAHLVRPQ KCRPYLVNLL PCLTRTSKRP
210 220 230 240 250
EESVQETLAA AVPKIMASFG NFANDNEIKV LLKAFIANLK SSSPTVRRTA
260 270 280 290 300
AGSAVSICQH SRRTQYFYNW LLNVLLGLLV PMEEEHSTLL ILGVLLTLRC
310 320 330 340 350
LVPLLQQQVK DTSLKGSFGV TRKEMEVSPS TEQLVQVYEL TLHHTQHQDH
360 370 380 390 400
NVVTGALELL QQLFRTPPPE LLQALTTPGG LGQLTLVQEE ARGRGRSGSI
410 420 430 440 450
VELLAGGGSS CSPVLSRKQK GKVLLGEEEA LEDDSESRSD VSSSAFAASV
460 470 480 490 500
KSEIGGELAA SSGVSTPGSV GHDIITEQPR SQHTLQADSV DLSGCDLTSA
510 520 530 540 550
ATDGDEEDIL SHSSSQFSAV PSDPAMDLND GTQASSPISD SSQTTTEGPD
560 570 580 590 600
SAVTPSDSSE IVLDGADSQY LGMQIGQPQE DDEEGAAGVL SGEVSDVFRN
610 620 630 640 650
SSLALQQAHL LERMGHSRQP SDSSIDKYVT RDEVAEASDP ESKPCRIKGD
660 670 680 690 700
IGQPNDDDSA PLVHCVRLLS ASFLLTGEKK ALVPDRDVRV SVKALALSCI
710 720 730 740 750
GAAVALHPES FFSRLYKVPL NTTESTEEQY VSDILNYIDH GDPQVRGATA
760 770 780 790 800
ILCGTLVYSI LSRSRLRVGD WLGNIRTLTG NTFSLVDCIP LLQKTLKDES
810 820 830 840 850
SVTCKLACTA VRHCVLSLCS SSYSDLGLQL LIDMLPLKNS SYWLVRTELL
860 870 880 890 900
DTLAEIDFRL VSFLEAKAES LHRGAHHYTG FLKLQERVLN NVVIYLLGDE
910 920 930 940 950
DPRVRHVAAT SLTRLVPKLF YKCDQGQADP VVAVARDQSS VYLKLLMHET
960 970 980 990 1000
QPPSHFSVST ITRIYRGYSL LPSITDVTME NNLSRVVAAV SHELITSTTR
1010 1020 1030 1040 1050
ALTFGCCEAL CLLSAAFPVC TWSLGWHCGV PPLSASDESR KSCTVGMASM
1060 1070 1080 1090 1100
ILTLLSSAWF PLDLSAHQDA LILAGNLLAA SAPKSLRSSW TSEEEANSAA
1110 1120 1130 1140 1150
TRQEEIWPAL GDRTLVPLVE QLFSHLLKVI NICAHVLDDV TPGPAIKAAL
1160 1170 1180 1190 1200
PSLTNPPSLS PIRRKGKEKE PGEQASTPMS PKKVGEASAA SRQSDTSGPV
1210 1220 1230 1240 1250
TASKSSSLGS FYHLPSYLKL HDVLKATHAN YKVTLDLQNS TEKFGGFLRS
1260 1270 1280 1290 1300
ALDVLSQILE LATLQDIGKC VEEVLGYLKS CFSREPMMAT VCVQQLLKTL
1310 1320 1330 1340 1350
FGTNLASQFD GLSSNPSKSQ CRAQRLGSSS VRPGLYHYCF MAPYTHFTQA
1360 1370 1380 1390 1400
LADASLRNMV QAEQERDASG WFDVLQKVSA QLKTNLTSVT KNRADKNAIH
1410 1420 1430 1440 1450
NHIRLFEPLV IKALKQYTTT TSVQLQKQVL DLLAQLVQLR VNYCLLDSDQ
1460 1470 1480 1490 1500
VFIGFVLKQF EYIEVGQFRE SEAIIPNIFF FLVLLSYERY HSKQIIGIPK
1510 1520 1530 1540 1550
IIQLCDGIMA SGRKAVTHAI PALQPIVHDL FVLRGTNKAD AGKELETQKE
1560 1570 1580 1590 1600
VVVSMLLRLI QYHQVLEMFI LVLQQCHKEN EDKWKRLSRQ VADIILPMLA
1610 1620 1630 1640 1650
KQQMHIDSHE ALGVLNTLFE ILAPSSLRPV DMLLRSMFIT PSTMASVSTV
1660 1670 1680 1690 1700
QLWISGILAI LRVLISQSTE DIVLCRIQEL SFSPHLLSCP VINRLRGGGG
1710 1720 1730 1740 1750
NVTLGECSEG KQKSLPEDTF SRFLLQLVGI LLEDIVTKQL KVDMSEQQHT
1760 1770 1780 1790 1800
FYCQELGTLL MCLIHIFKSG MFRRITAAAT RLFTSDGCEG SFYTLESLNA
1810 1820 1830 1840 1850
RVRSMVPTHP ALVLLWCQIL LLINHTDHRW WAEVQQTPKR HSLSCTKSLN
1860 1870 1880 1890 1900
PQKSGEEEDS GSAAQLGMCN REIVRRGALI LFCDYVCQNL HDSEHLTWLI
1910 1920 1930 1940 1950
VNHIQDLISL SHEPPVQDFI SAIHRNSAAS GLFIQAIQSR CENLSTPTTL
1960 1970 1980 1990 2000
KKTLQCLEGI HLSQSGAVLT LYVDRLLGTP FRALARMVDT LACRRVEMLL
2010 2020 2030 2040 2050
AANLQSSMAQ LPEEELNRIQ EHLQNSGLAQ RHQRLYSLLD RFRLSTVQDS
2060 2070 2080 2090 2100
LSPLPPVTSH PLDGDGHTSL ETVSPDKDWY LQLVRSQCWT RSDSALLEGA
2110 2120 2130 2140 2150
ELVNRIPAED MNDFMMSSEF NLSLLAPCLS LGMSEIANGQ KSPLFEAARG
2160 2170 2180 2190 2200
VILNRVTSVV QQLPAVHQVF QPFLPIEPTA YWNKLNDLLG DTTSYQSLTI
2210 2220 2230 2240 2250
LARALAQYLV VLSKVPAHLH LPPEKEGDTV KFVVMTVEAL SWHLIHEQIP
2260 2270 2280 2290 2300
LSLDLQAGLD CCCLALQVPG LWGVLSSPEY VTHACSLIHC VRFILEAIAV
2310 2320 2330 2340 2350
QPGDQLLGPE SRSHTPRAVR KEEVDSDIQN LSHVTSACEM VADMVESLQS
2360 2370 2380 2390 2400
VLALGHKRNS TLPSFLTAVL KNIVISLARL PLVNSYTRVP PLVWKLGWSP
2410 2420 2430 2440 2450
KPGGDFGTVF PEIPVEFLQE KEILKEFIYR INTLGWTNRT QFEETWATLL
2460 2470 2480 2490 2500
GVLVTQPLVM EQEESPPEED TERTQIHVLA VQAITSLVLS AMTVPVAGNP
2510 2520 2530 2540 2550
AVSCLEQQPR NKPLKALDTR FGRKLSMIRG IVEQEIQEMV SQRENTATHH
2560 2570 2580 2590 2600
SHQAWDPVPS LLPATTGALI SHDKLLLQIN PEREPGNMSY KLGQVSIHSV
2610 2620 2630 2640 2650
WLGNNITPLR EEEWDEEEEE ESDVPAPTSP PVSPVNSRKH RAGVDIHSCS
2660 2670 2680 2690 2700
QFLLELYSRW ILPSSAARRT PVILISEVVR SLLVVSDLFT ERTQFEMMYL
2710 2720 2730 2740 2750
TLTELRRVHP SEDEILIQYL VPATCKAAAV LGMDKTVAEP VSRLLESTLR
2760 2770 2780 2790 2800
SSHLPSQIGA LHGILYVLEC DLLDDTAKQL IPVVSDYLLS NLKGIAHCVN
2810 2820 2830 2840 2850
IHSQQHVLVM CATAFYLMEN YPLDVGPEFS ASVIQMCGVM LSGSEESTPS
2860 2870 2880 2890 2900
IIYHCALRGL ERLLLSEQLS RLDTESLVKL SVDRVNVQSP HRAMAALGLM
2910 2920 2930 2940 2950
LTCMYTGKEK ASPGRASDPS PATPDSESVI VAMERVSVLF DRIRKGFPCE
2960 2970 2980 2990 3000
ARVVARILPQ FLDDFFPPQD VMNKVIGEFL SNQQPYPQFM ATVVYKVFQT
3010 3020 3030 3040 3050
LHSAGQSSMV RDWVMLSLSN FTQRTPVAMA MWSLSCFLVS ASTSPWVSAI
3060 3070 3080 3090 3100
LPHVISRMGK LEQVDVNLFC LVATDFYRHQ IEEEFDRRAF QSVFEVVAAP
3110 3120
GSPYHRLLAC LQNVHKVTTC
Length:3,120
Mass (Da):344,788
Last modified:November 16, 2011 - v1
Checksum:i53CCDC45D53B8BA3
GO

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC133204 Genomic DNA. No translation available.
AC151669 Genomic DNA. No translation available.
CH466524 Genomic DNA. Translation: EDL37467.1.
RefSeqiNP_034544.1. NM_010414.3.
UniGeneiMm.209071.

Genome annotation databases

EnsembliENSMUST00000080036; ENSMUSP00000078945; ENSMUSG00000029104.
GeneIDi15194.
KEGGimmu:15194.
UCSCiuc008xdc.3. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AC133204 Genomic DNA. No translation available.
AC151669 Genomic DNA. No translation available.
CH466524 Genomic DNA. Translation: EDL37467.1.
RefSeqiNP_034544.1. NM_010414.3.
UniGeneiMm.209071.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000078945.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000080036; ENSMUSP00000078945; ENSMUSG00000029104.
GeneIDi15194.
KEGGimmu:15194.
UCSCiuc008xdc.3. mouse.

Organism-specific databases

CTDi3064.
MGIiMGI:96067. Htt.

Phylogenomic databases

eggNOGiENOG410IDZV. Eukaryota.
ENOG410XSEC. LUCA.
GeneTreeiENSGT00390000015863.
KOiK04533.
OMAiPIRRKGK.
OrthoDBiEOG091G004G.
TreeFamiTF323608.

Miscellaneous databases

ChiTaRSiHtt. mouse.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000029104.

Family and domain databases

Gene3Di1.25.10.10. 8 hits.
InterProiIPR011989. ARM-like.
IPR016024. ARM-type_fold.
IPR000091. Huntingtin.
IPR028426. Huntingtin_fam.
IPR024613. Huntingtin_middle-repeat.
[Graphical view]
PANTHERiPTHR10170. PTHR10170. 1 hit.
PfamiPF12372. DUF3652. 1 hit.
[Graphical view]
PRINTSiPR00375. HUNTINGTIN.
SUPFAMiSSF48371. SSF48371. 8 hits.
ProtoNetiSearch...

Entry informationi

Entry nameiG3X9H5_MOUSE
AccessioniPrimary (citable) accession number: G3X9H5
Entry historyi
Integrated into UniProtKB/TrEMBL: November 16, 2011
Last sequence update: November 16, 2011
Last modified: November 30, 2016
This is version 50 of the entry and version 1 of the sequence. [Complete history]
Entry statusiUnreviewed (UniProtKB/TrEMBL)

Miscellaneousi

Keywords - Technical termi

Complete proteome, Proteomics identificationCombined sources, Reference proteomeImported

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.