Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Dynein heavy chain 1, axonemal

Gene

Dnah1

Organism
Rattus norvegicus (Rat)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at transcript leveli

Functioni

Force generating protein of respiratory cilia. Produces force towards the minus ends of microtubules. Dynein has ATPase activity; the force-producing power stroke is thought to occur on release of ADP. Involved in sperm motility; implicated in sperm flagellar assembly (By similarity).By similarity

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Nucleotide bindingi1787 – 17948ATPSequence analysis
Nucleotide bindingi2054 – 20618ATPSequence analysis
Nucleotide bindingi2460 – 24678ATPSequence analysis
Nucleotide bindingi2819 – 28268ATPSequence analysis

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Motor protein

Keywords - Ligandi

ATP-binding, Nucleotide-binding

Names & Taxonomyi

Protein namesi
Recommended name:
Dynein heavy chain 1, axonemal
Alternative name(s):
Axonemal beta dynein heavy chain 1
Ciliary dynein heavy chain 1
Gene namesi
Name:Dnah1
Synonyms:Dlp1
OrganismiRattus norvegicus (Rat)
Taxonomic identifieri10116 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeRattus
Proteomesi
  • UP000002494 Componenti: Chromosome 16

Organism-specific databases

RGDi621795. Dnah1.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cell projection, Cilium, Cytoplasm, Cytoskeleton, Dynein, Microtubule

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 45164516Dynein heavy chain 1, axonemalPRO_0000318937Add
BLAST

Proteomic databases

PaxDbiQ63164.
PRIDEiQ63164.

PTM databases

iPTMnetiQ63164.
PhosphoSiteiQ63164.

Expressioni

Tissue specificityi

Expressed in brain.1 Publication

Gene expression databases

BgeeiENSRNOG00000026914.
ExpressionAtlasiQ63164. baseline.
GenevisibleiQ63164. RN.

Interactioni

Subunit structurei

Consists of at least two heavy chains and a number of intermediate and light chains.

Protein-protein interaction databases

STRINGi10116.ENSRNOP00000059841.

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni1 – 17481748StemBy similarityAdd
BLAST
Regioni1749 – 1956208AAA 1By similarityAdd
BLAST
Regioni2016 – 2249234AAA 2By similarityAdd
BLAST
Regioni2422 – 2682261AAA 3By similarityAdd
BLAST
Regioni2780 – 2972193AAA 4By similarityAdd
BLAST
Regioni2987 – 3285299StalkBy similarityAdd
BLAST
Regioni3388 – 3618231AAA 5By similarityAdd
BLAST
Regioni3831 – 4050220AAA 6By similarityAdd
BLAST

Coiled coil

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Coiled coili1174 – 120431Sequence analysisAdd
BLAST
Coiled coili1422 – 144524Sequence analysisAdd
BLAST
Coiled coili3069 – 309022Sequence analysisAdd
BLAST
Coiled coili3126 – 315025Sequence analysisAdd
BLAST
Coiled coili3289 – 3399111Sequence analysisAdd
BLAST
Coiled coili3619 – 368668Sequence analysisAdd
BLAST

Motif

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Motifi1787 – 17948GPAGTGKT motif
Motifi1837 – 18437CFDEFNR motifBy similarity

Domaini

Dynein heavy chains probably consist of an N-terminal stem (which binds cargo and interacts with other dynein components), and the head or motor domain. The motor contains six tandemly-linked AAA domains in the head, which form a ring. A stalk-like structure (formed by two of the coiled coil domains) protrudes between AAA 4 and AAA 5 and terminates in a microtubule-binding site. A seventh domain may also contribute to this ring; it is not clear whether the N-terminus or the C-terminus forms this extra domain. There are four well-conserved and two non-conserved ATPase sites, one per AAA domain. Probably only one of these (within AAA 1) actually hydrolyzes ATP, the others may serve a regulatory function (By similarity).By similarity

Sequence similaritiesi

Belongs to the dynein heavy chain family.Curated

Keywords - Domaini

Coiled coil

Phylogenomic databases

eggNOGiKOG3595. Eukaryota.
COG5245. LUCA.
GeneTreeiENSGT00760000118964.
HOGENOMiHOG000237308.
HOVERGENiHBG107830.
InParanoidiQ63164.
KOiK10408.
OMAiCWLRKLP.
OrthoDBiEOG091G00FO.
PhylomeDBiQ63164.

Family and domain databases

Gene3Di3.40.50.300. 4 hits.
InterProiIPR011704. ATPase_dyneun-rel_AAA.
IPR024743. Dynein_HC_stalk.
IPR024317. Dynein_heavy_chain_D4_dom.
IPR004273. Dynein_heavy_dom.
IPR013602. Dynein_heavy_dom-2.
IPR027417. P-loop_NTPase.
[Graphical view]
PfamiPF07728. AAA_5. 1 hit.
PF12780. AAA_8. 1 hit.
PF08393. DHC_N2. 1 hit.
PF03028. Dynein_heavy. 1 hit.
PF12777. MT. 1 hit.
[Graphical view]
SUPFAMiSSF52540. SSF52540. 5 hits.

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q63164-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MVTLSISDTL HTPAFVENRH LAITGMPTRK ANNGFPSPDD NNVWGQFWKD
60 70 80 90 100
LQTPLLVVVY VHGAQCQLLL PIENRRVSSG SDKSLKNGME ECDKEEASTS
110 120 130 140 150
SQGPGYCPAN VPENHDLEKM LKESSRNPEK TSPNPELKTP PLPLSDLGQP
160 170 180 190 200
RKSPLAGTDK KYPIMKQRGF YSDILSPGTL DQLGDVCCGP YLSQNLIRQA
210 220 230 240 250
DLDKFTPKGQ CHEHWAAESF VIPEDFQERV EQQSIGTTTR LLTQTDFPLQ
260 270 280 290 300
SYEPKVQVPF QVLPGRCPRK IEIERRKQQY LRLDIEQLLA NEGIDSNKLM
310 320 330 340 350
PRHPDLQQPQ TIEQGRDPLF PIYLPLKVFD NEEFDCRTPT EWLNMGLEPD
360 370 380 390 400
AQYRKPVPGK ALLPTDDALG HVHRAGCKQV SFSKCINTGI SLPEAGKKQS
410 420 430 440 450
VSFTLDSNDR NNLTKLSPVL QCLAIKPWEQ GHVKCTEFQL PRFTGALALS
460 470 480 490 500
RNTRAKRGSQ EASTRDHLDF EDPKSQELDY RWCEVGVLDY DEEKKLYLVQ
510 520 530 540 550
KTDKRGLVRD EMGMPIVNGG VTPEGRPPLL DTQYWVPRIQ LLFCAEDPRV
560 570 580 590 600
FTQRVVQANA LRKNTEALLL YNLYVDCMPT EGQRVINEQS LSKIKQWALS
610 620 630 640 650
TPRMRKGPSV LEHLSSLARE VNLDYERSMN KINFDQIVSS KPETFSFVTL
660 670 680 690 700
PEKEEEKVPN QGLVSVPEYP FREQKEDFTF VSLLTRSEVI TALSKVRAEC
710 720 730 740 750
NKVTSMSLFH SNLSKYSRLE EFEQIQSQTF SQVQMFLKDS WISTLKVAMR
760 770 780 790 800
SSLRDMSKGW YNLYETNWEV YLMSKLRKLM ELIKYMLQDT LRFLVQDSLG
810 820 830 840 850
SFAQFIGDAC CSVLECIDDM DWGEDLINSP YKPRKNPLFI VDLVLDNTGV
860 870 880 890 900
HYSTPLEQFE VILLNLFDKG ILATHAVPQL EKLVMEDIFI SGDPLLESVG
910 920 930 940 950
LHEPLVEELR ANITNAMHKA MIPLQAYAKE YRKTYQTQCP SAEEVREVVI
960 970 980 990 1000
THLKEKEILD NSLPSSIIIG PFYINVDNVK QSLSKKRKAL ATSMLDILAK
1010 1020 1030 1040 1050
NLHNEVDSIC EEFRSISRKI YEKPNSIEEL AELRDWMKGI PEKLVILEVR
1060 1070 1080 1090 1100
QALALAARSL VEPEVGPALE IPFNNPLPSM TRHFLLMQER IVKVMSDYEV
1110 1120 1130 1140 1150
MDEFFYNLTT DDFNDKWAAN NWPTKILGQI DMVRQQHVED EEKFRKIQLM
1160 1170 1180 1190 1200
DQNNFQEKLE GLQLVVAGFS THVEIARAHE IANEVRRVKK QLKDCQQLAM
1210 1220 1230 1240 1250
LYNNRERIFG LPITNYDKLS RMVKEFQPYL DLWTTASDWL RWSESWMNDP
1260 1270 1280 1290 1300
LSAIDAEQLE KNVIESFKTM HKCVKQFKDI PACQDVALDI RARIDEFKPY
1310 1320 1330 1340 1350
IPLIQGLRNP GMRNRHWEVL SNEININVRP KANLTFARCL EMNLQDYIES
1360 1370 1380 1390 1400
ISKVAEVAGK EYAIEQALDK MEKEWASILF NVLPYKETDT YILKSPDEAS
1410 1420 1430 1440 1450
QLLDDHIVMT QSMSFSPYKK PFEQRINSWE TKLKLTQEVL EEWLNCQRSW
1460 1470 1480 1490 1500
LYLEPIFSSE DITRQLPVES KRYQTMERIW RKIMKNAYEN REVINVCSDQ
1510 1520 1530 1540 1550
RLLDSLRDCN KLLDLVQKGL SEYLETKRTA FPRFYFLSDD ELLEILSQTK
1560 1570 1580 1590 1600
DPTAVQPHLR KCFENIARLL FQEDLEITHM YSAEGEEVKL SFSIYPSSNV
1610 1620 1630 1640 1650
EDWLLEVERS MKASVHDIIE MAIKAYPTML RTEWVLSWPG QVTIAGCQTY
1660 1670 1680 1690 1700
WTLEVAQALE ASSISSSLFP QLSKQLSDLV ALVRGKLSRM QRMVLSALIV
1710 1720 1730 1740 1750
IEVHAKDVVS KLIDENVVSV HDFEWISQLR YYWTNNDLYI RAVNAEFIYG
1760 1770 1780 1790 1800
YEYLGNSGRL VITPLTDRCY LTLTGALHLK FGGAPAGPAG TGKTETTKDL
1810 1820 1830 1840 1850
GKALAIQTVV FNCSDQLDFM AMGKFFKGLA SAGAWACFDE FNRIDIEVLS
1860 1870 1880 1890 1900
VVAQQITTIQ KAQQQRSREY IKSLGGAVMC LPFLCVLYVQ ALFRPVAMMV
1910 1920 1930 1940 1950
PDYAMIAEIS LYSFGFNEAN VLAKKITTTF KLSSEQLSSQ DHYDFGMRAV
1960 1970 1980 1990 2000
KTVISAAGNL KRENPTMNEE LICLRAIRDV NVPKFLQEDL KLFSGIVSDL
2010 2020 2030 2040 2050
FPTTKEEETD YGILDQAIRK ACEKNNLKDV EGFLIKCIQL YETTVVRHGL
2060 2070 2080 2090 2100
MLVGPTGSGK SNCYRILAAA MTSLKGKPSI SGGVYEAVNY YVLNPKSITM
2110 2120 2130 2140 2150
GQLYGEFDLL THEWTDGIFP SLIRAGAIAS DTNKKWYMFD GPVDAVWIEN
2160 2170 2180 2190 2200
MNTVLDDNKK LCLSSGEIIK LTEAMTMMFE VQDLAVASPA TVSRCGMVYL
2210 2220 2230 2240 2250
EPSILGLMPF VECWLKHLPS IIKPYEEQFK TLFVKFLESS IAFVRTTVKE
2260 2270 2280 2290 2300
VVASTNSNLT MSLLKLLDCF FKPFLPREGL KKIPSEKLSH IPELIEPWFI
2310 2320 2330 2340 2350
FSLVWSVGAT GDHSSRLNFS QWLKIKMVFE QIKLAFPEEG LVYDYRLDDA
2360 2370 2380 2390 2400
GISSTEDDDE DDEESKQVAT ALPAQEGGMV KPPGLWDLPP RDSGHLEKAT
2410 2420 2430 2440 2450
LTASRSEAVA WVKWMDYSVP FTMMPDTNYC NIIVPTMDTM QMSYLLGMLI
2460 2470 2480 2490 2500
TNHKPVLCIG PTGTGKTLTV SNKLLKNLPL EYISHFLTFS ARTSANQTQD
2510 2520 2530 2540 2550
LIDSKLDKRR KGVFGPPLGR NFIFFIDDLN MPALETYGAQ PPIELLRQWM
2560 2570 2580 2590 2600
DHGGWYDRKI IGAFKNLVDI NFVCAMGPPG GGRNAITPRL TRHFNYLSFI
2610 2620 2630 2640 2650
EMDEVSKKRI FSIILGCWMD GLLGEKSYRE PVPGAPNIVH MTEPLVNATI
2660 2670 2680 2690 2700
SIYAIITSQL LPTPAKSHYT FNLRDLSKVF QGILMAEPAK VEDKVQLLRL
2710 2720 2730 2740 2750
WYHENCRVFR DRLVNEEDRG WFDGLLEMKM EDLGVAFNKV CPFQPILYGD
2760 2770 2780 2790 2800
FMSPGSDVKS YELITSENKM MQVIEEYMED YNQINTAKLK LVLFVDAMSH
2810 2820 2830 2840 2850
ICRISRTLRQ ALGNALLLGV GGSGRSSLTR LASHMAEYEC FQVELSKNYG
2860 2870 2880 2890 2900
MSEWREDVKK ILLKAGMQNL PITFLFSDTQ IKNESFLEDI NNILNSGDIP
2910 2920 2930 2940 2950
NLYSADEQDQ IVNTMRPYIQ EQGLQPTKAN LMAAYTGRVR NNIHMVLCMS
2960 2970 2980 2990 3000
PIGEVFRARL RQFPSLVNCC TIDWFNEWPA EALQSVATRF LHEIPELECS
3010 3020 3030 3040 3050
SEVIEGLIHV CVYIHQSVAK KCVEYLAELA RHNYVTPKSY LELLNIFSIL
3060 3070 3080 3090 3100
IGQKKMELKT AKHRMKSGLD KLLRTSEDVA KMQEELEIMR PLLEEAAKDT
3110 3120 3130 3140 3150
LLTMDQIKVD TAIAEETRKS VQAEEIKANE KASKAQAIAD DAQKDLDEAL
3160 3170 3180 3190 3200
PALDAALASL RNLNKNDVTE VRAMQRPPPG VKLVIEAVCI MKGIKPKKVP
3210 3220 3230 3240 3250
GEKPGSKVDD YWEPGKGLLQ DPGRFLESLF KFDKDNIGEA VIKAIQPYID
3260 3270 3280 3290 3300
NEEFQPAAIA KVSKACTSIC QWVRAMHKYH FVAKAVEPKR QALREAQDDL
3310 3320 3330 3340 3350
EVTQRILEEA KHHLREVEDG ISTLQAKYRE CVTKKEELEM KCEQCEQRLG
3360 3370 3380 3390 3400
RADKSQSPGQ PPGAHPTRLL LQLINGLADE KVRWQDTVEN LENMLDNIFG
3410 3420 3430 3440 3450
DVLVAAGFVA YLGPFTGQYR AALYEYWVNQ LTVYGVPHTS KPTLISTLGN
3460 3470 3480 3490 3500
PVKIRSWQIA GLPNDTLSVE NGVINQFSQR WTHFIDPQGQ ANKWIKNMEK
3510 3520 3530 3540 3550
ESGLDVFKLS DRDFLRSMEN AIRFGKPCLL ENVGEELDPA LEPVLLKQTY
3560 3570 3580 3590 3600
KQQGNTVLKL GDTVIPYHED FRMYITTKLP NPHYSPEVST KLTLINFTLS
3610 3620 3630 3640 3650
PSGLEDQLLG QVVAEERPDL EEAKNQLIIS NAKMRQELKD IEDQILYRLS
3660 3670 3680 3690 3700
SSEGNPVDDV ELIKVLEASK MKAAEIQAKV RIAEQTEKDI DLTRMEYIPV
3710 3720 3730 3740 3750
AVRTQILFFC VSDLANVDPM YQYSLEWFLN IFLSGIANSE RADNLKKRIV
3760 3770 3780 3790 3800
NINRYLTFSL YSNVCRSLFE KHKLMFAFLL CVRIMMNEGK INQGEWRYLL
3810 3820 3830 3840 3850
SGGSIQTMSE NPAPHWLSDR AWRDILALSN LPAFSTFSTD FVQHLPKFQA
3860 3870 3880 3890 3900
IFDSAEPHRE PLPGIWNTYL DEFQKLLILR CLRGDKVTNA MQDFVANHLE
3910 3920 3930 3940 3950
PRFIEPQTAN LSAVFKESNS TTPLIFVLSP GTDPAADLYK FAEEMKFSKK
3960 3970 3980 3990 4000
LSAISLGQGQ GPRAEAMMRN SIERGKWVFF QNCHLAPSWM PALERLIEHI
4010 4020 4030 4040 4050
NPDKVHRDFR LWLTSLPSNK FPVSILQNGS KMTIEPPRGV KANLLKSYNS
4060 4070 4080 4090 4100
LSDDFLHSCQ KVVEFKSLLL SLCLFHGNAL ERRKFGPLGF NIPYEFTDGD
4110 4120 4130 4140 4150
LRICISQLKM FLDEYEDIPY KVLKYTAGEI NYGGRVTDDW DRRCVMNILE
4160 4170 4180 4190 4200
DFYNPAVLSP EHRYSKSGIY HQIPPTYDLN GYLSYIKSLP LNDMPEIFGL
4210 4220 4230 4240 4250
HDNANITFAQ NETFALFGAI LQLQPKSSSM GGQSREELVE DVAEDILVQV
4260 4270 4280 4290 4300
PKPVDLEEVV NKYPVLYEES MNTVLVQEVI RYNKLLMVIT QTLSDMLKAI
4310 4320 4330 4340 4350
KGLVVMSLEL ELMSTSLYNN AVPELWKSKA YPSLKPLASW IMDLLQRLNF
4360 4370 4380 4390 4400
LHSWIKNGIP SVFWISGFFF PQAFLTGTLQ NFARKFVISI DTITFDFKVL
4410 4420 4430 4440 4450
SYASSEIAER PSTGCYIYGL FLEGARWDPF DFQLAESRPK ELYTEMAVIW
4460 4470 4480 4490 4500
LLPVANRKVQ NQDFYLCPIY KTLTRAGTLS TTGHSTNYVI AVEIPSNQPQ
4510
RHWIKRGVAL ICALDY
Length:4,516
Mass (Da):515,021
Last modified:February 26, 2008 - v2
Checksum:iF46AC5E8DCCC0BAF
GO
Isoform 2 (identifier: Q63164-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1867-1890: SREYIKSLGGAVMCLPFLCVLYVQ → VERFMFEGVEIPLVPSCAVFITMNPGYAGRTELPDNLK

Note: No experimental confirmation available.
Show »
Length:4,530
Mass (Da):516,573
Checksum:i2590FAEDB47EE081
GO

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1867 – 189024SREYI…VLYVQ → VERFMFEGVEIPLVPSCAVF ITMNPGYAGRTELPDNLK in isoform 2. 1 PublicationVSP_031309Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AABR03100255 Genomic DNA. No translation available.
AABR03101829 Genomic DNA. No translation available.
AABR03101330 Genomic DNA. No translation available.
D26492 mRNA. Translation: BAA05500.1.
PIRiI70171.
RefSeqiNP_001028827.2. NM_001033655.2.
UniGeneiRn.136817.

Genome annotation databases

EnsembliENSRNOT00000035009; ENSRNOP00000032434; ENSRNOG00000026914. [Q63164-1]
GeneIDi171339.
KEGGirno:171339.
UCSCiRGD:621795. rat. [Q63164-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AABR03100255 Genomic DNA. No translation available.
AABR03101829 Genomic DNA. No translation available.
AABR03101330 Genomic DNA. No translation available.
D26492 mRNA. Translation: BAA05500.1.
PIRiI70171.
RefSeqiNP_001028827.2. NM_001033655.2.
UniGeneiRn.136817.

3D structure databases

ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10116.ENSRNOP00000059841.

PTM databases

iPTMnetiQ63164.
PhosphoSiteiQ63164.

Proteomic databases

PaxDbiQ63164.
PRIDEiQ63164.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSRNOT00000035009; ENSRNOP00000032434; ENSRNOG00000026914. [Q63164-1]
GeneIDi171339.
KEGGirno:171339.
UCSCiRGD:621795. rat. [Q63164-1]

Organism-specific databases

CTDi25981.
RGDi621795. Dnah1.

Phylogenomic databases

eggNOGiKOG3595. Eukaryota.
COG5245. LUCA.
GeneTreeiENSGT00760000118964.
HOGENOMiHOG000237308.
HOVERGENiHBG107830.
InParanoidiQ63164.
KOiK10408.
OMAiCWLRKLP.
OrthoDBiEOG091G00FO.
PhylomeDBiQ63164.

Miscellaneous databases

PROiQ63164.

Gene expression databases

BgeeiENSRNOG00000026914.
ExpressionAtlasiQ63164. baseline.
GenevisibleiQ63164. RN.

Family and domain databases

Gene3Di3.40.50.300. 4 hits.
InterProiIPR011704. ATPase_dyneun-rel_AAA.
IPR024743. Dynein_HC_stalk.
IPR024317. Dynein_heavy_chain_D4_dom.
IPR004273. Dynein_heavy_dom.
IPR013602. Dynein_heavy_dom-2.
IPR027417. P-loop_NTPase.
[Graphical view]
PfamiPF07728. AAA_5. 1 hit.
PF12780. AAA_8. 1 hit.
PF08393. DHC_N2. 1 hit.
PF03028. Dynein_heavy. 1 hit.
PF12777. MT. 1 hit.
[Graphical view]
SUPFAMiSSF52540. SSF52540. 5 hits.
ProtoNetiSearch...

Entry informationi

Entry nameiDYH1_RAT
AccessioniPrimary (citable) accession number: Q63164
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 26, 2008
Last sequence update: February 26, 2008
Last modified: September 7, 2016
This is version 88 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.