Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein dispatched homolog 1

Gene

Disp1

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Functions in hedgehog (Hh) signaling. Regulates the release and extracellular accumulation of cholesterol-modified hedgehog proteins and is hence required for effective production of the Hh signal.3 Publications

GO - Molecular functioni

  • hedgehog receptor activity Source: InterPro
  • peptide transporter activity Source: MGI

GO - Biological processi

  • determination of left/right symmetry Source: MGI
  • diaphragm development Source: MGI
  • dorsal/ventral pattern formation Source: MGI
  • embryonic pattern specification Source: MGI
  • patched ligand maturation Source: MGI
  • pattern specification process Source: MGI
  • peptide transport Source: MGI
  • smoothened signaling pathway Source: MGI
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Protein family/group databases

TCDBi2.A.6.9.2. the resistance-nodulation-cell division (rnd) superfamily.

Names & Taxonomyi

Protein namesi
Recommended name:
Protein dispatched homolog 1
Alternative name(s):
Mdispa
Gene namesi
Name:Disp1
Synonyms:Disp, Dispa, Icb, Icbins
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 1

Organism-specific databases

MGIiMGI:1916147. Disp1.

Subcellular locationi

Topology

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Transmembranei189 – 20921HelicalSequence analysisAdd
BLAST
Transmembranei499 – 51921HelicalSequence analysisAdd
BLAST
Transmembranei524 – 54421HelicalSequence analysisAdd
BLAST
Transmembranei548 – 56821HelicalSequence analysisAdd
BLAST
Transmembranei603 – 62321HelicalSequence analysisAdd
BLAST
Transmembranei637 – 65721HelicalSequence analysisAdd
BLAST
Transmembranei717 – 73721HelicalSequence analysisAdd
BLAST
Transmembranei986 – 100621HelicalSequence analysisAdd
BLAST
Transmembranei1008 – 102821HelicalSequence analysisAdd
BLAST
Transmembranei1038 – 105821HelicalSequence analysisAdd
BLAST
Transmembranei1081 – 110121HelicalSequence analysisAdd
BLAST
Transmembranei1109 – 112921HelicalSequence analysisAdd
BLAST

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Membrane

Pathology & Biotechi

Disruption phenotypei

Death at or soon after E9.5 probably due to abnormal embryonic turning and looping of the heart. Embryos also display defects in development of the forebrain and branchial arches.1 Publication

Mutagenesis

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Mutagenesisi571 – 5722DD → AA: Loss of function; when associated with A-1049. 1 Publication
Mutagenesisi571 – 5722DD → NN: Loss of function; when associated with N-1049. 1 Publication
Mutagenesisi829 – 8291C → F in icb; loss of function. 1 Publication
Mutagenesisi1049 – 10491D → A: Loss of function; when associated with 571-A-A-572. 1 Publication
Mutagenesisi1049 – 10491D → N: Loss of function; when associated with 571-N-N-572. 1 Publication

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 15211521Protein dispatched homolog 1PRO_0000310694Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi14 – 141N-linked (GlcNAc...)Sequence analysis
Glycosylationi58 – 581N-linked (GlcNAc...)Sequence analysis
Glycosylationi390 – 3901N-linked (GlcNAc...)Sequence analysis
Glycosylationi581 – 5811N-linked (GlcNAc...)Sequence analysis
Glycosylationi1455 – 14551N-linked (GlcNAc...)Sequence analysis

Keywords - PTMi

Glycoprotein

Proteomic databases

PaxDbiQ3TDN0.
PRIDEiQ3TDN0.

PTM databases

PhosphoSiteiQ3TDN0.

Expressioni

Developmental stagei

Expression overlaps with the one of SHH and IHH being restricted to tissues that require Hh signaling. PubMed:12372301, reported a more ubiquitous expression which is detected throughout the embryo at E7.5 and is maintained during embryonic development.3 Publications

Gene expression databases

BgeeiQ3TDN0.
CleanExiMM_DISP1.
ExpressionAtlasiQ3TDN0. baseline and differential.
GenevisibleiQ3TDN0. MM.

Interactioni

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000003035.

Structurei

3D structure databases

ProteinModelPortaliQ3TDN0.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini485 – 657173SSDPROSITE-ProRule annotationAdd
BLAST

Sequence similaritiesi

Belongs to the dispatched family.Curated
Contains 1 SSD (sterol-sensing) domain.PROSITE-ProRule annotation

Keywords - Domaini

Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiKOG3664. Eukaryota.
ENOG410XT7M. LUCA.
GeneTreeiENSGT00530000063208.
HOVERGENiHBG101595.
InParanoidiQ3TDN0.
OMAiLLNIFTC.
OrthoDBiEOG7DRJ26.
PhylomeDBiQ3TDN0.
TreeFamiTF324144.

Family and domain databases

InterProiIPR003392. Patched.
IPR000731. SSD.
[Graphical view]
PfamiPF02460. Patched. 2 hits.
[Graphical view]
PROSITEiPS50156. SSD. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q3TDN0-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MAVISGSDSV LLSNGSISTS TSNPSPLSPS DGDLPAQHLG PRETPRTKAS
60 70 80 90 100
PNGCLQLNGT VKSSFLPLDN QRTPQTPTQC CHPCPYHHPV SSHSNHQECH
110 120 130 140 150
PEAGLAASPA LASCRMQPHS EYSASLCPNH SPVYQAAHCL QPSPSFCLHH
160 170 180 190 200
PWPDHFQHQP VRQHLTIIRP SRPFKFPRSY AALLADWPVV VLGMCTLLIV
210 220 230 240 250
VCALVGVLVP ELPDFSDPLL GFEPRGTTIG QRLVTWNNMM RNTGYKATLA
260 270 280 290 300
NYPYKYAEEQ ARSHRDDRWS DDHHERERRE VDWNFQKDSF FCDVPSDGYS
310 320 330 340 350
RVVFASAGGE TLWNLPAIKS MCDVDNSRIR SHPQFSDLCQ RTTAVSCCPS
360 370 380 390 400
WTLGNYIAIL NNRSSCQKIV ERDVSHTLKL LRTCAKHYQN GTLGPDCWDK
410 420 430 440 450
AARRKDQLKC TNVPRKCTKY NAVYQILHYL VDKDFMTPKT ADYAVPALKY
460 470 480 490 500
SMLFSPTEKG ESMMNIYLDN FENWNSSDGI TTVTGIEFGI KHSLFQDYLL
510 520 530 540 550
MDTVYPAIAI AIVLLIMCVY TKSMFITLMT MFAIISSLIV SYFLYRVVFN
560 570 580 590 600
FEFFPFMNLT ALIILVGIGA DDAFVLCDVW NYTKFDKPRA ETSEAVSVTL
610 620 630 640 650
QHAALSMFVT SFTTAAAFYA NYVSNITAIR CFGVYAGTAI LVNYVLMVTW
660 670 680 690 700
LPAVIVLHER YLLNIFTCFR KPQPQAYDKS CWAVLCQKCR RVLFAVSEAS
710 720 730 740 750
RIFFEKVLPC IVIKFRYLWL IWFLALTVGG AYIVCVNPKM KLPSLELSEF
760 770 780 790 800
QVFRSSHPFE RYDAEFKKLF MFERVHHGEE LHMPITVIWG VSPEDSGDPL
810 820 830 840 850
NPKSKGELTL DSTFNIASPA SQAWILHFCQ KLRNQTFFHQ TEQQDFTSCF
860 870 880 890 900
IETFKQWMEN QDCDEPALYP CCSHCSFPYK QEVFELCIKK AIMELDRSTG
910 920 930 940 950
YHLNNKTPGP RFDINDTIRA VVLEFQSTFL FTLAYEKMQQ FYKEVDSWIS
960 970 980 990 1000
HELSSAPEGL SRGWFVSNLE FYDLQDSLSD GTLIAMGLSV AVAFSVMLLT
1010 1020 1030 1040 1050
TWNIIISLYA IVSIAGTIFV TVGSLVLLGW ELNVLESVTI SVAVGLSVDF
1060 1070 1080 1090 1100
AVHYGVAYRL APDPDREGKV IFSLSRMGSA IAMAALTTFV AGAMMMPSTV
1110 1120 1130 1140 1150
LAYTQLGTFM MLVMCVSWAF ATFFFQCLCR CLGPQGTCGQ IPFPTKLQCS
1160 1170 1180 1190 1200
PFSHTLSARP GDRGPSKTHA ASAYSVDARG QKSQLEHEFY ELQPLASHSC
1210 1220 1230 1240 1250
TSSEKTTYEE PHTCSEFFNG QAKNLRMPVP AAYSSELTKS PSSEPGSALL
1260 1270 1280 1290 1300
QSCLEQDTVC HFSLNPRCNC RDAYTHLQYG LPEIHCQQMG DSLCHKCAST
1310 1320 1330 1340 1350
AGGFVQIQSS VAPLKASHQA AEGLLHPAQH MLPPGMQNSR PRNFFLHSVQ
1360 1370 1380 1390 1400
HFQAQENLGR TSTHSTDERL PRTAELSPPP SDSRSTESFQ RACCHPENNQ
1410 1420 1430 1440 1450
RRLCKSRDPG DTEGSGGTKS KVSGLPNQTD KEEKQVEPSL LQTDETVNSE
1460 1470 1480 1490 1500
HLNHNESNFT FSHLPGEAGC RSCPNSPQSC RSIMRSKCGT EDCQTPNLEA
1510 1520
NVPAVPTHSD LSGESLLIKT L
Length:1,521
Mass (Da):170,130
Last modified:November 13, 2007 - v2
Checksum:i7B1E9A0F7873BB30
GO
Isoform 2 (identifier: Q3TDN0-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     329-345: IRSHPQFSDLCQRTTAV → VCKTQLKSIQHNKVMLR
     346-1521: Missing.

Show »
Length:345
Mass (Da):38,420
Checksum:i9CF7B836F0B954E8
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti292 – 2921C → W in BAE23607 (PubMed:16141072).Curated
Sequence conflicti688 – 6881K → M in AAN52161 (PubMed:12372301).Curated
Sequence conflicti949 – 9491I → V in BAE23607 (PubMed:16141072).Curated
Sequence conflicti1309 – 13091S → N in AAN52161 (PubMed:12372301).Curated
Sequence conflicti1309 – 13091S → N in AAN64660 (PubMed:12421714).Curated
Sequence conflicti1431 – 14311K → E in AAN52161 (PubMed:12372301).Curated
Sequence conflicti1431 – 14311K → E in AAN64660 (PubMed:12421714).Curated
Sequence conflicti1431 – 14311K → E in BAE41571 (PubMed:16141072).Curated

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei329 – 34517IRSHP…RTTAV → VCKTQLKSIQHNKVMLR in isoform 2. 1 PublicationVSP_029322Add
BLAST
Alternative sequencei346 – 15211176Missing in isoform 2. 1 PublicationVSP_029323Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AY150698 mRNA. Translation: AAN52161.1.
AY150577 mRNA. Translation: AAN64660.1.
AK004521 mRNA. Translation: BAB23344.1.
AK138276 mRNA. Translation: BAE23607.1.
AK170113 mRNA. Translation: BAE41571.1.
BC043102 mRNA. Translation: AAH43102.1.
BC059225 mRNA. Translation: AAH59225.1.
AY144589 mRNA. Translation: AAN08631.1.
CCDSiCCDS56661.1. [Q3TDN0-1]
RefSeqiNP_001265147.1. NM_001278218.1. [Q3TDN0-1]
NP_001265148.1. NM_001278219.1. [Q3TDN0-1]
NP_001265149.1. NM_001278220.1. [Q3TDN0-1]
NP_081142.3. NM_026866.3. [Q3TDN0-1]
UniGeneiMm.327216.
Mm.358721.

Genome annotation databases

EnsembliENSMUST00000003035; ENSMUSP00000003035; ENSMUSG00000030768. [Q3TDN0-1]
ENSMUST00000171366; ENSMUSP00000126742; ENSMUSG00000030768. [Q3TDN0-1]
ENSMUST00000195372; ENSMUSP00000141747; ENSMUSG00000030768. [Q3TDN0-1]
GeneIDi68897.
KEGGimmu:68897.
UCSCiuc008ick.2. mouse. [Q3TDN0-1]
uc008ico.2. mouse. [Q3TDN0-2]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AY150698 mRNA. Translation: AAN52161.1.
AY150577 mRNA. Translation: AAN64660.1.
AK004521 mRNA. Translation: BAB23344.1.
AK138276 mRNA. Translation: BAE23607.1.
AK170113 mRNA. Translation: BAE41571.1.
BC043102 mRNA. Translation: AAH43102.1.
BC059225 mRNA. Translation: AAH59225.1.
AY144589 mRNA. Translation: AAN08631.1.
CCDSiCCDS56661.1. [Q3TDN0-1]
RefSeqiNP_001265147.1. NM_001278218.1. [Q3TDN0-1]
NP_001265148.1. NM_001278219.1. [Q3TDN0-1]
NP_001265149.1. NM_001278220.1. [Q3TDN0-1]
NP_081142.3. NM_026866.3. [Q3TDN0-1]
UniGeneiMm.327216.
Mm.358721.

3D structure databases

ProteinModelPortaliQ3TDN0.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi10090.ENSMUSP00000003035.

Protein family/group databases

TCDBi2.A.6.9.2. the resistance-nodulation-cell division (rnd) superfamily.

PTM databases

PhosphoSiteiQ3TDN0.

Proteomic databases

PaxDbiQ3TDN0.
PRIDEiQ3TDN0.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000003035; ENSMUSP00000003035; ENSMUSG00000030768. [Q3TDN0-1]
ENSMUST00000171366; ENSMUSP00000126742; ENSMUSG00000030768. [Q3TDN0-1]
ENSMUST00000195372; ENSMUSP00000141747; ENSMUSG00000030768. [Q3TDN0-1]
GeneIDi68897.
KEGGimmu:68897.
UCSCiuc008ick.2. mouse. [Q3TDN0-1]
uc008ico.2. mouse. [Q3TDN0-2]

Organism-specific databases

CTDi84976.
MGIiMGI:1916147. Disp1.

Phylogenomic databases

eggNOGiKOG3664. Eukaryota.
ENOG410XT7M. LUCA.
GeneTreeiENSGT00530000063208.
HOVERGENiHBG101595.
InParanoidiQ3TDN0.
OMAiLLNIFTC.
OrthoDBiEOG7DRJ26.
PhylomeDBiQ3TDN0.
TreeFamiTF324144.

Miscellaneous databases

ChiTaRSiDisp1. mouse.
NextBioi328139.
PROiQ3TDN0.
SOURCEiSearch...

Gene expression databases

BgeeiQ3TDN0.
CleanExiMM_DISP1.
ExpressionAtlasiQ3TDN0. baseline and differential.
GenevisibleiQ3TDN0. MM.

Family and domain databases

InterProiIPR003392. Patched.
IPR000731. SSD.
[Graphical view]
PfamiPF02460. Patched. 2 hits.
[Graphical view]
PROSITEiPS50156. SSD. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Hedgehog-mediated patterning of the mammalian embryo requires transporter-like function of dispatched."
    Ma Y., Erkner A., Gong R., Yao S., Taipale J., Basler K., Beachy P.A.
    Cell 111:63-75(2002) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), FUNCTION, DEVELOPMENTAL STAGE, MUTAGENESIS OF 571-ASP-ASP-572 AND ASP-1049.
    Strain: 129.
    Tissue: Testis.
  2. "Mouse dispatched mutants fail to distribute hedgehog proteins and are defective in hedgehog signaling."
    Kawakami T., Kawcak T., Li Y.-J., Zhang W., Hu Y., Chuang P.-T.
    Development 129:5753-5765(2002) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), FUNCTION, DEVELOPMENTAL STAGE, DISRUPTION PHENOTYPE.
    Strain: Swiss Webster.
    Tissue: Embryo.
  3. "The transcriptional landscape of the mammalian genome."
    Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K., Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., Davis M.J.
    , Wilming L.G., Aidinis V., Allen J.E., Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M., Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G., di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G., Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M., Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E., Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N., Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T., Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H., Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J., Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L., Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K., Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P., Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O., Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G., Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M., Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C., Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y., Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B., Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K., Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K., Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C., Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y., Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T., Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N., Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N., Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S., Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J., Hayashizaki Y.
    Science 309:1559-1563(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
    Strain: C57BL/6J and NOD.
    Tissue: Dendritic cell, Embryo and Hypothalamus.
  4. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
    Strain: C57BL/6J.
    Tissue: Brain.
  5. "Mouse Dispatched homolog1 is required for long-range, but not juxtacrine, Hh signaling."
    Caspary T., Garcia-Garcia M.J., Huangfu D., Eggenschwiler J.T., Wyler M.R., Rakeman A.S., Alcorn H.L., Anderson K.V.
    Curr. Biol. 12:1628-1632(2002) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] OF 1-1501 (ISOFORM 1), FUNCTION, MUTAGENESIS OF CYS-829, DEVELOPMENTAL STAGE.
    Strain: C57BL/6J.

Entry informationi

Entry nameiDISP1_MOUSE
AccessioniPrimary (citable) accession number: Q3TDN0
Secondary accession number(s): Q3UUL8
, Q80ZZ8, Q8CGS3, Q8CIP6, Q8CIQ9, Q9CT62
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 13, 2007
Last sequence update: November 13, 2007
Last modified: November 11, 2015
This is version 75 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.