Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Slit homolog 3 protein

Gene

Slit3

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

May act as molecular guidance cue in cellular migration, and function may be mediated by interaction with roundabout homolog receptors.

GO - Molecular functioni

  • calcium ion binding Source: InterPro
  • heparin binding Source: GO_Central
  • receptor binding Source: MGI
  • Roundabout binding Source: MGI

GO - Biological processi

  • animal organ morphogenesis Source: MGI
  • apoptotic process involved in luteolysis Source: Ensembl
  • axon extension involved in axon guidance Source: MGI
  • axon guidance Source: MGI
  • cellular response to hormone stimulus Source: Ensembl
  • negative chemotaxis Source: MGI
  • negative regulation of cell growth Source: MGI
  • negative regulation of cell proliferation Source: MGI
  • negative regulation of chemokine-mediated signaling pathway Source: MGI
  • negative regulation of gene expression Source: MGI
  • regulation of axonogenesis Source: GO_Central
  • response to cortisol Source: Ensembl
  • Roundabout signaling pathway Source: MGI
Complete GO annotation...

Keywords - Molecular functioni

Developmental protein

Keywords - Biological processi

Differentiation, Neurogenesis

Names & Taxonomyi

Protein namesi
Recommended name:
Slit homolog 3 protein
Short name:
Slit-3
Short name:
Slit3
Gene namesi
Name:Slit3
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 11

Organism-specific databases

MGIiMGI:1315202. Slit3.

Subcellular locationi

GO - Cellular componenti

  • extracellular space Source: MGI
  • proteinaceous extracellular matrix Source: GO_Central
Complete GO annotation...

Keywords - Cellular componenti

Secreted

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Signal peptidei1 – 33Sequence analysisAdd BLAST33
ChainiPRO_000000773334 – 1523Slit homolog 3 proteinAdd BLAST1490

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Glycosylationi72N-linked (GlcNAc...)Sequence analysis1
Glycosylationi192N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi284 ↔ 293By similarity
Disulfide bondi441 ↔ 464By similarity
Disulfide bondi443 ↔ 485By similarity
Disulfide bondi505 ↔ 511By similarity
Disulfide bondi509 ↔ 518By similarity
Glycosylationi563N-linked (GlcNAc...)Sequence analysis1
Glycosylationi622N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi667 ↔ 690By similarity
Disulfide bondi669 ↔ 711By similarity
Glycosylationi784N-linked (GlcNAc...)Sequence analysis1
Glycosylationi792N-linked (GlcNAc...)Sequence analysis1
Glycosylationi797N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi920 ↔ 931By similarity
Disulfide bondi925 ↔ 941By similarity
Glycosylationi928N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi943 ↔ 952By similarity
Disulfide bondi959 ↔ 970By similarity
Disulfide bondi964 ↔ 982By similarity
Disulfide bondi984 ↔ 993By similarity
Disulfide bondi1000 ↔ 1011By similarity
Disulfide bondi1005 ↔ 1020By similarity
Disulfide bondi1022 ↔ 1031By similarity
Glycosylationi1025N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi1038 ↔ 1051By similarity
Disulfide bondi1045 ↔ 1060By similarity
Disulfide bondi1062 ↔ 1071By similarity
Disulfide bondi1078 ↔ 1089By similarity
Disulfide bondi1083 ↔ 1098By similarity
Disulfide bondi1100 ↔ 1109By similarity
Disulfide bondi1123 ↔ 1134By similarity
Disulfide bondi1128 ↔ 1143By similarity
Disulfide bondi1145 ↔ 1154By similarity
Glycosylationi1181N-linked (GlcNAc...)Sequence analysis1
Glycosylationi1247N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi1305 ↔ 1332By similarity
Disulfide bondi1355 ↔ 1364By similarity
Disulfide bondi1372 ↔ 1382By similarity
Disulfide bondi1377 ↔ 1391By similarity
Disulfide bondi1393 ↔ 1402By similarity
Glycosylationi1406N-linked (GlcNAc...)Sequence analysis1
Disulfide bondi1412 ↔ 1422By similarity
Disulfide bondi1417 ↔ 1432By similarity
Disulfide bondi1434 ↔ 1443By similarity
Disulfide bondi1449 ↔ 1487By similarity
Disulfide bondi1467 ↔ 1501By similarity
Disulfide bondi1478 ↔ 1517By similarity
Disulfide bondi1482 ↔ 1519By similarity

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

MaxQBiQ9WVB4.
PaxDbiQ9WVB4.
PRIDEiQ9WVB4.

PTM databases

PhosphoSitePlusiQ9WVB4.

Expressioni

Developmental stagei

Detected as early as E8.25 in the ventral neural tube. From E9.5 to E11.5 is expressed in the floor plate and motor columns. This pattern of expression continued until at least E17.5 and remained weak. Rostrally, at E8.25 is expressed in the ventral midline of the neural groove and in the neural fold prior to closure of the neural tube. Between E8.5 and E9.5, expression is observed in the ventral side of the mesencephalon and metencephalon and in the commissural plate after closure of the neural tube. After E13.5, the expression of SLIT3 is weak in the developing CNS. At E9.5, SLIT3 is detected in the otic vesicle and in the clefts between the first and the second branchial arches. Between E10.5 and E11.5 is prominently expressed in the otic vesicle but decreased in the branchial clefts. From E13.5 to E17.5, expression is observed in the cochlea, in the pigment layer of the retina, and in the olfactory epithelium. At E13.5 expression is observed in the whisker follicle surrounding the bulb and shaft. In the developing limb is first detected at E10.5 in distal limb bud mesenchyme. At this stage, is also observed in lateral ridge tissue flanking the limb bud. This pattern persisted through E11.5 but unlike with SLIT2, expression is not observed in the inter-limb bud lateral ridge tissue. At E11.5, expression in both the fore- and the hindlimb is most intense in the distal anterior mesenchyme and in the proximal posterior cleft between the limb bud and the lateral ridge. At E13.5 could be detected in the wrist and weakly in palm and proximal part of the digits excluding the tips of the digits.1 Publication

Gene expression databases

BgeeiENSMUSG00000056427.
CleanExiMM_SLIT3.
GenevisibleiQ9WVB4. MM.

Interactioni

GO - Molecular functioni

  • receptor binding Source: MGI
  • Roundabout binding Source: MGI

Protein-protein interaction databases

BioGridi203329. 3 interactors.
STRINGi10090.ENSMUSP00000066857.

Structurei

3D structure databases

ProteinModelPortaliQ9WVB4.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini34 – 61LRRNTAdd BLAST28
Repeati62 – 83LRR 1Add BLAST22
Repeati86 – 107LRR 2Add BLAST22
Repeati110 – 131LRR 3Add BLAST22
Repeati134 – 155LRR 4Add BLAST22
Repeati158 – 179LRR 5Add BLAST22
Repeati182 – 203LRR 6Add BLAST22
Domaini215 – 265LRRCT 1Add BLAST51
Domaini271 – 307LRRNT 2Add BLAST37
Repeati308 – 329LRR 7Add BLAST22
Repeati332 – 353LRR 8Add BLAST22
Repeati356 – 377LRR 9Add BLAST22
Repeati380 – 401LRR 10Add BLAST22
Repeati404 – 425LRR 11Add BLAST22
Domaini437 – 487LRRCT 2Add BLAST51
Domaini496 – 532LRRNT 3Add BLAST37
Repeati533 – 554LRR 12Add BLAST22
Repeati558 – 579LRR 13Add BLAST22
Repeati582 – 603LRR 14Add BLAST22
Repeati606 – 627LRR 15Add BLAST22
Repeati630 – 651LRR 16Add BLAST22
Domaini663 – 713LRRCT 3Add BLAST51
Domaini716 – 752LRRNT 4Add BLAST37
Repeati753 – 774LRR 17Add BLAST22
Repeati776 – 797LRR 18Add BLAST22
Repeati800 – 821LRR 19Add BLAST22
Repeati824 – 845LRR 20Add BLAST22
Domaini857 – 907LRRCT 4Add BLAST51
Domaini918 – 953EGF-like 1PROSITE-ProRule annotationAdd BLAST36
Domaini955 – 994EGF-like 2PROSITE-ProRule annotationAdd BLAST40
Domaini996 – 1032EGF-like 3PROSITE-ProRule annotationAdd BLAST37
Domaini1034 – 1072EGF-like 4PROSITE-ProRule annotationAdd BLAST39
Domaini1074 – 1110EGF-like 5PROSITE-ProRule annotationAdd BLAST37
Domaini1119 – 1155EGF-like 6PROSITE-ProRule annotationAdd BLAST37
Domaini1158 – 1332Laminin G-likePROSITE-ProRule annotationAdd BLAST175
Domaini1340 – 1365EGF-like 7PROSITE-ProRule annotationAdd BLAST26
Domaini1368 – 1403EGF-like 8PROSITE-ProRule annotationAdd BLAST36
Domaini1408 – 1444EGF-like 9PROSITE-ProRule annotationAdd BLAST37
Domaini1449 – 1523CTCKPROSITE-ProRule annotationAdd BLAST75

Sequence similaritiesi

Contains 1 CTCK (C-terminal cystine knot-like) domain.PROSITE-ProRule annotation
Contains 9 EGF-like domains.PROSITE-ProRule annotation
Contains 1 laminin G-like domain.PROSITE-ProRule annotation
Contains 20 LRR (leucine-rich) repeats.Curated
Contains 4 LRRCT domains.Curated
Contains 4 LRRNT domains.Curated

Keywords - Domaini

EGF-like domain, Leucine-rich repeat, Repeat, Signal

Phylogenomic databases

eggNOGiKOG4237. Eukaryota.
COG4886. LUCA.
GeneTreeiENSGT00840000129708.
HOGENOMiHOG000116120.
HOVERGENiHBG057959.
InParanoidiQ9WVB4.
KOiK06850.
OMAiEYVCPGP.
OrthoDBiEOG091G0MHP.
TreeFamiTF332887.

Family and domain databases

Gene3Di2.60.120.200. 1 hit.
3.80.10.10. 6 hits.
InterProiIPR013320. ConA-like_dom.
IPR000483. Cys-rich_flank_reg_C.
IPR006207. Cys_knot_C.
IPR001881. EGF-like_Ca-bd_dom.
IPR013032. EGF-like_CS.
IPR000742. EGF-like_dom.
IPR000152. EGF-type_Asp/Asn_hydroxyl_site.
IPR018097. EGF_Ca-bd_CS.
IPR009030. Growth_fac_rcpt_.
IPR032675. L_dom-like.
IPR001791. Laminin_G.
IPR001611. Leu-rich_rpt.
IPR003591. Leu-rich_rpt_typical-subtyp.
IPR000372. LRRNT.
[Graphical view]
PfamiPF00008. EGF. 5 hits.
PF12661. hEGF. 1 hit.
PF02210. Laminin_G_2. 1 hit.
PF13855. LRR_8. 5 hits.
PF01463. LRRCT. 4 hits.
PF01462. LRRNT. 3 hits.
[Graphical view]
SMARTiSM00041. CT. 1 hit.
SM00181. EGF. 9 hits.
SM00179. EGF_CA. 9 hits.
SM00282. LamG. 1 hit.
SM00369. LRR_TYP. 18 hits.
SM00082. LRRCT. 4 hits.
SM00013. LRRNT. 4 hits.
[Graphical view]
SUPFAMiSSF49899. SSF49899. 1 hit.
SSF52058. SSF52058. 4 hits.
SSF57184. SSF57184. 1 hit.
PROSITEiPS01185. CTCK_1. 1 hit.
PS01225. CTCK_2. 1 hit.
PS00022. EGF_1. 9 hits.
PS01186. EGF_2. 7 hits.
PS50026. EGF_3. 9 hits.
PS01187. EGF_CA. 2 hits.
PS50025. LAM_G_DOMAIN. 1 hit.
PS51450. LRR. 20 hits.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q9WVB4-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MALGRTGAGA AVRARLALGL ALASILSGPP AAACPTKCTC SAASVDCHGL
60 70 80 90 100
GLRAVPRGIP RNAERLDLDR NNITRITKMD FAGLKNLRVL HLEDNQVSII
110 120 130 140 150
ERGAFQDLKQ LERLRLNKNK LQVLPELLFQ STPKLTRLDL SENQIQGIPR
160 170 180 190 200
KAFRGVTGVK NLQLDNNHIS CIEDGAFRAL RDLEILTLNN NNISRILVTS
210 220 230 240 250
FNHMPKIRTL RLHSNHLYCD CHLAWLSDWL RQRRTIGQFT LCMAPVHLRG
260 270 280 290 300
FSVADVQKKE YVCPGPHSEA PACNANSLSC PSACSCSNNI VDCRGKGLTE
310 320 330 340 350
IPANLPEGIV EIRLEQNSIK SIPAGAFTQY KKLKRIDISK NQISDIAPDA
360 370 380 390 400
FQGLKSLTSL VLYGNKITEI PKGLFDGLVS LQLLLLNANK INCLRVNTFQ
410 420 430 440 450
DLQNLNLLSL YDNKLQTISK GLFVPLQSIQ TLHLAQNPFV CDCHLKWLAD
460 470 480 490 500
YLQDNPIETS GARCSSPRRL ANKRISQIKS KKFRCSGSED YRNRFSSECF
510 520 530 540 550
MDLVCPEKCR CEGTIVDCSN QKLARIPSHL PEYTTDLRLN DNDISVLEAT
560 570 580 590 600
GIFKKLPNLR KINLSNNRIK EVREGAFDGA AGVQELMLTG NQLETMHGRM
610 620 630 640 650
FRGLSSLKTL MLRSNLISCV SNDTFAGLSS VRLLSLYDNR ITTITPGAFT
660 670 680 690 700
TLVSLSTINL LSNPFNCNCH MAWLGRWLRK RRIVSGNPRC QKPFFLKEIP
710 720 730 740 750
IQDVAIQDFT CDGNEESSCQ LSPRCPEQCT CVETVVRCSN RGLHALPKGM
760 770 780 790 800
PKDVTELYLE GNHLTAVPKE LSAFRQLTLI DLSNNSISML TNHTFSNMSH
810 820 830 840 850
LSTLILSYNR LRCIPVHAFN GLRSLRVLTL HGNDISSVPE GSFNDLTSLS
860 870 880 890 900
HLALGTNPLH CDCSLRWLSE WVKAGYKEPG IARCSSPESM ADRLLLTTPT
910 920 930 940 950
HRFQCKGPVD INIVAKCNAC LSSPCKNNGT CSQDPVEQYR CTCPYSYKGK
960 970 980 990 1000
DCTVPINTCV QNPCEHGGTC HLSENLRDGF SCSCPLGFEG QRCEINPDDC
1010 1020 1030 1040 1050
EDNDCENSAT CVDGINNYAC LCPPNYTGEL CDEVIDYCVP EMNLCQHEAK
1060 1070 1080 1090 1100
CISLDKGFRC ECVPGYSGKL CETNNDDCVA HKCRHGAQCV DEVNGYTCIC
1110 1120 1130 1140 1150
PQGFSGLFCE HPPPMVLLQT SPCDQYECQN GAQCIVVQQE PTCRCPPGFA
1160 1170 1180 1190 1200
GPRCEKLITV NFVGKDSYVE LASAKVRPQA NISLQVATDK DNGILLYKGD
1210 1220 1230 1240 1250
NDPLALELYQ GHVRLVYDSL SSPPTTVYSV ETVNDGQFHS VELVMLNQTL
1260 1270 1280 1290 1300
NLVVDKGAPK SLGKLQKQPA VGSNSPLYLG GIPTSTGLSA LRQGADRPLG
1310 1320 1330 1340 1350
GFHGCIHEVR INNELQDFKA LPPQSLGVSP GCKSCTVCRH GLCRSVEKDS
1360 1370 1380 1390 1400
VVCECHPGWT GPLCDQEARD PCLGHSCRHG TCMATGDSYV CKCAEGYGGA
1410 1420 1430 1440 1450
LCDQKNDSAS ACSAFKCHHG QCHISDRGEP YCLCQPGFSG HHCEQENPCM
1460 1470 1480 1490 1500
GEIVREAIRR QKDYASCATA SKVPIMECRG GCGSQCCQPI RSKRRKYVFQ
1510 1520
CTDGSSFVEE VERHLECGCR ACS
Length:1,523
Mass (Da):167,727
Last modified:July 27, 2011 - v2
Checksum:i95C4E98E58B9C8EA
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti582G → S in AAD44760 (PubMed:10433822).Curated1
Sequence conflicti606S → G in AAD44760 (PubMed:10433822).Curated1
Sequence conflicti729C → F in AAD44760 (PubMed:10433822).Curated1
Sequence conflicti999D → G in AAD44760 (PubMed:10433822).Curated1
Sequence conflicti1242E → K in AAD44760 (PubMed:10433822).Curated1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF144629 mRNA. Translation: AAD44760.1.
AL669839
, AL669856, AL731688, AL732401, AL732630 Genomic DNA. Translation: CAI24241.1.
AL731688
, AL669839, AL669856, AL732401, AL732630 Genomic DNA. Translation: CAI24974.1.
AL732630
, AL669839, AL669856, AL731688, AL732401 Genomic DNA. Translation: CAI25087.1.
AL732401
, AL669839, AL669856, AL731688, AL732630 Genomic DNA. Translation: CAI25608.1.
AL669856
, AL669839, AL731688, AL732401, AL732630 Genomic DNA. Translation: CAI25804.1.
CCDSiCCDS24542.1.
RefSeqiNP_035542.2. NM_011412.3.
UniGeneiMm.478604.

Genome annotation databases

EnsembliENSMUST00000069837; ENSMUSP00000066857; ENSMUSG00000056427.
GeneIDi20564.
KEGGimmu:20564.
UCSCiuc007ild.2. mouse.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF144629 mRNA. Translation: AAD44760.1.
AL669839
, AL669856, AL731688, AL732401, AL732630 Genomic DNA. Translation: CAI24241.1.
AL731688
, AL669839, AL669856, AL732401, AL732630 Genomic DNA. Translation: CAI24974.1.
AL732630
, AL669839, AL669856, AL731688, AL732401 Genomic DNA. Translation: CAI25087.1.
AL732401
, AL669839, AL669856, AL731688, AL732630 Genomic DNA. Translation: CAI25608.1.
AL669856
, AL669839, AL731688, AL732401, AL732630 Genomic DNA. Translation: CAI25804.1.
CCDSiCCDS24542.1.
RefSeqiNP_035542.2. NM_011412.3.
UniGeneiMm.478604.

3D structure databases

ProteinModelPortaliQ9WVB4.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi203329. 3 interactors.
STRINGi10090.ENSMUSP00000066857.

PTM databases

PhosphoSitePlusiQ9WVB4.

Proteomic databases

MaxQBiQ9WVB4.
PaxDbiQ9WVB4.
PRIDEiQ9WVB4.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000069837; ENSMUSP00000066857; ENSMUSG00000056427.
GeneIDi20564.
KEGGimmu:20564.
UCSCiuc007ild.2. mouse.

Organism-specific databases

CTDi6586.
MGIiMGI:1315202. Slit3.

Phylogenomic databases

eggNOGiKOG4237. Eukaryota.
COG4886. LUCA.
GeneTreeiENSGT00840000129708.
HOGENOMiHOG000116120.
HOVERGENiHBG057959.
InParanoidiQ9WVB4.
KOiK06850.
OMAiEYVCPGP.
OrthoDBiEOG091G0MHP.
TreeFamiTF332887.

Miscellaneous databases

ChiTaRSiSlit3. mouse.
PROiQ9WVB4.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000056427.
CleanExiMM_SLIT3.
GenevisibleiQ9WVB4. MM.

Family and domain databases

Gene3Di2.60.120.200. 1 hit.
3.80.10.10. 6 hits.
InterProiIPR013320. ConA-like_dom.
IPR000483. Cys-rich_flank_reg_C.
IPR006207. Cys_knot_C.
IPR001881. EGF-like_Ca-bd_dom.
IPR013032. EGF-like_CS.
IPR000742. EGF-like_dom.
IPR000152. EGF-type_Asp/Asn_hydroxyl_site.
IPR018097. EGF_Ca-bd_CS.
IPR009030. Growth_fac_rcpt_.
IPR032675. L_dom-like.
IPR001791. Laminin_G.
IPR001611. Leu-rich_rpt.
IPR003591. Leu-rich_rpt_typical-subtyp.
IPR000372. LRRNT.
[Graphical view]
PfamiPF00008. EGF. 5 hits.
PF12661. hEGF. 1 hit.
PF02210. Laminin_G_2. 1 hit.
PF13855. LRR_8. 5 hits.
PF01463. LRRCT. 4 hits.
PF01462. LRRNT. 3 hits.
[Graphical view]
SMARTiSM00041. CT. 1 hit.
SM00181. EGF. 9 hits.
SM00179. EGF_CA. 9 hits.
SM00282. LamG. 1 hit.
SM00369. LRR_TYP. 18 hits.
SM00082. LRRCT. 4 hits.
SM00013. LRRNT. 4 hits.
[Graphical view]
SUPFAMiSSF49899. SSF49899. 1 hit.
SSF52058. SSF52058. 4 hits.
SSF57184. SSF57184. 1 hit.
PROSITEiPS01185. CTCK_1. 1 hit.
PS01225. CTCK_2. 1 hit.
PS00022. EGF_1. 9 hits.
PS01186. EGF_2. 7 hits.
PS50026. EGF_3. 9 hits.
PS01187. EGF_CA. 2 hits.
PS50025. LAM_G_DOMAIN. 1 hit.
PS51450. LRR. 20 hits.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiSLIT3_MOUSE
AccessioniPrimary (citable) accession number: Q9WVB4
Secondary accession number(s): B1ATW4
Entry historyi
Integrated into UniProtKB/Swiss-Prot: March 15, 2004
Last sequence update: July 27, 2011
Last modified: November 30, 2016
This is version 139 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.