Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Protein strawberry notch homolog 2

Gene

SBNO2

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at transcript leveli

Functioni

Acts as a transcriptional coregulator, that can have both coactivator and corepressor functions. Inhibits the DCSTAMP-repressive activity of TAL1, hence enhancing the access of the transcription factor MITF to the DC-STAMP promoter in osteoclast. Plays a role in bone homeostasis; required as a positive regulator in TNFSF11//RANKL-mediated osteoclast fusion via a DCSTAMP-dependent pathway. May also be required in the regulation of osteoblast differentiation (By similarity). Involved in the transcriptional corepression of NF-kappaB in macrophages (PubMed:18025162). Plays a role as a regulator in the proinflammatory cascade (PubMed:18025162).By similarity1 Publication

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Activator, Repressor

Keywords - Biological processi

Differentiation, Osteogenesis, Transcription, Transcription regulation

Names & Taxonomyi

Protein namesi
Recommended name:
Protein strawberry notch homolog 2
Gene namesi
Name:SBNO2
Synonyms:KIAA0963
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 19

Organism-specific databases

HGNCiHGNC:29158. SBNO2.

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA162402390.

Polymorphism and mutation databases

BioMutaiSBNO2.
DMDMi166233537.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 13661366Protein strawberry notch homolog 2PRO_0000314560Add
BLAST

Proteomic databases

EPDiQ9Y2G9.
MaxQBiQ9Y2G9.
PaxDbiQ9Y2G9.
PRIDEiQ9Y2G9.

PTM databases

iPTMnetiQ9Y2G9.
PhosphoSiteiQ9Y2G9.

Expressioni

Tissue specificityi

Detected in macrophages. IL10 regulates expression in a STAT3-dependent way.1 Publication

Inductioni

Up-regulated by interleukin IL6 and soluble interleukin receptor IL6R in astrocytes (PubMed:25903009).1 Publication

Gene expression databases

BgeeiQ9Y2G9.
CleanExiHS_SBNO2.
ExpressionAtlasiQ9Y2G9. baseline and differential.
GenevisibleiQ9Y2G9. HS.

Organism-specific databases

HPAiHPA041867.

Interactioni

Subunit structurei

Interacts with TAL1; this interaction inhibits TAL1 occupancy of the DCSTAMP promoter, leading to the activation of the DCSTAMP promoter by the transcription factor MITF.By similarity

Protein-protein interaction databases

IntActiQ9Y2G9. 2 interactions.
STRINGi9606.ENSP00000354733.

Structurei

3D structure databases

ProteinModelPortaliQ9Y2G9.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi183 – 19513Poly-GluAdd
BLAST
Compositional biasi1234 – 129461Pro-richAdd
BLAST

Sequence similaritiesi

Belongs to the SBNO family.Curated

Phylogenomic databases

eggNOGiKOG1513. Eukaryota.
ENOG410XQ7Q. LUCA.
GeneTreeiENSGT00390000016591.
HOGENOMiHOG000043949.
HOVERGENiHBG108461.
InParanoidiQ9Y2G9.
OMAiKIGKHHP.
OrthoDBiEOG70PBWM.
PhylomeDBiQ9Y2G9.
TreeFamiTF313526.

Family and domain databases

Gene3Di3.40.50.300. 1 hit.
InterProiIPR027417. P-loop_NTPase.
IPR030410. SBNO2.
IPR026937. SBNO_Helicase_C_dom.
IPR026741. SNO.
[Graphical view]
PANTHERiPTHR12706. PTHR12706. 1 hit.
PTHR12706:SF5. PTHR12706:SF5. 1 hit.
PfamiPF13871. Helicase_C_4. 1 hit.
[Graphical view]
SUPFAMiSSF52540. SSF52540. 2 hits.

Sequences (2)i

Sequence statusi: Complete.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q9Y2G9-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MLAVGPAMDR DYPQHEPPPA GSLLYSPPPL QSAMLHCPYW NTFSLPPYPA
60 70 80 90 100
FSSDSRPFMS SASFLGSQPC PDTSYAPVAT ASSLPPKTCD FAQDSSYFED
110 120 130 140 150
FSNISIFSSS VDSLSDIVDT PDFLPADSLN QVSTIWDDNP APSTHDKLFQ
160 170 180 190 200
LSRPFAGFED FLPSHSTPLL VSYQEQSVQS QPEEEDEAEE EEAEELGHTE
210 220 230 240 250
TYADYVPSKS KIGKQHPDRV VETSTLSSVP PPDITYTLAL PSDSGALSAL
260 270 280 290 300
QLEAITYACQ QHEVLLPSGQ RAGFLIGDGA GVGKGRTVAG VILENHLRGR
310 320 330 340 350
KKALWFSVSN DLKYDAERDL RDIEATGIAV HALSKIKYGD TTTSEGVLFA
360 370 380 390 400
TYSALIGESQ AGGQHRTRLR QILDWCGEAF EGVIVFDECH KAKNAGSTKM
410 420 430 440 450
GKAVLDLQNK LPLARVVYAS ATGASEPRNM IYMSRLGIWG EGTPFRNFEE
460 470 480 490 500
FLHAIEKRGV GAMEIVAMDM KVSGMYIARQ LSFSGVTFRI EEIPLAPAFE
510 520 530 540 550
CVYNRAALLW AEALNVFQQA ADWIGLESRK SLWGQFWSAH QRFFKYLCIA
560 570 580 590 600
AKVRRLVELA REELARDKCV VIGLQSTGEA RTREVLGEND GHLNCFVSAA
610 620 630 640 650
EGVFLSLIQK HFPSTKRKRD RGAGSKRKRR PRGRGAKAPR LACETAGVIR
660 670 680 690 700
ISDDSSTESD PGLDSDFNSS PESLVDDDVV IVDAVGLPSD DRGPLCLLQR
710 720 730 740 750
DPHGPGVLER VERLKQDLLD KVRRLGRELP VNTLDELIDQ LGGPQRVAEM
760 770 780 790 800
TGRKGRVVSR PDGTVAFESR AEQGLSIDHV NLREKQRFMS GEKLVAIISE
810 820 830 840 850
ASSSGVSLQA DRRVQNQRRR VHMTLELPWS ADRAIQQFGR THRSNQVSAP
860 870 880 890 900
EYVFLISELA GERRFASIVA KRLESLGALT HGDRRATESR DLSKYNFENK
910 920 930 940 950
YGTRALHCVL TTILSQTENK VPVPQGYPGG VPTFFRDMKQ GLLSVGIGGR
960 970 980 990 1000
ESRNGCLDVE KDCSITKFLN RILGLEVHKQ NALFQYFSDT FDHLIEMDKR
1010 1020 1030 1040 1050
EGKYDMGILD LAPGIEEIYE ESQQVFLAPG HPQDGQVVFY KISVDRGLKW
1060 1070 1080 1090 1100
EDAFAKSLAL TGPYDGFYLS YKVRGNKPSC LLAEQNRGQF FTVYKPNIGR
1110 1120 1130 1140 1150
QSQLEALDSL RRKFHRVTAE EAKEPWESGY ALSLTHCSHS AWNRHCRLAQ
1160 1170 1180 1190 1200
EGKDCLQGLR LRHHYMLCGA LLRVWGRIAA VMADVSSSSY LQIVRLKTKD
1210 1220 1230 1240 1250
RKKQVGIKIP EGCVRRVLQE LRLMDADVKR RQAPALGCPA PPAPRPLALP
1260 1270 1280 1290 1300
CGPGEVLDLT YSPPAEAFPP PPHFSFPAPL SLDAGPGVVP LGTPDAQADP
1310 1320 1330 1340 1350
AALAHQGCDI NFKEVLEDML RSLHAGPPSE GALGEGAGAG GAAGGGPERQ
1360
SVIQFSPPFP GAQAPL
Length:1,366
Mass (Da):150,275
Last modified:January 15, 2008 - v3
Checksum:i6AE084F984DDEC2E
GO
Isoform 2 (identifier: Q9Y2G9-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-93: MLAVGPAMDR...LPPKTCDFAQ → MREPLPGSAS...LWLQFEALNK

Show »
Length:1,309
Mass (Da):144,166
Checksum:iB2614E372C074B98
GO

Sequence cautioni

The sequence AAC28919.1 differs from that shown. Reason: Erroneous gene model prediction. Curated
The sequence BAA76807.2 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.Curated
The sequence BAB84928.1 differs from that shown. Reason: Frameshift at position 1236. Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti545 – 5451K → R in BAF85096 (PubMed:14702039).Curated
Sequence conflicti670 – 6701S → F in BAG54153 (PubMed:14702039).Curated
Sequence conflicti694 – 6941P → S in BAA76807 (PubMed:10231032).Curated
Sequence conflicti694 – 6941P → S in BAB84928 (Ref. 4) Curated
Sequence conflicti724 – 7241R → Q in BAB84928 (Ref. 4) Curated
Sequence conflicti1343 – 13431A → G in BAB84928 (Ref. 4) Curated

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1 – 9393MLAVG…CDFAQ → MREPLPGSASWGTPGPPSAG TMSQLQLWLQFEALNK in isoform 2. CuratedVSP_041216Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB023180 mRNA. Translation: BAA76807.2. Different initiation.
AK125139 mRNA. Translation: BAG54153.1.
AK292407 mRNA. Translation: BAF85096.1.
AC005390 Genomic DNA. Translation: AAC28919.1. Sequence problems.
AK074102 mRNA. Translation: BAB84928.1. Frameshift.
BC106021 mRNA. Translation: AAI06022.1.
CCDSiCCDS45894.1. [Q9Y2G9-1]
CCDS45895.1. [Q9Y2G9-3]
PIRiT02748.
RefSeqiNP_001093592.1. NM_001100122.1. [Q9Y2G9-3]
NP_055778.2. NM_014963.2. [Q9Y2G9-1]
XP_005259576.1. XM_005259519.3. [Q9Y2G9-1]
XP_005259577.1. XM_005259520.2. [Q9Y2G9-1]
UniGeneiHs.408708.

Genome annotation databases

EnsembliENST00000361757; ENSP00000354733; ENSG00000064932. [Q9Y2G9-1]
ENST00000438103; ENSP00000400762; ENSG00000064932. [Q9Y2G9-3]
ENST00000612198; ENSP00000477651; ENSG00000278788. [Q9Y2G9-1]
ENST00000622719; ENSP00000482802; ENSG00000278788. [Q9Y2G9-3]
ENST00000631948; ENSP00000488808; ENSG00000278788. [Q9Y2G9-1]
GeneIDi22904.
KEGGihsa:22904.
UCSCiuc002lrj.5. human. [Q9Y2G9-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AB023180 mRNA. Translation: BAA76807.2. Different initiation.
AK125139 mRNA. Translation: BAG54153.1.
AK292407 mRNA. Translation: BAF85096.1.
AC005390 Genomic DNA. Translation: AAC28919.1. Sequence problems.
AK074102 mRNA. Translation: BAB84928.1. Frameshift.
BC106021 mRNA. Translation: AAI06022.1.
CCDSiCCDS45894.1. [Q9Y2G9-1]
CCDS45895.1. [Q9Y2G9-3]
PIRiT02748.
RefSeqiNP_001093592.1. NM_001100122.1. [Q9Y2G9-3]
NP_055778.2. NM_014963.2. [Q9Y2G9-1]
XP_005259576.1. XM_005259519.3. [Q9Y2G9-1]
XP_005259577.1. XM_005259520.2. [Q9Y2G9-1]
UniGeneiHs.408708.

3D structure databases

ProteinModelPortaliQ9Y2G9.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

IntActiQ9Y2G9. 2 interactions.
STRINGi9606.ENSP00000354733.

PTM databases

iPTMnetiQ9Y2G9.
PhosphoSiteiQ9Y2G9.

Polymorphism and mutation databases

BioMutaiSBNO2.
DMDMi166233537.

Proteomic databases

EPDiQ9Y2G9.
MaxQBiQ9Y2G9.
PaxDbiQ9Y2G9.
PRIDEiQ9Y2G9.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000361757; ENSP00000354733; ENSG00000064932. [Q9Y2G9-1]
ENST00000438103; ENSP00000400762; ENSG00000064932. [Q9Y2G9-3]
ENST00000612198; ENSP00000477651; ENSG00000278788. [Q9Y2G9-1]
ENST00000622719; ENSP00000482802; ENSG00000278788. [Q9Y2G9-3]
ENST00000631948; ENSP00000488808; ENSG00000278788. [Q9Y2G9-1]
GeneIDi22904.
KEGGihsa:22904.
UCSCiuc002lrj.5. human. [Q9Y2G9-1]

Organism-specific databases

CTDi22904.
GeneCardsiSBNO2.
H-InvDBHIX0014572.
HGNCiHGNC:29158. SBNO2.
HPAiHPA041867.
MIMi615729. gene.
neXtProtiNX_Q9Y2G9.
PharmGKBiPA162402390.
HUGEiSearch...
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG1513. Eukaryota.
ENOG410XQ7Q. LUCA.
GeneTreeiENSGT00390000016591.
HOGENOMiHOG000043949.
HOVERGENiHBG108461.
InParanoidiQ9Y2G9.
OMAiKIGKHHP.
OrthoDBiEOG70PBWM.
PhylomeDBiQ9Y2G9.
TreeFamiTF313526.

Miscellaneous databases

ChiTaRSiSBNO2. human.
GenomeRNAii22904.
PROiQ9Y2G9.
SOURCEiSearch...

Gene expression databases

BgeeiQ9Y2G9.
CleanExiHS_SBNO2.
ExpressionAtlasiQ9Y2G9. baseline and differential.
GenevisibleiQ9Y2G9. HS.

Family and domain databases

Gene3Di3.40.50.300. 1 hit.
InterProiIPR027417. P-loop_NTPase.
IPR030410. SBNO2.
IPR026937. SBNO_Helicase_C_dom.
IPR026741. SNO.
[Graphical view]
PANTHERiPTHR12706. PTHR12706. 1 hit.
PTHR12706:SF5. PTHR12706:SF5. 1 hit.
PfamiPF13871. Helicase_C_4. 1 hit.
[Graphical view]
SUPFAMiSSF52540. SSF52540. 2 hits.
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Prediction of the coding sequences of unidentified human genes. XIII. The complete sequences of 100 new cDNA clones from brain which code for large proteins in vitro."
    Nagase T., Ishikawa K., Suyama M., Kikuno R., Hirosawa M., Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.
    DNA Res. 6:63-70(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
    Tissue: Brain.
  2. "Complete sequencing and characterization of 21,243 full-length human cDNAs."
    Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.
    , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
    Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
    Tissue: Testis and Tongue.
  3. "The DNA sequence and biology of human chromosome 19."
    Grimwood J., Gordon L.A., Olsen A.S., Terry A., Schmutz J., Lamerdin J.E., Hellsten U., Goodstein D., Couronne O., Tran-Gyamfi M., Aerts A., Altherr M., Ashworth L., Bajorek E., Black S., Branscomb E., Caenepeel S., Carrano A.V.
    , Caoile C., Chan Y.M., Christensen M., Cleland C.A., Copeland A., Dalin E., Dehal P., Denys M., Detter J.C., Escobar J., Flowers D., Fotopulos D., Garcia C., Georgescu A.M., Glavina T., Gomez M., Gonzales E., Groza M., Hammon N., Hawkins T., Haydu L., Ho I., Huang W., Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Larionov V., Leem S.-H., Lopez F., Lou Y., Lowry S., Malfatti S., Martinez D., McCready P.M., Medina C., Morgan J., Nelson K., Nolan M., Ovcharenko I., Pitluck S., Pollard M., Popkie A.P., Predki P., Quan G., Ramirez L., Rash S., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A., She X., Smith D., Slezak T., Solovyev V., Thayer N., Tice H., Tsai M., Ustaszewska A., Vo N., Wagner M., Wheeler J., Wu K., Xie G., Yang J., Dubchak I., Furey T.S., DeJong P., Dickson M., Gordon D., Eichler E.E., Pennacchio L.A., Richardson P., Stubbs L., Rokhsar D.S., Myers R.M., Rubin E.M., Lucas S.M.
    Nature 428:529-535(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  4. "The nucleotide sequence of a long cDNA clone isolated from human spleen."
    Jikuya H., Takano J., Nomura N., Kikuno R., Nagase T., Ohara O.
    Submitted (JAN-2002) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 102-1366 (ISOFORM 1).
    Tissue: Spleen.
  5. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 898-1366 (ISOFORM 1).
    Tissue: Skin.
  6. "A transcriptional repressor and corepressor induced by the STAT3-regulated anti-inflammatory signaling pathway."
    El Kasmi K.C., Smith A.M., Williams L., Neale G., Panopolous A., Watowich S.S., Hacker H., Foxwell B.M., Murray P.J.
    J. Immunol. 179:7215-7219(2007) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, TISSUE SPECIFICITY.
  7. "Strawberry notch homolog 2 is a novel inflammatory response factor predominantly but not exclusively expressed by astrocytes in the central nervous system."
    Grill M., Syme T.E., Nocon A.L., Lu A.Z., Hancock D., Rose-John S., Campbell I.L.
    Glia 63:1738-1752(2015) [PubMed] [Europe PMC] [Abstract]
    Cited for: INDUCTION.

Entry informationi

Entry nameiSBNO2_HUMAN
AccessioniPrimary (citable) accession number: Q9Y2G9
Secondary accession number(s): A8K8P2
, B3KWJ1, O75257, Q3KQX0, Q8TEM0
Entry historyi
Integrated into UniProtKB/Swiss-Prot: January 15, 2008
Last sequence update: January 15, 2008
Last modified: June 8, 2016
This is version 101 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 19
    Human chromosome 19: entries, gene names and cross-references to MIM
  2. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.