Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Oxidation resistance protein 1

Gene

OXR1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

May be involved in protection from oxidative damage.2 Publications

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Stress response

Names & Taxonomyi

Protein namesi
Recommended name:
Oxidation resistance protein 1
Gene namesi
Name:OXR1
ORF Names:Nbla00307
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 8

Organism-specific databases

HGNCiHGNC:15822. OXR1.

Subcellular locationi

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Mitochondrion

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA32856.

Polymorphism and mutation databases

BioMutaiOXR1.
DMDMi294862456.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 874874Oxidation resistance protein 1PRO_0000231645Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei91 – 911PhosphoserineCombined sources
Modified residuei119 – 1191PhosphothreonineBy similarity
Modified residuei201 – 2011PhosphoserineCombined sources
Modified residuei202 – 2021PhosphoserineCombined sources
Modified residuei204 – 2041PhosphoserineCombined sources
Modified residuei294 – 2941PhosphoserineCombined sources
Modified residuei334 – 3341PhosphoserineBy similarity
Modified residuei336 – 3361PhosphoserineBy similarity
Modified residuei341 – 3411PhosphothreonineBy similarity
Modified residuei346 – 3461PhosphoserineBy similarity
Modified residuei496 – 4961PhosphoserineBy similarity

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiQ8N573.
MaxQBiQ8N573.
PaxDbiQ8N573.
PeptideAtlasiQ8N573.
PRIDEiQ8N573.

PTM databases

iPTMnetiQ8N573.
PhosphoSiteiQ8N573.

Miscellaneous databases

PMAP-CutDBQ8N573.

Expressioni

Inductioni

By heat shock and oxidative stress.1 Publication

Gene expression databases

BgeeiENSG00000164830.
CleanExiHS_OXR1.
ExpressionAtlasiQ8N573. baseline and differential.
GenevisibleiQ8N573. HS.

Organism-specific databases

HPAiHPA027375.
HPA027380.
HPA027395.

Interactioni

Binary interactionsi

WithEntry#Exp.IntActNotes
KEAP1Q141453EBI-10265887,EBI-751001

Protein-protein interaction databases

BioGridi120391. 10 interactions.
IntActiQ8N573. 4 interactions.
STRINGi9606.ENSP00000405424.

Structurei

3D structure databases

ProteinModelPortaliQ8N573.
SMRiQ8N573. Positions 60-147, 709-874.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini99 – 14244LysMPROSITE-ProRule annotationAdd
BLAST
Domaini208 – 27568GRAMAdd
BLAST
Domaini711 – 847137TLDAdd
BLAST

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni551 – 57828Mediates oxidative antimutator activityAdd
BLAST

Sequence similaritiesi

Belongs to the OXR1 family.Curated
Contains 1 GRAM domain.Curated
Contains 1 LysM domain.PROSITE-ProRule annotation
Contains 1 TLD domain.Curated

Phylogenomic databases

eggNOGiKOG2372. Eukaryota.
ENOG4111QBZ. LUCA.
GeneTreeiENSGT00410000025363.
HOGENOMiHOG000195803.
HOVERGENiHBG102032.
InParanoidiQ8N573.
OMAiGPKEDST.
OrthoDBiEOG091G0L15.
PhylomeDBiQ8N573.
TreeFamiTF313530.

Family and domain databases

Gene3Di3.10.350.10. 1 hit.
InterProiIPR018392. LysM_dom.
IPR006571. TLDc_dom.
[Graphical view]
PfamiPF01476. LysM. 1 hit.
PF07534. TLD. 1 hit.
[Graphical view]
SMARTiSM00257. LysM. 1 hit.
SM00584. TLDc. 1 hit.
[Graphical view]
SUPFAMiSSF54106. SSF54106. 1 hit.
PROSITEiPS51782. LYSM. 1 hit.
[Graphical view]

Sequences (8)i

Sequence statusi: Complete.

This entry describes 8 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q8N573-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MTKDKNSPGL KKKSQSVDIN APGFNPLAGA GKQTPQASKP PAPKTPIIEE
60 70 80 90 100
EQNNAANTQK HPSRRSELKR FYTIDTGQKK TLDKKDGRRM SFQKPKGTIE
110 120 130 140 150
YTVESRDSLN SIALKFDTTP NELVQLNKLF SRAVVTGQVL YVPDPEYVSS
160 170 180 190 200
VESSPSLSPV SPLSPTSSEA EFDKTTNPDV HPTEATPSST FTGIRPARVV
210 220 230 240 250
SSTSEEEEAF TEKFLKINCK YITSGKGTVS GVLLVTPNNI MFDPHKNDPL
260 270 280 290 300
VQENGCEEYG IMCPMEEVMS AAMYKEILDS KIKESLPIDI DQLSGRDFCH
310 320 330 340 350
SKKMTGSNTE EIDSRIRDAG NDSASTAPRS TEESLSEDVF TESELSPIRE
360 370 380 390 400
ELVSSDELRQ DKSSGASSES VQTVNQAEVE SLTVKSESTG TPGHLRSDTE
410 420 430 440 450
HSTNEVGTLC HKTDLNNLEM AIKEDQIADN FQGISGPKED STSIKGNSDQ
460 470 480 490 500
DSFLHENSLH QEESQKENMP CGETAEFKQK QSVNKGKQGK EQNQDSQTEA
510 520 530 540 550
EELRKLWKTH TMQQTKQQRE NIQQVSQKEA KHKITSADGH IESSALLKEK
560 570 580 590 600
QRHRLHKFLC LRVGKPMRKT FVSQASATMQ QYAQRDKKHE YWFAVPQERT
610 620 630 640 650
DHLYAFFIQW SPEIYAEDTG EYTREPGFIV VKKIEESETI EDSSNQAAAR
660 670 680 690 700
EWEVVSVAEY HRRIDALNTE ELRTLCRRLQ ITTREDINSK QVATVKADLE
710 720 730 740 750
SESFRPNLSD PSELLLPDQI EKLTKHLPPR TIGYPWTLVY GTGKHGTSLK
760 770 780 790 800
TLYRTMTGLD TPVLMVIKDS DGQVFGALAS EPLKVSDGFY GTGETFVFTF
810 820 830 840 850
CPEFEVFKWT GDNMFFIKGD MDSLAFGGGG GEFALWLDGD LYHGRSHSCK
860 870
TFGNRTLSKK EDFFIQDIEI WAFE
Length:874
Mass (Da):97,970
Last modified:April 20, 2010 - v2
Checksum:i4231D07369CE2D36
GO
Isoform 2 (identifier: Q8N573-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-74: MTKDKNSPGL...RSELKRFYTI → MDYLTTFTEK...PPAPSSPEGP
     654-680: Missing.

Note: No experimental confirmation available.
Show »
Length:839
Mass (Da):93,734
Checksum:i930AB9A7CA7648DC
GO
Isoform 3 (identifier: Q8N573-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-68: Missing.
     69-74: KRFYTI → MFCQRK
     544-571: SALLKEKQRHRLHKFLCLRVGKPMRKTF → KCYRVNEVSSSNCMVSPSFLTDSKAAVP
     572-874: Missing.

Show »
Length:503
Mass (Da):55,698
Checksum:i13A23028C9B7AF83
GO
Isoform 4 (identifier: Q8N573-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-631: Missing.
     632-653: KKIEESETIEDSSNQAAAREWE → MSRLWYGKKGRRHQPINHKYTL

Note: No experimental confirmation available.
Show »
Length:243
Mass (Da):27,691
Checksum:i530B2F0DF5B0DA8A
GO
Isoform 5 (identifier: Q8N573-5) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-9: MTKDKNSPG → MSVSNLSW
     654-680: Missing.

Note: No experimental confirmation available.
Show »
Length:846
Mass (Da):94,692
Checksum:i3CBCE47C558A548B
GO
Isoform 6 (identifier: Q8N573-6) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-511: Missing.
     654-722: Missing.

Note: No experimental confirmation available.
Show »
Length:294
Mass (Da):33,774
Checksum:iB2635225195EBF96
GO
Isoform 7 (identifier: Q8N573-7) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-22: MTKDKNSPGLKKKSQSVDINAP → MSRLWYGKKGRRHQPINHKYTL
     23-680: Missing.

Show »
Length:216
Mass (Da):24,468
Checksum:i42F3BFA4328550AA
GO
Isoform 8 (identifier: Q8N573-8) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-9: MTKDKNSPG → MSVSNLSW

Show »
Length:873
Mass (Da):97,916
Checksum:i72E8B301BC705FDA
GO

Sequence cautioni

The sequence AAG25715 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.Curated
The sequence AAH32710 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.Curated
The sequence AAY26396 differs from that shown. Reason: Erroneous gene model prediction. Curated
The sequence BAA91456 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.Curated
The sequence BAF85588 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.Curated
The sequence CAI46186 differs from that shown. Reason: Erroneous initiation. Translation N-terminally extended.Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti154 – 1541S → P in BAC04711 (PubMed:14702039).Curated
Sequence conflicti495 – 4951D → V in AAG25715 (PubMed:11114193).Curated
Sequence conflicti681 – 6811I → N in AAG25715 (PubMed:11114193).Curated
Sequence conflicti727 – 7271L → F in AAG25715 (PubMed:11114193).Curated
Sequence conflicti813 – 8131N → D in BAA91456 (PubMed:14702039).Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti100 – 1001E → G.1 Publication
Corresponds to variant rs28921397 [ dbSNP | Ensembl ].
VAR_025861
Natural varianti360 – 3601Q → P.1 Publication
Corresponds to variant rs28921419 [ dbSNP | Ensembl ].
VAR_025862
Natural varianti516 – 5161K → R.1 Publication
Corresponds to variant rs28921420 [ dbSNP | Ensembl ].
VAR_025863

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1 – 631631Missing in isoform 4. 1 PublicationVSP_039004Add
BLAST
Alternative sequencei1 – 511511Missing in isoform 6. 1 PublicationVSP_039005Add
BLAST
Alternative sequencei1 – 7474MTKDK…RFYTI → MDYLTTFTEKSGRLLRGTAN RLLGFGGGGEARQVRFEDYL REPAQGDLGCGSPPHRPPAP SSPEGP in isoform 2. 3 PublicationsVSP_039006Add
BLAST
Alternative sequencei1 – 6868Missing in isoform 3. 1 PublicationVSP_039007Add
BLAST
Alternative sequencei1 – 2222MTKDK…DINAP → MSRLWYGKKGRRHQPINHKY TL in isoform 7. 2 PublicationsVSP_043632Add
BLAST
Alternative sequencei1 – 99MTKDKNSPG → MSVSNLSW in isoform 5 and isoform 8. 2 PublicationsVSP_039008
Alternative sequencei23 – 680658Missing in isoform 7. 2 PublicationsVSP_043633Add
BLAST
Alternative sequencei69 – 746KRFYTI → MFCQRK in isoform 3. 1 PublicationVSP_039009
Alternative sequencei544 – 57128SALLK…MRKTF → KCYRVNEVSSSNCMVSPSFL TDSKAAVP in isoform 3. 1 PublicationVSP_039010Add
BLAST
Alternative sequencei572 – 874303Missing in isoform 3. 1 PublicationVSP_039011Add
BLAST
Alternative sequencei632 – 65322KKIEE…AREWE → MSRLWYGKKGRRHQPINHKY TL in isoform 4. 1 PublicationVSP_039012Add
BLAST
Alternative sequencei654 – 72269Missing in isoform 6. 1 PublicationVSP_039014Add
BLAST
Alternative sequencei654 – 68027Missing in isoform 2 and isoform 5. 3 PublicationsVSP_039013Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
FN650108 mRNA. Translation: CBI84064.1.
FN650109 mRNA. Translation: CBI84065.1.
AB075503 mRNA. Translation: BAE45753.1.
AK000987 mRNA. Translation: BAA91456.1. Different initiation.
AK096148 mRNA. Translation: BAC04711.1.
AK124441 mRNA. Translation: BAC85852.1.
AK127563 mRNA. Translation: BAG54523.1.
AK292899 mRNA. Translation: BAF85588.1. Different initiation.
AK296561 mRNA. Translation: BAH12388.1.
AK303695 mRNA. Translation: BAH14021.1.
AK316153 mRNA. Translation: BAH14524.1.
DQ020202 Genomic DNA. Translation: AAY26396.1. Sequence problems.
AC023344 Genomic DNA. No translation available.
AC027031 Genomic DNA. No translation available.
AC090579 Genomic DNA. No translation available.
AP000430 Genomic DNA. No translation available.
AP002090 Genomic DNA. No translation available.
BC032710 mRNA. Translation: AAH32710.1. Different initiation.
AF309387 mRNA. Translation: AAG25715.1. Different initiation.
AL833193 mRNA. Translation: CAI46186.1. Different initiation.
CCDSiCCDS47909.1. [Q8N573-5]
CCDS56547.1. [Q8N573-8]
CCDS56548.1. [Q8N573-1]
CCDS56549.1. [Q8N573-4]
CCDS56550.1. [Q8N573-7]
CCDS6304.2. [Q8N573-2]
RefSeqiNP_001185461.1. NM_001198532.1. [Q8N573-1]
NP_001185462.1. NM_001198533.1. [Q8N573-8]
NP_001185463.1. NM_001198534.1. [Q8N573-4]
NP_001185464.1. NM_001198535.1. [Q8N573-7]
NP_060472.2. NM_018002.3. [Q8N573-5]
NP_851999.2. NM_181354.4. [Q8N573-2]
UniGeneiHs.127286.
Hs.701892.

Genome annotation databases

EnsembliENST00000297447; ENSP00000297447; ENSG00000164830. [Q8N573-4]
ENST00000312046; ENSP00000311026; ENSG00000164830. [Q8N573-2]
ENST00000442977; ENSP00000405424; ENSG00000164830. [Q8N573-1]
ENST00000445937; ENSP00000402918; ENSG00000164830. [Q8N573-5]
ENST00000449762; ENSP00000408659; ENSG00000164830. [Q8N573-7]
ENST00000452423; ENSP00000395032; ENSG00000164830. [Q8N573-8]
ENST00000497705; ENSP00000431014; ENSG00000164830. [Q8N573-3]
ENST00000517566; ENSP00000429205; ENSG00000164830. [Q8N573-8]
ENST00000531443; ENSP00000431966; ENSG00000164830. [Q8N573-5]
GeneIDi55074.
KEGGihsa:55074.
UCSCiuc003ymf.4. human. [Q8N573-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Web resourcesi

NIEHS-SNPs

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
FN650108 mRNA. Translation: CBI84064.1.
FN650109 mRNA. Translation: CBI84065.1.
AB075503 mRNA. Translation: BAE45753.1.
AK000987 mRNA. Translation: BAA91456.1. Different initiation.
AK096148 mRNA. Translation: BAC04711.1.
AK124441 mRNA. Translation: BAC85852.1.
AK127563 mRNA. Translation: BAG54523.1.
AK292899 mRNA. Translation: BAF85588.1. Different initiation.
AK296561 mRNA. Translation: BAH12388.1.
AK303695 mRNA. Translation: BAH14021.1.
AK316153 mRNA. Translation: BAH14524.1.
DQ020202 Genomic DNA. Translation: AAY26396.1. Sequence problems.
AC023344 Genomic DNA. No translation available.
AC027031 Genomic DNA. No translation available.
AC090579 Genomic DNA. No translation available.
AP000430 Genomic DNA. No translation available.
AP002090 Genomic DNA. No translation available.
BC032710 mRNA. Translation: AAH32710.1. Different initiation.
AF309387 mRNA. Translation: AAG25715.1. Different initiation.
AL833193 mRNA. Translation: CAI46186.1. Different initiation.
CCDSiCCDS47909.1. [Q8N573-5]
CCDS56547.1. [Q8N573-8]
CCDS56548.1. [Q8N573-1]
CCDS56549.1. [Q8N573-4]
CCDS56550.1. [Q8N573-7]
CCDS6304.2. [Q8N573-2]
RefSeqiNP_001185461.1. NM_001198532.1. [Q8N573-1]
NP_001185462.1. NM_001198533.1. [Q8N573-8]
NP_001185463.1. NM_001198534.1. [Q8N573-4]
NP_001185464.1. NM_001198535.1. [Q8N573-7]
NP_060472.2. NM_018002.3. [Q8N573-5]
NP_851999.2. NM_181354.4. [Q8N573-2]
UniGeneiHs.127286.
Hs.701892.

3D structure databases

ProteinModelPortaliQ8N573.
SMRiQ8N573. Positions 60-147, 709-874.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi120391. 10 interactions.
IntActiQ8N573. 4 interactions.
STRINGi9606.ENSP00000405424.

PTM databases

iPTMnetiQ8N573.
PhosphoSiteiQ8N573.

Polymorphism and mutation databases

BioMutaiOXR1.
DMDMi294862456.

Proteomic databases

EPDiQ8N573.
MaxQBiQ8N573.
PaxDbiQ8N573.
PeptideAtlasiQ8N573.
PRIDEiQ8N573.

Protocols and materials databases

DNASUi55074.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000297447; ENSP00000297447; ENSG00000164830. [Q8N573-4]
ENST00000312046; ENSP00000311026; ENSG00000164830. [Q8N573-2]
ENST00000442977; ENSP00000405424; ENSG00000164830. [Q8N573-1]
ENST00000445937; ENSP00000402918; ENSG00000164830. [Q8N573-5]
ENST00000449762; ENSP00000408659; ENSG00000164830. [Q8N573-7]
ENST00000452423; ENSP00000395032; ENSG00000164830. [Q8N573-8]
ENST00000497705; ENSP00000431014; ENSG00000164830. [Q8N573-3]
ENST00000517566; ENSP00000429205; ENSG00000164830. [Q8N573-8]
ENST00000531443; ENSP00000431966; ENSG00000164830. [Q8N573-5]
GeneIDi55074.
KEGGihsa:55074.
UCSCiuc003ymf.4. human. [Q8N573-1]

Organism-specific databases

CTDi55074.
GeneCardsiOXR1.
HGNCiHGNC:15822. OXR1.
HPAiHPA027375.
HPA027380.
HPA027395.
MIMi605609. gene.
neXtProtiNX_Q8N573.
PharmGKBiPA32856.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG2372. Eukaryota.
ENOG4111QBZ. LUCA.
GeneTreeiENSGT00410000025363.
HOGENOMiHOG000195803.
HOVERGENiHBG102032.
InParanoidiQ8N573.
OMAiGPKEDST.
OrthoDBiEOG091G0L15.
PhylomeDBiQ8N573.
TreeFamiTF313530.

Miscellaneous databases

ChiTaRSiOXR1. human.
GeneWikiiOXR1.
GenomeRNAii55074.
PMAP-CutDBQ8N573.
PROiQ8N573.
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000164830.
CleanExiHS_OXR1.
ExpressionAtlasiQ8N573. baseline and differential.
GenevisibleiQ8N573. HS.

Family and domain databases

Gene3Di3.10.350.10. 1 hit.
InterProiIPR018392. LysM_dom.
IPR006571. TLDc_dom.
[Graphical view]
PfamiPF01476. LysM. 1 hit.
PF07534. TLD. 1 hit.
[Graphical view]
SMARTiSM00257. LysM. 1 hit.
SM00584. TLDc. 1 hit.
[Graphical view]
SUPFAMiSSF54106. SSF54106. 1 hit.
PROSITEiPS51782. LYSM. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiOXR1_HUMAN
AccessioniPrimary (citable) accession number: Q8N573
Secondary accession number(s): A6NK11
, A8KA34, B3KXL1, B7Z402, B7Z8N5, D3HIS6, Q3LIB5, Q6ZVK9, Q8N8V0, Q9H266, Q9NWC7
Entry historyi
Integrated into UniProtKB/Swiss-Prot: April 4, 2006
Last sequence update: April 20, 2010
Last modified: September 7, 2016
This is version 129 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 8
    Human chromosome 8: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.