Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Poly [ADP-ribose] polymerase 4

Gene

PARP4

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Catalytic activityi

NAD+ + (ADP-D-ribosyl)(n)-acceptor = nicotinamide + (ADP-D-ribosyl)(n+1)-acceptor.PROSITE-ProRule annotation

GO - Molecular functioni

  • DNA binding Source: ProtInc
  • enzyme binding Source: MGI
  • NAD+ ADP-ribosyltransferase activity Source: UniProtKB

GO - Biological processi

  • cell death Source: UniProtKB
  • cellular protein modification process Source: UniProtKB
  • cellular response to DNA damage stimulus Source: UniProtKB
  • DNA repair Source: UniProtKB
  • inflammatory response Source: UniProtKB
  • protein ADP-ribosylation Source: UniProtKB
  • response to drug Source: UniProtKB
  • transport Source: UniProtKB
Complete GO annotation...

Keywords - Molecular functioni

Glycosyltransferase, Ribonucleoprotein, Transferase

Keywords - Ligandi

NAD

Enzyme and pathway databases

BioCyciZFISH:HS02404-MONOMER.
BRENDAi2.4.2.30. 2681.
SignaLinkiQ9UKK3.

Names & Taxonomyi

Protein namesi
Recommended name:
Poly [ADP-ribose] polymerase 4 (EC:2.4.2.30)
Short name:
PARP-4
Alternative name(s):
193 kDa vault protein
ADP-ribosyltransferase diphtheria toxin-like 4
Short name:
ARTD4
PARP-related/IalphaI-related H5/proline-rich
Short name:
PH5P
Vault poly(ADP-ribose) polymerase
Short name:
VPARP
Gene namesi
Name:PARP4
Synonyms:ADPRTL1, KIAA0177, PARPL
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 13

Organism-specific databases

HGNCiHGNC:271. PARP4.

Subcellular locationi

GO - Cellular componenti

  • cytoplasm Source: UniProtKB
  • extracellular exosome Source: UniProtKB
  • intracellular ribonucleoprotein complex Source: UniProtKB
  • membrane Source: UniProtKB
  • nucleus Source: UniProtKB
  • spindle microtubule Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Cytoskeleton, Nucleus

Pathology & Biotechi

Organism-specific databases

DisGeNETi143.
OpenTargetsiENSG00000102699.
PharmGKBiPA24591.

Chemistry databases

ChEMBLiCHEMBL6142.

Polymorphism and mutation databases

BioMutaiPARP4.
DMDMi308153574.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00002113301 – 1724Poly [ADP-ribose] polymerase 4Add BLAST1724

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei101PhosphothreonineCombined sources1
Modified residuei333PhosphothreonineCombined sources1
Modified residuei1236PhosphoserineCombined sources1
Modified residuei1335PhosphoserineCombined sources1
Modified residuei1476Asymmetric dimethylarginineCombined sources1
Modified residuei1504PhosphoserineCombined sources1

Keywords - PTMi

Methylation, Phosphoprotein

Proteomic databases

EPDiQ9UKK3.
MaxQBiQ9UKK3.
PaxDbiQ9UKK3.
PeptideAtlasiQ9UKK3.
PRIDEiQ9UKK3.

PTM databases

iPTMnetiQ9UKK3.
PhosphoSitePlusiQ9UKK3.

Expressioni

Tissue specificityi

Widely expressed; the highest levels are in the kidney; also detected in heart, placenta, lung, liver, skeletal muscle, spleen, leukocytes and pancreas.

Gene expression databases

BgeeiENSG00000102699.
CleanExiHS_PARP4.
GenevisibleiQ9UKK3. HS.

Organism-specific databases

HPAiHPA011739.

Interactioni

Subunit structurei

Component of the vault ribonucleoprotein particle, at least composed of MVP, PARP4 and one or more vault RNAs (vRNAs). Binds to MVP. Associates with TEP1.

Binary interactionsi

WithEntry#Exp.IntActNotes
ORFQ9Q2G43EBI-2623021,EBI-6248094From a different organism.

GO - Molecular functioni

  • enzyme binding Source: MGI

Protein-protein interaction databases

BioGridi106653. 10 interactors.
IntActiQ9UKK3. 18 interactors.
MINTiMINT-6768840.
STRINGi9606.ENSP00000371419.

Chemistry databases

BindingDBiQ9UKK3.

Structurei

3D structure databases

ProteinModelPortaliQ9UKK3.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Domaini1 – 94BRCTPROSITE-ProRule annotationAdd BLAST94
Domaini242 – 370PARP alpha-helicalPROSITE-ProRule annotationAdd BLAST129
Domaini369 – 573PARP catalyticPROSITE-ProRule annotationAdd BLAST205
Domaini607 – 735VITPROSITE-ProRule annotationAdd BLAST129
Domaini876 – 1046VWFAPROSITE-ProRule annotationAdd BLAST171

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni1562 – 1724Interaction with the major vault proteinAdd BLAST163

Motif

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Motifi19 – 25Nuclear localization signalSequence analysis7
Motifi1237 – 1249Nuclear localization signalSequence analysisAdd BLAST13

Sequence similaritiesi

Contains 1 BRCT domain.PROSITE-ProRule annotation
Contains 1 PARP alpha-helical domain.PROSITE-ProRule annotation
Contains 1 PARP catalytic domain.PROSITE-ProRule annotation
Contains 1 VIT domain.PROSITE-ProRule annotation
Contains 1 VWFA domain.PROSITE-ProRule annotation

Phylogenomic databases

eggNOGiKOG1037. Eukaryota.
ENOG410XP18. LUCA.
GeneTreeiENSGT00530000063006.
HOGENOMiHOG000139369.
HOVERGENiHBG053515.
InParanoidiQ9UKK3.
KOiK10798.
OMAiGKCMDLH.
OrthoDBiEOG091G03L8.
PhylomeDBiQ9UKK3.
TreeFamiTF329720.

Family and domain databases

CDDicd00027. BRCT. 1 hit.
Gene3Di1.20.142.10. 1 hit.
3.40.50.10190. 1 hit.
3.40.50.410. 1 hit.
3.90.228.10. 1 hit.
InterProiIPR001357. BRCT_dom.
IPR031273. PARP4.
IPR012317. Poly(ADP-ribose)pol_cat_dom.
IPR004102. Poly(ADP-ribose)pol_reg_dom.
IPR013694. VIT.
IPR002035. VWF_A.
[Graphical view]
PANTHERiPTHR10338:SF112. PTHR10338:SF112. 2 hits.
PfamiPF00533. BRCT. 1 hit.
PF00644. PARP. 1 hit.
PF08487. VIT. 1 hit.
PF00092. VWA. 1 hit.
[Graphical view]
SMARTiSM00292. BRCT. 1 hit.
SM00609. VIT. 1 hit.
SM00327. VWA. 1 hit.
[Graphical view]
SUPFAMiSSF47587. SSF47587. 1 hit.
SSF52113. SSF52113. 1 hit.
SSF53300. SSF53300. 1 hit.
PROSITEiPS50172. BRCT. 1 hit.
PS51060. PARP_ALPHA_HD. 1 hit.
PS51059. PARP_CATALYTIC. 1 hit.
PS51468. VIT. 1 hit.
PS50234. VWFA. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q9UKK3-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MVMGIFANCI FCLKVKYLPQ QQKKKLQTDI KENGGKFSFS LNPQCTHIIL
60 70 80 90 100
DNADVLSQYQ LNSIQKNHVH IANPDFIWKS IREKRLLDVK NYDPYKPLDI
110 120 130 140 150
TPPPDQKASS SEVKTEGLCP DSATEEEDTV ELTEFGMQNV EIPHLPQDFE
160 170 180 190 200
VAKYNTLEKV GMEGGQEAVV VELQCSRDSR DCPFLISSHF LLDDGMETRR
210 220 230 240 250
QFAIKKTSED ASEYFENYIE ELKKQGFLLR EHFTPEATQL ASEQLQALLL
260 270 280 290 300
EEVMNSSTLS QEVSDLVEMI WAEALGHLEH MLLKPVNRIS LNDVSKAEGI
310 320 330 340 350
LLLVKAALKN GETAEQLQKM MTEFYRLIPH KGTMPKEVNL GLLAKKADLC
360 370 380 390 400
QLIRDMVNVC ETNLSKPNPP SLAKYRALRC KIEHVEQNTE EFLRVRKEVL
410 420 430 440 450
QNHHSKSPVD VLQIFRVGRV NETTEFLSKL GNVRPLLHGS PVQNIVGILC
460 470 480 490 500
RGLLLPKVVE DRGVQRTDVG NLGSGIYFSD SLSTSIKYSH PGETDGTRLL
510 520 530 540 550
LICDVALGKC MDLHEKDFSL TEAPPGYDSV HGVSQTASVT TDFEDDEFVV
560 570 580 590 600
YKTNQVKMKY IIKFSMPGDQ IKDFHPSDHT ELEEYRPEFS NFSKVEDYQL
610 620 630 640 650
PDAKTSSSTK AGLQDASGNL VPLEDVHIKG RIIDTVAQVI VFQTYTNKSH
660 670 680 690 700
VPIEAKYIFP LDDKAAVCGF EAFINGKHIV GEIKEKEEAQ QEYLEAVTQG
710 720 730 740 750
HGAYLMSQDA PDVFTVSVGN LPPKAKVLIK ITYITELSIL GTVGVFFMPA
760 770 780 790 800
TVAPWQQDKA LNENLQDTVE KICIKEIGTK QSFSLTMSIE MPYVIEFIFS
810 820 830 840 850
DTHELKQKRT DCKAVISTME GSSLDSSGFS LHIGLSAAYL PRMWVEKHPE
860 870 880 890 900
KESEACMLVF QPDLDVDLPD LASESEVIIC LDCSSSMEGV TFLQAKQIAL
910 920 930 940 950
HALSLVGEKQ KVNIIQFGTG YKELFSYPKH ITSNTMAAEF IMSATPTMGN
960 970 980 990 1000
TDFWKTLRYL SLLYPARGSR NILLVSDGHL QDESLTLQLV KRSRPHTRLF
1010 1020 1030 1040 1050
ACGIGSTANR HVLRILSQCG AGVFEYFNAK SKHSWRKQIE DQMTRLCSPS
1060 1070 1080 1090 1100
CHSVSVKWQQ LNPDVPEALQ APAQVPSLFL NDRLLVYGFI PHCTQATLCA
1110 1120 1130 1140 1150
LIQEKEFRTM VSTTELQKTT GTMIHKLAAR ALIRDYEDGI LHENETSHEM
1160 1170 1180 1190 1200
KKQTLKSLII KLSKENSLIT QFTSFVAVEK RDENESPFPD IPKVSELIAK
1210 1220 1230 1240 1250
EDVDFLPYMS WQGEPQEAVR NQSLLASSEW PELRLSKRKH RKIPFSKRKM
1260 1270 1280 1290 1300
ELSQPEVSED FEEDGLGVLP AFTSNLERGG VEKLLDLSWT ESCKPTATEP
1310 1320 1330 1340 1350
LFKKVSPWET STSSFFPILA PAVGSYLPPT ARAHSPASLS FASYRQVASF
1360 1370 1380 1390 1400
GSAAPPRQFD ASQFSQGPVP GTCADWIPQS ASCPTGPPQN PPSSPYCGIV
1410 1420 1430 1440 1450
FSGSSLSSAQ SAPLQHPGGF TTRPSAGTFP ELDSPQLHFS LPTDPDPIRG
1460 1470 1480 1490 1500
FGSYHPSASS PFHFQPSAAS LTANLRLPMA SALPEALCSQ SRTTPVDLCL
1510 1520 1530 1540 1550
LEESVGSLEG SRCPVFAFQS SDTESDELSE VLQDSCFLQI KCDTKDDSIL
1560 1570 1580 1590 1600
CFLEVKEEDE IVCIQHWQDA VPWTELLSLQ TEDGFWKLTP ELGLILNLNT
1610 1620 1630 1640 1650
NGLHSFLKQK GIQSLGVKGR ECLLDLIATM LVLQFIRTRL EKEGIVFKSL
1660 1670 1680 1690 1700
MKMDDASISR NIPWAFEAIK QASEWVRRTE GQYPSICPRL ELGNDWDSAT
1710 1720
KQLLGLQPIS TVSPLHRVLH YSQG
Length:1,724
Mass (Da):192,595
Last modified:October 5, 2010 - v3
Checksum:iDCA1DD4C001EA22F
GO

Sequence cautioni

The sequence BAA11494 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti519S → P in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti897Q → E in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti897Q → E in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti936M → A in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti936M → A in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti936M → T in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1065V → A in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1065V → A in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1080L → R in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1080L → R in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1080L → R in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti1108R → C in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1108R → C in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1108R → C in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti1328P → T in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1328P → T in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1328P → T in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti1331A → T in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1331A → T in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1331A → T in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti1394S → A in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1394S → A in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1394S → A in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti1459S → Y in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1459S → Y in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1459S → Y in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti1550L → P in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1550L → P in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1550L → P in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti1555V → L in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1564I → T in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1564I → T in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1564I → T in BAA11494 (PubMed:8724849).Curated1
Sequence conflicti1656A → P in AAD47250 (PubMed:10477748).Curated1
Sequence conflicti1656A → P in AAC62491 (PubMed:10644454).Curated1
Sequence conflicti1656A → P in BAA11494 (PubMed:8724849).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_05664581I → V.Corresponds to variant rs35200240dbSNPEnsembl.1
Natural variantiVAR_056646122S → N.Corresponds to variant rs9578751dbSNPEnsembl.1
Natural variantiVAR_056647215F → Y.Corresponds to variant rs9318600dbSNPEnsembl.1
Natural variantiVAR_056648792P → L.Corresponds to variant rs4986818dbSNPEnsembl.1
Natural variantiVAR_056649873S → N.2 PublicationsCorresponds to variant rs7140044dbSNPEnsembl.1
Natural variantiVAR_056650899A → T.1 PublicationCorresponds to variant rs2275660dbSNPEnsembl.1
Natural variantiVAR_056651991K → R.Corresponds to variant rs34689435dbSNPEnsembl.1
Natural variantiVAR_0566521012V → I.Corresponds to variant rs9581043dbSNPEnsembl.1
Natural variantiVAR_0566531253S → T.Corresponds to variant rs4986822dbSNPEnsembl.1
Natural variantiVAR_0160901265G → A.2 PublicationsCorresponds to variant rs1050110dbSNPEnsembl.1
Natural variantiVAR_0160911280G → R.2 PublicationsCorresponds to variant rs13428dbSNPEnsembl.1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF158255 mRNA. Translation: AAD47250.1.
AF057160 mRNA. Translation: AAC62491.1.
D79999 mRNA. Translation: BAA11494.2. Different initiation.
AL359763 Genomic DNA. Translation: CAI12394.1.
CCDSiCCDS9307.1.
RefSeqiNP_006428.2. NM_006437.3.
XP_011533234.1. XM_011534932.2.
UniGeneiHs.744855.

Genome annotation databases

EnsembliENST00000381989; ENSP00000371419; ENSG00000102699.
GeneIDi143.
KEGGihsa:143.
UCSCiuc001upl.4. human.

Keywords - Coding sequence diversityi

Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AF158255 mRNA. Translation: AAD47250.1.
AF057160 mRNA. Translation: AAC62491.1.
D79999 mRNA. Translation: BAA11494.2. Different initiation.
AL359763 Genomic DNA. Translation: CAI12394.1.
CCDSiCCDS9307.1.
RefSeqiNP_006428.2. NM_006437.3.
XP_011533234.1. XM_011534932.2.
UniGeneiHs.744855.

3D structure databases

ProteinModelPortaliQ9UKK3.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi106653. 10 interactors.
IntActiQ9UKK3. 18 interactors.
MINTiMINT-6768840.
STRINGi9606.ENSP00000371419.

Chemistry databases

BindingDBiQ9UKK3.
ChEMBLiCHEMBL6142.

PTM databases

iPTMnetiQ9UKK3.
PhosphoSitePlusiQ9UKK3.

Polymorphism and mutation databases

BioMutaiPARP4.
DMDMi308153574.

Proteomic databases

EPDiQ9UKK3.
MaxQBiQ9UKK3.
PaxDbiQ9UKK3.
PeptideAtlasiQ9UKK3.
PRIDEiQ9UKK3.

Protocols and materials databases

DNASUi143.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000381989; ENSP00000371419; ENSG00000102699.
GeneIDi143.
KEGGihsa:143.
UCSCiuc001upl.4. human.

Organism-specific databases

CTDi143.
DisGeNETi143.
GeneCardsiPARP4.
H-InvDBHIX0019099.
HGNCiHGNC:271. PARP4.
HPAiHPA011739.
MIMi607519. gene.
neXtProtiNX_Q9UKK3.
OpenTargetsiENSG00000102699.
PharmGKBiPA24591.
HUGEiSearch...
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG1037. Eukaryota.
ENOG410XP18. LUCA.
GeneTreeiENSGT00530000063006.
HOGENOMiHOG000139369.
HOVERGENiHBG053515.
InParanoidiQ9UKK3.
KOiK10798.
OMAiGKCMDLH.
OrthoDBiEOG091G03L8.
PhylomeDBiQ9UKK3.
TreeFamiTF329720.

Enzyme and pathway databases

BioCyciZFISH:HS02404-MONOMER.
BRENDAi2.4.2.30. 2681.
SignaLinkiQ9UKK3.

Miscellaneous databases

ChiTaRSiPARP4. human.
GeneWikiiPARP4.
GenomeRNAii143.
PROiQ9UKK3.
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000102699.
CleanExiHS_PARP4.
GenevisibleiQ9UKK3. HS.

Family and domain databases

CDDicd00027. BRCT. 1 hit.
Gene3Di1.20.142.10. 1 hit.
3.40.50.10190. 1 hit.
3.40.50.410. 1 hit.
3.90.228.10. 1 hit.
InterProiIPR001357. BRCT_dom.
IPR031273. PARP4.
IPR012317. Poly(ADP-ribose)pol_cat_dom.
IPR004102. Poly(ADP-ribose)pol_reg_dom.
IPR013694. VIT.
IPR002035. VWF_A.
[Graphical view]
PANTHERiPTHR10338:SF112. PTHR10338:SF112. 2 hits.
PfamiPF00533. BRCT. 1 hit.
PF00644. PARP. 1 hit.
PF08487. VIT. 1 hit.
PF00092. VWA. 1 hit.
[Graphical view]
SMARTiSM00292. BRCT. 1 hit.
SM00609. VIT. 1 hit.
SM00327. VWA. 1 hit.
[Graphical view]
SUPFAMiSSF47587. SSF47587. 1 hit.
SSF52113. SSF52113. 1 hit.
SSF53300. SSF53300. 1 hit.
PROSITEiPS50172. BRCT. 1 hit.
PS51060. PARP_ALPHA_HD. 1 hit.
PS51059. PARP_CATALYTIC. 1 hit.
PS51468. VIT. 1 hit.
PS50234. VWFA. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiPARP4_HUMAN
AccessioniPrimary (citable) accession number: Q9UKK3
Secondary accession number(s): O75903
, Q14682, Q5QNZ9, Q9H1M6
Entry historyi
Integrated into UniProtKB/Swiss-Prot: September 26, 2001
Last sequence update: October 5, 2010
Last modified: November 30, 2016
This is version 164 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Human chromosome 13
    Human chromosome 13: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.