Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

PH-interacting protein

Gene

PHIP

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Probable regulator of the insulin and insulin-like growth factor signaling pathways. Stimulates cell proliferation through regulation of cyclin transcription and has an anti-apoptotic activity through AKT1 phosphorylation and activation. Plays a role in the regulation of cell morphology and cytoskeletal organization.2 Publications

GO - Molecular functioni

  • insulin receptor binding Source: UniProtKB
  • lysine-acetylated histone binding Source: UniProtKB

GO - Biological processi

Complete GO annotation...

Enzyme and pathway databases

BioCyciZFISH:ENSG00000146247-MONOMER.
SignaLinkiQ8WWQ0.

Names & Taxonomyi

Protein namesi
Recommended name:
PH-interacting protein
Short name:
PHIP
Alternative name(s):
IRS-1 PH domain-binding protein
WD repeat-containing protein 11
Gene namesi
Name:PHIP
Synonyms:WDR11
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 6

Organism-specific databases

HGNCiHGNC:15673. PHIP.

Subcellular locationi

GO - Cellular componenti

  • extracellular exosome Source: UniProtKB
  • nucleus Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

DisGeNETi55023.
OpenTargetsiENSG00000146247.
PharmGKBiPA33265.

Chemistry databases

ChEMBLiCHEMBL2176773.

Polymorphism and mutation databases

BioMutaiPHIP.
DMDMi308153472.

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00002977571 – 1821PH-interacting proteinAdd BLAST1821

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei136PhosphoserineCombined sources1
Cross-linki421Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)Combined sources
Modified residuei641PhosphoserineCombined sources1
Modified residuei659PhosphoserineCombined sources1
Modified residuei674PhosphoserineCombined sources1
Modified residuei677PhosphoserineCombined sources1
Modified residuei683PhosphoserineCombined sources1
Modified residuei692PhosphoserineCombined sources1
Modified residuei879PhosphoserineCombined sources1
Modified residuei880PhosphoserineCombined sources1
Modified residuei881PhosphoserineCombined sources1
Modified residuei911PhosphoserineCombined sources1
Modified residuei1281PhosphoserineCombined sources1
Modified residuei1283PhosphoserineCombined sources1
Modified residuei1296PhosphoserineCombined sources1
Modified residuei1315PhosphoserineCombined sources1
Modified residuei1359PhosphothreonineCombined sources1
Modified residuei1405PhosphoserineCombined sources1
Cross-linki1470Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO1)Combined sources
Cross-linki1470Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO2)Combined sources
Modified residuei1479PhosphoserineCombined sources1
Modified residuei1497N6-acetyllysineCombined sources1
Modified residuei1525PhosphoserineCombined sources1
Modified residuei1533N6-acetyllysineCombined sources1
Modified residuei1560PhosphoserineCombined sources1
Modified residuei1651PhosphoserineCombined sources1
Modified residuei1762PhosphoserineCombined sources1
Modified residuei1783PhosphoserineCombined sources1

Keywords - PTMi

Acetylation, Isopeptide bond, Phosphoprotein, Ubl conjugation

Proteomic databases

EPDiQ8WWQ0.
MaxQBiQ8WWQ0.
PaxDbiQ8WWQ0.
PeptideAtlasiQ8WWQ0.
PRIDEiQ8WWQ0.

PTM databases

iPTMnetiQ8WWQ0.
PhosphoSitePlusiQ8WWQ0.

Expressioni

Tissue specificityi

Expressed in myeloma and epidermoid carcinoma cell lines.1 Publication

Gene expression databases

BgeeiENSG00000146247.
CleanExiHS_PHIP.
GenevisibleiQ8WWQ0. HS.

Organism-specific databases

HPAiHPA019140.
HPA019838.

Interactioni

Subunit structurei

Interacts with IRS1 and IRS2 (By similarity). Interacts (via bromo domain) with acetylated lysine residues on histone H1.4, histone H3 and H4 (in vitro).By similarity1 Publication

GO - Molecular functioni

  • insulin receptor binding Source: UniProtKB
  • lysine-acetylated histone binding Source: UniProtKB

Protein-protein interaction databases

BioGridi120353. 29 interactors.
IntActiQ8WWQ0. 21 interactors.
MINTiMINT-1422606.
STRINGi9606.ENSP00000275034.

Structurei

Secondary structure

11821
Legend: HelixTurnBeta strandPDB Structure known for this area
Show more details
Feature keyPosition(s)DescriptionActionsGraphical viewLength
Turni1318 – 1320Combined sources3
Helixi1321 – 1334Combined sources14
Helixi1336 – 1341Combined sources6
Turni1347 – 1349Combined sources3
Helixi1353 – 1356Combined sources4
Helixi1363 – 1371Combined sources9
Helixi1378 – 1395Combined sources18
Helixi1402 – 1434Combined sources33

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
3MB3X-ray2.25A1302-1434[»]
5ENBX-ray1.73A1315-1440[»]
5ENCX-ray1.59A1315-1440[»]
5ENEX-ray1.49A1315-1440[»]
5ENFX-ray1.37A1315-1440[»]
5ENHX-ray1.95A1315-1440[»]
5ENIX-ray1.69A1315-1440[»]
5ENJX-ray1.63A1315-1440[»]
ProteinModelPortaliQ8WWQ0.
SMRiQ8WWQ0.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiQ8WWQ0.

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati181 – 222WD 1Add BLAST42
Repeati224 – 262WD 2Add BLAST39
Repeati265 – 310WD 3Add BLAST46
Repeati319 – 360WD 4Add BLAST42
Repeati363 – 402WD 5Add BLAST40
Repeati422 – 461WD 6Add BLAST40
Repeati464 – 504WD 7Add BLAST41
Repeati512 – 551WD 8Add BLAST40
Domaini1176 – 1246Bromo 1PROSITE-ProRule annotationAdd BLAST71
Domaini1333 – 1403Bromo 2PROSITE-ProRule annotationAdd BLAST71

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni924 – 1129Mediates interaction with IRS1By similarityAdd BLAST206

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi867 – 923Lys-richAdd BLAST57
Compositional biasi1752 – 1758Poly-Glu7

Sequence similaritiesi

Contains 2 bromo domains.PROSITE-ProRule annotation
Contains 8 WD repeats.PROSITE-ProRule annotation

Keywords - Domaini

Bromodomain, Repeat, WD repeat

Phylogenomic databases

eggNOGiKOG0644. Eukaryota.
ENOG410YCD8. LUCA.
GeneTreeiENSGT00720000108687.
HOVERGENiHBG108248.
InParanoidiQ8WWQ0.
KOiK11797.
OMAiHSEGDWR.
OrthoDBiEOG091G00HZ.
PhylomeDBiQ8WWQ0.
TreeFamiTF324197.

Family and domain databases

Gene3Di1.20.920.10. 2 hits.
2.130.10.10. 2 hits.
InterProiIPR001487. Bromodomain.
IPR018359. Bromodomain_CS.
IPR028738. PHIP.
IPR015943. WD40/YVTN_repeat-like_dom.
IPR001680. WD40_repeat.
IPR019775. WD40_repeat_CS.
IPR017986. WD40_repeat_dom.
[Graphical view]
PANTHERiPTHR16266:SF4. PTHR16266:SF4. 1 hit.
PfamiPF00439. Bromodomain. 2 hits.
PF00400. WD40. 5 hits.
[Graphical view]
PRINTSiPR00503. BROMODOMAIN.
SMARTiSM00297. BROMO. 2 hits.
SM00320. WD40. 8 hits.
[Graphical view]
SUPFAMiSSF47370. SSF47370. 2 hits.
SSF50978. SSF50978. 1 hit.
PROSITEiPS00633. BROMODOMAIN_1. 1 hit.
PS50014. BROMODOMAIN_2. 2 hits.
PS00678. WD_REPEATS_1. 2 hits.
PS50082. WD_REPEATS_2. 5 hits.
PS50294. WD_REPEATS_REGION. 1 hit.
[Graphical view]

Sequencei

Sequence statusi: Complete.

Q8WWQ0-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MSCERKGLSE LRSELYFLIA RFLEDGPCQQ AAQVLIREVA EKELLPRRTD
60 70 80 90 100
WTGKEHPRTY QNLVKYYRHL APDHLLQICH RLGPLLEQEI PQSVPGVQTL
110 120 130 140 150
LGAGRQSLLR TNKSCKHVVW KGSALAALHC GRPPESPVNY GSPPSIADTL
160 170 180 190 200
FSRKLNGKYR LERLVPTAVY QHMKMHKRIL GHLSSVYCVT FDRTGRRIFT
210 220 230 240 250
GSDDCLVKIW ATDDGRLLAT LRGHAAEISD MAVNYENTMI AAGSCDKMIR
260 270 280 290 300
VWCLRTCAPL AVLQGHSASI TSLQFSPLCS GSKRYLSSTG ADGTICFWLW
310 320 330 340 350
DAGTLKINPR PAKFTERPRP GVQMICSSFS AGGMFLATGS TDHIIRVYFF
360 370 380 390 400
GSGQPEKISE LEFHTDKVDS IQFSNTSNRF VSGSRDGTAR IWQFKRREWK
410 420 430 440 450
SILLDMATRP AGQNLQGIED KITKMKVTMV AWDRHDNTVI TAVNNMTLKV
460 470 480 490 500
WNSYTGQLIH VLMGHEDEVF VLEPHPFDPR VLFSAGHDGN VIVWDLARGV
510 520 530 540 550
KIRSYFNMIE GQGHGAVFDC KCSPDGQHFA CTDSHGHLLI FGFGSSSKYD
560 570 580 590 600
KIADQMFFHS DYRPLIRDAN NFVLDEQTQQ APHLMPPPFL VDVDGNPHPS
610 620 630 640 650
RYQRLVPGRE NCREEQLIPQ MGVTSSGLNQ VLSQQANQEI SPLDSMIQRL
660 670 680 690 700
QQEQDLRRSG EAVISNTSRL SRGSISSTSE VHSPPNVGLR RSGQIEGVRQ
710 720 730 740 750
MHSNAPRSEI ATERDLVAWS RRVVVPELSA GVASRQEEWR TAKGEEEIKT
760 770 780 790 800
YRSEEKRKHL TVPKENKIPT VSKNHAHEHF LDLGESKKQQ TNQHNYRTRS
810 820 830 840 850
ALEETPRPSE EIENGSSSSD EGEVVAVSGG TSEEEERAWH SDGSSSDYSS
860 870 880 890 900
DYSDWTADAG INLQPPKKVP KNKTKKAESS SDEEEESEKQ KQKQIKKEKK
910 920 930 940 950
KVNEEKDGPI SPKKKKPKER KQKRLAVGEL TENGLTLEEW LPSTWITDTI
960 970 980 990 1000
PRRCPFVPQM GDEVYYFRQG HEAYVEMARK NKIYSINPKK QPWHKMELRE
1010 1020 1030 1040 1050
QELMKIVGIK YEVGLPTLCC LKLAFLDPDT GKLTGGSFTM KYHDMPDVID
1060 1070 1080 1090 1100
FLVLRQQFDD AKYRRWNIGD RFRSVIDDAW WFGTIESQEP LQLEYPDSLF
1110 1120 1130 1140 1150
QCYNVCWDNG DTEKMSPWDM ELIPNNAVFP EELGTSVPLT DGECRSLIYK
1160 1170 1180 1190 1200
PLDGEWGTNP RDEECERIVA GINQLMTLDI ASAFVAPVDL QAYPMYCTVV
1210 1220 1230 1240 1250
AYPTDLSTIK QRLENRFYRR VSSLMWEVRY IEHNTRTFNE PGSPIVKSAK
1260 1270 1280 1290 1300
FVTDLLLHFI KDQTCYNIIP LYNSMKKKVL SDSEDEEKDA DVPGTSTRKR
1310 1320 1330 1340 1350
KDHQPRRRLR NRAQSYDIQA WKKQCEELLN LIFQCEDSEP FRQPVDLLEY
1360 1370 1380 1390 1400
PDYRDIIDTP MDFATVRETL EAGNYESPME LCKDVRLIFS NSKAYTPSKR
1410 1420 1430 1440 1450
SRIYSMSLRL SAFFEEHISS VLSDYKSALR FHKRNTITKR RKKRNRSSSV
1460 1470 1480 1490 1500
SSSAASSPER KKRILKPQLK SESSTSAFST PTRSIPPRHN AAQINGKTES
1510 1520 1530 1540 1550
SSVVRTRSNR VVVDPVVTEQ PSTSSAAKTF ITKANASAIP GKTILENSVK
1560 1570 1580 1590 1600
HSKALNTLSS PGQSSFSHGT RNNSAKENME KEKPVKRKMK SSVLPKASTL
1610 1620 1630 1640 1650
SKSSAVIEQG DCKNNALVPG TIQVNGHGGQ PSKLVKRGPG RKPKVEVNTN
1660 1670 1680 1690 1700
SGEIIHKKRG RKPKKLQYAK PEDLEQNNVH PIRDEVLPSS TCNFLSETNN
1710 1720 1730 1740 1750
VKEDLLQKKN RGGRKPKRKM KTQKLDADLL VPASVKVLRR SNRKKIDDPI
1760 1770 1780 1790 1800
DEEEEFEELK GSEPHMRTRN QGRRTAFYNE DDSEEEQRQL LFEDTSLTFG
1810 1820
TSSRGRVRKL TEKAKANLIG W
Length:1,821
Mass (Da):206,689
Last modified:October 5, 2010 - v2
Checksum:i88688EF8743C980F
GO

Sequence cautioni

The sequence AAH21905 differs from that shown. Contaminating sequence. Potential poly-A sequence.Curated
The sequence BAC11417 differs from that shown. Reason: Frameshift at position 1648.Curated

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti2S → P in BAB71353 (PubMed:14702039).Curated1
Sequence conflicti770T → I in BAB71353 (PubMed:14702039).Curated1
Sequence conflicti1648N → I in BAC11417 (PubMed:14702039).Curated1

Natural variant

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Natural variantiVAR_036238469V → I in a colorectal cancer sample; somatic mutation. 1 Publication1
Natural variantiVAR_034683663V → G.Combined sources5 PublicationsCorresponds to variant rs7747479dbSNPEnsembl.1
Natural variantiVAR_034684874T → I.1 PublicationCorresponds to variant rs11547228dbSNPEnsembl.1
Natural variantiVAR_0346851093L → P.2 PublicationsCorresponds to variant rs9350797dbSNPEnsembl.1
Natural variantiVAR_0346861135T → P.Corresponds to variant rs34841569dbSNPEnsembl.1
Natural variantiVAR_0346871445N → T.Corresponds to variant rs36048894dbSNPEnsembl.1
Natural variantiVAR_0362391767R → I in a colorectal cancer sample; somatic mutation. 1 Publication1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
DQ924532 mRNA. Translation: ABK76299.1.
AJ303102 mRNA. Translation: CAC83118.1.
AL450327, AL356776 Genomic DNA. Translation: CAH70884.1.
AL356776, AL450327 Genomic DNA. Translation: CAI14406.1.
CH471051 Genomic DNA. Translation: EAW48718.1.
BC008909 mRNA. Translation: AAH08909.2.
BC021905 mRNA. Translation: AAH21905.1. Sequence problems.
BC081569 mRNA. Translation: AAH81569.1.
BC137488 mRNA. Translation: AAI37489.1.
AK057039 mRNA. Translation: BAB71353.1.
AK075124 mRNA. Translation: BAC11417.1. Frameshift.
AL161957 mRNA. Translation: CAH10776.1.
AF310250 mRNA. Translation: AAG45145.1.
CCDSiCCDS4987.1.
RefSeqiNP_060404.3. NM_017934.5.
UniGeneiHs.511817.
Hs.606356.

Genome annotation databases

EnsembliENST00000275034; ENSP00000275034; ENSG00000146247.
GeneIDi55023.
KEGGihsa:55023.
UCSCiuc003pir.4. human.

Keywords - Coding sequence diversityi

Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
DQ924532 mRNA. Translation: ABK76299.1.
AJ303102 mRNA. Translation: CAC83118.1.
AL450327, AL356776 Genomic DNA. Translation: CAH70884.1.
AL356776, AL450327 Genomic DNA. Translation: CAI14406.1.
CH471051 Genomic DNA. Translation: EAW48718.1.
BC008909 mRNA. Translation: AAH08909.2.
BC021905 mRNA. Translation: AAH21905.1. Sequence problems.
BC081569 mRNA. Translation: AAH81569.1.
BC137488 mRNA. Translation: AAI37489.1.
AK057039 mRNA. Translation: BAB71353.1.
AK075124 mRNA. Translation: BAC11417.1. Frameshift.
AL161957 mRNA. Translation: CAH10776.1.
AF310250 mRNA. Translation: AAG45145.1.
CCDSiCCDS4987.1.
RefSeqiNP_060404.3. NM_017934.5.
UniGeneiHs.511817.
Hs.606356.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
PDB entryMethodResolution (Å)ChainPositionsPDBsum
3MB3X-ray2.25A1302-1434[»]
5ENBX-ray1.73A1315-1440[»]
5ENCX-ray1.59A1315-1440[»]
5ENEX-ray1.49A1315-1440[»]
5ENFX-ray1.37A1315-1440[»]
5ENHX-ray1.95A1315-1440[»]
5ENIX-ray1.69A1315-1440[»]
5ENJX-ray1.63A1315-1440[»]
ProteinModelPortaliQ8WWQ0.
SMRiQ8WWQ0.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi120353. 29 interactors.
IntActiQ8WWQ0. 21 interactors.
MINTiMINT-1422606.
STRINGi9606.ENSP00000275034.

Chemistry databases

ChEMBLiCHEMBL2176773.

PTM databases

iPTMnetiQ8WWQ0.
PhosphoSitePlusiQ8WWQ0.

Polymorphism and mutation databases

BioMutaiPHIP.
DMDMi308153472.

Proteomic databases

EPDiQ8WWQ0.
MaxQBiQ8WWQ0.
PaxDbiQ8WWQ0.
PeptideAtlasiQ8WWQ0.
PRIDEiQ8WWQ0.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000275034; ENSP00000275034; ENSG00000146247.
GeneIDi55023.
KEGGihsa:55023.
UCSCiuc003pir.4. human.

Organism-specific databases

CTDi55023.
DisGeNETi55023.
GeneCardsiPHIP.
H-InvDBHIX0032894.
HGNCiHGNC:15673. PHIP.
HPAiHPA019140.
HPA019838.
MIMi612870. gene.
neXtProtiNX_Q8WWQ0.
OpenTargetsiENSG00000146247.
PharmGKBiPA33265.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG0644. Eukaryota.
ENOG410YCD8. LUCA.
GeneTreeiENSGT00720000108687.
HOVERGENiHBG108248.
InParanoidiQ8WWQ0.
KOiK11797.
OMAiHSEGDWR.
OrthoDBiEOG091G00HZ.
PhylomeDBiQ8WWQ0.
TreeFamiTF324197.

Enzyme and pathway databases

BioCyciZFISH:ENSG00000146247-MONOMER.
SignaLinkiQ8WWQ0.

Miscellaneous databases

ChiTaRSiPHIP. human.
EvolutionaryTraceiQ8WWQ0.
GenomeRNAii55023.
PROiQ8WWQ0.
SOURCEiSearch...

Gene expression databases

BgeeiENSG00000146247.
CleanExiHS_PHIP.
GenevisibleiQ8WWQ0. HS.

Family and domain databases

Gene3Di1.20.920.10. 2 hits.
2.130.10.10. 2 hits.
InterProiIPR001487. Bromodomain.
IPR018359. Bromodomain_CS.
IPR028738. PHIP.
IPR015943. WD40/YVTN_repeat-like_dom.
IPR001680. WD40_repeat.
IPR019775. WD40_repeat_CS.
IPR017986. WD40_repeat_dom.
[Graphical view]
PANTHERiPTHR16266:SF4. PTHR16266:SF4. 1 hit.
PfamiPF00439. Bromodomain. 2 hits.
PF00400. WD40. 5 hits.
[Graphical view]
PRINTSiPR00503. BROMODOMAIN.
SMARTiSM00297. BROMO. 2 hits.
SM00320. WD40. 8 hits.
[Graphical view]
SUPFAMiSSF47370. SSF47370. 2 hits.
SSF50978. SSF50978. 1 hit.
PROSITEiPS00633. BROMODOMAIN_1. 1 hit.
PS50014. BROMODOMAIN_2. 2 hits.
PS00678. WD_REPEATS_1. 2 hits.
PS50082. WD_REPEATS_2. 5 hits.
PS50294. WD_REPEATS_REGION. 1 hit.
[Graphical view]
ProtoNetiSearch...

Entry informationi

Entry nameiPHIP_HUMAN
AccessioniPrimary (citable) accession number: Q8WWQ0
Secondary accession number(s): A7J992
, B2RPK4, Q05CQ9, Q5VVH4, Q66I29, Q69YV1, Q8NBZ5, Q96H52, Q96ME2, Q9H261
Entry historyi
Integrated into UniProtKB/Swiss-Prot: August 21, 2007
Last sequence update: October 5, 2010
Last modified: November 2, 2016
This is version 130 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. Human chromosome 6
    Human chromosome 6: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  6. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.