Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Centromere/kinetochore protein zw10 homolog

Gene

ZW10

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Essential component of the mitotic checkpoint, which prevents cells from prematurely exiting mitosis. Required for the assembly of the dynein-dynactin and MAD1-MAD2 complexes onto kinetochores. Its function related to the spindle assembly machinery is proposed to depend on its association in the mitotic RZZ complex (PubMed:11590237, PubMed:15485811, PubMed:15824131). Involved in regulation of membrane traffic between the Golgi and the endoplasmic reticulum (ER); the function is proposed to depend on its association in the interphase NRZ complex which is believed to play a role in SNARE assembly at the ER (PubMed:15029241).Curated5 Publications

GO - Molecular functioni

  • centromeric DNA binding Source: ProtInc

GO - Biological processi

  • cell division Source: UniProtKB-KW
  • ER to Golgi vesicle-mediated transport Source: HGNC
  • establishment of mitotic spindle orientation Source: UniProtKB
  • Golgi organization Source: UniProtKB
  • meiotic nuclear division Source: ProtInc
  • mitotic cell cycle checkpoint Source: UniProtKB
  • mitotic metaphase plate congression Source: UniProtKB
  • mitotic sister chromatid segregation Source: HGNC
  • mitotic spindle assembly checkpoint Source: GO_Central
  • protein complex assembly Source: HGNC
  • protein localization to kinetochore Source: UniProtKB
  • protein transport Source: UniProtKB-KW
  • regulation of exit from mitosis Source: HGNC
  • retrograde vesicle-mediated transport, Golgi to ER Source: Reactome
  • sister chromatid cohesion Source: Reactome
Complete GO annotation...

Keywords - Biological processi

Cell cycle, Cell division, ER-Golgi transport, Mitosis, Protein transport, Transport

Enzyme and pathway databases

ReactomeiR-HSA-2467813. Separation of Sister Chromatids.
R-HSA-2500257. Resolution of Sister Chromatid Cohesion.
R-HSA-5663220. RHO GTPases Activate Formins.
R-HSA-6811434. COPI-dependent Golgi-to-ER retrograde traffic.
R-HSA-68877. Mitotic Prometaphase.

Names & Taxonomyi

Protein namesi
Recommended name:
Centromere/kinetochore protein zw10 homolog
Gene namesi
Name:ZW10
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 11

Organism-specific databases

HGNCiHGNC:13194. ZW10.

Subcellular locationi

GO - Cellular componenti

  • condensed chromosome kinetochore Source: UniProtKB-SubCell
  • cytoplasm Source: HPA
  • cytosol Source: Reactome
  • Dsl1p complex Source: UniProtKB
  • endoplasmic reticulum Source: HGNC
  • endoplasmic reticulum membrane Source: UniProtKB-SubCell
  • kinetochore Source: UniProtKB
  • kinetochore microtubule Source: UniProtKB
  • membrane Source: UniProtKB
  • nucleus Source: HGNC
  • RZZ complex Source: UniProtKB
  • spindle pole Source: UniProtKB
Complete GO annotation...

Keywords - Cellular componenti

Centromere, Chromosome, Cytoplasm, Cytoskeleton, Endoplasmic reticulum, Kinetochore, Membrane

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA37759.

Polymorphism and mutation databases

BioMutaiZW10.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Initiator methionineiRemovedCombined sources2 Publications
Chaini2 – 779778Centromere/kinetochore protein zw10 homologPRO_0000184957Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei2 – 21N-acetylalanineCombined sources1 Publication
Modified residuei3 – 31PhosphoserineCombined sources
Modified residuei12 – 121PhosphoserineCombined sources
Modified residuei777 – 7771N6-acetyllysineCombined sources

Keywords - PTMi

Acetylation, Phosphoprotein

Proteomic databases

EPDiO43264.
MaxQBiO43264.
PaxDbiO43264.
PeptideAtlasiO43264.
PRIDEiO43264.

PTM databases

iPTMnetiO43264.
PhosphoSiteiO43264.

Expressioni

Tissue specificityi

Widely expressed.

Developmental stagei

No significant variation in expression during cell cycle.

Gene expression databases

BgeeiO43264.
CleanExiHS_ZW10.
ExpressionAtlasiO43264. baseline and differential.
GenevisibleiO43264. HS.

Organism-specific databases

HPAiCAB011565.
HPA051253.
HPA055410.

Interactioni

Subunit structurei

Interacts with NBAS and KNTC1/ROD; the interactions are mutually exclusive and indicative for its association in two different vesicle tethering complexes (PubMed:11590237, PubMed:15824131, PubMed:20462495). Component of the RZZ complex composed of KNTC1/ROD, ZW10 and ZWILCH (PubMed:12686595, PubMed:20462495). Component of the NRZ complex composed of NBAS, ZW10 and RINT1/TIP20L; NRZ associates with SNAREs STX18, USE1L, BNIP1/SEC20L and SEC22B (the assembly has been described as syntaxin 18 complex). Interacts directly with RINT1/TIP20L bound to BNIP1/SEC20L (PubMed:15029241, PubMed:15272311, PubMed:20462495, PubMed:19369418). Interacts with C19orf25 and ZWINT (PubMed:15485811, PubMed:15824131., PubMed:16732327).1 Publication7 Publications

Binary interactionsi

WithEntry#Exp.IntActNotes
RINT1Q6NUQ19EBI-1001217,EBI-726876
ZWINTO952296EBI-1001217,EBI-1001132

Protein-protein interaction databases

BioGridi114620. 45 interactions.
DIPiDIP-36471N.
IntActiO43264. 22 interactions.
MINTiMINT-2999296.
STRINGi9606.ENSP00000200135.

Structurei

3D structure databases

ProteinModelPortaliO43264.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Region

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Regioni2 – 317316Interaction with RINT1Add
BLAST
Regioni2 – 8180Interaction with ZWINTAdd
BLAST

Coiled coil

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Coiled coili14 – 130117Sequence analysisAdd
BLAST

Sequence similaritiesi

Belongs to the ZW10 family.Curated

Keywords - Domaini

Coiled coil

Phylogenomic databases

eggNOGiKOG2163. Eukaryota.
ENOG410XQ7J. LUCA.
GeneTreeiENSGT00390000016427.
HOGENOMiHOG000007982.
HOVERGENiHBG004603.
InParanoidiO43264.
KOiK11578.
OMAiIRFESVM.
OrthoDBiEOG718KBR.
PhylomeDBiO43264.
TreeFamiTF105966.

Family and domain databases

InterProiIPR009361. RZZ-complex_Zw10.
[Graphical view]
PfamiPF06248. Zw10. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: O43264-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MASFVTEVLA HSGRLEKEDL GTRISRLTRR VEEIKGEVCN MISKKYSEFL
60 70 80 90 100
PSMQSAQGLI TQVDKLSEDI DLLKSRIESE VRRDLHVSTG EFTDLKQQLE
110 120 130 140 150
RDSVVLSLLK QLQEFSTAIE EYNCALTEKK YVTGAQRLEE AQKCLKLLKS
160 170 180 190 200
RKCFDLKILK SLSMELTIQK QNILYHLGEE WQKLIVWKFP PSKDTSSLES
210 220 230 240 250
YLQTELHLYT EQSHKEEKTP MPPISSVLLA FSVLGELHSK LKSFGQMLLK
260 270 280 290 300
YILRPLASCP SLHAVIESQP NIVIIRFESI MTNLEYPSPS EVFTKIRLVL
310 320 330 340 350
EVLQKQLLDL PLDTDLENEK TSTVPLAEML GDMIWEDLSE CLIKNCLVYS
360 370 380 390 400
IPTNSSKLQQ YEEIIQSTEE FENALKEMRF LKGDTTDLLK YARNINSHFA
410 420 430 440 450
NKKCQDVIVA ARNLMTSEIH NTVKIIPDSK INVPELPTPD EDNKLEVQKV
460 470 480 490 500
SNTQYHEVMN LEPENTLDQH SFSLPTCRIS ESVKKLMELA YQTLLEATTS
510 520 530 540 550
SDQCAVQLFY SVRNIFHLFH DVVPTYHKEN LQKLPQLAAI HHNNCMYIAH
560 570 580 590 600
HLLTLGHQFR LRLAPILCDG TATFVDLVPG FRRLGTECFL AQMRAQKGEL
610 620 630 640 650
LERLSSARNF SNMDDEENYS AASKAVRQVL HQLKRLGIVW QDVLPVNIYC
660 670 680 690 700
KAMGTLLNTA ISEVIGKITA LEDISTEDGD RLYSLCKTVM DEGPQVFAPL
710 720 730 740 750
SEESKNKKYQ EEVPVYVPKW MPFKELMMML QASLQEIGDR WADGKGPLAA
760 770
AFSSSEVKAL IRALFQNTER RAAALAKIK
Length:779
Mass (Da):88,829
Last modified:January 23, 2007 - v3
Checksum:i9C38186153886481
GO
Isoform 2 (identifier: O43264-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     673-779: Missing.

Note: No experimental confirmation available.
Show »
Length:672
Mass (Da):76,807
Checksum:iA65CB8884058CB74
GO

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti77 – 771I → M.
Corresponds to variant rs2271796 [ dbSNP | Ensembl ].
VAR_053537

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei673 – 779107Missing in isoform 2. 1 PublicationVSP_056006Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U54996 mRNA. Translation: AAB88237.1.
AP002436 Genomic DNA. No translation available.
AP003170 Genomic DNA. No translation available.
BC128264 mRNA. Translation: AAI28265.1.
CCDSiCCDS8363.1. [O43264-1]
RefSeqiNP_004715.1. NM_004724.3. [O43264-1]
UniGeneiHs.503886.

Genome annotation databases

EnsembliENST00000200135; ENSP00000200135; ENSG00000086827. [O43264-1]
ENST00000535142; ENSP00000440879; ENSG00000086827. [O43264-2]
GeneIDi9183.
KEGGihsa:9183.
UCSCiuc001poe.4. human. [O43264-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
U54996 mRNA. Translation: AAB88237.1.
AP002436 Genomic DNA. No translation available.
AP003170 Genomic DNA. No translation available.
BC128264 mRNA. Translation: AAI28265.1.
CCDSiCCDS8363.1. [O43264-1]
RefSeqiNP_004715.1. NM_004724.3. [O43264-1]
UniGeneiHs.503886.

3D structure databases

ProteinModelPortaliO43264.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi114620. 45 interactions.
DIPiDIP-36471N.
IntActiO43264. 22 interactions.
MINTiMINT-2999296.
STRINGi9606.ENSP00000200135.

PTM databases

iPTMnetiO43264.
PhosphoSiteiO43264.

Polymorphism and mutation databases

BioMutaiZW10.

Proteomic databases

EPDiO43264.
MaxQBiO43264.
PaxDbiO43264.
PeptideAtlasiO43264.
PRIDEiO43264.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000200135; ENSP00000200135; ENSG00000086827. [O43264-1]
ENST00000535142; ENSP00000440879; ENSG00000086827. [O43264-2]
GeneIDi9183.
KEGGihsa:9183.
UCSCiuc001poe.4. human. [O43264-1]

Organism-specific databases

CTDi9183.
GeneCardsiZW10.
HGNCiHGNC:13194. ZW10.
HPAiCAB011565.
HPA051253.
HPA055410.
MIMi603954. gene.
neXtProtiNX_O43264.
PharmGKBiPA37759.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG2163. Eukaryota.
ENOG410XQ7J. LUCA.
GeneTreeiENSGT00390000016427.
HOGENOMiHOG000007982.
HOVERGENiHBG004603.
InParanoidiO43264.
KOiK11578.
OMAiIRFESVM.
OrthoDBiEOG718KBR.
PhylomeDBiO43264.
TreeFamiTF105966.

Enzyme and pathway databases

ReactomeiR-HSA-2467813. Separation of Sister Chromatids.
R-HSA-2500257. Resolution of Sister Chromatid Cohesion.
R-HSA-5663220. RHO GTPases Activate Formins.
R-HSA-6811434. COPI-dependent Golgi-to-ER retrograde traffic.
R-HSA-68877. Mitotic Prometaphase.

Miscellaneous databases

GeneWikiiZW10.
GenomeRNAii9183.
PROiO43264.
SOURCEiSearch...

Gene expression databases

BgeeiO43264.
CleanExiHS_ZW10.
ExpressionAtlasiO43264. baseline and differential.
GenevisibleiO43264. HS.

Family and domain databases

InterProiIPR009361. RZZ-complex_Zw10.
[Graphical view]
PfamiPF06248. Zw10. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1).
    Tissue: Cervix carcinoma.
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  3. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
  4. "Exploring proteomes and analyzing protein processing by mass spectrometric identification of sorted N-terminal peptides."
    Gevaert K., Goethals M., Martens L., Van Damme J., Staes A., Thomas G.R., Vandekerckhove J.
    Nat. Biotechnol. 21:566-569(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: PROTEIN SEQUENCE OF 2-14.
    Tissue: Platelet.
  5. Bienvenut W.V., Boldt K., von Kriegsheim A.F., Kolch W.
    Submitted (JUL-2007) to UniProtKB
    Cited for: PROTEIN SEQUENCE OF 2-14 AND 161-170, CLEAVAGE OF INITIATOR METHIONINE, ACETYLATION AT ALA-2, IDENTIFICATION BY MASS SPECTROMETRY.
    Tissue: Hepatoma.
  6. "The ZW10 and Rough Deal checkpoint proteins function together in a large, evolutionarily conserved complex targeted to the kinetochore."
    Scaeerou F., Starr D.A., Piano F., Papoulas O., Karess R.E., Goldberg M.L.
    J. Cell Sci. 114:3103-3114(2001) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, SUBCELLULAR LOCATION, INTERACTION WITH KNTC1.
  7. "Zwilch, a new component of the ZW10/ROD complex required for kinetochore functions."
    Williams B.C., Li Z., Liu S., Williams E.V., Leung G., Yen T.J., Goldberg M.L.
    Mol. Biol. Cell 14:1379-1391(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: INTERACTION WITH ZWILCH AND ZW10.
  8. "Implication of ZW10 in membrane trafficking between the endoplasmic reticulum and Golgi."
    Hirose H., Arasaki K., Dohmae N., Takio K., Hatsuzawa K., Nagahama M., Tani K., Yamamoto A., Tohyama M., Tagaya M.
    EMBO J. 23:1267-1278(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, SUBCELLULAR LOCATION, IDENTIFICATION BY MASS SPECTROMETRY, IDENTIFICATION IN A COMPLEX WITH STX18; USE1L; SEC22B AND RINT1, INTERACTION WITH RINT1.
  9. "Involvement of BNIP1 in apoptosis and endoplasmic reticulum membrane fusion."
    Nakajima K., Hirose H., Taniguchi M., Kurashina H., Arasaki K., Nagahama M., Tani K., Yamamoto A., Tagaya M.
    EMBO J. 23:3216-3226(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: INTERACTION WITH BNIP1 THROUGH RINT1.
  10. "Recapitulation of the Roberts syndrome cellular phenotype by inhibition of INCENP, ZWINT-1 and ZW10 genes."
    Musio A., Mariani T., Montagna C., Zambroni D., Ascoli C., Ried T., Vezzoni P.
    Gene 331:33-40(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION.
  11. "Human Zwint-1 specifies localization of Zeste White 10 to kinetochores and is essential for mitotic checkpoint signaling."
    Wang H., Hu X., Ding X., Dou Z., Yang Z., Shaw A.W., Teng M., Cleveland D.W., Goldberg M.L., Niu L., Yao X.
    J. Biol. Chem. 279:54590-54598(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, INTERACTION WITH ZWINT, SUBCELLULAR LOCATION.
  12. "ZW10 links mitotic checkpoint signaling to the structural kinetochore."
    Kops G.J.P.L., Kim Y., Weaver B.A.A., Mao Y., McLeod I., Yates J.R. III, Tagaya M., Cleveland D.W.
    J. Cell Biol. 169:49-60(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, IDENTIFICATION BY MASS SPECTROMETRY, INTERACTION WITH C19ORF25; KNTC1; RINT1; ZWILCH AND ZWINT, SUBCELLULAR LOCATION.
  13. "Hec1 sequentially recruits Zwint-1 and ZW10 to kinetochores for faithful chromosome segregation and spindle checkpoint control."
    Lin Y.-T., Chen Y., Wu G., Lee W.-H.
    Oncogene 25:6901-6914(2006) [PubMed] [Europe PMC] [Abstract]
    Cited for: INTERACTION WITH ZWINT, SUBCELLULAR LOCATION.
  14. "Lys-N and trypsin cover complementary parts of the phosphoproteome in a refined SCX-based approach."
    Gauci S., Helbig A.O., Slijper M., Krijgsveld J., Heck A.J., Mohammed S.
    Anal. Chem. 81:4493-4501(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT ALA-2, CLEAVAGE OF INITIATOR METHIONINE [LARGE SCALE ANALYSIS], IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
  15. "Identification of the neuroblastoma-amplified gene product as a component of the syntaxin 18 complex implicated in Golgi-to-endoplasmic reticulum retrograde transport."
    Aoki T., Ichimura S., Itoh A., Kuramoto M., Shinkawa T., Isobe T., Tagaya M.
    Mol. Biol. Cell 20:2639-2649(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: SUBUNIT.
  16. "Lysine acetylation targets protein complexes and co-regulates major cellular functions."
    Choudhary C., Kumar C., Gnad F., Nielsen M.L., Rehman M., Walther T.C., Olsen J.V., Mann M.
    Science 325:834-840(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT LYS-777, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
  17. "Structural analysis of the RZZ complex reveals common ancestry with multisubunit vesicle tethering machinery."
    Civril F., Wehenkel A., Giorgi F.M., Santaguida S., Di Fonzo A., Grigorean G., Ciccarelli F.D., Musacchio A.
    Structure 18:616-626(2010) [PubMed] [Europe PMC] [Abstract]
    Cited for: INTERACTION WITH NBAS, IDENTIFICATION IN THE RRZ COMPLEX, IDENTIFICATION IN THE NRZ COMPLEX.
  18. Cited for: IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
  19. "Toward a comprehensive characterization of a human cancer cell phosphoproteome."
    Zhou H., Di Palma S., Preisinger C., Peng M., Polat A.N., Heck A.J., Mohammed S.
    J. Proteome Res. 12:260-271(2013) [PubMed] [Europe PMC] [Abstract]
    Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-3 AND SER-12, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Tissue: Cervix carcinoma and Erythroleukemia.
  20. Cited for: ACETYLATION [LARGE SCALE ANALYSIS] AT ALA-2, CLEAVAGE OF INITIATOR METHIONINE [LARGE SCALE ANALYSIS], IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].

Entry informationi

Entry nameiZW10_HUMAN
AccessioniPrimary (citable) accession number: O43264
Secondary accession number(s): A1A528
Entry historyi
Integrated into UniProtKB/Swiss-Prot: December 1, 2000
Last sequence update: January 23, 2007
Last modified: July 6, 2016
This is version 143 of the entry and version 3 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Miscellaneous

Overexpression as well as silencing of ZW10 disrupts the morphology of the ER-Golgi intermediate compartment as well as the Golgi apparatus and slows down ER-Golgi transport.

Keywords - Technical termi

Complete proteome, Direct protein sequencing, Reference proteome

Documents

  1. Human chromosome 11
    Human chromosome 11: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.