Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Zinc finger and SCAN domain-containing protein 10

Gene

ZSCAN10

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at protein leveli

Functioni

Embryonic stem (ES) cell-specific transcription factor required to maintain ES cell pluripotency. Can both activate and /or repress expression of target genes, depending on the context. Specifically binds the 5'-[GA]CGCNNGCG[CT]-3' DNA consensus sequence. Regulates expression of POU5F1/OCT4, ZSCAN4 and ALYREF/THOC4 (By similarity).By similarity

Regions

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri292 – 31524C2H2-type 1PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri321 – 34323C2H2-type 2PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri349 – 37123C2H2-type 3PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri377 – 39923C2H2-type 4PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri421 – 44323C2H2-type 5PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri467 – 48923C2H2-type 6PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri495 – 51723C2H2-type 7PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri523 – 54523C2H2-type 8PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri551 – 57323C2H2-type 9PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri579 – 60123C2H2-type 10PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri607 – 62923C2H2-type 11PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri635 – 65723C2H2-type 12PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri669 – 69123C2H2-type 13PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri697 – 71923C2H2-type 14PROSITE-ProRule annotationAdd
BLAST

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Transcription, Transcription regulation

Keywords - Ligandi

DNA-binding, Metal-binding, Zinc

Enzyme and pathway databases

ReactomeiR-HSA-452723. Transcriptional regulation of pluripotent stem cells.

Names & Taxonomyi

Protein namesi
Recommended name:
Zinc finger and SCAN domain-containing protein 10
Alternative name(s):
Zinc finger protein 206
Gene namesi
Name:ZSCAN10
Synonyms:ZNF206
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 16

Organism-specific databases

HGNCiHGNC:12997. ZSCAN10.

Subcellular locationi

  • Nucleus PROSITE-ProRule annotation

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Nucleus

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA162410957.

Polymorphism and mutation databases

BioMutaiZSCAN10.
DMDMi55976759.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 725725Zinc finger and SCAN domain-containing protein 10PRO_0000047452Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei107 – 1071PhosphoserineCombined sources
Modified residuei153 – 1531PhosphoserineCombined sources
Modified residuei213 – 2131PhosphothreonineCombined sources

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiQ96SZ4.
PRIDEiQ96SZ4.

PTM databases

iPTMnetiQ96SZ4.
PhosphoSiteiQ96SZ4.

Expressioni

Gene expression databases

BgeeiQ96SZ4.
CleanExiHS_ZSCAN10.
ExpressionAtlasiQ96SZ4. baseline and differential.
GenevisibleiQ96SZ4. HS.

Interactioni

Subunit structurei

Interacts with POU5F1/OCT4 and SOX2.By similarity

Protein-protein interaction databases

STRINGi9606.ENSP00000252463.

Structurei

3D structure databases

ProteinModelPortaliQ96SZ4.
SMRiQ96SZ4. Positions 1-63, 289-721.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini1 – 7171SCAN boxPROSITE-ProRule annotationAdd
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi104 – 17572Pro-richAdd
BLAST

Sequence similaritiesi

Contains 14 C2H2-type zinc fingers.PROSITE-ProRule annotation
Contains 1 SCAN box domain.PROSITE-ProRule annotation

Zinc finger

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Zinc fingeri292 – 31524C2H2-type 1PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri321 – 34323C2H2-type 2PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri349 – 37123C2H2-type 3PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri377 – 39923C2H2-type 4PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri421 – 44323C2H2-type 5PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri467 – 48923C2H2-type 6PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri495 – 51723C2H2-type 7PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri523 – 54523C2H2-type 8PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri551 – 57323C2H2-type 9PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri579 – 60123C2H2-type 10PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri607 – 62923C2H2-type 11PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri635 – 65723C2H2-type 12PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri669 – 69123C2H2-type 13PROSITE-ProRule annotationAdd
BLAST
Zinc fingeri697 – 71923C2H2-type 14PROSITE-ProRule annotationAdd
BLAST

Keywords - Domaini

Repeat, Zinc-finger

Phylogenomic databases

eggNOGiKOG1721. Eukaryota.
COG5048. LUCA.
GeneTreeiENSGT00730000111047.
HOGENOMiHOG000234619.
HOVERGENiHBG018163.
InParanoidiQ96SZ4.
KOiK09230.
OMAiERPHACH.
OrthoDBiEOG7KSX7Q.
PhylomeDBiQ96SZ4.
TreeFamiTF338010.

Family and domain databases

Gene3Di3.30.160.60. 13 hits.
InterProiIPR008916. Retrov_capsid_C.
IPR003309. SCAN_dom.
IPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
PfamiPF02023. SCAN. 1 hit.
PF00096. zf-C2H2. 4 hits.
PF13912. zf-C2H2_6. 1 hit.
[Graphical view]
SMARTiSM00431. SCAN. 1 hit.
SM00355. ZnF_C2H2. 14 hits.
[Graphical view]
SUPFAMiSSF47353. SSF47353. 1 hit.
PROSITEiPS50804. SCAN_BOX. 1 hit.
PS00028. ZINC_FINGER_C2H2_1. 14 hits.
PS50157. ZINC_FINGER_C2H2_2. 14 hits.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q96SZ4-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MGPRASLSRL RELCGHWLRP ALHTKKQILE LLVLEQFLSV LPPHLLGRLQ
60 70 80 90 100
GQPLRDGEEV VLLLEGIHRE PSHAGPLDFS CNAGKSCPRA DVTLEEKGCA
110 120 130 140 150
SQVPSHSPKK ELPAEEPSVL GPSDEPPRPQ PRAAQPAEPG QWRLPPSSKQ
160 170 180 190 200
PLSPGPQKTF QALQESSPQG PSPWPEESSR DQELAAVLEC LTFEDVPENK
210 220 230 240 250
AWPAHPLGFG SRTPDKEEFK QEEPKGAAWP TPILAESQAD SPGVPGEPCA
260 270 280 290 300
QSLGRGAAAS GPGEDGSLLG SSEILEVKVA EGVPEPNPEL QFICADCGVS
310 320 330 340 350
FPQLSRLKAH QLRSHPAGRS FLCLCCGKSF GRSSILKLHM RTHTDERPHA
360 370 380 390 400
CHLCGHRFRQ SSHLSKHLLT HSSEPAFLCA ECGRGFQRRA SLVQHLLAHA
410 420 430 440 450
QDQKPPCAPE SKAEAPPLTD VLCSHCGQSF QRRSSLKRHL RIHARDKDRR
460 470 480 490 500
SSEGSGSRRR DSDRRPFVCS DCGKAFRRSE HLVAHRRVHT GERPFSCQAC
510 520 530 540 550
GRSFTQSSQL VSHQRVHTGE KPYACPQCGK RFVRRASLAR HLLTHGGPRP
560 570 580 590 600
HHCTQCGKSF GQTQDLARHQ RSHTGEKPCR CSECGEGFSQ SAHLARHQRI
610 620 630 640 650
HTGEKPHACD TCGHRFRNSS NLARHRRSHT GERPYSCQTC GRSFRRNAHL
660 670 680 690 700
RRHLATHAEP GQEQAEPPQE CVECGKSFSR SCNLLRHLLV HTGARPYSCT
710 720
QCGRSFSRNS HLLRHLRTHA RETLY
Length:725
Mass (Da):80,387
Last modified:December 1, 2001 - v1
Checksum:i046163DA13669F12
GO
Isoform 2 (identifier: Q96SZ4-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-339: Missing.

Note: No experimental confirmation available.
Show »
Length:386
Mass (Da):43,934
Checksum:i635DE92DB23BD813
GO
Isoform 3 (identifier: Q96SZ4-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-85: MGPRASLSRL...PLDFSCNAGK → MLPVSGGHGA...PGAQPRGAAG
     86-167: Missing.

Note: No experimental confirmation available.
Show »
Length:643
Mass (Da):69,665
Checksum:i4CF46847F24FE3D6
GO

Sequence cautioni

The sequence AAI14453.1 differs from that shown. Reason: Erroneous initiation. Translation N-terminally shortened.Curated

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1 – 339339Missing in isoform 2. 1 PublicationVSP_039221Add
BLAST
Alternative sequencei1 – 8585MGPRA…CNAGK → MLPVSGGHGATGVPEPAPGA LRPLAAAGSAHQETDPGAAG AGAVPECAASAPPGPPAGAA AQGWGGGGAAARGHPPGAQP RGAAG in isoform 3. 1 PublicationVSP_054601Add
BLAST
Alternative sequencei86 – 16782Missing in isoform 3. 1 PublicationVSP_054602Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK027455 mRNA. Translation: BAB55124.1.
AK074736 mRNA. Translation: BAG51995.1.
AC108134 Genomic DNA. No translation available.
BC114452 mRNA. Translation: AAI14453.1. Different initiation.
CCDSiCCDS61813.1. [Q96SZ4-2]
CCDS61814.1. [Q96SZ4-3]
RefSeqiNP_001269344.1. NM_001282415.1. [Q96SZ4-2]
NP_001269345.1. NM_001282416.1. [Q96SZ4-3]
NP_116194.2. NM_032805.2.
UniGeneiHs.334515.

Genome annotation databases

EnsembliENST00000252463; ENSP00000252463; ENSG00000130182. [Q96SZ4-1]
ENST00000538082; ENSP00000440047; ENSG00000130182. [Q96SZ4-3]
ENST00000575108; ENSP00000459520; ENSG00000130182. [Q96SZ4-2]
GeneIDi84891.
KEGGihsa:84891.
UCSCiuc002ctv.3. human. [Q96SZ4-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK027455 mRNA. Translation: BAB55124.1.
AK074736 mRNA. Translation: BAG51995.1.
AC108134 Genomic DNA. No translation available.
BC114452 mRNA. Translation: AAI14453.1. Different initiation.
CCDSiCCDS61813.1. [Q96SZ4-2]
CCDS61814.1. [Q96SZ4-3]
RefSeqiNP_001269344.1. NM_001282415.1. [Q96SZ4-2]
NP_001269345.1. NM_001282416.1. [Q96SZ4-3]
NP_116194.2. NM_032805.2.
UniGeneiHs.334515.

3D structure databases

ProteinModelPortaliQ96SZ4.
SMRiQ96SZ4. Positions 1-63, 289-721.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

STRINGi9606.ENSP00000252463.

PTM databases

iPTMnetiQ96SZ4.
PhosphoSiteiQ96SZ4.

Polymorphism and mutation databases

BioMutaiZSCAN10.
DMDMi55976759.

Proteomic databases

PaxDbiQ96SZ4.
PRIDEiQ96SZ4.

Protocols and materials databases

DNASUi84891.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000252463; ENSP00000252463; ENSG00000130182. [Q96SZ4-1]
ENST00000538082; ENSP00000440047; ENSG00000130182. [Q96SZ4-3]
ENST00000575108; ENSP00000459520; ENSG00000130182. [Q96SZ4-2]
GeneIDi84891.
KEGGihsa:84891.
UCSCiuc002ctv.3. human. [Q96SZ4-1]

Organism-specific databases

CTDi84891.
GeneCardsiZSCAN10.
HGNCiHGNC:12997. ZSCAN10.
neXtProtiNX_Q96SZ4.
PharmGKBiPA162410957.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG1721. Eukaryota.
COG5048. LUCA.
GeneTreeiENSGT00730000111047.
HOGENOMiHOG000234619.
HOVERGENiHBG018163.
InParanoidiQ96SZ4.
KOiK09230.
OMAiERPHACH.
OrthoDBiEOG7KSX7Q.
PhylomeDBiQ96SZ4.
TreeFamiTF338010.

Enzyme and pathway databases

ReactomeiR-HSA-452723. Transcriptional regulation of pluripotent stem cells.

Miscellaneous databases

GenomeRNAii84891.
NextBioi35522792.
PROiQ96SZ4.

Gene expression databases

BgeeiQ96SZ4.
CleanExiHS_ZSCAN10.
ExpressionAtlasiQ96SZ4. baseline and differential.
GenevisibleiQ96SZ4. HS.

Family and domain databases

Gene3Di3.30.160.60. 13 hits.
InterProiIPR008916. Retrov_capsid_C.
IPR003309. SCAN_dom.
IPR007087. Znf_C2H2.
IPR015880. Znf_C2H2-like.
IPR013087. Znf_C2H2/integrase_DNA-bd.
[Graphical view]
PfamiPF02023. SCAN. 1 hit.
PF00096. zf-C2H2. 4 hits.
PF13912. zf-C2H2_6. 1 hit.
[Graphical view]
SMARTiSM00431. SCAN. 1 hit.
SM00355. ZnF_C2H2. 14 hits.
[Graphical view]
SUPFAMiSSF47353. SSF47353. 1 hit.
PROSITEiPS50804. SCAN_BOX. 1 hit.
PS00028. ZINC_FINGER_C2H2_1. 14 hits.
PS50157. ZINC_FINGER_C2H2_2. 14 hits.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "Complete sequencing and characterization of 21,243 full-length human cDNAs."
    Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.
    , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
    Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2).
    Tissue: Teratocarcinoma.
  2. "The sequence and analysis of duplication-rich human chromosome 16."
    Martin J., Han C., Gordon L.A., Terry A., Prabhakar S., She X., Xie G., Hellsten U., Chan Y.M., Altherr M., Couronne O., Aerts A., Bajorek E., Black S., Blumer H., Branscomb E., Brown N.C., Bruno W.J.
    , Buckingham J.M., Callen D.F., Campbell C.S., Campbell M.L., Campbell E.W., Caoile C., Challacombe J.F., Chasteen L.A., Chertkov O., Chi H.C., Christensen M., Clark L.M., Cohn J.D., Denys M., Detter J.C., Dickson M., Dimitrijevic-Bussod M., Escobar J., Fawcett J.J., Flowers D., Fotopulos D., Glavina T., Gomez M., Gonzales E., Goodstein D., Goodwin L.A., Grady D.L., Grigoriev I., Groza M., Hammon N., Hawkins T., Haydu L., Hildebrand C.E., Huang W., Israni S., Jett J., Jewett P.B., Kadner K., Kimball H., Kobayashi A., Krawczyk M.-C., Leyba T., Longmire J.L., Lopez F., Lou Y., Lowry S., Ludeman T., Manohar C.F., Mark G.A., McMurray K.L., Meincke L.J., Morgan J., Moyzis R.K., Mundt M.O., Munk A.C., Nandkeshwar R.D., Pitluck S., Pollard M., Predki P., Parson-Quintana B., Ramirez L., Rash S., Retterer J., Ricke D.O., Robinson D.L., Rodriguez A., Salamov A., Saunders E.H., Scott D., Shough T., Stallings R.L., Stalvey M., Sutherland R.D., Tapia R., Tesmer J.G., Thayer N., Thompson L.S., Tice H., Torney D.C., Tran-Gyamfi M., Tsai M., Ulanovsky L.E., Ustaszewska A., Vo N., White P.S., Williams A.L., Wills P.L., Wu J.-R., Wu K., Yang J., DeJong P., Bruce D., Doggett N.A., Deaven L., Schmutz J., Grimwood J., Richardson P., Rokhsar D.S., Eichler E.E., Gilna P., Lucas S.M., Myers R.M., Rubin E.M., Pennacchio L.A.
    Nature 432:988-994(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  3. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
  4. "System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation."
    Rigbolt K.T., Prokhorova T.A., Akimov V., Henningsen J., Johansen P.T., Kratchmarova I., Kassem M., Mann M., Olsen J.V., Blagoev B.
    Sci. Signal. 4:RS3-RS3(2011) [PubMed] [Europe PMC] [Abstract]
    Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-107; SER-153 AND THR-213, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].

Entry informationi

Entry nameiZSC10_HUMAN
AccessioniPrimary (citable) accession number: Q96SZ4
Secondary accession number(s): B3KQD3, H0YFS6, Q1WWM2
Entry historyi
Integrated into UniProtKB/Swiss-Prot: November 23, 2004
Last sequence update: December 1, 2001
Last modified: March 16, 2016
This is version 112 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 16
    Human chromosome 16: entries, gene names and cross-references to MIM
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.