Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Transport and Golgi organization protein 2 homolog

Gene

TANGO2

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at transcript leveli

Names & Taxonomyi

Protein namesi
Recommended name:
Transport and Golgi organization protein 2 homolog
Gene namesi
Name:TANGO2
Synonyms:C22orf25
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
ProteomesiUP000005640: Chromosome 22

Organism-specific databases

HGNCiHGNC:25439. TANGO2.

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA143485406.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 276276Transport and Golgi organization protein 2 homologPRO_0000253891Add
BLAST

Proteomic databases

MaxQBiQ6ICL3.
PaxDbiQ6ICL3.
PRIDEiQ6ICL3.

PTM databases

PhosphoSiteiQ6ICL3.

Expressioni

Gene expression databases

BgeeiQ6ICL3.
CleanExiHS_C22orf25.
ExpressionAtlasiQ6ICL3. baseline and differential.
GenevestigatoriQ6ICL3.

Organism-specific databases

HPAiHPA003080.

Interactioni

Protein-protein interaction databases

BioGridi126178. 4 interactions.
STRINGi9606.ENSP00000332721.

Structurei

3D structure databases

ProteinModelPortaliQ6ICL3.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the Tango2 family.Curated

Phylogenomic databases

eggNOGiCOG3332.
GeneTreeiENSGT00390000012733.
HOGENOMiHOG000261311.
HOVERGENiHBG017881.
InParanoidiQ6ICL3.
OMAiLSHWETR.
OrthoDBiEOG7S2208.
PhylomeDBiQ6ICL3.
TreeFamiTF315064.

Family and domain databases

InterProiIPR008551. DUF833.
[Graphical view]
PANTHERiPTHR17985. PTHR17985. 1 hit.
PfamiPF05742. NRDE. 1 hit.
[Graphical view]

Sequences (6)i

Sequence statusi: Complete.

This entry describes 6 isoformsi produced by alternative splicing. Align

Isoform 1 (identifier: Q6ICL3-1) [UniParc]FASTAAdd to Basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MCIIFFKFDP RPVSKNAYRL ILAANRDEFY SRPSKLADFW GNNNEILSGL
60 70 80 90 100
DMEEGKEGGT WLGISTRGKL AALTNYLQPQ LDWQARGRGE LVTHFLTTDV
110 120 130 140 150
DSLSYLKKVS MEGHLYNGFN LIAADLSTAK GDVICYYGNR GEPDPIVLTP
160 170 180 190 200
GTYGLSNALL ETPWRKLCFG KQLFLEAVER SQALPKDVLI ASLLDVLNNE
210 220 230 240 250
EAQLPDPAIE DQGGEYVQPM LSKYAAVCVR CPGYGTRTNT IILVDADGHV
260 270
TFTERSMMDK DLSHWETRTY EFTLQS
Length:276
Mass (Da):30,937
Last modified:July 5, 2004 - v1
Checksum:i99D55353FD1B74E4
GO
Isoform 2 (identifier: Q6ICL3-2) [UniParc]FASTAAdd to Basket

The sequence of this isoform differs from the canonical sequence as follows:
     90-151: Missing.

Show »
Length:214
Mass (Da):24,194
Checksum:i6E2207E4F3CA29A2
GO
Isoform 3 (identifier: Q6ICL3-3) [UniParc]FASTAAdd to Basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-48: MCIIFFKFDP...FWGNNNEILS → MPLGAGTPVN...FWGNNNEILS
     190-197: Missing.

Note: No experimental confirmation available.

Show »
Length:273
Mass (Da):30,345
Checksum:i7AAF915CEB57393A
GO
Isoform 4 (identifier: Q6ICL3-4) [UniParc]FASTAAdd to Basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-18: MCIIFFKFDPRPVSKNAY → MPPKLLCAGR...REDSATEGSH

Show »
Length:317
Mass (Da):34,952
Checksum:i49895A717C4830F2
GO
Isoform 5 (identifier: Q6ICL3-5) [UniParc]FASTAAdd to Basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-126: MCIIFFKFDP...NGFNLIAADL → MAGHQHTWQAGSTHQLPAAAAGLAGPRA

Show »
Length:178
Mass (Da):19,467
Checksum:i399C2DF473F24294
GO
Isoform 6 (identifier: Q6ICL3-6) [UniParc]FASTAAdd to Basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-18: MCIIFFKFDPRPVSKNAY → MPPKLLCAGR...REDSATEGSH
     151-157: GTYGLSN → EPTLSSW
     158-276: Missing.

Show »
Length:198
Mass (Da):21,628
Checksum:iC4BF2CDFE87CD2B0
GO

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti125 – 1251D → N.1 Publication
Corresponds to variant rs17855650 [ dbSNP | Ensembl ].
VAR_028742
Natural varianti200 – 2001E → K.1 Publication
Corresponds to variant rs17854107 [ dbSNP | Ensembl ].
VAR_028743
Natural varianti245 – 2451D → E.
Corresponds to variant rs16982614 [ dbSNP | Ensembl ].
VAR_028744

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1 – 126126MCIIF…IAADL → MAGHQHTWQAGSTHQLPAAA AGLAGPRA in isoform 5. 1 PublicationVSP_055604Add
BLAST
Alternative sequencei1 – 4848MCIIF…NEILS → MPLGAGTPVNVQRREDSATE GSHRLILAANRDEFYSRPSK LADFWGNNNEILS in isoform 3. 1 PublicationVSP_021137Add
BLAST
Alternative sequencei1 – 1818MCIIF…SKNAY → MPPKLLCAGRCVGQDGAAQA WHCPPGQGHSVWDAVRMPLG AGTPVNVQRREDSATEGSH in isoform 4 and isoform 6. 1 PublicationVSP_055605Add
BLAST
Alternative sequencei90 – 15162Missing in isoform 2. 1 PublicationVSP_021139Add
BLAST
Alternative sequencei151 – 1577GTYGLSN → EPTLSSW in isoform 6. 1 PublicationVSP_055606
Alternative sequencei158 – 276119Missing in isoform 6. 1 PublicationVSP_055607Add
BLAST
Alternative sequencei190 – 1978Missing in isoform 3. 1 PublicationVSP_021138

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK057461 mRNA. Translation: BAB71498.1.
AK092484 mRNA. Translation: BAC03902.1.
AK295210 mRNA. Translation: BAH12013.1.
AK298593 mRNA. Translation: BAH12819.1.
AK301366 mRNA. Translation: BAH13466.1.
AK316056 mRNA. Translation: BAH14427.1.
CR456355 mRNA. Translation: CAG30241.1.
AC005663 Genomic DNA. No translation available.
AC006547 Genomic DNA. No translation available.
CH471176 Genomic DNA. Translation: EAX03001.1.
CH471176 Genomic DNA. Translation: EAX03003.1.
CH471176 Genomic DNA. Translation: EAX03005.1.
BC041339 mRNA. Translation: AAH41339.1.
AL713640 mRNA. Translation: CAD28454.1.
CCDSiCCDS13772.1. [Q6ICL3-1]
CCDS63404.1. [Q6ICL3-2]
CCDS63405.1. [Q6ICL3-4]
CCDS63406.1. [Q6ICL3-6]
CCDS63407.1. [Q6ICL3-5]
RefSeqiNP_001270035.1. NM_001283106.1. [Q6ICL3-1]
NP_001270045.1. NM_001283116.1. [Q6ICL3-1]
NP_001270058.1. NM_001283129.1. [Q6ICL3-4]
NP_001270077.1. NM_001283148.1.
NP_001270083.1. NM_001283154.1.
NP_001270108.1. NM_001283179.1. [Q6ICL3-2]
NP_001270115.1. NM_001283186.1. [Q6ICL3-2]
NP_001270128.1. NM_001283199.1.
NP_001270144.1. NM_001283215.1. [Q6ICL3-6]
NP_001270164.1. NM_001283235.1. [Q6ICL3-5]
NP_001270177.1. NM_001283248.1.
NP_690870.3. NM_152906.5. [Q6ICL3-1]
XP_005261280.1. XM_005261223.2. [Q6ICL3-2]
XP_005261284.1. XM_005261227.2. [Q6ICL3-5]
XP_005261285.1. XM_005261228.1. [Q6ICL3-5]
UniGeneiHs.474233.

Genome annotation databases

EnsembliENST00000327374; ENSP00000332721; ENSG00000183597. [Q6ICL3-1]
ENST00000398042; ENSP00000381122; ENSG00000183597. [Q6ICL3-2]
ENST00000401833; ENSP00000384827; ENSG00000183597. [Q6ICL3-4]
ENST00000401886; ENSP00000385662; ENSG00000183597. [Q6ICL3-2]
ENST00000432883; ENSP00000402926; ENSG00000183597. [Q6ICL3-5]
ENST00000434570; ENSP00000391262; ENSG00000183597. [Q6ICL3-6]
ENST00000456048; ENSP00000403645; ENSG00000183597. [Q6ICL3-4]
GeneIDi128989.
KEGGihsa:128989.
UCSCiuc002zrc.1. human. [Q6ICL3-1]
uc002zrf.2. human. [Q6ICL3-2]

Polymorphism databases

DMDMi74709518.

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK057461 mRNA. Translation: BAB71498.1.
AK092484 mRNA. Translation: BAC03902.1.
AK295210 mRNA. Translation: BAH12013.1.
AK298593 mRNA. Translation: BAH12819.1.
AK301366 mRNA. Translation: BAH13466.1.
AK316056 mRNA. Translation: BAH14427.1.
CR456355 mRNA. Translation: CAG30241.1.
AC005663 Genomic DNA. No translation available.
AC006547 Genomic DNA. No translation available.
CH471176 Genomic DNA. Translation: EAX03001.1.
CH471176 Genomic DNA. Translation: EAX03003.1.
CH471176 Genomic DNA. Translation: EAX03005.1.
BC041339 mRNA. Translation: AAH41339.1.
AL713640 mRNA. Translation: CAD28454.1.
CCDSiCCDS13772.1. [Q6ICL3-1]
CCDS63404.1. [Q6ICL3-2]
CCDS63405.1. [Q6ICL3-4]
CCDS63406.1. [Q6ICL3-6]
CCDS63407.1. [Q6ICL3-5]
RefSeqiNP_001270035.1. NM_001283106.1. [Q6ICL3-1]
NP_001270045.1. NM_001283116.1. [Q6ICL3-1]
NP_001270058.1. NM_001283129.1. [Q6ICL3-4]
NP_001270077.1. NM_001283148.1.
NP_001270083.1. NM_001283154.1.
NP_001270108.1. NM_001283179.1. [Q6ICL3-2]
NP_001270115.1. NM_001283186.1. [Q6ICL3-2]
NP_001270128.1. NM_001283199.1.
NP_001270144.1. NM_001283215.1. [Q6ICL3-6]
NP_001270164.1. NM_001283235.1. [Q6ICL3-5]
NP_001270177.1. NM_001283248.1.
NP_690870.3. NM_152906.5. [Q6ICL3-1]
XP_005261280.1. XM_005261223.2. [Q6ICL3-2]
XP_005261284.1. XM_005261227.2. [Q6ICL3-5]
XP_005261285.1. XM_005261228.1. [Q6ICL3-5]
UniGeneiHs.474233.

3D structure databases

ProteinModelPortaliQ6ICL3.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi126178. 4 interactions.
STRINGi9606.ENSP00000332721.

PTM databases

PhosphoSiteiQ6ICL3.

Polymorphism databases

DMDMi74709518.

Proteomic databases

MaxQBiQ6ICL3.
PaxDbiQ6ICL3.
PRIDEiQ6ICL3.

Protocols and materials databases

DNASUi128989.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000327374; ENSP00000332721; ENSG00000183597. [Q6ICL3-1]
ENST00000398042; ENSP00000381122; ENSG00000183597. [Q6ICL3-2]
ENST00000401833; ENSP00000384827; ENSG00000183597. [Q6ICL3-4]
ENST00000401886; ENSP00000385662; ENSG00000183597. [Q6ICL3-2]
ENST00000432883; ENSP00000402926; ENSG00000183597. [Q6ICL3-5]
ENST00000434570; ENSP00000391262; ENSG00000183597. [Q6ICL3-6]
ENST00000456048; ENSP00000403645; ENSG00000183597. [Q6ICL3-4]
GeneIDi128989.
KEGGihsa:128989.
UCSCiuc002zrc.1. human. [Q6ICL3-1]
uc002zrf.2. human. [Q6ICL3-2]

Organism-specific databases

CTDi128989.
GeneCardsiGC22P020005.
H-InvDBHIX0016248.
HGNCiHGNC:25439. TANGO2.
HPAiHPA003080.
neXtProtiNX_Q6ICL3.
PharmGKBiPA143485406.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiCOG3332.
GeneTreeiENSGT00390000012733.
HOGENOMiHOG000261311.
HOVERGENiHBG017881.
InParanoidiQ6ICL3.
OMAiLSHWETR.
OrthoDBiEOG7S2208.
PhylomeDBiQ6ICL3.
TreeFamiTF315064.

Miscellaneous databases

GeneWikiiC22orf25.
GenomeRNAii128989.
NextBioi35478652.
PROiQ6ICL3.

Gene expression databases

BgeeiQ6ICL3.
CleanExiHS_C22orf25.
ExpressionAtlasiQ6ICL3. baseline and differential.
GenevestigatoriQ6ICL3.

Family and domain databases

InterProiIPR008551. DUF833.
[Graphical view]
PANTHERiPTHR17985. PTHR17985. 1 hit.
PfamiPF05742. NRDE. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

  1. "Complete sequencing and characterization of 21,243 full-length human cDNAs."
    Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.
    , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
    Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2; 3; 4; 5 AND 6).
    Tissue: Caudate nucleus, Hippocampus, Mesangial cell, Placenta, Synovium and Testis.
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
  3. "The DNA sequence of human chromosome 22."
    Dunham I., Hunt A.R., Collins J.E., Bruskiewich R., Beare D.M., Clamp M., Smink L.J., Ainscough R., Almeida J.P., Babbage A.K., Bagguley C., Bailey J., Barlow K.F., Bates K.N., Beasley O.P., Bird C.P., Blakey S.E., Bridgeman A.M.
    , Buck D., Burgess J., Burrill W.D., Burton J., Carder C., Carter N.P., Chen Y., Clark G., Clegg S.M., Cobley V.E., Cole C.G., Collier R.E., Connor R., Conroy D., Corby N.R., Coville G.J., Cox A.V., Davis J., Dawson E., Dhami P.D., Dockree C., Dodsworth S.J., Durbin R.M., Ellington A.G., Evans K.L., Fey J.M., Fleming K., French L., Garner A.A., Gilbert J.G.R., Goward M.E., Grafham D.V., Griffiths M.N.D., Hall C., Hall R.E., Hall-Tamlyn G., Heathcott R.W., Ho S., Holmes S., Hunt S.E., Jones M.C., Kershaw J., Kimberley A.M., King A., Laird G.K., Langford C.F., Leversha M.A., Lloyd C., Lloyd D.M., Martyn I.D., Mashreghi-Mohammadi M., Matthews L.H., Mccann O.T., Mcclay J., Mclaren S., McMurray A.A., Milne S.A., Mortimore B.J., Odell C.N., Pavitt R., Pearce A.V., Pearson D., Phillimore B.J.C.T., Phillips S.H., Plumb R.W., Ramsay H., Ramsey Y., Rogers L., Ross M.T., Scott C.E., Sehra H.K., Skuce C.D., Smalley S., Smith M.L., Soderlund C., Spragon L., Steward C.A., Sulston J.E., Swann R.M., Vaudin M., Wall M., Wallis J.M., Whiteley M.N., Willey D.L., Williams L., Williams S.A., Williamson H., Wilmer T.E., Wilming L., Wright C.L., Hubbard T., Bentley D.R., Beck S., Rogers J., Shimizu N., Minoshima S., Kawasaki K., Sasaki T., Asakawa S., Kudoh J., Shintani A., Shibuya K., Yoshizaki Y., Aoki N., Mitsuyama S., Roe B.A., Chen F., Chu L., Crabtree J., Deschamps S., Do A., Do T., Dorman A., Fang F., Fu Y., Hu P., Hua A., Kenton S., Lai H., Lao H.I., Lewis J., Lewis S., Lin S.-P., Loh P., Malaj E., Nguyen T., Pan H., Phan S., Qi S., Qian Y., Ray L., Ren Q., Shaull S., Sloan D., Song L., Wang Q., Wang Y., Wang Z., White J., Willingham D., Wu H., Yao Z., Zhan M., Zhang G., Chissoe S., Murray J., Miller N., Minx P., Fulton R., Johnson D., Bemis G., Bentley D., Bradshaw H., Bourne S., Cordes M., Du Z., Fulton L., Goela D., Graves T., Hawkins J., Hinds K., Kemp K., Latreille P., Layman D., Ozersky P., Rohlfing T., Scheet P., Walker C., Wamsley A., Wohldmann P., Pepin K., Nelson J., Korf I., Bedell J.A., Hillier L.W., Mardis E., Waterston R., Wilson R., Emanuel B.S., Shaikh T., Kurahashi H., Saitta S., Budarf M.L., McDermid H.E., Johnson A., Wong A.C.C., Morrow B.E., Edelmann L., Kim U.J., Shizuya H., Simon M.I., Dumanski J.P., Peyrard M., Kedra D., Seroussi E., Fransson I., Tapia I., Bruder C.E., O'Brien K.P., Wilkinson P., Bodenteich A., Hartman K., Hu X., Khan A.S., Lane L., Tilahun Y., Wright H.
    Nature 402:489-495(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  4. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  5. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), VARIANTS ASN-125 AND LYS-200.
    Tissue: Brain.
  6. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 53-276 (ISOFORM 1).
    Tissue: Amygdala.

Entry informationi

Entry nameiTNG2_HUMAN
AccessioniPrimary (citable) accession number: Q6ICL3
Secondary accession number(s): A8MUE9
, B7WNV6, B7Z583, B7Z730, D3DX23, Q8IW05, Q8NAL0, Q8TCS0, Q96M16
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 17, 2006
Last sequence update: July 5, 2004
Last modified: January 7, 2015
This is version 78 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 22
    Human chromosome 22: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.