Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Solute carrier organic anion transporter family member 6A1

Gene

SLCO6A1

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 4 out of 5-Experimental evidence at transcript leveli

Functioni

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Biological processi

Transport

Protein family/group databases

TCDBi2.A.60.1.17. the organo anion transporter (oat) family.

Names & Taxonomyi

Protein namesi
Recommended name:
Solute carrier organic anion transporter family member 6A1
Alternative name(s):
Cancer/testis antigen 48
Short name:
CT48
Gonad-specific transporter
Short name:
GST
Organic anion-transporting polypeptide 6A1
Organic anion-transporting polypeptide I
Short name:
OATP-I
Solute carrier family 21 member 19
Gene namesi
Name:SLCO6A1
Synonyms:OATP6A1, SLC21A19
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 5

Organism-specific databases

HGNCiHGNC:23613. SLCO6A1.

Subcellular locationi

Topology

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Topological domaini1 – 106106CytoplasmicSequence analysisAdd
BLAST
Transmembranei107 – 12620Helical; Name=1Sequence analysisAdd
BLAST
Topological domaini127 – 14519ExtracellularSequence analysisAdd
BLAST
Transmembranei146 – 16621Helical; Name=2Sequence analysisAdd
BLAST
Topological domaini167 – 1715CytoplasmicSequence analysis
Transmembranei172 – 19625Helical; Name=3Sequence analysisAdd
BLAST
Topological domaini197 – 22327ExtracellularSequence analysisAdd
BLAST
Transmembranei224 – 25431Helical; Name=4Sequence analysisAdd
BLAST
Topological domaini255 – 27420CytoplasmicSequence analysisAdd
BLAST
Transmembranei275 – 29521Helical; Name=5Sequence analysisAdd
BLAST
Topological domaini296 – 31116ExtracellularSequence analysisAdd
BLAST
Transmembranei312 – 33625Helical; Name=6Sequence analysisAdd
BLAST
Topological domaini337 – 37842CytoplasmicSequence analysisAdd
BLAST
Transmembranei379 – 40022Helical; Name=7Sequence analysisAdd
BLAST
Topological domaini401 – 42020ExtracellularSequence analysisAdd
BLAST
Transmembranei421 – 44424Helical; Name=8Sequence analysisAdd
BLAST
Topological domaini445 – 4484CytoplasmicSequence analysis
Transmembranei449 – 47224Helical; Name=9Sequence analysisAdd
BLAST
Topological domaini473 – 581109ExtracellularSequence analysisAdd
BLAST
Transmembranei582 – 60423Helical; Name=10Sequence analysisAdd
BLAST
Topological domaini605 – 6139CytoplasmicSequence analysis
Transmembranei614 – 63926Helical; Name=11Sequence analysisAdd
BLAST
Topological domaini640 – 67334ExtracellularSequence analysisAdd
BLAST
Transmembranei674 – 69118Helical; Name=12Sequence analysisAdd
BLAST
Topological domaini692 – 71928CytoplasmicSequence analysisAdd
BLAST

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cell membrane, Membrane

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA134949852.

Polymorphism and mutation databases

BioMutaiSLCO6A1.
DMDMi160185610.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 719719Solute carrier organic anion transporter family member 6A1PRO_0000307642Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi300 – 3001N-linked (GlcNAc...)Sequence analysis
Glycosylationi497 – 4971N-linked (GlcNAc...)Sequence analysis
Disulfide bondi502 ↔ 528PROSITE-ProRule annotation
Disulfide bondi506 ↔ 517PROSITE-ProRule annotation
Disulfide bondi508 ↔ 532PROSITE-ProRule annotation
Glycosylationi546 – 5461N-linked (GlcNAc...)Sequence analysis
Glycosylationi661 – 6611N-linked (GlcNAc...)Sequence analysis

Keywords - PTMi

Disulfide bond, Glycoprotein

Proteomic databases

PaxDbiQ86UG4.
PRIDEiQ86UG4.

PTM databases

iPTMnetiQ86UG4.
PhosphoSiteiQ86UG4.

Expressioni

Tissue specificityi

Strongly expressed in testis. Weakly expressed in spleen, brain, fetal brain and placenta. Detected in lung tumors.2 Publications

Gene expression databases

BgeeiQ86UG4.
CleanExiHS_SLCO6A1.
ExpressionAtlasiQ86UG4. baseline and differential.
GenevisibleiQ86UG4. HS.

Organism-specific databases

HPAiHPA054126.

Interactioni

Protein-protein interaction databases

BioGridi126359. 12 interactions.
STRINGi9606.ENSP00000369135.

Structurei

3D structure databases

ProteinModelPortaliQ86UG4.
SMRiQ86UG4. Positions 500-532.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Domaini496 – 55156Kazal-likePROSITE-ProRule annotationAdd
BLAST

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi89 – 11830Cys-richAdd
BLAST

Sequence similaritiesi

Contains 1 Kazal-like domain.PROSITE-ProRule annotation

Keywords - Domaini

Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiKOG3626. Eukaryota.
ENOG410XRSF. LUCA.
GeneTreeiENSGT00760000119014.
HOGENOMiHOG000231270.
HOVERGENiHBG100565.
InParanoidiQ86UG4.
KOiK14357.
OMAiTHSAGIY.
OrthoDBiEOG75TMBK.
PhylomeDBiQ86UG4.
TreeFamiTF317540.

Family and domain databases

InterProiIPR002350. Kazal_dom.
IPR020846. MFS_dom.
IPR004156. OA_transporter.
[Graphical view]
PANTHERiPTHR11388. PTHR11388. 1 hit.
PfamiPF07648. Kazal_2. 1 hit.
PF03137. OATP. 1 hit.
[Graphical view]
SUPFAMiSSF103473. SSF103473. 3 hits.
TIGRFAMsiTIGR00805. oat. 1 hit.
PROSITEiPS51465. KAZAL_2. 1 hit.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q86UG4-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MFVGVARHSG SQDEVSRGVE PLEAARAQPA KDRRAKGTPK SSKPGKKHRY
60 70 80 90 100
LRLLPEALIR FGGFRKRKKA KSSVSKKPGE VDDSLEQPCG LGCLVSTCCE
110 120 130 140 150
CCNNIRCFMI FYCILLICQG VVFGLIDVSI GDFQKEYQLK TIEKLALEKS
160 170 180 190 200
YDISSGLVAI FIAFYGDRKK VIWFVASSFL IGLGSLLCAF PSINEENKQS
210 220 230 240 250
KVGIEDICEE IKVVSGCQSS GISFQSKYLS FFILGQTVQG IAGMPLYILG
260 270 280 290 300
ITFIDENVAT HSAGIYLGIA ECTSMIGYAL GYVLGAPLVK VPENTTSATN
310 320 330 340 350
TTVNNGSPEW LWTWWINFLF AAVVAWCTLI PLSCFPNNMP GSTRIKARKR
360 370 380 390 400
KQLHFFDSRL KDLKLGTNIK DLCAALWILM KNPVLICLAL SKATEYLVII
410 420 430 440 450
GASEFLPIYL ENQFILTPTV ATTLAGLVLI PGGALGQLLG GVIVSTLEMS
460 470 480 490 500
CKALMRFIMV TSVISLILLV FIIFVRCNPV QFAGINEDYD GTGKLGNLTA
510 520 530 540 550
PCNEKCRCSS SIYSSICGRD DIEYFSPCFA GCTYSKAQNQ KKMYYNCSCI
560 570 580 590 600
KEGLITADAE GDFIDARPGK CDAKCYKLPL FIAFIFSTLI FSGFSGVPIV
610 620 630 640 650
LAMTRVVPDK LRSLALGVSY VILRIFGTIP GPSIFKMSGE TSCILRDVNK
660 670 680 690 700
CGHTGRCWIY NKTKMAFLLV GICFLCKLCT IIFTTIAFFI YKRRLNENTD
710
FPDVTVKNPK VKKKEETDL
Length:719
Mass (Da):79,232
Last modified:October 23, 2007 - v2
Checksum:i56EFD346A8F0569B
GO
Isoform 2 (identifier: Q86UG4-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     206-267: Missing.

Note: No experimental confirmation available.
Show »
Length:657
Mass (Da):72,576
Checksum:i9F9946EB3C04FAAC
GO
Isoform 3 (identifier: Q86UG4-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     206-267: Missing.
     301-491: Missing.

Note: No experimental confirmation available.
Show »
Length:466
Mass (Da):51,398
Checksum:iF8E2638DFECD13E4
GO

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti23 – 231E → D in BAD18590 (PubMed:14702039).Curated
Sequence conflicti77 – 793KPG → NRE in AAP33048 (Ref. 1) Curated
Sequence conflicti300 – 3001N → K in BAD18590 (PubMed:14702039).Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti27 – 271A → V.
Corresponds to variant rs13190449 [ dbSNP | Ensembl ].
VAR_036622
Natural varianti381 – 3811K → R.
Corresponds to variant rs17150488 [ dbSNP | Ensembl ].
VAR_036623
Natural varianti527 – 5271P → A.
Corresponds to variant rs10073333 [ dbSNP | Ensembl ].
VAR_053680
Natural varianti654 – 6541T → R.1 Publication
Corresponds to variant rs10055840 [ dbSNP | Ensembl ].
VAR_036624

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei206 – 26762Missing in isoform 2 and isoform 3. 2 PublicationsVSP_028753Add
BLAST
Alternative sequencei301 – 491191Missing in isoform 3. 1 PublicationVSP_028754Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AY273897 mRNA. Translation: AAP33048.1.
AF505657 mRNA. Translation: AAP30851.1.
AK131445 mRNA. Translation: BAD18590.1.
AC094108 Genomic DNA. No translation available.
CH471086 Genomic DNA. Translation: EAW49095.1.
BC034976 mRNA. Translation: AAH34976.1.
CCDSiCCDS34206.1. [Q86UG4-1]
CCDS75282.1. [Q86UG4-2]
CCDS78042.1. [Q86UG4-3]
RefSeqiNP_001275931.1. NM_001289002.1. [Q86UG4-1]
NP_001275933.1. NM_001289004.1. [Q86UG4-2]
NP_001294943.1. NM_001308014.1.
NP_775759.3. NM_173488.4. [Q86UG4-1]
XP_005271931.1. XM_005271874.2. [Q86UG4-1]
UniGeneiHs.388874.

Genome annotation databases

EnsembliENST00000379807; ENSP00000369135; ENSG00000205359. [Q86UG4-1]
ENST00000389019; ENSP00000373671; ENSG00000205359. [Q86UG4-2]
ENST00000506729; ENSP00000421339; ENSG00000205359. [Q86UG4-1]
GeneIDi133482.
KEGGihsa:133482.
UCSCiuc003knn.5. human. [Q86UG4-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AY273897 mRNA. Translation: AAP33048.1.
AF505657 mRNA. Translation: AAP30851.1.
AK131445 mRNA. Translation: BAD18590.1.
AC094108 Genomic DNA. No translation available.
CH471086 Genomic DNA. Translation: EAW49095.1.
BC034976 mRNA. Translation: AAH34976.1.
CCDSiCCDS34206.1. [Q86UG4-1]
CCDS75282.1. [Q86UG4-2]
CCDS78042.1. [Q86UG4-3]
RefSeqiNP_001275931.1. NM_001289002.1. [Q86UG4-1]
NP_001275933.1. NM_001289004.1. [Q86UG4-2]
NP_001294943.1. NM_001308014.1.
NP_775759.3. NM_173488.4. [Q86UG4-1]
XP_005271931.1. XM_005271874.2. [Q86UG4-1]
UniGeneiHs.388874.

3D structure databases

ProteinModelPortaliQ86UG4.
SMRiQ86UG4. Positions 500-532.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi126359. 12 interactions.
STRINGi9606.ENSP00000369135.

Protein family/group databases

TCDBi2.A.60.1.17. the organo anion transporter (oat) family.

PTM databases

iPTMnetiQ86UG4.
PhosphoSiteiQ86UG4.

Polymorphism and mutation databases

BioMutaiSLCO6A1.
DMDMi160185610.

Proteomic databases

PaxDbiQ86UG4.
PRIDEiQ86UG4.

Protocols and materials databases

DNASUi133482.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000379807; ENSP00000369135; ENSG00000205359. [Q86UG4-1]
ENST00000389019; ENSP00000373671; ENSG00000205359. [Q86UG4-2]
ENST00000506729; ENSP00000421339; ENSG00000205359. [Q86UG4-1]
GeneIDi133482.
KEGGihsa:133482.
UCSCiuc003knn.5. human. [Q86UG4-1]

Organism-specific databases

CTDi133482.
GeneCardsiSLCO6A1.
H-InvDBHIX0024807.
HGNCiHGNC:23613. SLCO6A1.
HPAiHPA054126.
MIMi613365. gene.
neXtProtiNX_Q86UG4.
PharmGKBiPA134949852.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG3626. Eukaryota.
ENOG410XRSF. LUCA.
GeneTreeiENSGT00760000119014.
HOGENOMiHOG000231270.
HOVERGENiHBG100565.
InParanoidiQ86UG4.
KOiK14357.
OMAiTHSAGIY.
OrthoDBiEOG75TMBK.
PhylomeDBiQ86UG4.
TreeFamiTF317540.

Miscellaneous databases

GenomeRNAii133482.
PROiQ86UG4.
SOURCEiSearch...

Gene expression databases

BgeeiQ86UG4.
CleanExiHS_SLCO6A1.
ExpressionAtlasiQ86UG4. baseline and differential.
GenevisibleiQ86UG4. HS.

Family and domain databases

InterProiIPR002350. Kazal_dom.
IPR020846. MFS_dom.
IPR004156. OA_transporter.
[Graphical view]
PANTHERiPTHR11388. PTHR11388. 1 hit.
PfamiPF07648. Kazal_2. 1 hit.
PF03137. OATP. 1 hit.
[Graphical view]
SUPFAMiSSF103473. SSF103473. 3 hits.
TIGRFAMsiTIGR00805. oat. 1 hit.
PROSITEiPS51465. KAZAL_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Cloning and characterization of two novel OATP genes on human 5q21.1."
    Fu-Zhang W.
    Submitted (APR-2003) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1).
  2. "Identification and characterization of novel rat and human gonad-specific organic anion transporters."
    Suzuki T., Onogawa T., Asano N., Mizutamari H., Mikkaichi T., Tanemoto M., Abe M., Satoh F., Unno M., Nunoki K., Suzuki M., Hishinuma T., Goto J., Shimosegawa T., Matsuno S., Ito S., Abe T.
    Mol. Endocrinol. 17:1203-1215(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), TISSUE SPECIFICITY.
  3. "Complete sequencing and characterization of 21,243 full-length human cDNAs."
    Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.
    , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
    Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3), VARIANT ARG-654.
    Tissue: Testis.
  4. "The DNA sequence and comparative analysis of human chromosome 5."
    Schmutz J., Martin J., Terry A., Couronne O., Grimwood J., Lowry S., Gordon L.A., Scott D., Xie G., Huang W., Hellsten U., Tran-Gyamfi M., She X., Prabhakar S., Aerts A., Altherr M., Bajorek E., Black S.
    , Branscomb E., Caoile C., Challacombe J.F., Chan Y.M., Denys M., Detter J.C., Escobar J., Flowers D., Fotopulos D., Glavina T., Gomez M., Gonzales E., Goodstein D., Grigoriev I., Groza M., Hammon N., Hawkins T., Haydu L., Israni S., Jett J., Kadner K., Kimball H., Kobayashi A., Lopez F., Lou Y., Martinez D., Medina C., Morgan J., Nandkeshwar R., Noonan J.P., Pitluck S., Pollard M., Predki P., Priest J., Ramirez L., Retterer J., Rodriguez A., Rogers S., Salamov A., Salazar A., Thayer N., Tice H., Tsai M., Ustaszewska A., Vo N., Wheeler J., Wu K., Yang J., Dickson M., Cheng J.-F., Eichler E.E., Olsen A., Pennacchio L.A., Rokhsar D.S., Richardson P., Lucas S.M., Myers R.M., Rubin E.M.
    Nature 431:268-274(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  5. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  6. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
    Tissue: Brain.
  7. "Identification of the gonad-specific anion transporter SLCO6A1 as a cancer/testis (CT) antigen expressed in human lung cancer."
    Lee S.-Y., Williamson B., Caballero O.L., Chen Y.-T., Scanlan M.J., Ritter G., Jongeneel C.V., Simpson A.J.G., Old L.J.
    Cancer Immun. 4:13-13(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: TISSUE SPECIFICITY, IDENTIFICATION AS A CANCER/TESTIS ANTIGEN.

Entry informationi

Entry nameiSO6A1_HUMAN
AccessioniPrimary (citable) accession number: Q86UG4
Secondary accession number(s): A6NHC1
, Q6ZMY5, Q86UV2, Q8IYU5
Entry historyi
Integrated into UniProtKB/Swiss-Prot: October 23, 2007
Last sequence update: October 23, 2007
Last modified: June 8, 2016
This is version 95 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 5
    Human chromosome 5: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.