Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Mitoguardin-2

Gene

FAM73B

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Regulator of mitochondrial fusion: acts by forming homo- and heterodimers at the mitochondrial outer membrane and facilitating the formation of PLD6/MitoPLD dimers. May act by regulating phospholipid metabolism via PLD6/MitoPLD.1 Publication

GO - Molecular functioni

  • protein heterodimerization activity Source: UniProtKB
  • protein homodimerization activity Source: UniProtKB

GO - Biological processi

  • bone development Source: Ensembl
  • mitochondrial fusion Source: UniProtKB
Complete GO annotation...

Names & Taxonomyi

Protein namesi
Recommended name:
Mitoguardin-21 Publication
Alternative name(s):
Protein FAM73BCurated
Gene namesi
Name:FAM73BImported
Synonyms:C9orf54Imported, MIGA21 Publication
ORF Names:PSEC01121 Publication
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 9

Organism-specific databases

HGNCiHGNC:23621. FAM73B.

Subcellular locationi

Topology

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Transmembranei11 – 3121HelicalSequence analysisAdd
BLAST
Transmembranei42 – 6221HelicalSequence analysisAdd
BLAST

GO - Cellular componenti

  • integral component of cell outer membrane Source: UniProtKB
  • mitochondrial outer membrane Source: UniProtKB-SubCell
Complete GO annotation...

Keywords - Cellular componenti

Membrane, Mitochondrion, Mitochondrion outer membrane

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA134896424.

Polymorphism and mutation databases

BioMutaiFAM73B.
DMDMi74749888.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 593593Mitoguardin-2PRO_0000313657Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei132 – 1321PhosphoserineBy similarity
Modified residuei206 – 2061PhosphothreonineBy similarity
Modified residuei220 – 2201PhosphoserineBy similarity
Modified residuei224 – 2241PhosphoserineBy similarity
Modified residuei228 – 2281PhosphoserineBy similarity
Modified residuei273 – 2731PhosphothreonineBy similarity
Modified residuei276 – 2761PhosphoserineCombined sources

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiQ7L4E1.
MaxQBiQ7L4E1.
PaxDbiQ7L4E1.
PRIDEiQ7L4E1.

PTM databases

iPTMnetiQ7L4E1.
PhosphoSiteiQ7L4E1.

Expressioni

Gene expression databases

BgeeiQ7L4E1.
CleanExiHS_FAM73B.
ExpressionAtlasiQ7L4E1. baseline and differential.
GenevisibleiQ7L4E1. HS.

Organism-specific databases

HPAiHPA041363.

Interactioni

Subunit structurei

Homodimer and heterodimer; forms heterodimers with FAM73A/MIGA1 (PubMed:26711011). Interacts with PLD6/MitoPLD (PubMed:26711011).1 Publication

GO - Molecular functioni

  • protein heterodimerization activity Source: UniProtKB
  • protein homodimerization activity Source: UniProtKB

Protein-protein interaction databases

BioGridi124335. 10 interactions.
STRINGi9606.ENSP00000351138.

Structurei

3D structure databases

ProteinModelPortaliQ7L4E1.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Compositional bias

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Compositional biasi67 – 715Poly-Arg
Compositional biasi105 – 14844Ser-richAdd
BLAST

Sequence similaritiesi

Belongs to the mitoguardin family.Curated

Keywords - Domaini

Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiKOG3831. Eukaryota.
ENOG410Y4ER. LUCA.
GeneTreeiENSGT00390000008565.
HOGENOMiHOG000286032.
HOVERGENiHBG106606.
InParanoidiQ7L4E1.
OMAiPEMGGEH.
OrthoDBiEOG7QK0BM.
PhylomeDBiQ7L4E1.
TreeFamiTF313896.

Family and domain databases

InterProiIPR019392. Miga.
[Graphical view]
PANTHERiPTHR21508. PTHR21508. 1 hit.
PfamiPF10265. DUF2217. 1 hit.
[Graphical view]

Sequences (3)i

Sequence statusi: Complete.

This entry describes 3 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q7L4E1-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MAFRRAEGTS MIQALAMTVA EIPVFLYTTF GQSAFSQLRL TPGLRKVLFA
60 70 80 90 100
TALGTVALAL AAHQLKRRRR RKKQVGPEMG GEQLGTVPLP ILLARKVPSV
110 120 130 140 150
KKGYSSRRVQ SPSSKSNDTL SGISSIEPSK HSGSSHSVAS MMAVNSSSPT
160 170 180 190 200
AACSGLWDAR GMEESLTTSD GNAESLYMQG MELFEEALQK WEQALSVGQR
210 220 230 240 250
GDSGSTPMPR DGLRNPETAS EPLSEPESQR KEFAEKLESL LHRAYHLQEE
260 270 280 290 300
FGSTFPADSM LLDLERTLML PLTEGSLRLR ADDEDSLTSE DSFFSATELF
310 320 330 340 350
ESLQTGDYPI PLSRPAAAYE EALQLVKEGR VPCRTLRTEL LGCYSDQDFL
360 370 380 390 400
AKLHCVRQAF EGLLEDKSNQ LFFGKVGRQM VTGLMTKAEK SPKGFLESYE
410 420 430 440 450
EMLSYALRPE TWATTRLELE GRGVVCMSFF DIVLDFILMD AFEDLENPPA
460 470 480 490 500
SVLAVLRNRW LSDSFKETAL ATACWSVLKA KRRLLMVPDG FISHFYSVSE
510 520 530 540 550
HVSPVLAFGF LGPKPQLAEV CAFFKHQIVQ YLRDMFDLDN VRYTSLPALA
560 570 580 590
DDILQLSRRR SEILLGYLGV PAASSAGVNG ALPRENGPLG ELQ
Length:593
Mass (Da):65,531
Last modified:December 21, 2004 - v1
Checksum:i128969424149397F
GO
Isoform 2 (identifier: Q7L4E1-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     226-323: PESQRKEFAE...RPAAAYEEAL → TESHSVARLE...RGLAAAAGGR
     324-593: Missing.

Note: No experimental confirmation available.
Show »
Length:323
Mass (Da):34,083
Checksum:iB4DBD00107DB4203
GO
Isoform 3 (identifier: Q7L4E1-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     338-349: TELLGCYSDQDF → APKASWRATRRC
     350-593: Missing.

Note: No experimental confirmation available.
Show »
Length:349
Mass (Da):38,248
Checksum:i4226070C236E647B
GO

Sequence cautioni

The sequence BAB55159.1 differs from that shown. Reason: Erroneous initiation. Curated
The sequence BAB84953.1 differs from that shown. Reason: Erroneous initiation. Curated
The sequence EAW87861.1 differs from that shown. Reason: Erroneous initiation. Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti62 – 621A → G in BAB84953 (PubMed:12693554).Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti78 – 781E → K.
Corresponds to variant rs6478859 [ dbSNP | Ensembl ].
VAR_037690
Natural varianti100 – 1001V → A.
Corresponds to variant rs16930845 [ dbSNP | Ensembl ].
VAR_037691
Natural varianti212 – 2121G → S.
Corresponds to variant rs17452596 [ dbSNP | Ensembl ].
VAR_037692

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei226 – 32398PESQR…YEEAL → TESHSVARLECSGAISAQCN LRFLGSRDSPASASQVAGIT ARVTAEGVCREAGVPAAPCL PPAGGVRLHLPRRQHAARPR EDPHAAPDRGLAAAAGGR in isoform 2. 1 PublicationVSP_030087Add
BLAST
Alternative sequencei324 – 593270Missing in isoform 2. 1 PublicationVSP_030088Add
BLAST
Alternative sequencei338 – 34912TELLG…SDQDF → APKASWRATRRC in isoform 3. 1 PublicationVSP_030089Add
BLAST
Alternative sequencei350 – 593244Missing in isoform 3. 1 PublicationVSP_030090Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK074127 mRNA. Translation: BAB84953.1. Different initiation.
AK075421 mRNA. Translation: BAC11611.1.
AL592211 Genomic DNA. Translation: CAI12367.3.
CH471090 Genomic DNA. Translation: EAW87861.1. Different initiation.
CH471090 Genomic DNA. Translation: EAW87862.1.
CH471090 Genomic DNA. Translation: EAW87864.1.
BC009114 mRNA. Translation: AAH09114.2.
AK027502 mRNA. Translation: BAB55159.1. Different initiation.
CCDSiCCDS6917.1. [Q7L4E1-1]
RefSeqiNP_116198.2. NM_032809.2. [Q7L4E1-1]
XP_005252339.1. XM_005252282.1. [Q7L4E1-1]
UniGeneiHs.632693.

Genome annotation databases

EnsembliENST00000358369; ENSP00000351138; ENSG00000148343. [Q7L4E1-1]
ENST00000439290; ENSP00000391603; ENSG00000148343. [Q7L4E1-2]
ENST00000445183; ENSP00000396618; ENSG00000148343. [Q7L4E1-3]
GeneIDi84895.
KEGGihsa:84895.
UCSCiuc004bxa.4. human. [Q7L4E1-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK074127 mRNA. Translation: BAB84953.1. Different initiation.
AK075421 mRNA. Translation: BAC11611.1.
AL592211 Genomic DNA. Translation: CAI12367.3.
CH471090 Genomic DNA. Translation: EAW87861.1. Different initiation.
CH471090 Genomic DNA. Translation: EAW87862.1.
CH471090 Genomic DNA. Translation: EAW87864.1.
BC009114 mRNA. Translation: AAH09114.2.
AK027502 mRNA. Translation: BAB55159.1. Different initiation.
CCDSiCCDS6917.1. [Q7L4E1-1]
RefSeqiNP_116198.2. NM_032809.2. [Q7L4E1-1]
XP_005252339.1. XM_005252282.1. [Q7L4E1-1]
UniGeneiHs.632693.

3D structure databases

ProteinModelPortaliQ7L4E1.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi124335. 10 interactions.
STRINGi9606.ENSP00000351138.

PTM databases

iPTMnetiQ7L4E1.
PhosphoSiteiQ7L4E1.

Polymorphism and mutation databases

BioMutaiFAM73B.
DMDMi74749888.

Proteomic databases

EPDiQ7L4E1.
MaxQBiQ7L4E1.
PaxDbiQ7L4E1.
PRIDEiQ7L4E1.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000358369; ENSP00000351138; ENSG00000148343. [Q7L4E1-1]
ENST00000439290; ENSP00000391603; ENSG00000148343. [Q7L4E1-2]
ENST00000445183; ENSP00000396618; ENSG00000148343. [Q7L4E1-3]
GeneIDi84895.
KEGGihsa:84895.
UCSCiuc004bxa.4. human. [Q7L4E1-1]

Organism-specific databases

CTDi84895.
GeneCardsiFAM73B.
H-InvDBHIX0008445.
HGNCiHGNC:23621. FAM73B.
HPAiHPA041363.
neXtProtiNX_Q7L4E1.
PharmGKBiPA134896424.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiKOG3831. Eukaryota.
ENOG410Y4ER. LUCA.
GeneTreeiENSGT00390000008565.
HOGENOMiHOG000286032.
HOVERGENiHBG106606.
InParanoidiQ7L4E1.
OMAiPEMGGEH.
OrthoDBiEOG7QK0BM.
PhylomeDBiQ7L4E1.
TreeFamiTF313896.

Miscellaneous databases

ChiTaRSiFAM73B. human.
GeneWikiiFAM73B.
GenomeRNAii84895.
NextBioi75240.
PROiQ7L4E1.

Gene expression databases

BgeeiQ7L4E1.
CleanExiHS_FAM73B.
ExpressionAtlasiQ7L4E1. baseline and differential.
GenevisibleiQ7L4E1. HS.

Family and domain databases

InterProiIPR019392. Miga.
[Graphical view]
PANTHERiPTHR21508. PTHR21508. 1 hit.
PfamiPF10265. DUF2217. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Characterization of long cDNA clones from human adult spleen. II. The complete sequences of 81 cDNA clones."
    Jikuya H., Takano J., Kikuno R., Hirosawa M., Nagase T., Nomura N., Ohara O.
    DNA Res. 10:49-57(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
    Tissue: Spleen.
  2. "Signal sequence and keyword trap in silico for selection of full-length human cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries."
    Otsuki T., Ota T., Nishikawa T., Hayashi K., Suzuki Y., Yamamoto J., Wakamatsu A., Kimura K., Sakamoto K., Hatano N., Kawai Y., Ishii S., Saito K., Kojima S., Sugiyama T., Ono T., Okano K., Yoshikawa Y.
    , Aotsuka S., Sasaki N., Hattori A., Okumura K., Nagai K., Sugano S., Isogai T.
    DNA Res. 12:117-126(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 3).
    Tissue: Placenta.
  3. "DNA sequence and analysis of human chromosome 9."
    Humphray S.J., Oliver K., Hunt A.R., Plumb R.W., Loveland J.E., Howe K.L., Andrews T.D., Searle S., Hunt S.E., Scott C.E., Jones M.C., Ainscough R., Almeida J.P., Ambrose K.D., Ashwell R.I.S., Babbage A.K., Babbage S., Bagguley C.L.
    , Bailey J., Banerjee R., Barker D.J., Barlow K.F., Bates K., Beasley H., Beasley O., Bird C.P., Bray-Allen S., Brown A.J., Brown J.Y., Burford D., Burrill W., Burton J., Carder C., Carter N.P., Chapman J.C., Chen Y., Clarke G., Clark S.Y., Clee C.M., Clegg S., Collier R.E., Corby N., Crosier M., Cummings A.T., Davies J., Dhami P., Dunn M., Dutta I., Dyer L.W., Earthrowl M.E., Faulkner L., Fleming C.J., Frankish A., Frankland J.A., French L., Fricker D.G., Garner P., Garnett J., Ghori J., Gilbert J.G.R., Glison C., Grafham D.V., Gribble S., Griffiths C., Griffiths-Jones S., Grocock R., Guy J., Hall R.E., Hammond S., Harley J.L., Harrison E.S.I., Hart E.A., Heath P.D., Henderson C.D., Hopkins B.L., Howard P.J., Howden P.J., Huckle E., Johnson C., Johnson D., Joy A.A., Kay M., Keenan S., Kershaw J.K., Kimberley A.M., King A., Knights A., Laird G.K., Langford C., Lawlor S., Leongamornlert D.A., Leversha M., Lloyd C., Lloyd D.M., Lovell J., Martin S., Mashreghi-Mohammadi M., Matthews L., McLaren S., McLay K.E., McMurray A., Milne S., Nickerson T., Nisbett J., Nordsiek G., Pearce A.V., Peck A.I., Porter K.M., Pandian R., Pelan S., Phillimore B., Povey S., Ramsey Y., Rand V., Scharfe M., Sehra H.K., Shownkeen R., Sims S.K., Skuce C.D., Smith M., Steward C.A., Swarbreck D., Sycamore N., Tester J., Thorpe A., Tracey A., Tromans A., Thomas D.W., Wall M., Wallis J.M., West A.P., Whitehead S.L., Willey D.L., Williams S.A., Wilming L., Wray P.W., Young L., Ashurst J.L., Coulson A., Blocker H., Durbin R.M., Sulston J.E., Hubbard T., Jackson M.J., Bentley D.R., Beck S., Rogers J., Dunham I.
    Nature 429:369-374(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  4. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  5. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
    Tissue: Brain.
  6. "Complete sequencing and characterization of 21,243 full-length human cDNAs."
    Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.
    , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
    Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 308-593 (ISOFORM 1).
  7. Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-276, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Tissue: Cervix carcinoma.
  8. "Quantitative phosphoproteomic analysis of T cell receptor signaling reveals system-wide modulation of protein-protein interactions."
    Mayya V., Lundgren D.H., Hwang S.-I., Rezaul K., Wu L., Eng J.K., Rodionov V., Han D.K.
    Sci. Signal. 2:RA46-RA46(2009) [PubMed] [Europe PMC] [Abstract]
    Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-276, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Tissue: Leukemic T-cell.
  9. "Mitoguardin regulates mitochondrial fusion through MitoPLD and is required for neuronal homeostasis."
    Zhang Y., Liu X., Bai J., Tian X., Zhao X., Liu W., Duan X., Shang W., Fan H.Y., Tong C.
    Mol. Cell 61:111-124(2016) [PubMed] [Europe PMC] [Abstract]
    Cited for: FUNCTION, SUBCELLULAR LOCATION, SUBUNIT, INTERACTION WITH FAM73A AND PLD6.

Entry informationi

Entry nameiMIGA2_HUMAN
AccessioniPrimary (citable) accession number: Q7L4E1
Secondary accession number(s): Q8NBM3, Q8TEJ6, Q969E6
Entry historyi
Integrated into UniProtKB/Swiss-Prot: January 15, 2008
Last sequence update: December 21, 2004
Last modified: April 13, 2016
This is version 98 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 9
    Human chromosome 9: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.