Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Neurotrophin-3

Gene

NTF3

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Seems to promote the survival of visceral and proprioceptive sensory neurons.

GO - Molecular functioni

  1. chemoattractant activity Source: BHF-UCL
  2. growth factor activity Source: BHF-UCL
  3. neurotrophin p75 receptor binding Source: GO_Central
  4. receptor binding Source: ProtInc

GO - Biological processi

  1. activation of GTPase activity Source: BHF-UCL
  2. activation of MAPK activity Source: BHF-UCL
  3. activation of protein kinase B activity Source: BHF-UCL
  4. axon guidance Source: Ensembl
  5. brain development Source: Ensembl
  6. cell-cell signaling Source: ProtInc
  7. enteric nervous system development Source: Ensembl
  8. epidermis development Source: Ensembl
  9. glial cell fate determination Source: Ensembl
  10. induction of positive chemotaxis Source: BHF-UCL
  11. mechanoreceptor differentiation Source: Ensembl
  12. myelination Source: Ensembl
  13. negative regulation of neuron apoptotic process Source: GO_Central
  14. negative regulation of peptidyl-tyrosine phosphorylation Source: BHF-UCL
  15. nerve development Source: Ensembl
  16. nervous system development Source: ProtInc
  17. neuromuscular synaptic transmission Source: Ensembl
  18. neuron projection morphogenesis Source: GO_Central
  19. peripheral nervous system development Source: Ensembl
  20. positive chemotaxis Source: GOC
  21. positive regulation of actin cytoskeleton reorganization Source: BHF-UCL
  22. positive regulation of cell migration Source: BHF-UCL
  23. positive regulation of cell proliferation Source: BHF-UCL
  24. positive regulation of glial cell differentiation Source: Ensembl
  25. positive regulation of peptidyl-serine phosphorylation Source: BHF-UCL
  26. positive regulation of peptidyl-tyrosine phosphorylation Source: BHF-UCL
  27. positive regulation of receptor internalization Source: BHF-UCL
  28. positive regulation of transcription from RNA polymerase II promoter Source: Ensembl
  29. regulation of neuron differentiation Source: GO_Central
  30. regulation of synaptic transmission Source: Ensembl
  31. signal transduction Source: ProtInc
  32. smooth muscle cell differentiation Source: Ensembl
  33. transmembrane receptor protein tyrosine kinase signaling pathway Source: BHF-UCL
Complete GO annotation...

Keywords - Molecular functioni

Growth factor

Enzyme and pathway databases

SignaLinkiP20783.

Names & Taxonomyi

Protein namesi
Recommended name:
Neurotrophin-3
Short name:
NT-3
Alternative name(s):
HDNF
Nerve growth factor 2
Short name:
NGF-2
Neurotrophic factor
Gene namesi
Name:NTF3
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
ProteomesiUP000005640 Componenti: Chromosome 12

Organism-specific databases

HGNCiHGNC:8023. NTF3.

Subcellular locationi

GO - Cellular componenti

  1. cytoplasmic membrane-bounded vesicle Source: GO_Central
  2. extracellular region Source: GO_Central
Complete GO annotation...

Keywords - Cellular componenti

Secreted

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA31806.

Polymorphism and mutation databases

BioMutaiNTF3.
DMDMi128581.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 1818Sequence AnalysisAdd
BLAST
Propeptidei19 – 138120PRO_0000019659Add
BLAST
Chaini139 – 257119Neurotrophin-3PRO_0000019660Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi131 – 1311N-linked (GlcNAc...)Sequence Analysis
Disulfide bondi152 ↔ 217
Disulfide bondi195 ↔ 246
Disulfide bondi205 ↔ 248

Keywords - PTMi

Cleavage on pair of basic residues, Disulfide bond, Glycoprotein

Proteomic databases

PaxDbiP20783.
PRIDEiP20783.

PTM databases

PhosphoSiteiP20783.

Miscellaneous databases

PMAP-CutDBP20783.

Expressioni

Tissue specificityi

Brain and peripheral tissues.

Gene expression databases

BgeeiP20783.
CleanExiHS_NTF3.
GenevestigatoriP20783.

Organism-specific databases

HPAiHPA032000.
HPA032001.

Interactioni

Binary interactionsi

WithEntry#Exp.IntActNotes
MEOX2A4D1273EBI-1025994,EBI-10172134

Protein-protein interaction databases

BioGridi110963. 10 interactions.
DIPiDIP-346N.
IntActiP20783. 3 interactions.
MINTiMINT-188565.
STRINGi9606.ENSP00000397297.

Structurei

Secondary structure

1
257
Legend: HelixTurnBeta strand
Show more details
Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Beta strandi148 – 1525Combined sources
Beta strandi154 – 1596Combined sources
Beta strandi164 – 1674Combined sources
Beta strandi172 – 1754Combined sources
Beta strandi177 – 1793Combined sources
Turni181 – 1833Combined sources
Beta strandi190 – 1956Combined sources
Beta strandi200 – 2056Combined sources
Turni210 – 2123Combined sources
Beta strandi216 – 23318Combined sources
Beta strandi236 – 24914Combined sources

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
1B8KX-ray2.15A139-257[»]
1BNDX-ray2.30B139-257[»]
1NT3X-ray2.40A139-257[»]
3BUKX-ray2.60A/B139-257[»]
ProteinModelPortaliP20783.
SMRiP20783. Positions 146-253.
ModBaseiSearch...
MobiDBiSearch...

Miscellaneous databases

EvolutionaryTraceiP20783.

Family & Domainsi

Sequence similaritiesi

Belongs to the NGF-beta family.Curated

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiNOG39844.
GeneTreeiENSGT00390000007725.
HOGENOMiHOG000231516.
HOVERGENiHBG006494.
InParanoidiP20783.
KOiK04356.
OMAiYAEHKSH.
OrthoDBiEOG7RBZ8Z.
PhylomeDBiP20783.
TreeFamiTF106463.

Family and domain databases

Gene3Di2.10.90.10. 1 hit.
InterProiIPR029034. Cystine-knot_cytokine.
IPR020408. Nerve_growth_factor-like.
IPR002072. Nerve_growth_factor-rel.
IPR019846. Nerve_growth_factor_CS.
IPR015578. Neurotrophin-3.
[Graphical view]
PANTHERiPTHR11589. PTHR11589. 1 hit.
PTHR11589:SF4. PTHR11589:SF4. 1 hit.
PfamiPF00243. NGF. 1 hit.
[Graphical view]
PIRSFiPIRSF001789. NGF. 1 hit.
PRINTSiPR01914. NEUROTROPHN3.
PR00268. NGF.
ProDomiPD002052. Nerve_growth_factor-rel. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00140. NGF. 1 hit.
[Graphical view]
SUPFAMiSSF57501. SSF57501. 1 hit.
PROSITEiPS00248. NGF_1. 1 hit.
PS50270. NGF_2. 1 hit.
[Graphical view]

Sequences (2)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: P20783-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MSILFYVIFL AYLRGIQGNN MDQRSLPEDS LNSLIIKLIQ ADILKNKLSK
60 70 80 90 100
QMVDVKENYQ STLPKAEAPR EPERGGPAKS AFQPVIAMDT ELLRQQRRYN
110 120 130 140 150
SPRVLLSDST PLEPPPLYLM EDYVGSPVVA NRTSRRKRYA EHKSHRGEYS
160 170 180 190 200
VCDSESLWVT DKSSAIDIRG HQVTVLGEIK TGNSPVKQYF YETRCKEARP
210 220 230 240 250
VKNGCRGIDD KHWNSQCKTS QTYVRALTSE NNKLVGWRWI RIDTSCVCAL

SRKIGRT
Length:257
Mass (Da):29,355
Last modified:February 1, 1991 - v1
Checksum:i39A5BB3B28E25E03
GO
Isoform 2 (identifier: P20783-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-1: M → MVTFATILQVNKVM

Note: No experimental confirmation available.

Show »
Length:270
Mass (Da):30,800
Checksum:i43B3342B32424E97
GO

Polymorphismi

Variant Glu-76 (frequently reported as Glu-63) was thought to be associated with severe forms of schizophrenia. This does not seem to be the case.

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti76 – 761G → E.2 Publications
Corresponds to variant rs1805149 [ dbSNP | Ensembl ].
VAR_012084

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1 – 11M → MVTFATILQVNKVM in isoform 2. 1 PublicationVSP_043353

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X53655 mRNA. Translation: CAA37703.1.
M37763 Genomic DNA. Translation: AAA59953.1.
M61180 Genomic DNA. Translation: AAA63231.1.
AK293895 mRNA. Translation: BAH11621.1.
CR541906 mRNA. Translation: CAG46704.1.
CH471116 Genomic DNA. Translation: EAW88824.1.
CH471116 Genomic DNA. Translation: EAW88825.1.
BC069773 mRNA. Translation: AAH69773.1.
BC107075 mRNA. Translation: AAI07076.1.
CCDSiCCDS44806.1. [P20783-2]
CCDS8538.1. [P20783-1]
PIRiA36208. C40304.
RefSeqiNP_001096124.1. NM_001102654.1. [P20783-2]
NP_002518.1. NM_002527.4. [P20783-1]
UniGeneiHs.99171.

Genome annotation databases

EnsembliENST00000331010; ENSP00000328738; ENSG00000185652. [P20783-1]
ENST00000423158; ENSP00000397297; ENSG00000185652. [P20783-2]
GeneIDi4908.
KEGGihsa:4908.
UCSCiuc001qnk.4. human. [P20783-2]
uc001qnl.4. human. [P20783-1]

Polymorphism and mutation databases

BioMutaiNTF3.

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
X53655 mRNA. Translation: CAA37703.1.
M37763 Genomic DNA. Translation: AAA59953.1.
M61180 Genomic DNA. Translation: AAA63231.1.
AK293895 mRNA. Translation: BAH11621.1.
CR541906 mRNA. Translation: CAG46704.1.
CH471116 Genomic DNA. Translation: EAW88824.1.
CH471116 Genomic DNA. Translation: EAW88825.1.
BC069773 mRNA. Translation: AAH69773.1.
BC107075 mRNA. Translation: AAI07076.1.
CCDSiCCDS44806.1. [P20783-2]
CCDS8538.1. [P20783-1]
PIRiA36208. C40304.
RefSeqiNP_001096124.1. NM_001102654.1. [P20783-2]
NP_002518.1. NM_002527.4. [P20783-1]
UniGeneiHs.99171.

3D structure databases

Select the link destinations:
PDBei
RCSB PDBi
PDBji
Links Updated
EntryMethodResolution (Å)ChainPositionsPDBsum
1B8KX-ray2.15A139-257[»]
1BNDX-ray2.30B139-257[»]
1NT3X-ray2.40A139-257[»]
3BUKX-ray2.60A/B139-257[»]
ProteinModelPortaliP20783.
SMRiP20783. Positions 146-253.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi110963. 10 interactions.
DIPiDIP-346N.
IntActiP20783. 3 interactions.
MINTiMINT-188565.
STRINGi9606.ENSP00000397297.

PTM databases

PhosphoSiteiP20783.

Polymorphism and mutation databases

BioMutaiNTF3.
DMDMi128581.

Proteomic databases

PaxDbiP20783.
PRIDEiP20783.

Protocols and materials databases

DNASUi4908.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000331010; ENSP00000328738; ENSG00000185652. [P20783-1]
ENST00000423158; ENSP00000397297; ENSG00000185652. [P20783-2]
GeneIDi4908.
KEGGihsa:4908.
UCSCiuc001qnk.4. human. [P20783-2]
uc001qnl.4. human. [P20783-1]

Organism-specific databases

CTDi4908.
GeneCardsiGC12P005541.
HGNCiHGNC:8023. NTF3.
HPAiHPA032000.
HPA032001.
MIMi162660. gene.
neXtProtiNX_P20783.
PharmGKBiPA31806.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiNOG39844.
GeneTreeiENSGT00390000007725.
HOGENOMiHOG000231516.
HOVERGENiHBG006494.
InParanoidiP20783.
KOiK04356.
OMAiYAEHKSH.
OrthoDBiEOG7RBZ8Z.
PhylomeDBiP20783.
TreeFamiTF106463.

Enzyme and pathway databases

SignaLinkiP20783.

Miscellaneous databases

EvolutionaryTraceiP20783.
GeneWikiiNeurotrophin-3.
GenomeRNAii4908.
NextBioi18887.
PMAP-CutDBP20783.
PROiP20783.
SOURCEiSearch...

Gene expression databases

BgeeiP20783.
CleanExiHS_NTF3.
GenevestigatoriP20783.

Family and domain databases

Gene3Di2.10.90.10. 1 hit.
InterProiIPR029034. Cystine-knot_cytokine.
IPR020408. Nerve_growth_factor-like.
IPR002072. Nerve_growth_factor-rel.
IPR019846. Nerve_growth_factor_CS.
IPR015578. Neurotrophin-3.
[Graphical view]
PANTHERiPTHR11589. PTHR11589. 1 hit.
PTHR11589:SF4. PTHR11589:SF4. 1 hit.
PfamiPF00243. NGF. 1 hit.
[Graphical view]
PIRSFiPIRSF001789. NGF. 1 hit.
PRINTSiPR01914. NEUROTROPHN3.
PR00268. NGF.
ProDomiPD002052. Nerve_growth_factor-rel. 1 hit.
[Graphical view] [Entries sharing at least one domain]
SMARTiSM00140. NGF. 1 hit.
[Graphical view]
SUPFAMiSSF57501. SSF57501. 1 hit.
PROSITEiPS00248. NGF_1. 1 hit.
PS50270. NGF_2. 1 hit.
[Graphical view]
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Cloning and expression of a cDNA encoding a novel human neurotrophic factor."
    Kaisho Y., Yoshimura K., Nakahama K.
    FEBS Lett. 266:187-191(1990) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1).
  2. "Primary structure and biological activity of a novel human neurotrophic factor."
    Rosenthal A., Goeddel D.V., Nguyen T., Lewis M., Shih A., Laramee G.R., Nikolics K., Winslow J.W.
    Neuron 4:767-773(1990) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA / MRNA] (ISOFORM 1).
  3. "Molecular cloning of a human gene that is a member of the nerve growth factor family."
    Jones K.R., Reichardt L.F.
    Proc. Natl. Acad. Sci. U.S.A. 87:8060-8064(1990) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
  4. "Human and rat brain-derived neurotrophic factor and neurotrophin-3: gene structures, distributions, and chromosomal localizations."
    Maisonpierre P.C., le Beau M.M., Espinosa R. III, Ip N.Y., Belluscio L., de la Monte S.M., Squinto S., Furth M.E., Yancopoulos G.D.
    Genomics 10:558-568(1991) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA].
  5. "Complete sequencing and characterization of 21,243 full-length human cDNAs."
    Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.
    , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
    Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2).
    Tissue: Cerebellum.
  6. "Cloning of human full open reading frames in Gateway(TM) system entry vector (pDONR201)."
    Halleck A., Ebert L., Mkoundinya M., Schick M., Eisenstein S., Neubert P., Kstrang K., Schatten R., Shen B., Henze S., Mar W., Korn B., Zuo D., Hu Y., LaBaer J.
    Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
  7. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  8. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
  9. "Evolutionary studies of the nerve growth factor family reveal a novel member abundantly expressed in Xenopus ovary."
    Hallboeoek F., Ibanez C.F., Persson H.
    Neuron 6:845-858(1991) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE OF 194-236.
    Tissue: Leukocyte.
  10. "Structure of the brain-derived neurotrophic factor/neurotrophin 3 heterodimer."
    Robinson R.C., Radziejewski C., Stuart D.I., Jones E.Y.
    Biochemistry 34:4139-4146(1995) [PubMed] [Europe PMC] [Abstract]
    Cited for: X-RAY CRYSTALLOGRAPHY (2.3 ANGSTROMS).
  11. "Association of neurotrophin-3 gene variant with severe forms of schizophrenia."
    Hattori M., Nanko S.
    Biochem. Biophys. Res. Commun. 209:513-518(1995) [PubMed] [Europe PMC] [Abstract]
    Cited for: VARIANT GLU-76.
  12. "Failure to find associations of the CA repeat polymorphism in the first intron and the Gly-63/Glu-63 polymorphism of the neurotrophin-3 gene with schizophrenia."
    Arinami T., Takekoshi K., Itokawa M., Hamaguchi H., Toru M.
    Psychiatr. Genet. 6:13-15(1996) [PubMed] [Europe PMC] [Abstract]
    Cited for: VARIANT GLU-76.

Entry informationi

Entry nameiNTF3_HUMAN
AccessioniPrimary (citable) accession number: P20783
Secondary accession number(s): B7Z1T5, Q6FH50
Entry historyi
Integrated into UniProtKB/Swiss-Prot: February 1, 1991
Last sequence update: February 1, 1991
Last modified: April 29, 2015
This is version 157 of the entry and version 1 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

3D-structure, Complete proteome, Reference proteome

Documents

  1. Human chromosome 12
    Human chromosome 12: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. PDB cross-references
    Index of Protein Data Bank (PDB) cross-references
  6. SIMILARITY comments
    Index of protein domains and families

External Data

Dasty 3

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into Uniref entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.