Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Regulator of microtubule dynamics protein 2

Gene

RMDN2

Organism
Homo sapiens (Human)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Names & Taxonomyi

Protein namesi
Recommended name:
Regulator of microtubule dynamics protein 2
Short name:
RMD-2
Short name:
hRMD-2
Alternative name(s):
Protein FAM82A1
Gene namesi
Name:RMDN2
Synonyms:FAM82A, FAM82A1
ORF Names:BLOCK18, UNQ9371/PRO34163
OrganismiHomo sapiens (Human)
Taxonomic identifieri9606 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresPrimatesHaplorrhiniCatarrhiniHominidaeHomo
Proteomesi
  • UP000005640 Componenti: Chromosome 2

Organism-specific databases

HGNCiHGNC:26567. RMDN2.

Subcellular locationi

  • Membrane Curated; Single-pass membrane protein Curated
  • Cytoplasm 1 Publication
  • Cytoplasmcytoskeletonspindle 1 Publication
  • Cytoplasmcytoskeletonspindle pole 1 Publication

  • Note: In interphase localizes in the cytoplasm, and during mitosis localizes to the spindle microtubules and spindle poles. Also detected as large dots in the perinuclear region.

Topology

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Transmembranei9 – 2820HelicalSequence analysisAdd
BLAST

GO - Cellular componenti

Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Cytoskeleton, Membrane, Microtubule

Pathology & Biotechi

Organism-specific databases

PharmGKBiPA162387925.

Polymorphism and mutation databases

BioMutaiRMDN2.
DMDMi147643051.

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Chaini1 – 410410Regulator of microtubule dynamics protein 2PRO_0000287504Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Modified residuei51 – 511PhosphoserineBy similarity
Modified residuei121 – 1211PhosphoserineBy similarity
Modified residuei139 – 1391PhosphothreonineCombined sources
Modified residuei152 – 1521PhosphotyrosineCombined sources
Modified residuei154 – 1541PhosphothreonineCombined sources
Modified residuei157 – 1571PhosphothreonineCombined sources

Keywords - PTMi

Phosphoprotein

Proteomic databases

EPDiQ96LZ7.
MaxQBiQ96LZ7.
PaxDbiQ96LZ7.
PRIDEiQ96LZ7.

PTM databases

iPTMnetiQ96LZ7.
PhosphoSiteiQ96LZ7.

Expressioni

Gene expression databases

BgeeiQ96LZ7.
CleanExiHS_FAM82A1.
ExpressionAtlasiQ96LZ7. baseline and differential.
GenevisibleiQ96LZ7. HS.

Organism-specific databases

HPAiHPA034705.
HPA034706.

Interactioni

Subunit structurei

Interacts with microtubules.1 Publication

Binary interactionsi

WithEntry#Exp.IntActNotes
AGTRAPQ6RW133EBI-2806908,EBI-741181
CMTM5Q96DZ93EBI-2806908,EBI-2548702
VAPBQ53XM73EBI-2806908,EBI-10178947

Protein-protein interaction databases

BioGridi127373. 9 interactions.
IntActiQ96LZ7. 5 interactions.
STRINGi9606.ENSP00000234195.

Structurei

3D structure databases

ProteinModelPortaliQ96LZ7.
SMRiQ96LZ7. Positions 337-396.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Coiled coil

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Coiled coili68 – 11043Sequence analysisAdd
BLAST

Sequence similaritiesi

Belongs to the RMDN family.Curated

Keywords - Domaini

Coiled coil, Transmembrane, Transmembrane helix

Phylogenomic databases

eggNOGiENOG410J8TA. Eukaryota.
ENOG411167M. LUCA.
GeneTreeiENSGT00530000063162.
HOVERGENiHBG072518.
InParanoidiQ96LZ7.
OMAiVIFQERQ.
OrthoDBiEOG7BP834.
PhylomeDBiQ96LZ7.
TreeFamiTF315854.

Family and domain databases

Gene3Di1.25.40.10. 1 hit.
InterProiIPR011990. TPR-like_helical_dom.
[Graphical view]
SUPFAMiSSF48452. SSF48452. 1 hit.

Sequences (4)i

Sequence statusi: Complete.

This entry describes 4 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q96LZ7-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MPYSTNKELI LGIMVGTAGI SLLLLWYHKV RKPGIAMKLP EFLSLGNTFN
60 70 80 90 100
SITLQDEIHD DQGTTVIFQE RQLQILEKLN ELLTNMEELK EEIRFLKEAI
110 120 130 140 150
PKLEEYIQDE LGGKITVHKI SPQHRARKRR LPTIQSSATS NSSEEAESEG
160 170 180 190 200
GYITANTDTE EQSFPVPKAF NTRVEELNLD VLLQKVDHLR MSESGKSESF
210 220 230 240 250
ELLRDHKEKF RDEIEFMWRF ARAYGDMYEL STNTQEKKHY ANIGKTLSER
260 270 280 290 300
AINRAPMNGH CHLWYAVLCG YVSEFEGLQN KINYGHLFKE HLDIAIKLLP
310 320 330 340 350
EEPFLYYLKG RYCYTVSKLS WIEKKMAATL FGKIPSSTVQ EALHNFLKAE
360 370 380 390 400
ELCPGYSNPN YMYLAKCYTD LEENQNALKF CNLALLLPTV TKEDKEAQKE
410
MQKIMTSLKR
Length:410
Mass (Da):47,399
Last modified:May 15, 2007 - v2
Checksum:iDE5249E5808CA42A
GO
Isoform 2 (identifier: Q96LZ7-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-151: MPYSTNKELI...SSEEAESEGG → MGKCLSCCKE...SSPIEIPKIR
     394-410: DKEAQKEMQKIMTSLKR → EL

Show »
Length:573
Mass (Da):65,170
Checksum:i249BFF67E72C7F40
GO
Isoform 3 (identifier: Q96LZ7-3) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-151: MPYSTNKELI...SSEEAESEGG → MGKCLR
     349-410: AEELCPGYSNPNYMYLAKCYTDLEENQNALKFCNLALLLPTVTKEDKEAQKEMQKIMTSLKR → VHFVYPLF

Note: No experimental confirmation available.
Show »
Length:211
Mass (Da):24,806
Checksum:iA3C86418A2644414
GO
Isoform 4 (identifier: Q96LZ7-4) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-151: MPYSTNKELI...SSEEAESEGG → MGKCLR

Show »
Length:265
Mass (Da):30,961
Checksum:iB2DE94C98C9542EF
GO

Sequence cautioni

The sequence AAM81211.1 differs from that shown. Reason: Erroneous gene model prediction. Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Isoform 2 (identifier: Q96LZ7-2)
Sequence conflicti259 – 2591G → D in BAB71517 (PubMed:14702039).Curated

Natural variant

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Natural varianti259 – 2591G → D.1 Publication
Corresponds to variant rs4670800 [ dbSNP | Ensembl ].
VAR_032316

Alternative sequence

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Alternative sequencei1 – 151151MPYST…ESEGG → MGKCLSCCKEDQSFQRCSPE DQVSTDAQHRGASSISQPSI SLGHKTSYSPVTHKVNAAKA SRRLLSVSSPSFSERRYSLF VGFQKRNASPYWQQSRANFD SEEDTGFTDIKSSSDHCGSF ISRRRRFSSRKLSIVSYYKS AIFFDPQASGQNVFNLNEIE IFSKTSSNTDAKKHITISAP EYNTKNFKNFETNTTSPAFG NTIDTASYQQSTSSFFSLAS DISSPDQQNGIANDIQQRGQ LCKDLKDFLHPRPESYSTGH SPIMIPQHPSQSGTFPFLHK AGFSSSYKNSGCFIPPQSEL TSGLFEDEDFAVLFQDEDRS SPIEIPKIR in isoform 2. 1 PublicationVSP_025524Add
BLAST
Alternative sequencei1 – 151151MPYST…ESEGG → MGKCLR in isoform 3 and isoform 4. 2 PublicationsVSP_025525Add
BLAST
Alternative sequencei349 – 41062AEELC…TSLKR → VHFVYPLF in isoform 3. 1 PublicationVSP_025526Add
BLAST
Alternative sequencei394 – 41017DKEAQ…TSLKR → EL in isoform 2. 1 PublicationVSP_025527Add
BLAST

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK057516 mRNA. Translation: BAB71517.1.
AK095462 mRNA. Translation: BAC04551.1.
AY358269 mRNA. Translation: AAQ88636.1.
AF435956 mRNA. Translation: AAM20907.1.
AC009229 Genomic DNA. No translation available.
AC016689 Genomic DNA. Translation: AAX88865.1.
CH471053 Genomic DNA. Translation: EAX00385.1.
CH471053 Genomic DNA. Translation: EAX00387.1.
BC024243 mRNA. Translation: AAH24243.3.
AH011736 Genomic DNA. Translation: AAM81211.1. Sequence problems.
BR000689 mRNA. Translation: FAA00414.1.
BR000692 mRNA. Translation: FAA00417.1.
CCDSiCCDS1792.1. [Q96LZ7-2]
CCDS54351.1. [Q96LZ7-1]
CCDS54352.1. [Q96LZ7-4]
RefSeqiNP_001164262.1. NM_001170791.2. [Q96LZ7-1]
NP_001164263.1. NM_001170792.2. [Q96LZ7-1]
NP_001164264.1. NM_001170793.2. [Q96LZ7-4]
NP_653314.3. NM_144713.4.
UniGeneiHs.591566.

Genome annotation databases

EnsembliENST00000354545; ENSP00000346549; ENSG00000115841. [Q96LZ7-1]
ENST00000406384; ENSP00000386004; ENSG00000115841. [Q96LZ7-1]
ENST00000417700; ENSP00000392977; ENSG00000115841. [Q96LZ7-4]
GeneIDi151393.
KEGGihsa:151393.
UCSCiuc002rql.4. human. [Q96LZ7-1]

Keywords - Coding sequence diversityi

Alternative splicing, Polymorphism

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK057516 mRNA. Translation: BAB71517.1.
AK095462 mRNA. Translation: BAC04551.1.
AY358269 mRNA. Translation: AAQ88636.1.
AF435956 mRNA. Translation: AAM20907.1.
AC009229 Genomic DNA. No translation available.
AC016689 Genomic DNA. Translation: AAX88865.1.
CH471053 Genomic DNA. Translation: EAX00385.1.
CH471053 Genomic DNA. Translation: EAX00387.1.
BC024243 mRNA. Translation: AAH24243.3.
AH011736 Genomic DNA. Translation: AAM81211.1. Sequence problems.
BR000689 mRNA. Translation: FAA00414.1.
BR000692 mRNA. Translation: FAA00417.1.
CCDSiCCDS1792.1. [Q96LZ7-2]
CCDS54351.1. [Q96LZ7-1]
CCDS54352.1. [Q96LZ7-4]
RefSeqiNP_001164262.1. NM_001170791.2. [Q96LZ7-1]
NP_001164263.1. NM_001170792.2. [Q96LZ7-1]
NP_001164264.1. NM_001170793.2. [Q96LZ7-4]
NP_653314.3. NM_144713.4.
UniGeneiHs.591566.

3D structure databases

ProteinModelPortaliQ96LZ7.
SMRiQ96LZ7. Positions 337-396.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi127373. 9 interactions.
IntActiQ96LZ7. 5 interactions.
STRINGi9606.ENSP00000234195.

PTM databases

iPTMnetiQ96LZ7.
PhosphoSiteiQ96LZ7.

Polymorphism and mutation databases

BioMutaiRMDN2.
DMDMi147643051.

Proteomic databases

EPDiQ96LZ7.
MaxQBiQ96LZ7.
PaxDbiQ96LZ7.
PRIDEiQ96LZ7.

Protocols and materials databases

DNASUi151393.
Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENST00000354545; ENSP00000346549; ENSG00000115841. [Q96LZ7-1]
ENST00000406384; ENSP00000386004; ENSG00000115841. [Q96LZ7-1]
ENST00000417700; ENSP00000392977; ENSG00000115841. [Q96LZ7-4]
GeneIDi151393.
KEGGihsa:151393.
UCSCiuc002rql.4. human. [Q96LZ7-1]

Organism-specific databases

CTDi151393.
GeneCardsiRMDN2.
HGNCiHGNC:26567. RMDN2.
HPAiHPA034705.
HPA034706.
MIMi611872. gene.
neXtProtiNX_Q96LZ7.
PharmGKBiPA162387925.
GenAtlasiSearch...

Phylogenomic databases

eggNOGiENOG410J8TA. Eukaryota.
ENOG411167M. LUCA.
GeneTreeiENSGT00530000063162.
HOVERGENiHBG072518.
InParanoidiQ96LZ7.
OMAiVIFQERQ.
OrthoDBiEOG7BP834.
PhylomeDBiQ96LZ7.
TreeFamiTF315854.

Miscellaneous databases

GenomeRNAii151393.
PROiQ96LZ7.
SOURCEiSearch...

Gene expression databases

BgeeiQ96LZ7.
CleanExiHS_FAM82A1.
ExpressionAtlasiQ96LZ7. baseline and differential.
GenevisibleiQ96LZ7. HS.

Family and domain databases

Gene3Di1.25.40.10. 1 hit.
InterProiIPR011990. TPR-like_helical_dom.
[Graphical view]
SUPFAMiSSF48452. SSF48452. 1 hit.
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Complete sequencing and characterization of 21,243 full-length human cDNAs."
    Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S.
    , Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.
    Nat. Genet. 36:40-45(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3), VARIANT ASP-259.
    Tissue: Testis.
  2. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
  3. Guo J.H., Yu L.
    Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4).
  4. "Generation and annotation of the DNA sequences of human chromosomes 2 and 4."
    Hillier L.W., Graves T.A., Fulton R.S., Fulton L.A., Pepin K.H., Minx P., Wagner-McPherson C., Layman D., Wylie K., Sekhon M., Becker M.C., Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Kremitzki C., Oddy L., Du H.
    , Sun H., Bradshaw-Cordum H., Ali J., Carter J., Cordes M., Harris A., Isak A., van Brunt A., Nguyen C., Du F., Courtney L., Kalicki J., Ozersky P., Abbott S., Armstrong J., Belter E.A., Caruso L., Cedroni M., Cotton M., Davidson T., Desai A., Elliott G., Erb T., Fronick C., Gaige T., Haakenson W., Haglund K., Holmes A., Harkins R., Kim K., Kruchowski S.S., Strong C.M., Grewal N., Goyea E., Hou S., Levy A., Martinka S., Mead K., McLellan M.D., Meyer R., Randall-Maher J., Tomlinson C., Dauphin-Kohlberg S., Kozlowicz-Reilly A., Shah N., Swearengen-Shahid S., Snider J., Strong J.T., Thompson J., Yoakum M., Leonard S., Pearman C., Trani L., Radionenko M., Waligorski J.E., Wang C., Rock S.M., Tin-Wollam A.-M., Maupin R., Latreille P., Wendl M.C., Yang S.-P., Pohl C., Wallis J.W., Spieth J., Bieri T.A., Berkowicz N., Nelson J.O., Osborne J., Ding L., Meyer R., Sabo A., Shotland Y., Sinha P., Wohldmann P.E., Cook L.L., Hickenbotham M.T., Eldred J., Williams D., Jones T.A., She X., Ciccarelli F.D., Izaurralde E., Taylor J., Schmutz J., Myers R.M., Cox D.R., Huang X., McPherson J.D., Mardis E.R., Clifton S.W., Warren W.C., Chinwalla A.T., Eddy S.R., Marra M.A., Ovcharenko I., Furey T.S., Miller W., Eichler E.E., Bork P., Suyama M., Torrents D., Waterston R.H., Wilson R.K.
    Nature 434:724-731(2005) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  5. Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
  6. "The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC)."
    The MGC Project Team
    Genome Res. 14:2121-2127(2004) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1).
    Tissue: Brain.
  7. "Physical/genetic map of the 2p22-2p21 region on chromosome 2."
    Gorry M.C., Zhang Y., Marks J.J., Suppe B., Hart P.S., Cortelli J.R., Pallos D., Hart T.C.
    Submitted (NOV-2001) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 152-410.
  8. "RMD-1, a novel microtubule-associated protein, functions in chromosome segregation in Caenorhabditis elegans."
    Oishi K., Okano H., Sawa H.
    J. Cell Biol. 179:1149-1162(2007) [PubMed] [Europe PMC] [Abstract]
    Cited for: IDENTIFICATION (ISOFORM 1), INTERACTION WITH MICROTUBULES, SUBCELLULAR LOCATION.
  9. "An enzyme assisted RP-RPLC approach for in-depth analysis of human liver phosphoproteome."
    Bian Y., Song C., Cheng K., Dong M., Wang F., Huang J., Sun D., Wang L., Ye M., Zou H.
    J. Proteomics 96:253-262(2014) [PubMed] [Europe PMC] [Abstract]
    Cited for: PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT THR-139; TYR-152; THR-154 AND THR-157, IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
    Tissue: Liver.

Entry informationi

Entry nameiRMD2_HUMAN
AccessioniPrimary (citable) accession number: Q96LZ7
Secondary accession number(s): A9UMZ7
, A9UN00, Q4ZG33, Q6UXN4, Q8N657, Q8N9A2, Q8NCV6, Q8NHM0
Entry historyi
Integrated into UniProtKB/Swiss-Prot: May 15, 2007
Last sequence update: May 15, 2007
Last modified: June 8, 2016
This is version 118 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program
DisclaimerAny medical or genetic information present in this entry is provided for research, educational and informational purposes only. It is not in any way intended to be used as a substitute for professional medical advice, diagnosis, treatment or care.

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Human chromosome 2
    Human chromosome 2: entries, gene names and cross-references to MIM
  2. Human entries with polymorphisms or disease mutations
    List of human entries with polymorphisms or disease mutations
  3. Human polymorphisms and disease mutations
    Index of human polymorphisms and disease mutations
  4. MIM cross-references
    Online Mendelian Inheritance in Man (MIM) cross-references in UniProtKB/Swiss-Prot
  5. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.