Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Microtubule-associated protein 1A

Gene

Map1a

Organism
Mus musculus (Mouse)
Status
Reviewed-Annotation score: Annotation score: 5 out of 5-Experimental evidence at protein leveli

Functioni

Structural protein involved in the filamentous cross-bridging between microtubules and other skeletal elements.

GO - Molecular functioni

  • microtubule binding Source: MGI

GO - Biological processi

  • microtubule cytoskeleton organization Source: InterPro
  • photoreceptor cell maintenance Source: MGI
  • sensory perception of sound Source: MGI
Complete GO annotation...

Names & Taxonomyi

Protein namesi
Recommended name:
Microtubule-associated protein 1A
Short name:
MAP-1A
Cleaved into the following 2 chains:
Gene namesi
Name:Map1a
Synonyms:Mtap1, Mtap1a
OrganismiMus musculus (Mouse)
Taxonomic identifieri10090 [NCBI]
Taxonomic lineageiEukaryotaMetazoaChordataCraniataVertebrataEuteleostomiMammaliaEutheriaEuarchontogliresGliresRodentiaSciurognathiMuroideaMuridaeMurinaeMusMus
Proteomesi
  • UP000000589 Componenti: Chromosome 2

Organism-specific databases

MGIiMGI:1306776. Map1a.

Subcellular locationi

GO - Cellular componenti

  • cytosol Source: MGI
  • microtubule Source: UniProtKB-KW
  • photoreceptor outer segment Source: MGI
  • postsynaptic density Source: MGI
Complete GO annotation...

Keywords - Cellular componenti

Cytoplasm, Cytoskeleton, Microtubule

PTM / Processingi

Molecule processing

Feature keyPosition(s)DescriptionActionsGraphical viewLength
ChainiPRO_00000727531 – 2776Microtubule-associated protein 1AAdd BLAST2776
ChainiPRO_00004183771 – 2542MAP1A heavy chainAdd BLAST2542
ChainiPRO_00002697272543 – 2776MAP1 light chain LC2Add BLAST234

Amino acid modifications

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Modified residuei114PhosphoserineCombined sources1
Modified residuei117PhosphoserineCombined sources1
Modified residuei118PhosphoserineCombined sources1
Modified residuei121PhosphoserineCombined sources1
Modified residuei155PhosphoserineCombined sources1
Modified residuei177PhosphotyrosineCombined sources1
Modified residuei319PhosphoserineBy similarity1
Modified residuei322PhosphoserineBy similarity1
Modified residuei384PhosphoserineCombined sources1
Modified residuei504PhosphothreonineBy similarity1
Modified residuei526PhosphoserineCombined sources1
Modified residuei527PhosphoserineCombined sources1
Modified residuei604PhosphoserineBy similarity1
Modified residuei611PhosphoserineBy similarity1
Modified residuei633PhosphothreonineCombined sources1
Modified residuei644PhosphoserineCombined sources1
Modified residuei667PhosphoserineCombined sources1
Modified residuei678PhosphoserineCombined sources1
Modified residuei786PhosphoserineBy similarity1
Modified residuei873PhosphoserineCombined sources1
Modified residuei876PhosphoserineCombined sources1
Modified residuei877PhosphoserineCombined sources1
Modified residuei890PhosphoserineCombined sources1
Modified residuei893PhosphothreonineCombined sources1
Modified residuei895PhosphoserineCombined sources1
Modified residuei899PhosphoserineCombined sources1
Modified residuei908PhosphoserineCombined sources1
Modified residuei981PhosphoserineCombined sources1
Modified residuei991PhosphoserineCombined sources1
Modified residuei999PhosphoserineCombined sources1
Modified residuei1008PhosphoserineCombined sources1
Modified residuei1014PhosphoserineBy similarity1
Modified residuei1023PhosphoserineCombined sources1
Modified residuei1062PhosphoserineCombined sources1
Modified residuei1068PhosphothreonineCombined sources1
Modified residuei1131PhosphoserineCombined sources1
Modified residuei1133PhosphoserineCombined sources1
Modified residuei1147PhosphoserineCombined sources1
Modified residuei1159PhosphoserineBy similarity1
Modified residuei1177PhosphoserineCombined sources1
Modified residuei1187PhosphoserineBy similarity1
Modified residuei1190PhosphoserineBy similarity1
Modified residuei1196PhosphoserineCombined sources1
Modified residuei1205PhosphoserineCombined sources1
Modified residuei1208PhosphoserineBy similarity1
Modified residuei1251PhosphoserineBy similarity1
Modified residuei1289PhosphoserineCombined sources1
Modified residuei1310PhosphoserineCombined sources1
Modified residuei1313PhosphoserineCombined sources1
Modified residuei1316PhosphoserineCombined sources1
Modified residuei1516PhosphoserineCombined sources1
Modified residuei1580PhosphoserineBy similarity1
Modified residuei1606PhosphoserineCombined sources1
Modified residuei1634PhosphoserineBy similarity1
Modified residuei1648PhosphoserineBy similarity1
Modified residuei1720PhosphoserineBy similarity1
Modified residuei1747PhosphoserineCombined sources1
Modified residuei1762PhosphoserineCombined sources1
Modified residuei1768PhosphoserineCombined sources1
Modified residuei1772PhosphoserineCombined sources1
Modified residuei1777PhosphothreonineCombined sources1
Modified residuei1783PhosphoserineCombined sources1
Modified residuei1789PhosphoserineCombined sources1
Modified residuei1902PhosphoserineBy similarity1
Modified residuei1928PhosphothreonineBy similarity1
Modified residuei1993PhosphoserineBy similarity1
Modified residuei2031PhosphothreonineBy similarity1
Modified residuei2048PhosphoserineBy similarity1
Modified residuei2082PhosphoserineCombined sources1
Modified residuei2209PhosphoserineBy similarity1
Modified residuei2226PhosphoserineBy similarity1
Modified residuei2230PhosphoserineBy similarity1
Modified residuei2233PhosphoserineBy similarity1
Modified residuei2234PhosphoserineBy similarity1
Modified residuei2425PhosphoserineBy similarity1
Modified residuei2623PhosphoserineBy similarity1
Modified residuei2637PhosphoserineBy similarity1

Post-translational modificationi

Phosphorylated by CSNK1D.By similarity
LC2 is generated from MAP1A by proteolytic processing. It is free to associate with both MAP1A and MAP1B (By similarity).By similarity

Keywords - PTMi

Phosphoprotein

Proteomic databases

PaxDbiQ9QYR6.
PRIDEiQ9QYR6.

PTM databases

iPTMnetiQ9QYR6.
PhosphoSitePlusiQ9QYR6.
SwissPalmiQ9QYR6.

Expressioni

Tissue specificityi

Both isoforms highly expressed in brain, and to a lesser extent in embryo. Isoform 1 is also expressed at a low level in other tissues including heart and muscle.1 Publication

Gene expression databases

BgeeiENSMUSG00000027254.
CleanExiMM_MTAP1A.
ExpressionAtlasiQ9QYR6. baseline and differential.
GenevisibleiQ9QYR6. MM.

Interactioni

Subunit structurei

3 different light chains, LC1, LC2 and LC3, can associate with MAP1A and MAP1B proteins. Interacts with guanylate kinase-like domain of DLG1, DLG2 and DLG4. Binds to CSNK1D (By similarity). Interacts with TIAM2.By similarity1 Publication

GO - Molecular functioni

  • microtubule binding Source: MGI

Protein-protein interaction databases

BioGridi201583. 5 interactors.
IntActiQ9QYR6. 5 interactors.
MINTiMINT-1203191.
STRINGi10090.ENSMUSP00000092223.

Structurei

3D structure databases

ProteinModelPortaliQ9QYR6.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Domains and Repeats

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Repeati336 – 33813
Repeati415 – 41723
Repeati420 – 42233
Repeati424 – 42643
Repeati427 – 42953
Repeati431 – 43363
Repeati436 – 43873
Repeati440 – 44283
Repeati444 – 44693
Repeati449 – 451103
Repeati539 – 541113

Region

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Regioni336 – 54111 X 3 AA approximate repeats of K-K-[DE]Add BLAST206

Compositional bias

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Compositional biasi338 – 677Glu-richAdd BLAST340
Compositional biasi1748 – 2303Pro-richAdd BLAST556
Compositional biasi2530 – 2551Pro-richAdd BLAST22
Compositional biasi2747 – 2750Poly-Gln4

Domaini

The basic region containing the repeats may be responsible for the binding of MAP1A to microtubules.

Sequence similaritiesi

Belongs to the MAP1 family.Curated

Keywords - Domaini

Repeat

Phylogenomic databases

eggNOGiKOG3592. Eukaryota.
ENOG410XRYM. LUCA.
GeneTreeiENSGT00550000074593.
HOVERGENiHBG052408.
InParanoidiQ9QYR6.
KOiK10429.

Family and domain databases

Gene3Di3.60.15.10. 2 hits.
InterProiIPR026074. MAP1.
IPR015656. MAP1A.
IPR001279. Metallo-B-lactamas.
[Graphical view]
PANTHERiPTHR13843. PTHR13843. 1 hit.
PTHR13843:SF6. PTHR13843:SF6. 1 hit.
SUPFAMiSSF56281. SSF56281. 1 hit.

Sequences (2)i

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

This entry describes 2 isoformsi produced by alternative splicing. AlignAdd to basket

Isoform 1 (identifier: Q9QYR6-1) [UniParc]FASTAAdd to basket

This isoform has been chosen as the 'canonical' sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

« Hide

        10         20         30         40         50
MDGVAEFSEY VSETVDVPSP FDLLEPPTSG GFLKLSKPCC YIFPGGRGDS
60 70 80 90 100
ALFAVNGFNI LVDGGSDRKS CFWKLVRHLD RIDSVLLTHI GADNLPGING
110 120 130 140 150
LLQRKVAELE EEQSQGSSSY SDWVKNLISP ELGVVFFNVP DKLRLPDASR
160 170 180 190 200
KAKRSIEEAC LTLQHLNRLG IQAEPLYRVV SNTIEPLTLF HKMGVGRLDM
210 220 230 240 250
YVLNPVKDSK EMQFLMQKWA GNSKAKTGIV LANGKEAEIS VPYLTSITAL
260 270 280 290 300
VVWLPANPTE KIVRVLFPGN APQNKILEGL EKLRHLDFLR YPVATQKDLA
310 320 330 340 350
AGAVPANLKP SKIKHRADSK ESLKAAPKTA MSKLAKREEV LEEGAKEARS
360 370 380 390 400
ELAKELAKSE KKAKEPSEKP PEKPSKPERV RTESSEALKA EKRKLIKDKV
410 420 430 440 450
GKKHLKEKIS KLEEKRDKEK KEIKKERKEL KKEEGRKEEK KDAKKDEKRK
460 470 480 490 500
DTKPELKKFS KPDLKPFTPE VRKTLYKAKA PGRLKVDKGR AARGEKELSS
510 520 530 540 550
EPRTPPAQKG AAPPPAASGH RELALSSPED LTQDFEELKR EERGLLAEPR
560 570 580 590 600
DTELGEKPLP ADASEQGRPS TAIQVTQPPA SVLEQEQVER EKEVVPDFPE
610 620 630 640 650
DKGSKNRAPD SGAEVEREKE TWEERKPREA ELTPENIAAA REESEPEVKE
660 670 680 690 700
DVIEKAELEE MEEVHPSDEE EEETKAESFY QKHMQEALKV IPKGREALGG
710 720 730 740 750
RELGFQGKAP EKETASFLSS LATPAGAAEH VSYIQDETIP GYSETEQTIS
760 770 780 790 800
DEEIHDEPDE RPAPPRFPTS TYDLSGPEGP GPFEASQSAE SAVPASSSKT
810 820 830 840 850
YGAPETELTY PPNMVAAPLA EEEHVSSATS ITECDKLSSF ATSVAEDQSV
860 870 880 890 900
ASLTAPQTEE TGKSSLLLDT VTSIPSSRTE ATQGLDYVPS AGTISPTSSL
910 920 930 940 950
EEDKGFKSPP CEDFSVTGES EKKGESVGRG LTGEKAVGKE EKNVTTSEKL
960 970 980 990 1000
SSQYAAVFGA PGHALHPGEP ALGEVEERCL SPDDSTVKMA SPPPSGPPSA
1010 1020 1030 1040 1050
AHTPFHQSPV EEKSEPQDFQ EDSWGDTKHA PGVSKEDAEE QTVKPGPEEA
1060 1070 1080 1090 1100
MSEEGKVPLS RSPQAQDTLG SLAGGQTGCT IQLLPEQDKA VVFETGEAGA
1110 1120 1130 1140 1150
ASGAGSLPGE VRTQEPAEPQ KDELLGFTDQ SFSPEDAESL SVLSVVSPDT
1160 1170 1180 1190 1200
AKQEATPRSP CTPKEQQLHK DLWPMVSPED TQSLSFSEES PSKETSLDIS
1210 1220 1230 1240 1250
SKQLSPESLG TLQFGELSLG KEEKGPLVKA EDNSCHLAPV SIPEPHTATV
1260 1270 1280 1290 1300
SPPTDEAAGE AGLTDESPAG NLPGSSFSHS ALSGDRKHSP GEITGPGGHF
1310 1320 1330 1340 1350
MTSDSSLTKS PESLSSPAME DLAMEWGGKA PGSEDRATEQ KEKELERKSE
1360 1370 1380 1390 1400
TLQQKDQILS EKAALVQRDS VMHQKDEALD EENKPGGQQD KTSEQKGRDL
1410 1420 1430 1440 1450
DKKDTAVELG KGPEPKGKDL YLEDQGLAEK DKALEQRGAA LQQTQAPEPR
1460 1470 1480 1490 1500
ARAQEHRDLE QKDEHLELRD KTPEEKDKVL VLEDRAPEHI IPQPTQTDRA
1510 1520 1530 1540 1550
PEHRSKVDKE QKDEASEEKE QVLEQKDWAR EKEGAALDQD NRAAGQKDGT
1560 1570 1580 1590 1600
LKEDKTQGQK SSFLEDKSTT PKEMTLDQKS PEKAKGVEQQ DGAVPEKTRA
1610 1620 1630 1640 1650
LGLEESPEEE GKAREQEEKY WKEQDVVQGW RETSPTRGEP VPAWEGKSPE
1660 1670 1680 1690 1700
QEVRYWRDRD ITLQQDAYWK ELSCERKVWF PHELDGQGAR PRYSEEREST
1710 1720 1730 1740 1750
FLDEGPNEQE ITPLQHTPRS PWASDFKDFQ EPLPQKGLEV ERWLAESPVG
1760 1770 1780 1790 1800
LPPEEEDKLT RSPFEIISPP ASPPEMTGQR VPSAPGQESP VPDTKSTPPT
1810 1820 1830 1840 1850
RNEPTTPSWL AEIPPWVPKD RPLPPAPLSP APAPPTPAPD PHAPAPFSWG
1860 1870 1880 1890 1900
IAEYDSVVAA VQEGAAELEG GPYSPLGKDY RKAEGEREGE GGAGAPDSSS
1910 1920 1930 1940 1950
FSSKVPEVTE SHTTRDAEQT EPEQREPTPY PDERSFQYAD IYEQMMLTGL
1960 1970 1980 1990 2000
GPACPTREPP LGASGDWPPH LSTKEEAAGR NKSAEKELSS AVSPPNLHSD
2010 2020 2030 2040 2050
TPTFSYASLA GPTIPPRQEP EPGPNVEPSF TPPAVPPRAP ISLSQDPSPP
2060 2070 2080 2090 2100
LNGSTTSCGP DRRTPSPKEA GRSHWDDGTN DSDLEKGARE QPEKETQSPS
2110 2120 2130 2140 2150
PHHPMPVGHP SLWPETEAHS SLSSDSHLGP VRPSLDFPAS AFGFSSLQPA
2160 2170 2180 2190 2200
PPQLPSPAEP RSAPCGSLAF SGDRALALVP GTPTRTRHDE YLEVTKAPSL
2210 2220 2230 2240 2250
DSSLPQLPSP SSPGAPLLSN LPRPASPALS EGSSSEATTP VISSVAERFP
2260 2270 2280 2290 2300
PGLEVAEQSS GELGPGNEPA AHSLWDLTPL SPAPLASRDL APAPAPAPAP
2310 2320 2330 2340 2350
SLPGNLGDGT LSCRPECSGE LTKKPSPFLS HSGDHEANGP GETSLNPPGF
2360 2370 2380 2390 2400
ATATAEKEEA EALHAWERGS WPEGAERSSR PDTLLSSEQR PGKSSGGPPC
2410 2420 2430 2440 2450
SLSSEVEAGP QGCATDPRPH CGELSPSFLN PPLPPSTDDS DLSTEEARLA
2460 2470 2480 2490 2500
GKGGRRRAGR PGATGGPCPM ADETPPTSAS DSGSSQSDSD VPPETEECPS
2510 2520 2530 2540 2550
ITAEAALDSD EDGDFLPVDK AGGVSGTHHP RPGHDPPPAP LPDPRPPPPR
2560 2570 2580 2590 2600
PDVCMADPEG LSSESGRVER LREKVQGRPG RKAPGRAKPA SPARRLDIRG
2610 2620 2630 2640 2650
KRSPTPGKGP VDRTSRALPR PRSTPSQVTS EEKDGHSPMS KGLVNGLKAG
2660 2670 2680 2690 2700
STALGSKGSS GPPVYVDLAY IPNHCSGKTA DQDFFRRVRA SYYVVSGNDP
2710 2720 2730 2740 2750
ANGEPSRAVL DALLEGKAQW GENLQVTLIP THDTEVTREW YQQTHEQQQQ
2760 2770
LNVLVLASSS TVVMQDESFP ACKIEF
Length:2,776
Mass (Da):300,140
Last modified:January 9, 2007 - v2
Checksum:i62A8CC584CCAB21A
GO
Isoform 2 (identifier: Q9QYR6-2) [UniParc]FASTAAdd to basket

The sequence of this isoform differs from the canonical sequence as follows:
     1-1: M → METTPELGLQ...LNPEPVLPTM

Show »
Length:3,000
Mass (Da):324,586
Checksum:i58EB524496EB49C9
GO

Experimental Info

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Sequence conflicti349R → C in BAE27940 (PubMed:16141072).Curated1

Alternative sequence

Feature keyPosition(s)DescriptionActionsGraphical viewLength
Alternative sequenceiVSP_0032011M → METTPELGLQSLGAPPAQNP AEPLCEAGAAVAAARWDLRK YSLLIVIGDIGTESQLRAVR AHLEQGILSWNIDLSSFDLN QQLRLFITRHLAHFSSEVKG QRTLCHQSETLETIILVNPT ADSISSEVHHLLSSPSAHKL LILSGQTLEPEGDLILQSGT YSYQNFAQVLHKPEIAQLLS NRDPGIQAFLTVSCLGEGDW SHLGLSSSQETLHLRLNPEP VLPTM in isoform 2. 1 Publication1

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK147474 mRNA. Translation: BAE27940.1.
AK160546 mRNA. Translation: BAE35863.1.
AK163468 mRNA. Translation: BAE37354.1.
AL845466 Genomic DNA. Translation: CAM24218.1.
AF182211, AF182208, AF182209 Genomic DNA. Translation: AAF06164.1.
AF182211 Genomic DNA. Translation: AAF06163.1.
AF182213 mRNA. Translation: AAD55790.1.
AF182212 mRNA. Translation: AAD55789.1.
CCDSiCCDS50685.1. [Q9QYR6-1]
RefSeqiNP_001166977.1. NM_001173506.1. [Q9QYR6-1]
NP_115769.1. NM_032393.2.
UniGeneiMm.36501.

Genome annotation databases

EnsembliENSMUST00000110639; ENSMUSP00000106269; ENSMUSG00000027254. [Q9QYR6-1]
GeneIDi17754.
KEGGimmu:17754.
UCSCiuc008lyj.2. mouse. [Q9QYR6-1]

Keywords - Coding sequence diversityi

Alternative splicing

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AK147474 mRNA. Translation: BAE27940.1.
AK160546 mRNA. Translation: BAE35863.1.
AK163468 mRNA. Translation: BAE37354.1.
AL845466 Genomic DNA. Translation: CAM24218.1.
AF182211, AF182208, AF182209 Genomic DNA. Translation: AAF06164.1.
AF182211 Genomic DNA. Translation: AAF06163.1.
AF182213 mRNA. Translation: AAD55790.1.
AF182212 mRNA. Translation: AAD55789.1.
CCDSiCCDS50685.1. [Q9QYR6-1]
RefSeqiNP_001166977.1. NM_001173506.1. [Q9QYR6-1]
NP_115769.1. NM_032393.2.
UniGeneiMm.36501.

3D structure databases

ProteinModelPortaliQ9QYR6.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi201583. 5 interactors.
IntActiQ9QYR6. 5 interactors.
MINTiMINT-1203191.
STRINGi10090.ENSMUSP00000092223.

PTM databases

iPTMnetiQ9QYR6.
PhosphoSitePlusiQ9QYR6.
SwissPalmiQ9QYR6.

Proteomic databases

PaxDbiQ9QYR6.
PRIDEiQ9QYR6.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsembliENSMUST00000110639; ENSMUSP00000106269; ENSMUSG00000027254. [Q9QYR6-1]
GeneIDi17754.
KEGGimmu:17754.
UCSCiuc008lyj.2. mouse. [Q9QYR6-1]

Organism-specific databases

CTDi4130.
MGIiMGI:1306776. Map1a.

Phylogenomic databases

eggNOGiKOG3592. Eukaryota.
ENOG410XRYM. LUCA.
GeneTreeiENSGT00550000074593.
HOVERGENiHBG052408.
InParanoidiQ9QYR6.
KOiK10429.

Miscellaneous databases

PROiQ9QYR6.
SOURCEiSearch...

Gene expression databases

BgeeiENSMUSG00000027254.
CleanExiMM_MTAP1A.
ExpressionAtlasiQ9QYR6. baseline and differential.
GenevisibleiQ9QYR6. MM.

Family and domain databases

Gene3Di3.60.15.10. 2 hits.
InterProiIPR026074. MAP1.
IPR015656. MAP1A.
IPR001279. Metallo-B-lactamas.
[Graphical view]
PANTHERiPTHR13843. PTHR13843. 1 hit.
PTHR13843:SF6. PTHR13843:SF6. 1 hit.
SUPFAMiSSF56281. SSF56281. 1 hit.
ProtoNetiSearch...

Entry informationi

Entry nameiMAP1A_MOUSE
AccessioniPrimary (citable) accession number: Q9QYR6
Secondary accession number(s): A2ARN9
, Q3TQM8, Q3TUV8, Q3UHB7, Q9QZH9, Q9QZI0, Q9QZI1
Entry historyi
Integrated into UniProtKB/Swiss-Prot: September 26, 2001
Last sequence update: January 9, 2007
Last modified: November 30, 2016
This is version 133 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programChordata Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. MGD cross-references
    Mouse Genome Database (MGD) cross-references in UniProtKB/Swiss-Prot
  2. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.