Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.
Protein

Probable pectate lyase 18

Gene

At4g24780

Organism
Arabidopsis thaliana (Mouse-ear cress)
Status
Reviewed-Annotation score: Annotation score: 3 out of 5-Experimental evidence at transcript leveli

Functioni

Catalytic activityi

Eliminative cleavage of (1->4)-alpha-D-galacturonan to give oligosaccharides with 4-deoxy-alpha-D-galact-4-enuronosyl groups at their non-reducing ends.

Cofactori

Ca2+By similarityNote: Binds 1 Ca2+ ion. Required for its activity.By similarity

Pathwayi: pectin degradation

This protein is involved in step 2 of the subpathway that synthesizes 2-dehydro-3-deoxy-D-gluconate from pectin.
Proteins known to be involved in the 5 steps of the subpathway in this organism are:
  1. Probable pectinesterase/pectinesterase inhibitor 7 (PME7), Probable pectinesterase 48 (PME48), Probable pectinesterase 49 (PME49), Probable pectinesterase 50 (PME50), Pectinesterase, Probable pectinesterase/pectinesterase inhibitor 19 (PME19), Probable pectinesterase/pectinesterase inhibitor 42 (PME42), Probable pectinesterase 15 (PME15), Putative pectinesterase 14 (PME14), Probable pectinesterase/pectinesterase inhibitor 40 (PME40), Pectinesterase (At3g10720), Probable pectinesterase 55 (PME55), Probable pectinesterase/pectinesterase inhibitor 23 (PME23), Pectinesterase 4 (PME4), Probable pectinesterase/pectinesterase inhibitor 39 (PME39), Putative pectinesterase/pectinesterase inhibitor 22 (PME22), Pectinesterase (At3g14310), Pectinesterase/pectinesterase inhibitor 18 (PME18), Putative pectinesterase/pectinesterase inhibitor 43 (PME43), Probable pectinesterase/pectinesterase inhibitor 34 (PME34), Pectinesterase 1 (PME1), Putative pectinesterase 63 (PME63), Putative pectinesterase 10 (PME10), Probable pectinesterase/pectinesterase inhibitor 64 (PME64), Pectinesterase (F14I3.7), Pectinesterase 2 (PME2), Probable pectinesterase 29 (PME29), Pectinesterase, Probable pectinesterase/pectinesterase inhibitor 21 (PME21), Putative pectinesterase/pectinesterase inhibitor 45 (PME45), Probable pectinesterase/pectinesterase inhibitor 12 (PME12), Probable pectinesterase 8 (PME8), Probable pectinesterase/pectinesterase inhibitor 44 (PME44), Pectinesterase, Pectinesterase/pectinesterase inhibitor 3 (PME3), Pectinesterase 31 (PME31), Probable pectinesterase/pectinesterase inhibitor 25 (PME25), Probable pectinesterase/pectinesterase inhibitor 51 (PME51), Probable pectinesterase/pectinesterase inhibitor 58 (PME58), Putative pectinesterase 57 (PME57), Pectinesterase (At3g62170), Probable pectinesterase/pectinesterase inhibitor 20 (PME20), Pectinesterase (T27B3.30), Probable pectinesterase/pectinesterase inhibitor 60 (PME60), Pectinesterase QRT1 (QRT1), Probable pectinesterase/pectinesterase inhibitor 59 (PME59), Putative pectinesterase 11 (PME11), Pectinesterase PPME1 (PPME1), Probable pectinesterase/pectinesterase inhibitor 32 (PME32), Probable pectinesterase/pectinesterase inhibitor 33 (PME33), Probable pectinesterase/pectinesterase inhibitor 36 (PME36), Probable pectinesterase/pectinesterase inhibitor 13 (PME13), Putative pectinesterase 52 (PME52), Pectinesterase, Probable pectinesterase/pectinesterase inhibitor 54 (PME54), Pectinesterase, Pectinesterase (At1g53840), Pectinesterase 5 (PME5), Probable pectinesterase/pectinesterase inhibitor 16 (PME16), Probable pectinesterase 30 (PME30), Probable pectinesterase/pectinesterase inhibitor VGDH2 (VGDH2), Putative pectinesterase/pectinesterase inhibitor 24 (PME24), Putative pectinesterase/pectinesterase inhibitor 26 (PME26), Probable pectinesterase/pectinesterase inhibitor 35 (PME35), Probable pectinesterase 68 (PME68), Pectinesterase, Probable pectinesterase 67 (PME67), Putative pectinesterase/pectinesterase inhibitor 38 (PME38), Probable pectinesterase 56 (PME56), Pectinesterase, Probable pectinesterase/pectinesterase inhibitor 47 (PME47), Putative pectinesterase/pectinesterase inhibitor 28 (PME28), Probable pectinesterase 53 (PME53), Probable pectinesterase/pectinesterase inhibitor 46 (PME46), Probable pectinesterase/pectinesterase inhibitor 17 (PME17), Probable pectinesterase/pectinesterase inhibitor 41 (PME41), Probable pectinesterase/pectinesterase inhibitor 61 (PME61), Probable pectinesterase 66 (PME66), Probable pectinesterase/pectinesterase inhibitor 6 (PME6)
  2. Pectate lyase (T26I12.20), Probable pectate lyase 7 (At3g01270), Probable pectate lyase 18 (At4g24780), Probable pectate lyase 16 (At4g22080), Putative pectate lyase 17 (At4g22090), Putative pectate lyase 14 (At4g13210), Pectate lyase (At3g55140), Probable pectate lyase 20 (At5g48900), Probable pectate lyase 6 (At2g02720), Pectate lyase (At1g14420), Pectate lyase (At3g01270), Pectate lyase (At2g02720), Probable pectate lyase 22 (At5g63180), Pectate lyase (T5E8_80), Pectate lyase (At3g01270), Pectate lyase (At3g55140), Putative pectate lyase 2 (At1g11920), Pectate lyase (At3g07010), Pectate lyase, Probable pectate lyase 8 (At3g07010), Probable pectate lyase 4 (At1g30350), Probable pectate lyase 19 (At5g15110), Pectate lyase (F11F8_12), Probable pectate lyase 13 (PMR6), Probable pectate lyase 3 (AT59), Probable pectate lyase 5 (At1g67750), Putative pectate lyase 21 (At5g55720), Pectate lyase (At4g13210), Probable pectate lyase 12 (At3g53190), Pectate lyase (At4g24780), Pectate lyase (At5g04310), Pectate lyase (At4g13710), Probable pectate lyase 15 (At4g13710), Pectate lyase (At3g53190), Pectate lyase (At3g01270), Putative pectate lyase 11 (At3g27400), Pectate lyase (At5g04310), Pectate lyase (At3g07010), Probable pectate lyase 9 (At3g24230), Probable pectate lyase 1 (At1g04680), Probable pectate lyase 10 (At3g24670)
  3. no protein annotated in this organism
  4. no protein annotated in this organism
  5. no protein annotated in this organism
This subpathway is part of the pathway pectin degradation, which is itself part of Glycan metabolism.
View all proteins of this organism that are known to be involved in the subpathway that synthesizes 2-dehydro-3-deoxy-D-gluconate from pectin, the pathway pectin degradation and in Glycan metabolism.

Sites

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Metal bindingi206 – 2061CalciumBy similarity
Metal bindingi230 – 2301CalciumBy similarity
Metal bindingi234 – 2341CalciumBy similarity
Active sitei286 – 2861Sequence analysis

GO - Molecular functioni

GO - Biological processi

Complete GO annotation...

Keywords - Molecular functioni

Lyase

Keywords - Ligandi

Calcium, Metal-binding

Enzyme and pathway databases

BioCyciARA:AT4G24780-MONOMER.
ARA:GQT-2048-MONOMER.
UniPathwayiUPA00545; UER00824.

Protein family/group databases

CAZyiPL1. Polysaccharide Lyase Family 1.

Names & Taxonomyi

Protein namesi
Recommended name:
Probable pectate lyase 18 (EC:4.2.2.2)
Alternative name(s):
Pectate lyase A10
Gene namesi
Ordered Locus Names:At4g24780
ORF Names:F22K18.20
OrganismiArabidopsis thaliana (Mouse-ear cress)
Taxonomic identifieri3702 [NCBI]
Taxonomic lineageiEukaryotaViridiplantaeStreptophytaEmbryophytaTracheophytaSpermatophytaMagnoliophytaeudicotyledonsGunneridaePentapetalaerosidsmalvidsBrassicalesBrassicaceaeCamelineaeArabidopsis
Proteomesi
  • UP000006548 Componenti: Chromosome 4

Organism-specific databases

TAIRiAT4G24780.

Subcellular locationi

GO - Cellular componenti

  • membrane Source: TAIR
Complete GO annotation...

PTM / Processingi

Molecule processing

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Signal peptidei1 – 2424Sequence analysisAdd
BLAST
Chaini25 – 408384Probable pectate lyase 18PRO_0000024883Add
BLAST

Amino acid modifications

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Glycosylationi42 – 421N-linked (GlcNAc...)Sequence analysis

Keywords - PTMi

Glycoprotein

Proteomic databases

PaxDbiQ9C5M8.
PRIDEiQ9C5M8.

Expressioni

Tissue specificityi

Expressed in flowers, but not in leaves.1 Publication

Gene expression databases

ExpressionAtlasiQ9C5M8. baseline and differential.
GenevisibleiQ9C5M8. AT.

Interactioni

Protein-protein interaction databases

BioGridi13869. 2 interactions.
STRINGi3702.AT4G24780.1.

Structurei

3D structure databases

ProteinModelPortaliQ9C5M8.
SMRiQ9C5M8. Positions 60-400.
ModBaseiSearch...
MobiDBiSearch...

Family & Domainsi

Sequence similaritiesi

Belongs to the polysaccharide lyase 1 family.Curated

Keywords - Domaini

Signal

Phylogenomic databases

eggNOGiENOG410IPFF. Eukaryota.
COG3866. LUCA.
HOGENOMiHOG000237948.
InParanoidiQ9C5M8.
KOiK01728.
OMAiQRDMTIQ.
PhylomeDBiQ9C5M8.

Family and domain databases

Gene3Di2.160.20.10. 1 hit.
InterProiIPR002022. Amb_allergen_dom.
IPR018082. AmbAllergen.
IPR012334. Pectin_lyas_fold.
IPR011050. Pectin_lyase_fold/virulence.
[Graphical view]
PfamiPF00544. Pec_lyase_C. 1 hit.
[Graphical view]
PRINTSiPR00807. AMBALLERGEN.
SMARTiSM00656. Amb_all. 1 hit.
[Graphical view]
SUPFAMiSSF51126. SSF51126. 1 hit.

Sequencei

Sequence statusi: Complete.

Sequence processingi: The displayed sequence is further processed into a mature form.

Q9C5M8-1 [UniParc]FASTAAdd to basket

« Hide

        10         20         30         40         50
MKMQTKKLFI TIVSFLLYAP LFLSSPVPDP ESVVEEVHKS INASVAGRRK
60 70 80 90 100
LGYLSCTTGN PIDDCWRCDP HWEQHRQRLA DCAIGFGKNA IGGRDGRIYV
110 120 130 140 150
VTDSGNDNPV SPKPGTLRHA VVQDEPLWII FQRDMTIQLK EELIMNSFKT
160 170 180 190 200
IDGRGASVHI SGGPCITIQY VTNIIIHGIH IHDCKQGGNA MVRSSPRHFG
210 220 230 240 250
WRTISDGDGV SIFGGSHVWV DHCSFSNCED GLIDAIMGST AITLSNNHMT
260 270 280 290 300
HHDKVMLLGH SDTYSRDKNM QVTIAFNHFG EGLVQRMPRC RHGYFHVVNN
310 320 330 340 350
DYTHWEMYAI GGSANPTINS QGNRFLAPNI RFSKEVTKHE DAPESEWKRW
360 370 380 390 400
NWRSSGDLLL NGAFFTPSGG AASSSYAKAS SLGAKPSSLV GPLTSTSGAL

NCRKGSRC
Length:408
Mass (Da):45,014
Last modified:June 16, 2003 - v2
Checksum:i4117CB52C48E588C
GO

Sequence cautioni

The sequence AAM65103.1 differs from that shown. Reason: Erroneous initiation. Curated
The sequence CAA22985.1 differs from that shown. Reason: Erroneous gene model prediction. Curated
The sequence CAB79388.1 differs from that shown. Reason: Erroneous gene model prediction. Curated

Experimental Info

Feature keyPosition(s)LengthDescriptionGraphical viewFeature identifierActions
Sequence conflicti107 – 1071D → Y in AAM65103 (Ref. 4) Curated
Sequence conflicti252 – 2521H → R in AAB69761 (PubMed:9278171).Curated
Sequence conflicti271 – 2711Q → H in AAK25850 (PubMed:14593172).Curated
Sequence conflicti339 – 3391H → D in AAM65103 (Ref. 4) Curated

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AL035356 Genomic DNA. Translation: CAA22985.1. Sequence problems.
AL161562 Genomic DNA. Translation: CAB79388.1. Sequence problems.
CP002687 Genomic DNA. Translation: AEE84955.1.
CP002687 Genomic DNA. Translation: AEE84956.1.
AF360140 mRNA. Translation: AAK25850.1.
AY087561 mRNA. Translation: AAM65103.1. Different initiation.
U83621 Genomic DNA. Translation: AAB69761.1.
PIRiT05556.
RefSeqiNP_001190827.1. NM_001203898.1.
NP_567707.1. NM_118611.2.
UniGeneiAt.23543.

Genome annotation databases

EnsemblPlantsiAT4G24780.1; AT4G24780.1; AT4G24780.
AT4G24780.2; AT4G24780.2; AT4G24780.
GeneIDi828580.
GrameneiAT4G24780.1; AT4G24780.1; AT4G24780.
AT4G24780.2; AT4G24780.2; AT4G24780.
KEGGiath:AT4G24780.

Cross-referencesi

Sequence databases

Select the link destinations:
EMBLi
GenBanki
DDBJi
Links Updated
AL035356 Genomic DNA. Translation: CAA22985.1. Sequence problems.
AL161562 Genomic DNA. Translation: CAB79388.1. Sequence problems.
CP002687 Genomic DNA. Translation: AEE84955.1.
CP002687 Genomic DNA. Translation: AEE84956.1.
AF360140 mRNA. Translation: AAK25850.1.
AY087561 mRNA. Translation: AAM65103.1. Different initiation.
U83621 Genomic DNA. Translation: AAB69761.1.
PIRiT05556.
RefSeqiNP_001190827.1. NM_001203898.1.
NP_567707.1. NM_118611.2.
UniGeneiAt.23543.

3D structure databases

ProteinModelPortaliQ9C5M8.
SMRiQ9C5M8. Positions 60-400.
ModBaseiSearch...
MobiDBiSearch...

Protein-protein interaction databases

BioGridi13869. 2 interactions.
STRINGi3702.AT4G24780.1.

Protein family/group databases

CAZyiPL1. Polysaccharide Lyase Family 1.

Proteomic databases

PaxDbiQ9C5M8.
PRIDEiQ9C5M8.

Protocols and materials databases

Structural Biology KnowledgebaseSearch...

Genome annotation databases

EnsemblPlantsiAT4G24780.1; AT4G24780.1; AT4G24780.
AT4G24780.2; AT4G24780.2; AT4G24780.
GeneIDi828580.
GrameneiAT4G24780.1; AT4G24780.1; AT4G24780.
AT4G24780.2; AT4G24780.2; AT4G24780.
KEGGiath:AT4G24780.

Organism-specific databases

TAIRiAT4G24780.

Phylogenomic databases

eggNOGiENOG410IPFF. Eukaryota.
COG3866. LUCA.
HOGENOMiHOG000237948.
InParanoidiQ9C5M8.
KOiK01728.
OMAiQRDMTIQ.
PhylomeDBiQ9C5M8.

Enzyme and pathway databases

UniPathwayiUPA00545; UER00824.
BioCyciARA:AT4G24780-MONOMER.
ARA:GQT-2048-MONOMER.

Miscellaneous databases

PROiQ9C5M8.

Gene expression databases

ExpressionAtlasiQ9C5M8. baseline and differential.
GenevisibleiQ9C5M8. AT.

Family and domain databases

Gene3Di2.160.20.10. 1 hit.
InterProiIPR002022. Amb_allergen_dom.
IPR018082. AmbAllergen.
IPR012334. Pectin_lyas_fold.
IPR011050. Pectin_lyase_fold/virulence.
[Graphical view]
PfamiPF00544. Pec_lyase_C. 1 hit.
[Graphical view]
PRINTSiPR00807. AMBALLERGEN.
SMARTiSM00656. Amb_all. 1 hit.
[Graphical view]
SUPFAMiSSF51126. SSF51126. 1 hit.
ProtoNetiSearch...

Publicationsi

« Hide 'large scale' publications
  1. "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana."
    Mayer K.F.X., Schueller C., Wambutt R., Murphy G., Volckaert G., Pohl T., Duesterhoeft A., Stiekema W., Entian K.-D., Terryn N., Harris B., Ansorge W., Brandt P., Grivell L.A., Rieger M., Weichselgartner M., de Simone V., Obermaier B.
    , Mache R., Mueller M., Kreis M., Delseny M., Puigdomenech P., Watson M., Schmidtheini T., Reichert B., Portetelle D., Perez-Alonso M., Boutry M., Bancroft I., Vos P., Hoheisel J., Zimmermann W., Wedler H., Ridley P., Langham S.-A., McCullagh B., Bilham L., Robben J., van der Schueren J., Grymonprez B., Chuang Y.-J., Vandenbussche F., Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E., Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., Holzer E., Brandt A., Peters S., van Staveren M., Dirkse W., Mooijman P., Klein Lankhorst R., Rose M., Hauf J., Koetter P., Berneiser S., Hempel S., Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C., Gielen J., Villarroel R., De Clercq R., van Montagu M., Rogers J., Cronin A., Quail M.A., Bray-Allen S., Clark L., Doggett J., Hall S., Kay M., Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A., Lyne M., Benes V., Rechmann S., Borkova D., Bloecker H., Scharfe M., Grimm M., Loehnert T.-H., Dose S., de Haan M., Maarse A.C., Schaefer M., Mueller-Auer S., Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A., Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., Massenet O., Quigley F., Clabauld G., Muendlein A., Felber R., Schnabl S., Hiller R., Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R., Berger C., Monfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M., Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E., Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P., Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K., Mewes H.-W., Stocker S., Zaccaria P., Bevan M., Wilson R.K., de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K., Huang E., Spiegel L., Sekhon M., Murray J., Sheet P., Cordes M., Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G., Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K., Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K., Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L.W., Nelson J., Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H., Ali J., Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B., Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J., Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K., O'Shaughnessy A., Rodriguez M., Hoffman J., Till S., Granat S., Shohdy N., Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.A., Martienssen R., McCombie W.R.
    Nature 402:769-777(1999) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
    Strain: cv. Columbia.
  2. The Arabidopsis Information Resource (TAIR)
    Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases
    Cited for: GENOME REANNOTATION.
    Strain: cv. Columbia.
  3. "Empirical analysis of transcriptional activity in the Arabidopsis genome."
    Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J., Southwick A.M., Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F., Karlin-Newmann G., Liu S.X., Lam B., Sakano H., Wu T., Yu G.
    , Miranda M., Quach H.L., Tripp M., Chang C.H., Lee J.M., Toriumi M.J., Chan M.M., Tang C.C., Onodera C.S., Deng J.M., Akiyama K., Ansari Y., Arakawa T., Banh J., Banno F., Bowser L., Brooks S.Y., Carninci P., Chao Q., Choy N., Enju A., Goldsmith A.D., Gurjal M., Hansen N.F., Hayashizaki Y., Johnson-Hopson C., Hsuan V.W., Iida K., Karnes M., Khan S., Koesema E., Ishida J., Jiang P.X., Jones T., Kawai J., Kamiya A., Meyers C., Nakajima M., Narusaka M., Seki M., Sakurai T., Satou M., Tamse R., Vaysberg M., Wallender E.K., Wong C., Yamamura Y., Yuan S., Shinozaki K., Davis R.W., Theologis A., Ecker J.R.
    Science 302:842-846(2003) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
    Strain: cv. Columbia.
  4. "Full-length cDNA from Arabidopsis thaliana."
    Brover V.V., Troukhan M.E., Alexandrov N.A., Lu Y.-P., Flavell R.B., Feldmann K.A.
    Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases
    Cited for: NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
  5. "Identification of the tobacco and Arabidopsis homologues of the pollen-expressed LAT59 gene of tomato."
    Kulikauskas R., McCormick S.
    Plant Mol. Biol. 34:809-814(1997) [PubMed] [Europe PMC] [Abstract]
    Cited for: NUCLEOTIDE SEQUENCE [GENOMIC DNA] OF 73-298, TISSUE SPECIFICITY.

Entry informationi

Entry nameiPLY18_ARATH
AccessioniPrimary (citable) accession number: Q9C5M8
Secondary accession number(s): O23667, Q8LAW7, Q9SB71
Entry historyi
Integrated into UniProtKB/Swiss-Prot: June 16, 2003
Last sequence update: June 16, 2003
Last modified: May 11, 2016
This is version 115 of the entry and version 2 of the sequence. [Complete history]
Entry statusiReviewed (UniProtKB/Swiss-Prot)
Annotation programPlant Protein Annotation Program

Miscellaneousi

Keywords - Technical termi

Complete proteome, Reference proteome

Documents

  1. Arabidopsis thaliana
    Arabidopsis thaliana: entries and gene names
  2. PATHWAY comments
    Index of metabolic and biosynthesis pathways
  3. SIMILARITY comments
    Index of protein domains and families

Similar proteinsi

Links to similar proteins from the UniProt Reference Clusters (UniRef) at 100%, 90% and 50% sequence identity:
100%UniRef100 combines identical sequences and sub-fragments with 11 or more residues from any organism into one UniRef entry.
90%UniRef90 is built by clustering UniRef100 sequences that have at least 90% sequence identity to, and 80% overlap with, the longest sequence (a.k.a seed sequence).
50%UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the cluster.