Repository logo
 
Publication

Comparing clustering solutions: the use of adjusted paired indices

dc.contributor.authorAmorim, Maria José de Pina da Cruz
dc.contributor.authorCardoso, Margarida G. M. S.
dc.date.accessioned2016-05-20T10:43:19Z
dc.date.available2016-05-20T10:43:19Z
dc.date.issued2015
dc.description.abstractIn the present paper we compare clustering solutions using indices of paired agreement. We propose a new method - IADJUST - to correct indices of paired agreement, excluding agreement by chance. This new method overcomes previous limitations known in the literature as it permits the correction of any index. We illustrate its use in external clustering validation, to measure the accordance between clusters and an a priori known structure. The adjusted indices are intended to provide a realistic measure of clustering performance that excludes agreement by chance with ground truth. We use simulated data sets, under a range of scenarios - considering diverse numbers of clusters, clusters overlaps and balances - to discuss the pertinence and the precision of our proposal. Precision is established based on comparisons with the analytical approach for correction specific indices that can be corrected in this way are used for this purpose. The pertinence of the proposed correction is discussed when making a detailed comparison between the performance of two classical clustering approaches, namely Expectation-Maximization (EM) and K-Means (KM) algorithms. Eight indices of paired agreement are studied and new corrected indices are obtained.pt_PT
dc.identifier.citationAMORIM, Maria josé; CARDOSO, Margarida G. M. S. - Comparing clustering solutions: the use of adjusted paired indices. Intelligent Data Analysis. ISSN 1088-467X. Vol. 19, N.º 6 (2015), pp. 1275-1296pt_PT
dc.identifier.doi10.3233/IDA-150782pt_PT
dc.identifier.issn1088-467X
dc.identifier.issn1571-4128
dc.identifier.urihttp://hdl.handle.net/10400.21/6191
dc.language.isoengpt_PT
dc.peerreviewedyespt_PT
dc.publisherIos Presspt_PT
dc.subjectAdjusted indicespt_PT
dc.subjectIndices of paired agreementpt_PT
dc.subjectClustering evaluationpt_PT
dc.subjectExternal evaluationpt_PT
dc.titleComparing clustering solutions: the use of adjusted paired indicespt_PT
dc.typejournal article
dspace.entity.typePublication
oaire.citation.endPage1296pt_PT
oaire.citation.issue6pt_PT
oaire.citation.startPage1275pt_PT
oaire.citation.titleIntelligent Data Analysispt_PT
oaire.citation.volume19pt_PT
rcaap.rightsclosedAccesspt_PT
rcaap.typearticlept_PT

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Comparing clustering solutions - The use of adjusted paired indices.pdf
Size:
1.49 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: