Please use this identifier to cite or link to this item: http://hdl.handle.net/10400.21/9525
Title: Clustering stability and ground truth: numerical experiments
Author: Amorim, Maria José
Cardoso, Maria Margarida
Keywords: Clustering
External validation
Stability
Issue Date: 1-Aug-2016
Publisher: Institute of Electrical and Electronics Engineers
Citation: AMORIM, Maria José; CARDOSO, Margarida G. M. S. – Clustering stability and ground truth: numerical experiments. In 2015 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K). Lisbon, Portugal: IEEE, 2015. ISBN 978-1-5090-1967-0. Pp. 259-264
Abstract: Stability has been considered an important property for evaluating clustering solutions. Nevertheless, there are no conclusive studies on the relationship between this property and the capacity to recover clusters inherent to data ("ground truth"). This study focuses on this relationship resorting to synthetic data generated under diverse scenarios (controlling relevant factors). Stability is evaluated using a weighted cross-validation procedure. Indices of agreement (corrected for agreement by chance) are used both to assess stability and external validation. The results obtained reveal a new perspective so far not mentioned in the literature. Despite the clear relationship between stability and external validity when a broad range of scenarios is considered, within-scenarios conclusions deserve our special attention: faced with a specific clustering problem (as we do in practice), there is no significant relationship between stability and the ability to recover data clusters.
URI: http://hdl.handle.net/10400.21/9525
ISBN: 978-9-8975-8164-9
978-1-5090-1967-0
Publisher Version: https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7526928
Appears in Collections:ISEL - Matemática - Comunicações

Files in This Item:
File Description SizeFormat 
MJAmorim.pdf330,19 kBAdobe PDFView/Open    Request a copy


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote 

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.