Authors
Eamonn Keogh, Jessica Lin
Publication date
2005/8
Journal
Knowledge and information systems
Volume
8
Pages
154-177
Publisher
Springer London
Description
Given the recent explosion of interest in streaming data and online algorithms, clustering of time-series subsequences, extracted via a sliding window, has received much attention. In this work, we make a surprising claim. Clustering of time-series subsequences is meaningless. More concretely, clusters extracted from these time series are forced to obey a certain constraint that is pathologically unlikely to be satisfied by any dataset, and because of this, the clusters extracted by any clustering algorithm are essentially random. While this constraint can be intuitively demonstrated with a simple illustration and is simple to prove, it has never appeared in the literature. We can justify calling our claim surprising because it invalidates the contribution of dozens of previously published papers. We will justify our claim with a theorem, illustrative examples, and a comprehensive set of experiments on reimplementations …
Total citations
200420052006200720082009201020112012201320142015201620172018201920202021202220232024153229383538513641364946414255525354554318