View article

[PDF] from parisdescartes.fr

Beyond one billion time series: indexing and mining very large time series collections with SAX2+

Authors

Alessandro Camerra, Jin Shieh, Themis Palpanas, Thanawin Rakthanmanon, Eamonn Keogh

Publication date

2014/4

Journal

Knowledge and information systems

Volume

Issue

Pages

123-151

Publisher

Springer London

Description

There is an increasingly pressing need, by several applications in diverse domains, for developing techniques able to index and mine very large collections of time series. Examples of such applications come from astronomy, biology, the web, and other domains. It is not unusual for these applications to involve numbers of time series in the order of hundreds of millions to billions. However, all relevant techniques that have been proposed in the literature so far have not considered any data collections much larger than one-million time series. In this paper, we describe SAX 2.0 and its improvements, SAX 2.0 Clustered and SAX2+, three methods designed for indexing and mining truly massive collections of time series. We show that the main bottleneck in mining such massive datasets is the time taken to build the index, and we thus introduce a novel bulk loading mechanism, the first of this kind …

Total citations

Cited by 149

201420152016201720182019202020212022202320246 9 7 18 22 16 26 11 14 11 5

Scholar articles

Beyond one billion time series: indexing and mining very large time series collections with SAX2+

A Camerra, J Shieh, T Palpanas, T Rakthanmanon… - Knowledge and information systems, 2014