Authors
Thanawin Rakthanmanon, Bilson Campana, Abdullah Mueen, Gustavo Batista, Brandon Westover, Qiang Zhu, Jesin Zakaria, Eamonn Keogh
Publication date
2012/8/12
Book
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Pages
262-270
Description
Most time series data mining algorithms use similarity search as a core subroutine, and thus the time taken for similarity search is the bottleneck for virtually all time series data mining algorithms. The difficulty of scaling search to large datasets largely explains why most academic work on time series data mining has plateaued at considering a few millions of time series objects, while much of industry and science sits on billions of time series objects waiting to be explored. In this work we show that by using a combination of four novel ideas we can search and mine truly massive time series for the first time. We demonstrate the following extremely unintuitive fact; in large datasets we can exactly search under DTW much more quickly than the current state-of-the-art Euclidean distance search algorithms. We demonstrate our work on the largest set of time series experiments ever attempted. In particular, the largest …
Total citations
20122013201420152016201720182019202020212022202320241040769512512114113013713312612043
Scholar articles
T Rakthanmanon, B Campana, A Mueen, G Batista… - Proceedings of the 18th ACM SIGKDD international …, 2012
T Rakthanmanon, B Campana, A Mueen, G Batista… - Proceedings of the 18th ACM SIGKDD international …
T Rakthanmanon, B Campana, A Mueen, G Batista… - Knowledge Discovery and Data Mining, Bejing, China, 2012
T Rakthanmanon, B Campana, A Mueen, G Batista… - Proceedings of the 18th ACM SIGKDD intersnational …
T Rakthanmanon, AM Bilson Campana… - Searching and mining trillions of time series …