Authors
Daichi Amagata, Makoto Onizuka, Takahiro Hara
Publication date
2021/6/9
Book
Proceedings of the 2021 International Conference on Management of Data
Pages
36-48
Description
Distance-based outlier detection is widely adopted in many fields, e.g., data mining and machine learning, because it is unsupervised, can be employed in a generic metric space, and does not have any assumptions of data distributions. Data mining and machine learning applications face a challenge of dealing with large datasets, which requires efficient distance-based outlier detection algorithms. Due to the popularization of computational environments with large memory, it is possible to build a main-memory index and detect outliers based on it, which is a promising solution for fast distance-based outlier detection.
Motivated by this observation, we propose a novel approach that exploits a proximity graph. Our approach can employ an arbitrary proximity graph and obtains a significant speed-up against state-of-the-art. However, designing an effective proximity graph raises a challenge, because existing …
Total citations
202220232024478
Scholar articles
D Amagata, M Onizuka, T Hara - Proceedings of the 2021 International Conference on …, 2021