Authors
Paolo Ciaccia, Marco Patella, Pavel Zezula
Publication date
1997
Conference
Proceedings of the Twenty-third International Conference on Very Large Data Bases, Athens, Greece, 26-29 August 1997
Pages
426-435
Publisher
Morgan Kaufmann
Description
A new access method, called M-tree, is proposed to organize and search large data sets from a generic “metric space”, ie where object proximity is only defined by a distance function satisfying the positivity, symmetry, and triangle inequality postulates. We detail algorithms for insertion of objects and split management, which keep the M-tree always balanced-several heuristic split alternatives are considered and experimentally evaluated. Algorithms for similarity (range and k-nearest neighbors) queries are also described. Results from extensive experimentation with a prototype system are reported, considering as the performance criteria the number of page I/O’s and the number of distance computations. The results demonstrate that the M-tree indeed extends the domain of applicability beyond the traditional vector spaces, performs reasonably well in high-dimensional data spaces, and scales well in case of growing files.
Total citations
1997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024725496965789412312414813615916216312512013911198979374867758576419
Scholar articles
P Ciaccia, M Patella, P Zezula - Proceedings of the International Conference on Very …, 1997