Authors
Yaoshu Wang, Chuan Xiao, Jianbin Qin, Xin Cao, Yifang Sun, Wei Wang, Makoto Onizuka
Publication date
2020/6/11
Book
Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
Pages
1197-1212
Description
In this paper, we investigate the possibilities of utilizing deep learning for cardinality estimation of similarity selection. Answering this problem accurately and efficiently is essential to many data management applications, especially for query optimization. Moreover, in some applications the estimated cardinality is supposed to be consistent and interpretable. Hence a monotonic estimation w.r.t. the query threshold is preferred. We propose a novel and generic method that can be applied to any data type and distance function. Our method consists of a feature extraction model and a regression model. The feature extraction model transforms original data and threshold to a Hamming space, in which a deep learning-based regression model is utilized to exploit the incremental property of cardinality w.r.t. the threshold for both accuracy and monotonicity. We develop a training strategy tailored to our model as well as …
Total citations
2020202120222023202446963
Scholar articles
Y Wang, C Xiao, J Qin, X Cao, Y Sun, W Wang… - Proceedings of the 2020 ACM SIGMOD International …, 2020