GPU-based butterfly counting Y Xia, F Zhang, Q Xu, M Zhang, Z Yao, L Lu, X Du, D Deng, B He, S Ma The VLDB Journal, 1-25, 2024 | | 2024 |
G-Learned Index: Enabling Efficient Learned Index on GPU J Liu, F Zhang, L Lu, C Qi, X Guo, D Deng, G Li, H Zhang, J Zhai, H Zhang, ... IEEE Transactions on Parallel and Distributed Systems, 2024 | | 2024 |
SeRF: Segment Graph for Range-Filtering Approximate Nearest Neighbor Search C Zuo, M Qiao, W Zhou, F Li, D Deng Proceedings of the ACM on Management of Data 2 (1), 1-26, 2024 | | 2024 |
Neural locality sensitive hashing for entity blocking R Wang, L Kong, Y Tao, A Borthwick, D Golac, H Johnson, S Hijazi, ... Proceedings of the 2024 SIAM International Conference on Data Mining (SDM …, 2024 | | 2024 |
Near-duplicate sequence search at scale for large language model memorization evaluation Z Peng, Z Wang, D Deng Proceedings of the ACM on Management of Data 1 (2), 1-18, 2023 | 4 | 2023 |
ARKGraph: All-Range Approximate K-Nearest-Neighbor Graph C Zuo, D Deng Proceedings of the VLDB Endowment 16 (10), 2645-2658, 2023 | 1 | 2023 |
The case for learned provenance graph storage systems H Ding, J Zhai, D Deng, S Ma 32nd USENIX Security Symposium (USENIX Security 23), 3277-3294, 2023 | 5 | 2023 |
TxtAlign: efficient near-duplicate text alignment search via bottom-k sketches for plagiarism detection Z Wang, C Zuo, D Deng Proceedings of the 2022 International Conference on Management of Data, 1146 …, 2022 | 10 | 2022 |
Spine: Scaling up programming-by-negative-example for string filtering and transformation C Zuo, S Assadi, D Deng Proceedings of the 2022 International Conference on Management of Data, 521-530, 2022 | 1 | 2022 |
Efficient Load-Balanced Butterfly Counting on GPU Q Xu, F Zhang, Z Yao, L Lu, X Du, D Deng, B He Proceedings of the VLDB Endowment, 2022 | 9 | 2022 |
G-slide: A gpu-based sub-linear deep learning engine via lsh sparsification Z Pan, F Zhang, H Li, C Zhang, X Du, D Deng IEEE Transactions on Parallel and Distributed Systems 33 (11), 3015-3027, 2021 | 5 | 2021 |
Allign: Aligning all-pair near-duplicate passages in long texts W Feng, D Deng Proceedings of the 2021 International Conference on Management of Data, 541-553, 2021 | 9 | 2021 |
Internal and external memory set containment join C Yang, D Deng, S Shang, F Zhu, L Liu, L Shao The VLDB Journal 30 (3), 447-470, 2021 | 3 | 2021 |
DeltaPQ: Lossless Product Quantization Code Compression for High Dimensional Similarity Search R Wang, D Deng Proceedings of the VLDB Endowment 13 (13), 3603-3616, 2020 | 19 | 2020 |
Efficient locality-sensitive hashing over high-dimensional data streams C Yang, D Deng, S Shang, L Shao 2020 IEEE 36th International Conference on Data Engineering (ICDE), 1986-1989, 2020 | 10 | 2020 |
Josie: Overlap set similarity search for finding joinable tables in data lakes E Zhu, D Deng, F Nargesian, RJ Miller Proceedings of the 2019 International Conference on Management of Data, 847-864, 2019 | 176 | 2019 |
Technical report: Optimizing human involvement for entity matching and consolidation J Sun, D Deng, I Ilyas, G Li, S Madden, M Ouzzani, M Stonebraker, ... arXiv preprint arXiv:1906.06574, 2019 | 2 | 2019 |
Balance-aware distributed string similarity-based query processing system J Sun, Z Shang, G Li, D Deng, Z Bao Proceedings of the VLDB Endowment 12 (9), 961-974, 2019 | 15 | 2019 |
Unsupervised string transformation learning for entity consolidation D Deng, W Tao, Z Abedjan, A Elmagarmid, IF Ilyas, G Li, S Madden, ... 2019 IEEE 35th International Conference on Data Engineering (ICDE), 196-207, 2019 | 33* | 2019 |
Lcjoin: Set containment join via list crosscutting D Deng, C Yang, S Shang, F Zhu, L Liu, L Shao 2019 IEEE 35th International Conference on Data Engineering (ICDE), 362-373, 2019 | 18 | 2019 |