Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/108315
Title: | QSRP : efficient reverse k−ranks query processing on high-dimensional embeddings | Authors: | Bian, Z Yan, X Zhang, J Yiu, ML Tang, B |
Issue Date: | 2024 | Source: | 2024 IEEE 40th International Conference on Data Engineering, 13-17 May 2024, Utrecht, Netherlands, p. 4614-4627 | Abstract: | Embedding models represent users and products as high-dimensional embedding vectors and are widely used for recommendation. In this paper, we study the reverse k-ranks query, which finds the users that are the most interested in a product and has many applications including product promotion, targeted advertising, and market analysis. As reverse k-ranks solutions for low dimensionality (e.g., trees) fail for the high-dimensional embeddings generated by embedding models, we propose the QSRP framework. QSRP precomputes the score table between all user and product embeddings to facilitate pruning and refinement at query time. As the score table is usually large, QSRP samples some of its columns as the index to fit in memory. To tackle the problem that naive uniform sampling results in poor pruning effect, we propose query-aware sampling, which conducts sampling by explicitly maximizing the pruning effect for a set of sample queries. Moreover, we introduce regression-based pruning, which fits cheap linear functions to predict the bounds used for pruning. We also design techniques to build the index with limited memory, reduce index building time, and handle updates. We evaluate QSRP under various configurations and compare with state-of-the-art baselines. The results show that QSRP achieves shorter query time than the baselines in all cases, and the speedup is usually over 100x. | Keywords: | Embedding model High dimensional data Product-oriented recommendation Ranking query Reverse k-ranks |
Publisher: | Institute of Electrical and Electronics Engineers | ISBN: | 979-8-3503-1715-2 | DOI: | 10.1109/ICDE60146.2024.00351 | Rights: | © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication Z. Bian, X. Yan, J. Zhang, M. L. Yiu and B. Tang, "QSRP: Efficient Reverse k−Ranks Query Processing on High-Dimensional Embeddings," 2024 IEEE 40th International Conference on Data Engineering (ICDE), Utrecht, Netherlands, 2024, pp. 4614-4627 is available at https://doi.org/10.1109/ICDE60146.2024.00351. |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Bian_QSRP_Efficient_Reverse.pdf | Preprint version | 1.51 MB | Adobe PDF | View/Open |
Page views
89
Citations as of Apr 14, 2025
Downloads
94
Citations as of Apr 14, 2025

Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.