Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105467
PIRA download icon_1.1View/Download Full Text
Title: Accelerating similarity-based mining tasks on high-dimensional data by processing-in-memory
Authors: Wang, F 
Yiu, ML 
Shao, Z 
Issue Date: 2021
Source: 2021 IEEE 37th International Conference on Data Engineering (ICDE), 19-22 April 2021, Chania, Greece, p. 1859-1864
Abstract: Similarity computation is a core subroutine of many mining tasks on multi-dimensional data, which are often massive datasets at high dimensionality. In these mining tasks, the performance bottleneck is caused by the ‘memory wall’ problem as substantial amount of data needs to be transferred from memory to processors. Recent advances in non-volatile memory (NVM) enable processing-in-memory (PIM), which reduces data transfer and thus alleviates the performance bottleneck. Nevertheless, NVM PIM supports specific operations only (e.g., dot-product on non-negative integer vectors) but not arbitrary operations. In this paper, we tackle the above challenge and carefully exploit NVM PIM to accelerate similarity-based mining tasks on multi-dimensional data without compromising the accuracy of results. Experimental results on real datasets show that our proposed method achieves up to 10.5x and 8.5x speedup for state-of-art kNN classification and k-means clustering algorithms, respectively.
Publisher: Institute of Electrical and Electronics Engineers
ISBN: 978-1-7281-9184-3
DOI: 10.1109/ICDE51399.2021.00167
Rights: ©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication F. Wang, M. L. Yiu and Z. Shao, "Accelerating Similarity-based Mining Tasks on High-dimensional Data by Processing-in-memory," 2021 IEEE 37th International Conference on Data Engineering (ICDE), Chania, Greece, 2021, pp. 1859-1864 is available at https://doi.org/10.1109/ICDE51399.2021.00167.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
Yiu_Accelerating_Similarity-Based_Mining.pdfPre-Published version1.01 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

79
Last Week
5
Last month
Citations as of Nov 30, 2025

Downloads

69
Citations as of Nov 30, 2025

SCOPUSTM   
Citations

2
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

1
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.