Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/109210
Title: | BoostER : leveraging large language models for enhancing entity resolution | Authors: | Li, H Li, S Hao, F Zhang, CJ Song, Y Chen, L |
Issue Date: | 2024 | Source: | In WWW '24 Companion: Companion Proceedings of the ACM Web Conference 2024, p. 1043-1046. New York, NY: The Association for Computing Machinery, 2024 | Abstract: | Entity resolution, which involves identifying and merging records that refer to the same real-world entity, is a crucial task in areas like Web data integration. This importance is underscored by the presence of numerous duplicated and multi-version data resources on the Web. However, achieving high-quality entity resolution typically demands significant effort. The advent of Large Language Models (LLMs) like GPT-4 has demonstrated advanced linguistic capabilities, which can be a new paradigm for this task. In this paper, we propose a demonstration system named BoostER that examines the possibility of leveraging LLMs in the entity resolution process, revealing advantages in both easy deployment and low cost. Our approach optimally selects a set of matching questions and poses them to LLMs for verification, then refines the distribution of entity resolution results with the response of LLMs. This offers promising prospects to achieve a high-quality entity resolution result for real-world applications, especially to individuals or small companies without the need for extensive model training or significant financial investment. © 2024 Copyright held by the owner/author(s). | Keywords: | Entity Resolution Large Language Models Web Data Integration |
Publisher: | Association for Computing Machinery | ISBN: | 979-8-4007-0172-6 | DOI: | 10.1145/3589335.3651245 | Description: | WWW '24: The ACM Web Conference 2024, Singapore, Singapore, May 13-17, 2024 | Rights: | © 2024 Copyright held by the owner/author(s). This work is licensed under a Creative Commons Attribution-NonCommercial International 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/). The following publication Li, H., Li, S., Hao, F., Zhang, C. J., Song, Y., & Chen, L. (2024). BoostER: Leveraging Large Language Models for Enhancing Entity Resolution Companion Proceedings of the ACM Web Conference 2024, Singapore, Singapore is available at https://doi.org/10.1145/3589335.3651245. |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
3589335.3651245.pdf | 1.79 MB | Adobe PDF | View/Open |
Page views
85
Citations as of Apr 14, 2025
Downloads
14
Citations as of Apr 14, 2025
SCOPUSTM
Citations
1
Citations as of Aug 1, 2025

Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.