Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/109210
Title: | BoostER : leveraging large language models for enhancing entity resolution | Authors: | Li, H Li, S Hao, F Zhang, CJ Song, Y Chen, L |
Issue Date: | 2024 | Source: | In WWW '24 Companion: Companion Proceedings of the ACM Web Conference 2024, p. 1043-1046. New York, NY: The Association for Computing Machinery, 2024 | Abstract: | Entity resolution, which involves identifying and merging records that refer to the same real-world entity, is a crucial task in areas like Web data integration. This importance is underscored by the presence of numerous duplicated and multi-version data resources on the Web. However, achieving high-quality entity resolution typically demands significant effort. The advent of Large Language Models (LLMs) like GPT-4 has demonstrated advanced linguistic capabilities, which can be a new paradigm for this task. In this paper, we propose a demonstration system named BoostER that examines the possibility of leveraging LLMs in the entity resolution process, revealing advantages in both easy deployment and low cost. Our approach optimally selects a set of matching questions and poses them to LLMs for verification, then refines the distribution of entity resolution results with the response of LLMs. This offers promising prospects to achieve a high-quality entity resolution result for real-world applications, especially to individuals or small companies without the need for extensive model training or significant financial investment. © 2024 Copyright held by the owner/author(s). | Keywords: | Entity Resolution Large Language Models Web Data Integration |
Publisher: | Association for Computing Machinery | ISBN: | 979-8-4007-0172-6 | DOI: | 10.1145/3589335.3651245 | Description: | WWW '24: The ACM Web Conference 2024, Singapore, Singapore, May 13-17, 2024 | Rights: | © 2024 Copyright held by the owner/author(s). This work is licensed under a Creative Commons Attribution-NonCommercial International 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/). The following publication Li, H., Li, S., Hao, F., Zhang, C. J., Song, Y., & Chen, L. (2024). BoostER: Leveraging Large Language Models for Enhancing Entity Resolution Companion Proceedings of the ACM Web Conference 2024, Singapore, Singapore is available at https://doi.org/10.1145/3589335.3651245. |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
3589335.3651245.pdf | 1.79 MB | Adobe PDF | View/Open |
Page views
85
Citations as of Apr 14, 2025
Downloads
14
Citations as of Apr 14, 2025

Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.