Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/109210
PIRA download icon_1.1View/Download Full Text
Title: BoostER : leveraging large language models for enhancing entity resolution
Authors: Li, H 
Li, S
Hao, F 
Zhang, CJ 
Song, Y
Chen, L
Issue Date: 2024
Source: In WWW '24 Companion: Companion Proceedings of the ACM Web Conference 2024, p. 1043-1046. New York, NY: The Association for Computing Machinery, 2024
Abstract: Entity resolution, which involves identifying and merging records that refer to the same real-world entity, is a crucial task in areas like Web data integration. This importance is underscored by the presence of numerous duplicated and multi-version data resources on the Web. However, achieving high-quality entity resolution typically demands significant effort. The advent of Large Language Models (LLMs) like GPT-4 has demonstrated advanced linguistic capabilities, which can be a new paradigm for this task. In this paper, we propose a demonstration system named BoostER that examines the possibility of leveraging LLMs in the entity resolution process, revealing advantages in both easy deployment and low cost. Our approach optimally selects a set of matching questions and poses them to LLMs for verification, then refines the distribution of entity resolution results with the response of LLMs. This offers promising prospects to achieve a high-quality entity resolution result for real-world applications, especially to individuals or small companies without the need for extensive model training or significant financial investment. © 2024 Copyright held by the owner/author(s).
Keywords: Entity Resolution
Large Language Models
Web Data Integration
Publisher: Association for Computing Machinery
ISBN: 979-8-4007-0172-6
DOI: 10.1145/3589335.3651245
Description: WWW '24: The ACM Web Conference 2024, Singapore, Singapore, May 13-17, 2024
Rights: © 2024 Copyright held by the owner/author(s).
This work is licensed under a Creative Commons Attribution-NonCommercial International 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/).
The following publication Li, H., Li, S., Hao, F., Zhang, C. J., Song, Y., & Chen, L. (2024). BoostER: Leveraging Large Language Models for Enhancing Entity Resolution Companion Proceedings of the ACM Web Conference 2024, Singapore, Singapore is available at https://doi.org/10.1145/3589335.3651245.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
3589335.3651245.pdf1.79 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

85
Citations as of Apr 14, 2025

Downloads

14
Citations as of Apr 14, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.