Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/109210
PIRA download icon_1.1View/Download Full Text
Title: BoostER : leveraging large language models for enhancing entity resolution
Authors: Li, H 
Li, S
Hao, F 
Zhang, CJ 
Song, Y
Chen, L
Issue Date: 2024
Source: In WWW '24 Companion: Companion Proceedings of the ACM Web Conference 2024, p. 1043-1046. New York, NY: The Association for Computing Machinery, 2024
Abstract: Entity resolution, which involves identifying and merging records that refer to the same real-world entity, is a crucial task in areas like Web data integration. This importance is underscored by the presence of numerous duplicated and multi-version data resources on the Web. However, achieving high-quality entity resolution typically demands significant effort. The advent of Large Language Models (LLMs) like GPT-4 has demonstrated advanced linguistic capabilities, which can be a new paradigm for this task. In this paper, we propose a demonstration system named BoostER that examines the possibility of leveraging LLMs in the entity resolution process, revealing advantages in both easy deployment and low cost. Our approach optimally selects a set of matching questions and poses them to LLMs for verification, then refines the distribution of entity resolution results with the response of LLMs. This offers promising prospects to achieve a high-quality entity resolution result for real-world applications, especially to individuals or small companies without the need for extensive model training or significant financial investment. © 2024 Copyright held by the owner/author(s).
Keywords: Entity Resolution
Large Language Models
Web Data Integration
Publisher: Association for Computing Machinery
ISBN: 979-8-4007-0172-6
DOI: 10.1145/3589335.3651245
Description: WWW '24: The ACM Web Conference 2024, Singapore, Singapore, May 13-17, 2024
Rights: © 2024 Copyright held by the owner/author(s).
This work is licensed under a Creative Commons Attribution-NonCommercial International 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/).
The following publication Li, H., Li, S., Hao, F., Zhang, C. J., Song, Y., & Chen, L. (2024). BoostER: Leveraging Large Language Models for Enhancing Entity Resolution Companion Proceedings of the ACM Web Conference 2024, Singapore, Singapore is available at https://doi.org/10.1145/3589335.3651245.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
3589335.3651245.pdf1.79 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

85
Citations as of Apr 14, 2025

Downloads

14
Citations as of Apr 14, 2025

SCOPUSTM   
Citations

1
Citations as of Aug 1, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.