Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/109210
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorSchool of Hotel and Tourism Managementen_US
dc.contributorDepartment of Computingen_US
dc.creatorLi, Hen_US
dc.creatorLi, Sen_US
dc.creatorHao, Fen_US
dc.creatorZhang, CJen_US
dc.creatorSong, Yen_US
dc.creatorChen, Len_US
dc.date.accessioned2024-09-24T04:20:53Z-
dc.date.available2024-09-24T04:20:53Z-
dc.identifier.isbn979-8-4007-0172-6en_US
dc.identifier.urihttp://hdl.handle.net/10397/109210-
dc.descriptionWWW '24: The ACM Web Conference 2024, Singapore, Singapore, May 13-17, 2024en_US
dc.language.isoenen_US
dc.publisherAssociation for Computing Machineryen_US
dc.rights© 2024 Copyright held by the owner/author(s).en_US
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial International 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/).en_US
dc.rightsThe following publication Li, H., Li, S., Hao, F., Zhang, C. J., Song, Y., & Chen, L. (2024). BoostER: Leveraging Large Language Models for Enhancing Entity Resolution Companion Proceedings of the ACM Web Conference 2024, Singapore, Singapore is available at https://doi.org/10.1145/3589335.3651245.en_US
dc.subjectEntity Resolutionen_US
dc.subjectLarge Language Modelsen_US
dc.subjectWeb Data Integrationen_US
dc.titleBoostER : leveraging large language models for enhancing entity resolutionen_US
dc.typeConference Paperen_US
dc.identifier.spage1043en_US
dc.identifier.epage1046en_US
dc.identifier.doi10.1145/3589335.3651245en_US
dcterms.abstractEntity resolution, which involves identifying and merging records that refer to the same real-world entity, is a crucial task in areas like Web data integration. This importance is underscored by the presence of numerous duplicated and multi-version data resources on the Web. However, achieving high-quality entity resolution typically demands significant effort. The advent of Large Language Models (LLMs) like GPT-4 has demonstrated advanced linguistic capabilities, which can be a new paradigm for this task. In this paper, we propose a demonstration system named BoostER that examines the possibility of leveraging LLMs in the entity resolution process, revealing advantages in both easy deployment and low cost. Our approach optimally selects a set of matching questions and poses them to LLMs for verification, then refines the distribution of entity resolution results with the response of LLMs. This offers promising prospects to achieve a high-quality entity resolution result for real-world applications, especially to individuals or small companies without the need for extensive model training or significant financial investment. © 2024 Copyright held by the owner/author(s).en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn WWW '24 Companion: Companion Proceedings of the ACM Web Conference 2024, p. 1043-1046. New York, NY: The Association for Computing Machinery, 2024en_US
dcterms.issued2024-
dc.identifier.scopus2-s2.0-85194465934-
dc.relation.ispartofbookWWW '24 Companion: Companion Proceedings of the ACM Web Conference 2024en_US
dc.relation.conferenceInternational World Wide Web Conference [WWW]en_US
dc.description.validate202409 bcchen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_TA-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextMajor Program of National Language Commission; Natural Science Foundation of Guangdong; PolyU(UGC); Innovation and Technology Fund; PolyU-MinshangCT Generative AI Laboratory; Research Matching Grant Scheme; PolyU Start-up Funden_US
dc.description.pubStatusPublisheden_US
dc.description.TAACM (2024)en_US
dc.description.oaCategoryTAen_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
3589335.3651245.pdf1.79 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

85
Citations as of Apr 14, 2025

Downloads

14
Citations as of Apr 14, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.