Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/95609
PIRA download icon_1.1View/Download Full Text
Title: A principled approach using fuzzy set theory for passage-based document retrieval
Authors: Dang, EKF 
Luk, RWP 
Allan, J
Issue Date: Jul-2021
Source: IEEE transactions on fuzzy systems, July 2021, v. 29, no. 7, 9076849, p. 1967-1977
Abstract: In this article, we present a novel principled approach to passage-based (document) retrieval using fuzzy set theory. The approach formulates passage score combination according to general relevance decision principles. By operationalizing these principles using aggregation operators of fuzzy set theory, our approach justifies the common heuristics of taking the maximum constituent passage score as the overall document score. Experiments show that this heuristics is only the near best, with some fuzzy set aggregation operators stipulated in our approach being better methods. The significance of our principled approach is the applicability of many passage score combination methods, potentially bringing further performance enhancement. Experiments on several text retrieval conference collections demonstrate that our approach performs significantly better than document-based retrieval. While recent works in the literature mostly employ document-based rather than passage-based retrieval due to the common conception that document length normalization solves the problem of varying document lengths, our results show that document length normalization alone is not sufficient, especially in pseudo-relevance feedback retrieval.
Keywords: Fuzzy aggregation
Fuzzy information retrieval (IR) system
Generalized mean (GMean)
Performance evaluation
Principled passage-based retrieval
T-conorms
Publisher: Institute of Electrical and Electronics Engineers
Journal: IEEE transactions on fuzzy systems 
ISSN: 1063-6706
EISSN: 1941-0034
DOI: 10.1109/TFUZZ.2020.2990110
Rights: © 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication E. K. F. Dang, R. W. P. Luk and J. Allan, "A Principled Approach Using Fuzzy Set Theory for Passage-Based Document Retrieval," in IEEE Transactions on Fuzzy Systems, vol. 29, no. 7, pp. 1967-1977, July 2021 is available at https://doi.org/10.1109/TFUZZ.2020.2990110
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Dang_Principled_Approach_Using.pdfPre-Published Version552.09 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

113
Last Week
1
Last month
Citations as of Sep 22, 2024

Downloads

82
Citations as of Sep 22, 2024

SCOPUSTM   
Citations

10
Citations as of Sep 26, 2024

WEB OF SCIENCETM
Citations

8
Citations as of Sep 26, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.