Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105583
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computing-
dc.creatorWu, HC-
dc.creatorLuk, RWP-
dc.creatorWong, KF-
dc.creatorNie, JY-
dc.date.accessioned2024-04-15T07:35:12Z-
dc.date.available2024-04-15T07:35:12Z-
dc.identifier.issn0218-1940-
dc.identifier.urihttp://hdl.handle.net/10397/105583-
dc.language.isoenen_US
dc.publisherWorld Scientific Publishing Co. Pte. Ltd.en_US
dc.rightsElectronic version of an article published as International Journal of Software Engineering and Knowledge Engineering, Vol. 29, No. 06, pp. 873-895, https://doi.org/10.1142/S021819401950030X © World Scientific Publishing Company https://www.worldscientific.com/worldscinet/ijsekeen_US
dc.subjectInformation retrievalen_US
dc.subjectLanguage modelen_US
dc.subjectProximity matchingen_US
dc.titleBinary independence language model in a relevance feedback environmenten_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage873-
dc.identifier.epage895-
dc.identifier.volume29-
dc.identifier.issue6-
dc.identifier.doi10.1142/S021819401950030X-
dcterms.abstractModel construction is a kind of knowledge engineering, and building retrieval models is critical to the success of search engines. This article proposes a new (retrieval) language model, called binary independence language model (BILM). It integrates two document-context based language models together into one by the log-odds ratio where these two are language models applied to describe document-contexts of query terms. One model is based on relevance information while the other is based on the non-relevance information. Each model incorporates link dependencies and multiple query term dependencies. The probabilities are interpolated between the relative frequency and the background probabilities. In a simulated relevance feedback environment of top 20 judged documents, our BILM performed statistically significantly better than the other highly effective retrieval models at 95% confidence level across four TREC collections using fixed parameter values for the mean average precision. For the less stable performance measure (i.e. precision at the top 10), no statistical significance is shown between the different models for the individual test collections although numerically our BILM is better than two other models with a confidence level of 95% based on a paired sign test across the test collections of both relevance feedback and retrospective experiments.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationInternational journal of software engineering and knowledge engineering, June 2019, v. 29, no. 6, p. 873-895-
dcterms.isPartOfInternational journal of software engineering and knowledge engineering-
dcterms.issued2019-06-
dc.identifier.scopus2-s2.0-85068084720-
dc.identifier.eissn1793-6403-
dc.description.validate202402 bcch-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberCOMP-0592en_US
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS14229951en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Wu_Binary_Independence_Language.pdfPre-Published version1.24 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

39
Citations as of Jul 7, 2024

Downloads

34
Citations as of Jul 7, 2024

SCOPUSTM   
Citations

1
Citations as of Jul 4, 2024

WEB OF SCIENCETM
Citations

1
Citations as of Jul 4, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.