Binary independence language model in a relevance feedback environment

Wu, HC; Luk, RWP; Wong, KF; Nie, JY

doi:10.1142/S021819401950030X

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105583

DC Field	Value	Language
dc.contributor	Department of Computing	-
dc.creator	Wu, HC	-
dc.creator	Luk, RWP	-
dc.creator	Wong, KF	-
dc.creator	Nie, JY	-
dc.date.accessioned	2024-04-15T07:35:12Z	-
dc.date.available	2024-04-15T07:35:12Z	-
dc.identifier.issn	0218-1940	-
dc.identifier.uri	http://hdl.handle.net/10397/105583	-
dc.language.iso	en	en_US
dc.publisher	World Scientific Publishing Co. Pte. Ltd.	en_US
dc.rights	Electronic version of an article published as International Journal of Software Engineering and Knowledge Engineering, Vol. 29, No. 06, pp. 873-895, https://doi.org/10.1142/S021819401950030X © World Scientific Publishing Company https://www.worldscientific.com/worldscinet/ijseke	en_US
dc.subject	Information retrieval	en_US
dc.subject	Language model	en_US
dc.subject	Proximity matching	en_US
dc.title	Binary independence language model in a relevance feedback environment	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.spage	873	-
dc.identifier.epage	895	-
dc.identifier.volume	29	-
dc.identifier.issue	6	-
dc.identifier.doi	10.1142/S021819401950030X	-
dcterms.abstract	Model construction is a kind of knowledge engineering, and building retrieval models is critical to the success of search engines. This article proposes a new (retrieval) language model, called binary independence language model (BILM). It integrates two document-context based language models together into one by the log-odds ratio where these two are language models applied to describe document-contexts of query terms. One model is based on relevance information while the other is based on the non-relevance information. Each model incorporates link dependencies and multiple query term dependencies. The probabilities are interpolated between the relative frequency and the background probabilities. In a simulated relevance feedback environment of top 20 judged documents, our BILM performed statistically significantly better than the other highly effective retrieval models at 95% confidence level across four TREC collections using fixed parameter values for the mean average precision. For the less stable performance measure (i.e. precision at the top 10), no statistical significance is shown between the different models for the individual test collections although numerically our BILM is better than two other models with a confidence level of 95% based on a paired sign test across the test collections of both relevance feedback and retrospective experiments.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	International journal of software engineering and knowledge engineering, June 2019, v. 29, no. 6, p. 873-895	-
dcterms.isPartOf	International journal of software engineering and knowledge engineering	-
dcterms.issued	2019-06	-
dc.identifier.scopus	2-s2.0-85068084720	-
dc.identifier.eissn	1793-6403	-
dc.description.validate	202402 bcch	-
dc.description.oa	Accepted Manuscript	en_US
dc.identifier.FolderNumber	COMP-0592	en_US
dc.description.fundingSource	RGC	en_US
dc.description.pubStatus	Published	en_US
dc.identifier.OPUS	14229951	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
Wu_Binary_Independence_Language.pdf	Pre-Published version	1.24 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Final Accepted Manuscript

Access

View full-text via PolyU eLinks

Show simple item record

Page views

139

Last Week
9

Last month

Citations as of Apr 12, 2026

Downloads

110

Citations as of Apr 12, 2026

SCOPUS^TM
Citations

1

Citations as of May 8, 2026

WEB OF SCIENCE^TM
Citations

1

Citations as of Apr 23, 2026

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Downloads

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM