Integrated and enhanced pipeline system to support spoken language analytics for screening neurocognitive disorders

Meng, H; Mak, B; Mak, MW; Fung, H; Gong, X; Kwok, T; Liu, X; Mok, V; Wong, P; Woo, J; Wu, X; Wong, KH; Xu, SS; Zheng, N; Huang, R; Kang, J; Ke, X; Li, J; Li, J; Wang, Y

doi:10.21437/Interspeech.2023-2249

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/111706

DC Field	Value	Language
dc.contributor	Department of Electrical and Electronic Engineering	-
dc.creator	Meng, H	-
dc.creator	Mak, B	-
dc.creator	Mak, MW	-
dc.creator	Fung, H	-
dc.creator	Gong, X	-
dc.creator	Kwok, T	-
dc.creator	Liu, X	-
dc.creator	Mok, V	-
dc.creator	Wong, P	-
dc.creator	Woo, J	-
dc.creator	Wu, X	-
dc.creator	Wong, KH	-
dc.creator	Xu, SS	-
dc.creator	Zheng, N	-
dc.creator	Huang, R	-
dc.creator	Kang, J	-
dc.creator	Ke, X	-
dc.creator	Li, J	-
dc.creator	Li, J	-
dc.creator	Wang, Y	-
dc.date.accessioned	2025-03-13T02:22:09Z	-
dc.date.available	2025-03-13T02:22:09Z	-
dc.identifier.uri	http://hdl.handle.net/10397/111706	-
dc.description	24th Annual Conference of the International Speech Communication Association, INTERSPEECH 2023, Dublin, Ireland, August 20-24, 2023	en_US
dc.language.iso	en	en_US
dc.publisher	International Speech Communication Association	en_US
dc.rights	Copyright © 2023 ISCA	en_US
dc.rights	The following publication Meng, H., Mak, B., Mak, M.-W., Fung, H., Gong, X., Kwok, T., Liu, X., Mok, V., Wong, P., Woo, J., Wu, X., Wong, K.H., Xu, S., Zheng, N., Huang, R., Kang, J., Ke, X., Li, J., Li, J., Wang, Y. (2023) Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders. Proc. Interspeech 2023, 1713-1717 is available at https://doi.org/10.21437/Interspeech.2023-2249.	en_US
dc.title	Integrated and enhanced pipeline system to support spoken language analytics for screening neurocognitive disorders	en_US
dc.type	Conference Paper	en_US
dc.identifier.spage	1713	-
dc.identifier.epage	1717	-
dc.identifier.doi	10.21437/Interspeech.2023-2249	-
dcterms.abstract	This paper presents an enhanced pipeline system for automated screening of neurocognitive disorders, e.g. Alzheimer's Disease (AD), using spoken language technologies. To ensure local relevance, the pipeline is applied to two-way interactions between clinical assessors and older adult participants in spoken Cantonese, the predominant language used in Hong Kong. The pipeline includes: (i) Speaker diarization using speaker-turn-aware scoring to capture the temporal structure of conversations. (ii) ASR using XLS-R wav2vec 2.0 models further pre-trained on Cantonese speech data and fine-tuned. (iii) Language modelling using RoBERTa with further fine-tuning. (iv) AD screening with neural network classification. A reference benchmark is obtained using the ADReSS corpus where no diarization is needed, and the partial pipeline attained a competitive detection accuracy of 87.5%.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023, p. 1713-1717	-
dcterms.issued	2023	-
dc.identifier.scopus	2-s2.0-85171578221	-
dc.relation.conference	Conference of the International Speech Communication Association [INTERSPEECH]	-
dc.description.validate	202503 bcch	-
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	OA_Others	en_US
dc.description.fundingSource	RGC	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	VoR allowed	en_US
Appears in Collections:	Conference Paper

Files in This Item:

File	Description	Size	Format
meng23d_interspeech.pdf		174.75 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show simple item record

Page views

6

Citations as of Apr 14, 2025

Downloads

5

Citations as of Apr 14, 2025

SCOPUS^TM
Citations

8

Citations as of Nov 21, 2025

Google Scholar^TM

Check