Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/111706
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Electrical and Electronic Engineering | - |
dc.creator | Meng, H | - |
dc.creator | Mak, B | - |
dc.creator | Mak, MW | - |
dc.creator | Fung, H | - |
dc.creator | Gong, X | - |
dc.creator | Kwok, T | - |
dc.creator | Liu, X | - |
dc.creator | Mok, V | - |
dc.creator | Wong, P | - |
dc.creator | Woo, J | - |
dc.creator | Wu, X | - |
dc.creator | Wong, KH | - |
dc.creator | Xu, SS | - |
dc.creator | Zheng, N | - |
dc.creator | Huang, R | - |
dc.creator | Kang, J | - |
dc.creator | Ke, X | - |
dc.creator | Li, J | - |
dc.creator | Li, J | - |
dc.creator | Wang, Y | - |
dc.date.accessioned | 2025-03-13T02:22:09Z | - |
dc.date.available | 2025-03-13T02:22:09Z | - |
dc.identifier.uri | http://hdl.handle.net/10397/111706 | - |
dc.description | 24th Annual Conference of the International Speech Communication Association, INTERSPEECH 2023, Dublin, Ireland, August 20-24, 2023 | en_US |
dc.language.iso | en | en_US |
dc.publisher | International Speech Communication Association | en_US |
dc.rights | Copyright © 2023 ISCA | en_US |
dc.rights | The following publication Meng, H., Mak, B., Mak, M.-W., Fung, H., Gong, X., Kwok, T., Liu, X., Mok, V., Wong, P., Woo, J., Wu, X., Wong, K.H., Xu, S., Zheng, N., Huang, R., Kang, J., Ke, X., Li, J., Li, J., Wang, Y. (2023) Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders. Proc. Interspeech 2023, 1713-1717 is available at https://doi.org/10.21437/Interspeech.2023-2249. | en_US |
dc.title | Integrated and enhanced pipeline system to support spoken language analytics for screening neurocognitive disorders | en_US |
dc.type | Conference Paper | en_US |
dc.identifier.spage | 1713 | - |
dc.identifier.epage | 1717 | - |
dc.identifier.doi | 10.21437/Interspeech.2023-2249 | - |
dcterms.abstract | This paper presents an enhanced pipeline system for automated screening of neurocognitive disorders, e.g. Alzheimer's Disease (AD), using spoken language technologies. To ensure local relevance, the pipeline is applied to two-way interactions between clinical assessors and older adult participants in spoken Cantonese, the predominant language used in Hong Kong. The pipeline includes: (i) Speaker diarization using speaker-turn-aware scoring to capture the temporal structure of conversations. (ii) ASR using XLS-R wav2vec 2.0 models further pre-trained on Cantonese speech data and fine-tuned. (iii) Language modelling using RoBERTa with further fine-tuning. (iv) AD screening with neural network classification. A reference benchmark is obtained using the ADReSS corpus where no diarization is needed, and the partial pipeline attained a competitive detection accuracy of 87.5%. | - |
dcterms.accessRights | open access | en_US |
dcterms.bibliographicCitation | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023, p. 1713-1717 | - |
dcterms.issued | 2023 | - |
dc.identifier.scopus | 2-s2.0-85171578221 | - |
dc.relation.conference | Conference of the International Speech Communication Association [INTERSPEECH] | - |
dc.description.validate | 202503 bcch | - |
dc.description.oa | Version of Record | en_US |
dc.identifier.FolderNumber | OA_Others | en_US |
dc.description.fundingSource | RGC | en_US |
dc.description.pubStatus | Published | en_US |
dc.description.oaCategory | VoR allowed | en_US |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
meng23d_interspeech.pdf | 174.75 kB | Adobe PDF | View/Open |
Page views
6
Citations as of Apr 14, 2025
Downloads
5
Citations as of Apr 14, 2025

Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.