Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107106
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineering-
dc.creatorXu, SS-
dc.creatorMak, MW-
dc.creatorWong, KH-
dc.creatorMeng, H-
dc.creatorKwok, TCY-
dc.date.accessioned2024-06-13T01:03:57Z-
dc.date.available2024-06-13T01:03:57Z-
dc.identifier.isbn978-1-7281-6994-1 (Electronic)-
dc.identifier.isbn978-1-7281-6995-8 (Print on Demand(PoD))-
dc.identifier.urihttp://hdl.handle.net/10397/107106-
dc.description2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP), 24-27 January 2021, Hong Kongen_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication S. S. Xu, M. -W. Mak, K. H. Wong, H. Meng and T. C. Y. Kwok, "Age-Invariant Speaker Embedding for Diarization of Cognitive Assessments," 2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP), Hong Kong, 2021 is available at https://doi.org/10.1109/ISCSLP49672.2021.9362084.en_US
dc.subjectAge-invariant speaker embeddingen_US
dc.subjectDeep neural networksen_US
dc.subjectMontreal cognitive assessmentsen_US
dc.subjectSpeaker diarizationen_US
dc.titleAge-invariant speaker embedding for diarization of cognitive assessmentsen_US
dc.typeConference Paperen_US
dc.identifier.doi10.1109/ISCSLP49672.2021.9362084-
dcterms.abstractThis paper investigates an age-invariant speaker embedding approach to speaker diarization, which is an essential step towards the automatic cognitive assessments from speech. Studies have shown that incorporating speaker traits (e.g., age, gender, etc.) can improve speaker diarization performance. However, we found that age information in the speaker embeddings is detrimental to speaker diarization if there is a severe mismatch between the age distributions in the training data and test data. To minimize the detrimental effect of age mismatch, an adversarial training strategy is introduced to remove age variability from the utterance-level speaker embeddings. Evaluations on an interactive dialog dataset for Montreal cognitive assessments (MoCA) show that the adversarial training strategy can produce age-invariant embeddings and reduce diarization error rate (DER) by 4.33%. The approach also outperforms the conventional method even with less training data.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn Proceedings of 2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP), 24-27 January 2021, Hong Kong-
dcterms.issued2021-
dc.identifier.scopus2-s2.0-85102583136-
dc.relation.conferenceInternational Symposium on Chinese Spoken Language Processing [ISCSLP]-
dc.description.validate202403 bckw-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0085en_US
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS53153575en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Xu_Age-Invariant_Speaker_Embedding.pdfPre-Published version554.11 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

109
Last Week
4
Last month
Citations as of Nov 9, 2025

Downloads

42
Citations as of Nov 9, 2025

SCOPUSTM   
Citations

6
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

1
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.