Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/113413
PIRA download icon_1.1View/Download Full Text
Title: Vector quantization-based counterfactual augmentation for speech-based depression detection under data scarcity
Authors: Zuo, L 
Mak, MW 
Issue Date: Oct-2025
Source: IEEE journal of biomedical and health informatics, Oct. 2025, v. 29, no. 10, p. 7559-7567
Abstract: Data scarcity is a common and serious problem in depression detection, often leading to overfitting and bias that degrade the performance of depression detectors. We propose a counterfactual augmentation (CF aug) framework that generates latent features for speechbased depression detection under data-scarce conditions. The generation method is based on exploring how feature changes affect the outcomes. To this end, we introduce a counterfactual layer to a deep network to transform the representation of the original data to its opposite class, while a group-wise vector quantization module helps the model explore how the changes in vectors (or entries) sampled from codebooks affect the outcome. Experimental results demonstrate that CF-aug can alleviate the overfitting and bias problems caused by data scarcity. Our CF-aug framework achieves competitive performance compared to state-of-the-art methods on two depression datasets. We also demonstrate the potential of CF-aug in other domains and modalities for medical diagnosis under data-scarce settings.
Keywords: Counterfactuals
Data augmentation
Data scarcity
Speech-based depression detection
Vector quantization
Publisher: Institute of Electrical and Electronics Engineers
Journal: IEEE journal of biomedical and health informatics 
ISSN: 2168-2194
EISSN: 2168-2208
DOI: 10.1109/JBHI.2025.3566767
Rights: © 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication L. Zuo and M. -W. Mak, "Vector Quantization-Based Counterfactual Augmentation for Speech-Based Depression Detection Under Data Scarcity," in IEEE Journal of Biomedical and Health Informatics, vol. 29, no. 10, pp. 7559-7567, Oct. 2025 is available at https://doi.org/10.1109/JBHI.2025.3566767.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Zuo_Vector_Quantization_Based.pdfPre-Published version1.03 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.