Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/113413
Title: Vector quantization-based counterfactual augmentation for speech-based depression detection under data scarcity
Authors: Zuo, L 
Mak, MW 
Issue Date: 2025
Source: IEEE journal of biomedical and health informatics, Date of Publication: 02 May 2025, Early Access, https://doi.org/10.1109/JBHI.2025.3566767
Abstract: Data scarcity is a common and serious problem in depression detection, often leading to overfitting and bias that degrade the performance of depression detectors. We propose a counterfactual augmentation (CF aug) framework that generates latent features for speechbased depression detection under data-scarce conditions. The generation method is based on exploring how feature changes affect the outcomes. To this end, we introduce a counterfactual layer to a deep network to transform the representation of the original data to its opposite class, while a group-wise vector quantization module helps the model explore how the changes in vectors (or entries) sampled from codebooks affect the outcome. Experimental results demonstrate that CF-aug can alleviate the overfitting and bias problems caused by data scarcity. Our CF-aug framework achieves competitive performance compared to state-of-the-art methods on two depression datasets. We also demonstrate the potential of CF-aug in other domains and modalities for medical diagnosis under data-scarce settings.
Keywords: Counterfactuals
Data augmentation
Data scarcity
Speech-based depression detection
Vector quantization
Publisher: Institute of Electrical and Electronics Engineers
Journal: IEEE journal of biomedical and health informatics 
ISSN: 2168-2194
EISSN: 2168-2208
DOI: 10.1109/JBHI.2025.3566767
Appears in Collections:Journal/Magazine Article

Open Access Information
Status embaroged access
Embargo End Date 0000-00-00 (to be updated)
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.