Please use this identifier to cite or link to this item:
Title: Wavelet based speech presence probability estimator for speech enhancement
Authors: Lun, DPK 
Shen, TW
Hsung, TC
Keywords: Multitaper spectrum estimation
Speech enhancement
Speech presence probability
Wavelet denoising
Issue Date: 2012
Publisher: Academic Press Inc Elsevier Science
Source: Digital signal processing : a review journal, 2012, v. 22, no. 6, p. 1161-1173 How to cite?
Journal: Digital Signal Processing: A Review Journal 
Abstract: A reliable speech presence probability (SPP) estimator is important to many frequency domain speech enhancement algorithms. It is known that a good estimate of SPP can be obtained by having a smooth a-posteriori signal to noise ratio (SNR) function, which can be achieved by reducing the noise variance when estimating the speech power spectrum. Recently, the wavelet denoising with multitaper spectrum (MTS) estimation technique was suggested for such purpose. However, traditional approaches directly make use of the wavelet shrinkage denoiser which has not been fully optimized for denoising the MTS of noisy speech signals. In this paper, we firstly propose a two-stage wavelet denoising algorithm for estimating the speech power spectrum. First, we apply the wavelet transform to the periodogram of a noisy speech signal. Using the resulting wavelet coefficients, an oracle is developed to indicate the approximate locations of the noise floor in the periodogram. Second, we make use of the oracle developed in stage 1 to selectively remove the wavelet coefficients of the noise floor in the log MTS of the noisy speech. The wavelet coefficients that remained are then used to reconstruct a denoised MTS and in turn generate a smooth a-posteriori SNR function. To adapt to the enhanced a-posteriori SNR function, we further propose a new method to estimate the generalized likelihood ratio (GLR), which is an essential parameter for SPP estimation. Simulation results show that the new SPP estimator outperforms the traditional approaches and enables an improvement in both the quality and intelligibility of the enhanced speeches.
DOI: 10.1016/j.dsp.2012.06.011
Appears in Collections:Journal/Magazine Article

View full-text via PolyU eLinks SFX Query
Show full item record


Citations as of Feb 23, 2017


Last Week
Last month
Citations as of Feb 24, 2017

Page view(s)

Last Week
Last month
Checked on Feb 26, 2017

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.