Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/27033
Title: Speaker verification from coded telephone speech using stochastic feature transformation and handset identification
Authors: Mak, MW 
Kung, SY
Keywords: keywords: {Noise
Telephone sets
USA Councils
Issue Date: 2002
Publisher: IEEE
Source: 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 13-17 May 2002, Orlando, FL, USA, p. I701-I704 How to cite?
Journal: 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 13-17 May 2002, Orlando, FL, USA 
Abstract: The performance of telephone-based speaker verification systems can be severely degraded by the acoustic mismatch caused by telephone handsets. This paper proposes to combine a handset selector with stochastic feature transformation to reduce the mismatch. Specifically, a GMM-based handset selector is trained to identify the most likely handset used by the claimants, and then handset-specific stochastic feature transformations are applied to the distorted feature vectors. To overcome the non-linear distortion introduced by telephone handsets, a 2nd-order stochastic feature transformation is proposed. Estimation algorithms based on the stochastic matching technique and the EM algorithm are derived. Experimental results based on 150 speakers of the HTIMIT corpus show that the handset selector is able to identify the handsets accurately (98.3%), and that both linear and non-linear transformation reduce the error rate significantly (from 12.37% to 5.49%).
URI: http://hdl.handle.net/10397/27033
ISBN: 0-7803-7402-9
ISSN: 1520-6149
DOI: 10.1109/ICASSP.2002.5743814
Appears in Collections:Conference Paper

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

14
Last Week
0
Last month
Checked on Mar 19, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.