Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/75546
Title: Text-independent voice conversion using deep neural network based phonetic level features
Authors: Zheng, HD 
Cai, WC
Zhou, TY
Zhang, SL
Li, M
Keywords: Gaussian mixture model
Phoneme posterior probability
Voice conversion
Deep neural network
Issue Date: 2016
Publisher: IEEE Computer Society
Source: 23rd International Conference on Pattern Recognition (ICPR), Mexican Assoc Comp Vis Robot & Neural Comp, Mexico, Dec 4-8, 2016, p. 2872-2877 How to cite?
Abstract: This paper presents a phonetically-aware joint density Gaussian mixture model (JD-GMM) framework for voice conversion that no longer requires parallel data from source speaker at the training stage. Considering that the phonetic level features contain text information which should be preserved in the conversion task, we propose a method that only concatenates phonetic discriminant features and spectral features extracted from the same target speakers speech to train a JD-GMM. After the mapping relationship of these two features is trained, we can use phonetic discriminant features from source speaker to estimate target speaker's spectral features at conversion stage. The phonetic discriminant features are extracted using PCA from the output layer of a deep neural network (DNN) in an automatic speaker recognition (ASR) system. It can be seen as a low dimensional representation of the senone posteriors. We compare the proposed phonetically-aware method with conventional JD-GMM method on the Voice Conversion Challenge 2016 training database. The experimental results show that our proposed phonetically-aware feature method can obtain similar performance compared to the conventional JD-GMM in the case of using only target speech as training data.
URI: http://hdl.handle.net/10397/75546
ISBN: 978-1-5090-4847-2
ISSN: 1051-4651
Appears in Collections:Conference Paper

Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

1
Citations as of May 11, 2018

WEB OF SCIENCETM
Citations

1
Last Week
0
Last month
Citations as of May 28, 2018

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.