Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/77325
PIRA download icon_1.1View/Download Full Text
Title: Investigations on Mandarin aspiratory animations using an airflow model
Authors: Chen, F
Wang, L
Chen, H
Peng, G 
Issue Date: Dec-2017
Source: IEEE/ACM transactions on audio, speech, and language processing, Dec. 2017, v. 25, no. 12, p. 2399-2409
Abstract: Various three-dimensional (3-D) talking heads have been developed lately for language learning, with both external and internal articulatory movements being visualized to guide learning. Mandarin pronunciation animation is challenging due to its confusable stops and affricates with similar places of articulation. Until now, less attention has been paid to the biosignal information of aspiratory airflow, which is essential in distinguishing Mandarin consonants. This study fills a research gap by presenting the quantitative analyses of airflow, and then designing an airflow model for a 3-D pronunciation system. The airflow information was collected by Phonatory Aerodynamic System, so that confusable consonants in Mandarin could be discerned by mean airflow rate, peak airflow rate, airflow duration, and peak time. Based on the airflow parameters, an airflow model using the physical equation of fluid flow was proposed and solved, which was then combined and synchronized with the existing 3-D articulatory model. Therefore, the new multimodal system was implemented to synchronously exhibit the airflow motions and articulatory movements of uttering Mandarin syllables. Both an audio-visual perception test and a pronunciation training study were conducted to assess the effectiveness of our system. Perceptual results indicated that identification accuracy was improved for both native and nonnative groups with the help of airflow motions, while native perceivers exhibited higher accuracy due to long-term language experience. Moreover, our system helped Japanese learners of Mandarin enhance their production skills of Mandarin aspirated consonants, reflected by higher gain values of voice onset time after training.
Publisher: Institute of Electrical and Electronics Engineers
Journal: IEEE/ACM transactions on audio, speech, and language processing 
ISSN: 2329-9290
EISSN: 2329-9304
DOI: 10.1109/TASLP.2017.2755400
Rights: © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication F. Chen, L. Wang, H. Chen and G. Peng, "Investigations on Mandarin Aspiratory Animations Using an Airflow Model," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 12, pp. 2399-2409, Dec. 2017 is available at https://dx.doi.org/10.1109/TASLP.2017.2755400.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Chen_et_al_TASLP_2017.pdfPre-Published version1.83 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

119
Last Week
1
Last month
Citations as of Apr 21, 2024

Downloads

36
Citations as of Apr 21, 2024

SCOPUSTM   
Citations

3
Citations as of Apr 19, 2024

WEB OF SCIENCETM
Citations

3
Last Week
0
Last month
Citations as of Apr 18, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.