Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/88458
Title: | Source microphone recognition aided by a kernel-based projection method | Authors: | Jiang, Y Leung, FH |
Issue Date: | Nov-2019 | Source: | IEEE transactions on information forensics and security, Nov. 2019, v. 14, no. 11, p. 2875-2886 | Abstract: | Microphone recognition aims at recognizing different microphones based on the recorded speeches. In the literature, Gaussian Supervector (GSV) has been used as the feature vector representing a speech recording, which is obtained by adapting a universal background model (UBM). However, it is not clear how the performance of the GSV will be affected by the number of mixture components in the UBM. Besides, the raw GSV obtained from a speech recording contains both the microphone response information and the speech information, meaning that the raw GSV can be quite noisy as the feature vector for microphone recognition. In this paper, we investigate how GSV will be affected by the UBM and other parameters during the calculation of the GSV. In addition, in order to improve the quality of the raw GSV, we propose a kernel-based projection method to be applied to the raw GSV. This projection method maps the raw GSV onto another dimensional space. It is expected that in the projected feature space, the microphone response information and the speech information can be separated into different dimensions, meaning that the projected GSV should be better as the feature vector for microphone recognition compared to the raw GSV. Two classifiers that have been used in the literature, namely linear support vector machine (SVM) and sparse representation-based classifier (SRC), are employed to compare the performance of the raw GSV and the projected GSV. The experimental results demonstrate that the projected GSV can outperform the raw GSV no matter using linear SVM or SRC as the classifier, which shows the effectiveness of the projection method. | Keywords: | Kernel-based projection Linear support vector machine Microphone recognition Sparse representation based classifier |
Publisher: | Institute of Electrical and Electronics Engineers | Journal: | IEEE transactions on information forensics and security | ISSN: | 1556-6013 | EISSN: | 1556-6021 | DOI: | 10.1109/TIFS.2019.2911175 | Rights: | © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication Y. Jiang and F. H. F. Leung, "Source Microphone Recognition Aided by a Kernel-Based Projection Method," in IEEE Transactions on Information Forensics and Security, vol. 14, no. 11, pp. 2875-2886, Nov. 2019 is available at https://dx.doi.org/10.1109/TIFS.2019.2911175 |
Appears in Collections: | Journal/Magazine Article |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Jiang_Source_Microphone_Recognition.pdf | Pre-Published version | 1.19 MB | Adobe PDF | View/Open |
Page views
74
Last Week
0
0
Last month
Citations as of Oct 13, 2024
Downloads
42
Citations as of Oct 13, 2024
SCOPUSTM
Citations
19
Citations as of Aug 15, 2024
WEB OF SCIENCETM
Citations
18
Citations as of Oct 10, 2024
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.