Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/88458
PIRA download icon_1.1View/Download Full Text
Title: Source microphone recognition aided by a kernel-based projection method
Authors: Jiang, Y 
Leung, FH 
Issue Date: Nov-2019
Source: IEEE transactions on information forensics and security, Nov. 2019, v. 14, no. 11, p. 2875-2886
Abstract: Microphone recognition aims at recognizing different microphones based on the recorded speeches. In the literature, Gaussian Supervector (GSV) has been used as the feature vector representing a speech recording, which is obtained by adapting a universal background model (UBM). However, it is not clear how the performance of the GSV will be affected by the number of mixture components in the UBM. Besides, the raw GSV obtained from a speech recording contains both the microphone response information and the speech information, meaning that the raw GSV can be quite noisy as the feature vector for microphone recognition. In this paper, we investigate how GSV will be affected by the UBM and other parameters during the calculation of the GSV. In addition, in order to improve the quality of the raw GSV, we propose a kernel-based projection method to be applied to the raw GSV. This projection method maps the raw GSV onto another dimensional space. It is expected that in the projected feature space, the microphone response information and the speech information can be separated into different dimensions, meaning that the projected GSV should be better as the feature vector for microphone recognition compared to the raw GSV. Two classifiers that have been used in the literature, namely linear support vector machine (SVM) and sparse representation-based classifier (SRC), are employed to compare the performance of the raw GSV and the projected GSV. The experimental results demonstrate that the projected GSV can outperform the raw GSV no matter using linear SVM or SRC as the classifier, which shows the effectiveness of the projection method.
Keywords: Kernel-based projection
Linear support vector machine
Microphone recognition
Sparse representation based classifier
Publisher: Institute of Electrical and Electronics Engineers
Journal: IEEE transactions on information forensics and security 
ISSN: 1556-6013
EISSN: 1556-6021
DOI: 10.1109/TIFS.2019.2911175
Rights: © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication Y. Jiang and F. H. F. Leung, "Source Microphone Recognition Aided by a Kernel-Based Projection Method," in IEEE Transactions on Information Forensics and Security, vol. 14, no. 11, pp. 2875-2886, Nov. 2019 is available at https://dx.doi.org/10.1109/TIFS.2019.2911175
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Jiang_Source_Microphone_Recognition.pdfPre-Published version1.19 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

73
Last Week
0
Last month
Citations as of Sep 22, 2024

Downloads

40
Citations as of Sep 22, 2024

SCOPUSTM   
Citations

19
Citations as of Aug 15, 2024

WEB OF SCIENCETM
Citations

18
Citations as of Sep 26, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.