Comparison of supervector and majority voting in acoustic scene identification

Jiang, Y; Leung, FH

doi:10.1109/ICDSP.2018.8631624

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/88459

DC Field	Value	Language
dc.contributor	Department of Electronic and Information Engineering	en_US
dc.creator	Jiang, Y	en_US
dc.creator	Leung, FH	en_US
dc.date.accessioned	2020-11-26T03:10:24Z	-
dc.date.available	2020-11-26T03:10:24Z	-
dc.identifier.isbn	978-1-5386-6811-5 (Electronic)	en_US
dc.identifier.isbn	978-1-5386-6810-8 (USB)	en_US
dc.identifier.isbn	978-1-5386-6812-2 (Print on Demand(PoD))	en_US
dc.identifier.uri	http://hdl.handle.net/10397/88459	-
dc.language.iso	en	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.rights	© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	en_US
dc.rights	The following publication Y. Jiang and F. H. F. Leung, "Comparison of Supervector and Majority Voting in Acoustic Scene Identification," 2018 IEEE 23rd International Conference on Digital Signal Processing (DSP), Shanghai, China, 2018, pp. 1-5 is available at https://dx.doi.org/10.1109/ICDSP.2018.8631624	en_US
dc.subject	Acoustic scene identification	en_US
dc.subject	Majority voting	en_US
dc.subject	Gaussian supervector	en_US
dc.subject	Factor analysis supervector	en_US
dc.subject	I-vector	en_US
dc.title	Comparison of supervector and majority voting in acoustic scene identification	en_US
dc.type	Conference Paper	en_US
dc.identifier.spage	1	en_US
dc.identifier.epage	5	en_US
dc.identifier.doi	10.1109/ICDSP.2018.8631624	en_US
dcterms.abstract	Acoustic scene identification aims to identify the acoustic environment from the acoustic signal. Usually one first divides a piece of acoustic signal into multiple short-time frames and then calculates frame-level features. A natural question is then how to make use of these frame-level features for identification purposes. In this paper, we compare two feature aggregation methods. One method is Majority Voting (MV), which treats each frame-level feature as an independent feature vector and then perform identification using majority voting strategies. In this way, an acoustic signal is represented by multiple feature vectors. The other method is Supervector, which maps the frame-level features to a single feature vector. In this way, an acoustic signal is represented by one feature vector. Particularly, we consider three types of Supervector, which are Gaussian Supervector, Factor Analysis Supervector, and i-vector. We then compare Supervector with MV in an acoustic identification task. Different classifiers are employed, including Gaussian Mixture Model (GMM), Support Vector Machine (SVM), Multilayer Perceptron (MLP), and Deep Neural Network (DNN). Experimental results indicate that these two feature aggregation methods give very similar performances, nonetheless, each has its own advantages and disadvantages.	en_US
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	2018 IEEE 23rd International Conference on Digital Signal Processing (DSP), Shanghai, China, China, 19-21 Nov. 2018, p. 1-5	en_US
dcterms.issued	2018-11	-
dc.relation.conference	IEEE International Conference on Digital Signal Processing [DSP])	en_US
dc.description.validate	202011 bcrc	en_US
dc.description.oa	Accepted Manuscript	en_US
dc.identifier.FolderNumber	a0512-n02	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Conference Paper

Files in This Item:

File	Description	Size	Format
Jiang_Comparison_Supervector_Majority.pdf	Pre-Published version	998.2 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Final Accepted Manuscript

Access

View full-text via PolyU eLinks

Show simple item record

Page views

124

Last Week
2

Last month

Citations as of Apr 14, 2025

Downloads

35

Citations as of Apr 14, 2025

SCOPUS^TM
Citations

2

Citations as of Jul 4, 2024

Google Scholar^TM

Check