Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107008
PIRA download icon_1.1View/Download Full Text
Title: Robust scream sound detection via sound event partitioning
Authors: Lei, B
Mak, MW 
Issue Date: Jun-2016
Source: Multimedia tools and applications, June 2016, v. 75, no. 11, p. 6071-6089
Abstract: This paper proposes a robust scream-sound detection scheme for acoustic surveillance applications. To enhance the discriminability between scream and non-scream sounds, a sound-event partitioning (SEP) method that facilitates the extraction of multiple acoustic vectors from a single sound event is developed. Regularized principal component analysis (PCA) and normalization are applied to the acoustic vectors, which are then classified by support vector machines (SVMs). Experimental results based on 1000 sound events show that the proposed scheme is effective even if there are severe mismatches between the training and testing conditions. The experimental results also show that the proposed scheme can reduce the equal error rate (EER) by up to 60 % when compared to a classical approach that uses mel-frequency cepstral coefficients (MFCC) as features. Extensive analyses on different processing stages of the proposed sound detection scheme also suggest that sound partitioning and feature normalization play important roles in boosting the detection performance.
Keywords: Feature normalization
Regularized PCA-whitening
Scream sound detection
Sound event partitioning
Publisher: Springer New York LLC
Journal: Multimedia tools and applications 
ISSN: 1380-7501
EISSN: 1573-7721
DOI: 10.1007/s11042-015-2555-z
Rights: © Springer Science+Business Media New York 2015
This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use(https://www.springernature.com/gp/open-research/policies/accepted-manuscript-terms), but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s11042-015-2555-z.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Mak_Robust_Scream_Sound.pdfPre-Published version2.64 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

3
Citations as of Jun 30, 2024

Downloads

3
Citations as of Jun 30, 2024

SCOPUSTM   
Citations

7
Citations as of Jun 21, 2024

WEB OF SCIENCETM
Citations

4
Citations as of Jun 27, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.