Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/111949
PIRA download icon_1.1View/Download Full Text
Title: A study of discriminatory speech classification based on improved SMOTE and SVM-RF
Authors: Wu, C
Hu, H
Zhu, D
Shan, X
Yung, KL 
Ip, AWH
Issue Date: Aug-2024
Source: Applied sciences, Aug. 2024, v. 14, no. 15, 6468
Abstract: The rapid development of the Internet has facilitated expression, sharing, and interaction on social networks, but some speech may contain harmful discrimination. Therefore, it is crucial to classify such speech. In this paper, we collected discriminatory data from Sina Weibo and propose the improved Synthetic Minority Over-sampling Technique (SMOTE) algorithm based on Latent Dirichlet Allocation (LDA) to improve data quality and balance. And we propose a new integration method integrating Support Vector Machine (SVM) and Random Forest (RF). The experimental results demonstrate that the integrated model exhibits enhanced precision, recall, and F1 score by 6.0%, 5.4%, and 5.7%, respectively, in comparison with SVM alone. Moreover, it exhibits the best performance in comparison with other machine learning methods. Furthermore, the positive impact of improved SMOTE and this integrated method on model classification is also confirmed in ablation experiments.
Keywords: Discrimination speech
Integration method
Latent Dirichlet allocation
Random forest
Support vector machine
Publisher: MDPI AG
Journal: Applied sciences 
DOI: 10.3390/app14156468
Rights: © 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
The following publication Wu, C., Hu, H., Zhu, D., Shan, X., Yung, K.-L., & Ip, A. W. H. (2024). A Study of Discriminatory Speech Classification Based on Improved Smote and SVM-RF. Applied Sciences, 14(15), 6468 is available at https://doi.org/10.3390/app14156468.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
applsci-14-06468-v2.pdf1.1 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

4
Citations as of Apr 14, 2025

Downloads

2
Citations as of Apr 14, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.