Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/111949
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Industrial and Systems Engineering-
dc.creatorWu, C-
dc.creatorHu, H-
dc.creatorZhu, D-
dc.creatorShan, X-
dc.creatorYung, KL-
dc.creatorIp, AWH-
dc.date.accessioned2025-03-19T07:35:19Z-
dc.date.available2025-03-19T07:35:19Z-
dc.identifier.urihttp://hdl.handle.net/10397/111949-
dc.language.isoenen_US
dc.publisherMDPI AGen_US
dc.rights© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).en_US
dc.rightsThe following publication Wu, C., Hu, H., Zhu, D., Shan, X., Yung, K.-L., & Ip, A. W. H. (2024). A Study of Discriminatory Speech Classification Based on Improved Smote and SVM-RF. Applied Sciences, 14(15), 6468 is available at https://doi.org/10.3390/app14156468.en_US
dc.subjectDiscrimination speechen_US
dc.subjectIntegration methoden_US
dc.subjectLatent Dirichlet allocationen_US
dc.subjectRandom foresten_US
dc.subjectSupport vector machineen_US
dc.titleA study of discriminatory speech classification based on improved SMOTE and SVM-RFen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume14-
dc.identifier.issue15-
dc.identifier.doi10.3390/app14156468-
dcterms.abstractThe rapid development of the Internet has facilitated expression, sharing, and interaction on social networks, but some speech may contain harmful discrimination. Therefore, it is crucial to classify such speech. In this paper, we collected discriminatory data from Sina Weibo and propose the improved Synthetic Minority Over-sampling Technique (SMOTE) algorithm based on Latent Dirichlet Allocation (LDA) to improve data quality and balance. And we propose a new integration method integrating Support Vector Machine (SVM) and Random Forest (RF). The experimental results demonstrate that the integrated model exhibits enhanced precision, recall, and F1 score by 6.0%, 5.4%, and 5.7%, respectively, in comparison with SVM alone. Moreover, it exhibits the best performance in comparison with other machine learning methods. Furthermore, the positive impact of improved SMOTE and this integrated method on model classification is also confirmed in ablation experiments.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationApplied sciences, Aug. 2024, v. 14, no. 15, 6468-
dcterms.isPartOfApplied sciences-
dcterms.issued2024-08-
dc.identifier.scopus2-s2.0-85200861881-
dc.identifier.artn6468-
dc.description.validate202503 bcch-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Scopus/WOSen_US
dc.description.fundingSourceSelf-fundeden_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
applsci-14-06468-v2.pdf1.1 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

4
Citations as of Apr 14, 2025

Downloads

2
Citations as of Apr 14, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.