Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/111949
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Industrial and Systems Engineering | - |
dc.creator | Wu, C | - |
dc.creator | Hu, H | - |
dc.creator | Zhu, D | - |
dc.creator | Shan, X | - |
dc.creator | Yung, KL | - |
dc.creator | Ip, AWH | - |
dc.date.accessioned | 2025-03-19T07:35:19Z | - |
dc.date.available | 2025-03-19T07:35:19Z | - |
dc.identifier.uri | http://hdl.handle.net/10397/111949 | - |
dc.language.iso | en | en_US |
dc.publisher | MDPI AG | en_US |
dc.rights | © 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). | en_US |
dc.rights | The following publication Wu, C., Hu, H., Zhu, D., Shan, X., Yung, K.-L., & Ip, A. W. H. (2024). A Study of Discriminatory Speech Classification Based on Improved Smote and SVM-RF. Applied Sciences, 14(15), 6468 is available at https://doi.org/10.3390/app14156468. | en_US |
dc.subject | Discrimination speech | en_US |
dc.subject | Integration method | en_US |
dc.subject | Latent Dirichlet allocation | en_US |
dc.subject | Random forest | en_US |
dc.subject | Support vector machine | en_US |
dc.title | A study of discriminatory speech classification based on improved SMOTE and SVM-RF | en_US |
dc.type | Journal/Magazine Article | en_US |
dc.identifier.volume | 14 | - |
dc.identifier.issue | 15 | - |
dc.identifier.doi | 10.3390/app14156468 | - |
dcterms.abstract | The rapid development of the Internet has facilitated expression, sharing, and interaction on social networks, but some speech may contain harmful discrimination. Therefore, it is crucial to classify such speech. In this paper, we collected discriminatory data from Sina Weibo and propose the improved Synthetic Minority Over-sampling Technique (SMOTE) algorithm based on Latent Dirichlet Allocation (LDA) to improve data quality and balance. And we propose a new integration method integrating Support Vector Machine (SVM) and Random Forest (RF). The experimental results demonstrate that the integrated model exhibits enhanced precision, recall, and F1 score by 6.0%, 5.4%, and 5.7%, respectively, in comparison with SVM alone. Moreover, it exhibits the best performance in comparison with other machine learning methods. Furthermore, the positive impact of improved SMOTE and this integrated method on model classification is also confirmed in ablation experiments. | - |
dcterms.accessRights | open access | en_US |
dcterms.bibliographicCitation | Applied sciences, Aug. 2024, v. 14, no. 15, 6468 | - |
dcterms.isPartOf | Applied sciences | - |
dcterms.issued | 2024-08 | - |
dc.identifier.scopus | 2-s2.0-85200861881 | - |
dc.identifier.artn | 6468 | - |
dc.description.validate | 202503 bcch | - |
dc.description.oa | Version of Record | en_US |
dc.identifier.FolderNumber | OA_Scopus/WOS | en_US |
dc.description.fundingSource | Self-funded | en_US |
dc.description.pubStatus | Published | en_US |
dc.description.oaCategory | CC | en_US |
Appears in Collections: | Journal/Magazine Article |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
applsci-14-06468-v2.pdf | 1.1 MB | Adobe PDF | View/Open |
Page views
4
Citations as of Apr 14, 2025
Downloads
2
Citations as of Apr 14, 2025

Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.