Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/82231
Title: Ensemble with estimation : seeking for optimization in class noisy data
Authors: Xu, RF
Wen, ZY
Gui, L
Lu, Q 
Li, BY
Wang, XZ
Issue Date: 2020
Source: International journal of machine learning and cybernetics, 12 June 2020, v. 11, no. 2, p. 231-248
Abstract: Class noise, as know as the mislabeled data in training set, can lead to poor accuracy in classification no matter what machine learning methods are used. A reasonable estimation of class noise has a significant impact on the performance of learning methods. However, the error in existing estimation is inevitable theoretically and infer the performance of optimal classifier trained on noisy data. Instead of seeking a single optimal classifier on noisy data, in this work, we use a set of weak classifiers, which are caused by negative impacts of noisy data, to learn an ensemble strong classifier which is based on the training error and estimation of class noise. By this strategy, the proposed ensemble with estimation method overcomes the gap between the estimation and true distribution of class noise. Our proposed method does not require any a priori knowledge about class noises. We prove that the optimal ensemble classifier on the noisy distribution can approximate the optimal classifier on the clean distribution when the training set grows. Comparisons with existing algorithms show that our methods outperform state-of-the-art approaches on a large number of benchmark datasets in different domains. Both the theoretical analysis and the experimental result reveal that our method can improve the performance, works well on clean data and is robust on the algorithm parameter.
Keywords: Class noise
Ensemble learning
Machine learning
Publisher: Springer
Journal: International journal of machine learning and cybernetics 
ISSN: 1868-8071
EISSN: 1868-808X
DOI: 10.1007/s13042-019-00969-8
Rights: © The Author(s) 2019
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
The following publication Xu, R., Wen, Z., Gui, L. et al. Ensemble with estimation: seeking for optimization in class noisy data. Int. J. Mach. Learn. & Cyber. 11, 231–248 (2020) is available at https://dx.doi.org/10.1007/s13042-019-00969-8
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

3
Citations as of Jul 12, 2020

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.