Distributed minimum error entropy algorithms

Guo, X; Hu, T; Wu, Q

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/88202

DC Field	Value	Language
dc.contributor	Department of Applied Mathematics	en_US
dc.creator	Guo, X	en_US
dc.creator	Hu, T	en_US
dc.creator	Wu, Q	en_US
dc.date.accessioned	2020-09-23T08:14:08Z	-
dc.date.available	2020-09-23T08:14:08Z	-
dc.identifier.issn	1532-4435	en_US
dc.identifier.uri	http://hdl.handle.net/10397/88202	-
dc.language.iso	en	en_US
dc.publisher	MIT Press	en_US
dc.rights	©2020 Xin Guo, Ting Hu and Qiang Wu.	en_US
dc.rights	License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v21/18-696.html	en_US
dc.rights	The following publication Guo, X., Hu, T., & Wu, Q. (2020). Distributed minimum error entropy algorithms. Journal of Machine Learning Research, 21(126), 1-31 is available at https://jmlr.org/papers/v21/18-696.html	en_US
dc.title	Distributed minimum error entropy algorithms	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.spage	1	en_US
dc.identifier.epage	31	en_US
dc.identifier.volume	21	en_US
dc.identifier.issue	126	en_US
dcterms.abstract	Minimum Error Entropy (MEE) principle is an important approach in Information Theoretical Learning (ITL). It is widely applied and studied in various fields for its robustness to noise. In this paper, we study a reproducing kernel-based distributed MEE algorithm, DMEE, which is designed to work with both fully supervised data and semi-supervised data. The divide-and-conquer approach is employed, so there is no inter-node communication overhead. Similar as other distributed algorithms, DMEE significantly reduces the computational complexity and memory requirement on single computing nodes. With fully supervised data, our proved learning rates equal the minimax optimal learning rates of the classical pointwise kernel-based regressions. Under the semi-supervised learning scenarios, we show that DMEE exploits unlabeled data effectively, in the sense that first, under the settings with weak regularity assumptions, additional unlabeled data significantly improves the learning rates of DMEE. Second, with sufficient unlabeled data, labeled data can be distributed to many more computing nodes, that each node takes only O(1) labels, without spoiling the learning rates in terms of the number of labels. This conclusion overcomes the saturation phenomenon in unlabeled data size. It parallels a recent results for regularized least squares (Lin and Zhou, 2018), and suggests that an inflation of unlabeled data is a solution to the MEE learning problems with decentralized data source for the concerns of privacy protection. Our work refers to pairwise learning and non-convex loss. The theoretical analysis is achieved by distributed U-statistics and error decomposition techniques in integral operators.	en_US
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	Journal of machine learning research, 2020, v. 21, no. 126, p. 1-31	en_US
dcterms.isPartOf	Journal of machine learning research	en_US
dcterms.issued	2020	-
dc.identifier.eissn	1533-7928	en_US
dc.description.validate	202009 bcrc	en_US
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	a0481-n01	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	CC	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
Distributed_Minimum_Error.pdf		470.68 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show simple item record

Page views

225

Last Week
0

Last month

Citations as of Jun 22, 2025

Downloads

51

Citations as of Jun 22, 2025

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Downloads

Google ScholarTM

Google Scholar^TM