Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/114287
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Data Science and Artificial Intelligenceen_US
dc.creatorShen, Len_US
dc.creatorPu, Nen_US
dc.creatorZhong, Zen_US
dc.creatorGong, Men_US
dc.creatorYu, Den_US
dc.creatorZhang, Cen_US
dc.creatorHan, Ben_US
dc.date.accessioned2025-07-22T01:34:18Z-
dc.date.available2025-07-22T01:34:18Z-
dc.identifier.urihttp://hdl.handle.net/10397/114287-
dc.language.isoenen_US
dc.publisherTransactions on Machine Learning Researchen_US
dc.rightsCC BY 4.0 (https://creativecommons.org/licenses/by/4.0/)en_US
dc.rightsThe following publication Shen, L., Pu, N., Zhong, Z., Gong, M., Yu, D., Zhang, C., & Han, B. Federated Generalized Novel Category Discovery with Prompts Tuning, Transactions on Machine Learning Research, 2025 is available at https://openreview.net/forum?id=dVMESwnMlo.en_US
dc.titleFederated generalized novel category discovery with prompts tuningen_US
dc.typeJournal/Magazine Articleen_US
dcterms.abstractGeneralized category discovery (GCD) is proposed to handle categories from unseen labels during the inference stage by clustering them. Most works in GCD provide solutions for unseen classes in data-centralized settings. However, unlabeled categories possessed by clients, which are common in real-world federated learning (FL), have been largely ignored and degraded the performance of classic FL algorithms. To demonstrate and mitigate the harmful effect of unseen classes, we dive into a GCD problem setting applicable for FL named FedGCD, analyze overfitting problem in FedGCD in detail, establish a strong baseline constructed with state-of-the-art GCD algorithm simGCD, and design a learning framework with prompt tuning to tackle both the overfitting and communication burden problems in FedGCD. In our methods, clients first separately carry out prompt learning on local data. Then, we aggregate the prompts from all clients as the global prompt to help capture global knowledge and then send the global prompts to local clients to allow access to broader knowledge from other clients. By this method, we significantly reduce the parameters needed to upload in FedGCD, which is a common obstacle in the real application of most FL algorithms. We conduct experiments on both generic and fine-grained datasets like CIFAR-100 and CUB-200, and show that our method is comparable to the FL version of simGCD and surpasses other baselines with significantly fewer parameters to transmit.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationTransactions on machine learning research, Published: 12 Jul 2025, Accepted papers, 4531, https://openreview.net/forum?id=dVMESwnMloen_US
dcterms.isPartOfTransactions on machine learning researchen_US
dcterms.issued2025-
dc.identifier.eissn2835-8856en_US
dc.identifier.artn4531en_US
dc.description.validate202507 bcchen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumbera3947-
dc.identifier.SubFormID51799.2-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThe NSFC General Program No. 62376235en_US
dc.description.fundingTextCCF-Baidu Open Fund, HKBU Faculty Niche Research Areas No. RC-FNRA-IG/22-23/SCI/04en_US
dc.description.fundingTextHKBU CSD Departmental Incentive Schemeen_US
dc.description.pubStatusEarly releaseen_US
dc.description.oaCategoryCCen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
TMLR.Journal.2025.4531_Federated_Generalized_Nov.pdf1.27 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.