Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/116617
PIRA download icon_1.1View/Download Full Text
Title: VFLAIR-LLM : a comprehensive framework and benchmark for split learning of LLMs
Authors: Gu, Z
Fan, Q
Sun, L
Liu, Y 
Ye, X
Issue Date: Aug-2025
Source: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2025, v. 2, p. 5470-5481
Abstract: With the advancement of Large Language Models (LLMs), LLM applications have expanded into a growing number of fields. However, users with data privacy concerns face limitations in directly utilizing LLM APIs, while private deployments incur significant computational demands. This creates a substantial challenge in achieving secure LLM adaptation under constrained local resources. To address this issue, collaborative learning methods, such as Split Learning (SL), offer a resource-efficient and privacy-preserving solution for adapting LLMs to private domains. In this study, we introduce VFLAIR-LLM (available at https://github.com/FLAIR-THU/VFLAIR-LLM), an extensible and lightweight split learning framework for LLMs, enabling privacy-preserving LLM inference and fine-tuning in resource-constrained environments. Our library provides two LLM partition settings, supporting three task types and 18 datasets. In addition, we provide standard modules for implementing and evaluating attacks and defenses. We benchmark 5 attacks and 9 defenses under various Split Learning for LLM(SL-LLM) settings, offering concrete insights and recommendations on the choice of model partition configurations, defense strategies, and relevant hyperparameters for real-world applications.
Keywords: Data privacy
Federated learning
Large language models
Split learning
Publisher: Association for Computing Machinery
Journal: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 
ISSN: 2154-817X
DOI: 10.1145/3711896.3737411
Rights: ©2025 Copyright held by the owner/author(s).
This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).
The following publication Gu, Z., Fan, Q., Sun, L., Liu, Y., & Ye, X. (2025, August). VFLAIR-LLM: A Comprehensive Framework and Benchmark for Split Learning of LLMs. In Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2, 5470-5481 is available at https://doi.org/10.1145/3711896.3737411.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
3711896_3737411.pdf1.99 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.