Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/115293
| Title: | LAMBDA : a large model based data agent | Authors: | Sun, M Han, R Jiang, B Qi, H Sun, D Yuan, Y Huang, J |
Issue Date: | 2025 | Source: | Journal of the American Statistical Association, Published online: 17 Jul 2025, Latest Articles, https://doi.org/10.1080/01621459.2025.2510000 | Abstract: | We introduce LArge Model Based Data Agent (LAMBDA), a novel open-source, code-free multi-agent data analysis system that leverages the power of large language models. LAMBDA is designed to address data analysis challenges in data-driven applications through innovatively designed data agents using natural language. At the core of LAMBDA are two key agent roles: the programmer and the inspector, which are engineered to work together seamlessly. Specifically, the programmer generates code based on the user’s instructions and domain-specific knowledge, while the inspector debugs the code when necessary. To ensure robustness and handle adverse scenarios, LAMBDA features a user interface that allows direct user intervention. Moreover, LAMBDA can flexibly integrate external models and algorithms through our proposed Knowledge Integration Mechanism, catering to the needs of customized data analysis. LAMBDA has demonstrated strong performance on various data analysis tasks. It has the potential to enhance data analysis paradigms by seamlessly integrating human and artificial intelligence, making it more accessible, effective, and efficient for users from diverse backgrounds. The strong performance of LAMBDA in solving data analysis problems is demonstrated using real-world data examples. The code for LAMBDA is available at https://github.com/AMA-CMFAI/LAMBDA and videos of three case studies can be viewed at https://www.polyu.edu.hk/ama/cmfai/lambda.html. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work. | Keywords: | Code generation via natural language Data analysis Large models Multi-agent collaboration Software system |
Publisher: | Taylor & Francis | Journal: | Journal of the American Statistical Association | ISSN: | 0162-1459 | EISSN: | 1537-274X | DOI: | 10.1080/01621459.2025.2510000 | Rights: | © 2025 The Author(s). Published with license by Taylor & Francis Group, LLC. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent. The following publication Sun, M., Han, R., Jiang, B., Qi, H., Sun, D., Yuan, Y., & Huang, J. (2025). Lambda: A large model based data agent. Journal of the American Statistical Association, 1-13 is available at https://doi.org/10.1080/01621459.2025.2510000. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Sun_LAMBDA_Large_Mode.pdf | 4.34 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



