Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/115293
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Data Science and Artificial Intelligenceen_US
dc.contributorDepartment of Applied Mathematicsen_US
dc.creatorSun, Men_US
dc.creatorHan, Ren_US
dc.creatorJiang, Ben_US
dc.creatorQi, Hen_US
dc.creatorSun, Den_US
dc.creatorYuan, Yen_US
dc.creatorHuang, Jen_US
dc.date.accessioned2025-09-19T03:23:53Z-
dc.date.available2025-09-19T03:23:53Z-
dc.identifier.issn0162-1459en_US
dc.identifier.urihttp://hdl.handle.net/10397/115293-
dc.language.isoenen_US
dc.publisherTaylor & Francisen_US
dc.rights© 2025 The Author(s). Published with license by Taylor & Francis Group, LLC.en_US
dc.rightsThis is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent.en_US
dc.rightsThe following publication Sun, M., Han, R., Jiang, B., Qi, H., Sun, D., Yuan, Y., & Huang, J. (2025). Lambda: A large model based data agent. Journal of the American Statistical Association, 1-13 is available at https://doi.org/10.1080/01621459.2025.2510000.en_US
dc.subjectCode generation via natural languageen_US
dc.subjectData analysisen_US
dc.subjectLarge modelsen_US
dc.subjectMulti-agent collaborationen_US
dc.subjectSoftware systemen_US
dc.titleLAMBDA : a large model based data agenten_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.doi10.1080/01621459.2025.2510000en_US
dcterms.abstractWe introduce LArge Model Based Data Agent (LAMBDA), a novel open-source, code-free multi-agent data analysis system that leverages the power of large language models. LAMBDA is designed to address data analysis challenges in data-driven applications through innovatively designed data agents using natural language. At the core of LAMBDA are two key agent roles: the programmer and the inspector, which are engineered to work together seamlessly. Specifically, the programmer generates code based on the user’s instructions and domain-specific knowledge, while the inspector debugs the code when necessary. To ensure robustness and handle adverse scenarios, LAMBDA features a user interface that allows direct user intervention. Moreover, LAMBDA can flexibly integrate external models and algorithms through our proposed Knowledge Integration Mechanism, catering to the needs of customized data analysis. LAMBDA has demonstrated strong performance on various data analysis tasks. It has the potential to enhance data analysis paradigms by seamlessly integrating human and artificial intelligence, making it more accessible, effective, and efficient for users from diverse backgrounds. The strong performance of LAMBDA in solving data analysis problems is demonstrated using real-world data examples. The code for LAMBDA is available at https://github.com/AMA-CMFAI/LAMBDA and videos of three case studies can be viewed at https://www.polyu.edu.hk/ama/cmfai/lambda.html. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationJournal of the American Statistical Association, Published online: 17 Jul 2025, Latest Articles, https://doi.org/10.1080/01621459.2025.2510000en_US
dcterms.isPartOfJournal of the American Statistical Associationen_US
dcterms.issued2025-
dc.identifier.scopus2-s2.0-105010848153-
dc.identifier.eissn1537-274Xen_US
dc.description.validate202509 bchyen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberCDCF_2024-2025-
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work was funded by the Centre for the Mathematical Foundations of Generative AI and the research grants from The Hong Kong Polytechnic University (P0046811). The research of Ruijian Han was partially supported by The Hong Kong Polytechnic University (P0044617, P0045351, P0050935). The research of Houduo Qi was partially supported by the Hong Kong RGC grant (15309223) and The Hong Kong Polytechnic University (P0045347). The research of Defeng Sun and Yancheng Yuan was partially supported by the Research Center for Intelligent Operations Research at The Hong Kong Polytechnic University (P0051214). The research of Jian Huang was partially supported by The Hong Kong Polytechnic University (P0042888, P0045417, P0045931).en_US
dc.description.pubStatusEarly releaseen_US
dc.description.oaCategoryCCen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Sun_LAMBDA_Large_Mode.pdf4.34 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.