Prompt4Vis : prompting large language models with example mining for tabular data visualization

Li, S; Chen, X; Song, Y; Song, Y; Zhang, CJ; Hao, F; Chen, L

doi:10.1007/s00778-025-00912-0

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/113067

Title:	Prompt4Vis : prompting large language models with example mining for tabular data visualization
Authors:	Li, S Chen, X Song, Y Song, Y Zhang, CJ Hao, F Chen, L
Issue Date:	Jul-2025
Source:	VLDB journal, July 2025, v. 34, no. 4, 38
Abstract:	We are currently in the epoch of Large Language Models (LLMs), which have transformed numerous technological domains within the database community. In this paper, we examine the application of LLMs in text-to-visualization (text-to-vis). The advancement of natural language processing technologies has made natural language interfaces more accessible and intuitive for visualizing tabular data. However, despite utilizing advanced neural network architectures, current methods such as Seq2Vis, ncNet, and RGVisNet for transforming natural language queries into DV commands still underperform, indicating significant room for improvement. In this paper, we introduce Prompt4Vis, a novel framework that leverages LLMs and In-context learning to enhance the generation of data visualizations from natural language. Given that In-context learning’s effectiveness is highly dependent on the selection of examples, it is critical to optimize this aspect. Additionally, encoding the full database schema of a query is not only costly but can also lead to inaccuracies. This framework includes two main components: (1) an example mining module that identifies highly effective examples to enhance In-context learning capabilities for text-to-vis applications, and (2) a schema filtering module designed to streamline database schemas. Comprehensive testing on the NVBench dataset has shown that Prompt4Vis significantly outperforms the current state-of-the-art model, RGVisNet, by approximately 35.9% on development sets and 71.3% on test sets. To the best of our knowledge, Prompt4Vis is the first framework to incorporate In-context learning for enhancing text-to-vis, marking a pioneering step in the domain.
Keywords:	In-context learning Large language model NLP for database Prompt engineering Text-to-vis
Publisher:	Springer
Journal:	VLDB journal
ISSN:	1066-8888
EISSN:	0949-877X
DOI:	10.1007/s00778-025-00912-0
Rights:	© The Author(s) 2025. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The following publication Li, S., Chen, X., Song, Y. et al. prompt4vis: prompting large language models with example mining for tabular data visualization. The VLDB Journal 34, 38 (2025) is available at https://doi.org/10.1007/s00778-025-00912-0.
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
s00778-025-00912-0.pdf		1.87 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show full item record

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM