Towards controllable and explainable text generation via causal intervention in LLMs

Qiu, J; Fang, Q; Kang, W

doi:10.3390/electronics14163279

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/117588

DC Field	Value	Language
dc.contributor	Department of Industrial and Systems Engineering	-
dc.creator	Qiu, J	-
dc.creator	Fang, Q	-
dc.creator	Kang, W	-
dc.date.accessioned	2026-02-26T03:47:12Z	-
dc.date.available	2026-02-26T03:47:12Z	-
dc.identifier.uri	http://hdl.handle.net/10397/117588	-
dc.language.iso	en	en_US
dc.publisher	MDPI AG	en_US
dc.rights	Copyright: © 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).	en_US
dc.rights	The following publication Qiu, J., Fang, Q., & Kang, W. (2025). Towards Controllable and Explainable Text Generation via Causal Intervention in LLMs. Electronics, 14(16), 3279 is available at https://doi.org/10.3390/electronics14163279.	en_US
dc.subject	Counterfactual training	en_US
dc.subject	Hidden-state intervention	en_US
dc.subject	Multi-attribute disentanglement	en_US
dc.subject	Resource-efficient generation	en_US
dc.subject	Structural causal model (SCM)	en_US
dc.title	Towards controllable and explainable text generation via causal intervention in LLMs	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.volume	14	-
dc.identifier.issue	16	-
dc.identifier.doi	10.3390/electronics14163279	-
dcterms.abstract	Large Language Models (LLMs) excel in diverse text generation tasks but still face limited controllability, opaque decision processes, and frequent hallucinations. This paper presents a structural causal intervention framework that models input–hidden–output dependencies through a structural causal model and performs targeted interventions on hidden representations. By combining counterfactual sample construction with contrastive training, our method enables precise control of style, sentiment, and factual consistency while providing explicit causal explanations for output changes. Experiments on three representative tasks demonstrate consistent and substantial improvements: style transfer accuracy reaches 92.3% (+7–14 percentage points over strong baselines), sentiment-controlled generation achieves 90.1% accuracy (+1.3–10.9 points), and multi-attribute conflict rates drop to 3.7% (a 40–60% relative reduction). Our method also improves causal attribution scores to 0.83–0.85 and human agreement rates to 87–88%, while reducing training and inference latency by 25–30% through sparse masking that modifies ≤10% of hidden units per attribute. These results confirm that integrating structural causal intervention with counterfactual training advances controllability, interpretability, and efficiency in LLM-based generation, offering a robust foundation for deployment in reliability-critical and resource-constrained applications.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	Electronics (Switzerland), Aug. 2025, v. 14, no. 16, 3279	-
dcterms.isPartOf	Electronics (Switzerland)	-
dcterms.issued	2025-08	-
dc.identifier.scopus	2-s2.0-105014405949	-
dc.identifier.eissn	2079-9292	-
dc.identifier.artn	3279	-
dc.description.validate	202602 bcch	-
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	OA_Scopus/WOS	en_US
dc.description.fundingSource	Self-funded	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	CC	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
electronics-14-03279-v2.pdf		1.47 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show simple item record

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM