Causality-informed neural networks for regularized learning in regression problems

Zhang, X; Wang, T; Wang, XL; Fan, FL; Cheung, YM; Bose, I

doi:10.1109/TSMC.2025.3646993

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118131

DC Field	Value	Language
dc.contributor	Department of Industrial and Systems Engineering	en_US
dc.creator	Zhang, X	en_US
dc.creator	Wang, T	en_US
dc.creator	Wang, XL	en_US
dc.creator	Fan, FL	en_US
dc.creator	Cheung, YM	en_US
dc.creator	Bose, I	en_US
dc.date.accessioned	2026-03-18T04:03:03Z	-
dc.date.available	2026-03-18T04:03:03Z	-
dc.identifier.issn	2168-2216	en_US
dc.identifier.uri	http://hdl.handle.net/10397/118131	-
dc.language.iso	en	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.rights	© 2026 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	en_US
dc.rights	The following publication X. Zhang, T. Wang, X. -L. Wang, F. -L. Fan, Y. -M. Cheung and I. Bose, 'Causality-Informed Neural Networks for Regularized Learning in Regression Problems,' in IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 56, no. 3, pp. 1895-1910, March 2026 is available at https://doi.org/10.1109/TSMC.2025.3646993.	en_US
dc.subject	Causal inference	en_US
dc.subject	Causality-informed neural network (CINN)	en_US
dc.subject	Deep learning	en_US
dc.subject	Informed learning	en_US
dc.title	Causality-informed neural networks for regularized learning in regression problems	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.spage	1895	en_US
dc.identifier.epage	1910	en_US
dc.identifier.volume	56	en_US
dc.identifier.issue	3	en_US
dc.identifier.doi	10.1109/TSMC.2025.3646993	en_US
dcterms.abstract	Neural networks that overlook the underlying causal relationships among observed variables pose significant risks in high-stakes decision-making contexts due to concerns about the robustness and stability of model performance. To tackle this issue, we present a general approach for embedding hierarchical causal structure among observed variables into a neural network to inform its learning. The proposed methodology, termed causality-informed neural network (CINN), exploits hierarchical causal structure learned from observational data as a structurally informed prior to guide the layer-to-layer architectural design of the neural network while maintaining the orientation of causal relationships in the discovered causal graph. The proposed method involves three steps. First, CINN mines causal relationships from observational data via directed acyclic graph (DAG) learning, where causal discovery is recast as a continuous optimization problem to circumvent the combinatorial nature of DAG learning. Second, we encode the discovered hierarchical causal graph among observed variables into a neural network via a dedicated architecture and loss function. By classifying observed variables in the DAG as root, intermediate, and leaf nodes, we translate the hierarchical causal DAG into CINN by creating a one-to-one correspondence between DAG nodes and certain CINN neurons. For the loss function, both intermediate and leaf nodes in the DAG are treated as target outputs during CINN training, facilitating the co-learning of causal relationships among the observed variables. Finally, as multiple loss components emerge in CINN, we leverage the projection of conflicting gradients (PCGrads) to mitigate the gradient interference among the multiple learning tasks. Computational studies indicate that CINN outperforms several state-of-the-art methods across a broad range of datasets. In addition, an ablation study that incrementally incorporates structural and quantitative causal knowledge into the neural network is conducted to highlight the pivotal role of causal knowledge in enhancing neural network’s prediction performance.	en_US
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	IEEE transactions on systems, man, and cybernetics. Systems, Mar. 2026, v. 56, no. 3, p. 1895-1910	en_US
dcterms.isPartOf	IEEE transactions on systems, man, and cybernetics. Systems	en_US
dcterms.issued	2026-03	-
dc.identifier.scopus	2-s2.0-105028030320	-
dc.identifier.eissn	2168-2232	en_US
dc.description.validate	202603 bcjz	en_US
dc.description.oa	Accepted Manuscript	en_US
dc.identifier.SubFormID	G001288/2026-02	-
dc.description.fundingSource	RGC	en_US
dc.description.fundingSource	Others	en_US
dc.description.fundingText	This work was supported in part by the Research Grants Council of Hong Kong Special Administrative Region, China, under Project PolyU 25206422; in part by the National Natural Science Foundation of China under Grant 62406269; in part by the Research Committee of The Hong Kong Polytechnic University under Project RKB0 and Project G-UARJ; in part by the NSFC/Research Grants Council (RGC) Joint Research Scheme under Project N_HKBU214/21; in part by the Seed Funding for Collaborative Research Grants of Hong Kong Baptist University (HKBU) under Grant RC-SFCRG/23-24/R2/SCI/10; and in part by Guangdong and Hong Kong Universities “1 + 1 + 1” Cross-Campus Research Collaboration Scheme under Grant 2025A0505000004.	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
Zhang_Causality-informed_Neural_Networks.pdf	Pre-Published version	8.38 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Final Accepted Manuscript

Access

View full-text via PolyU eLinks

Show simple item record

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM