Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/98182
PIRA download icon_1.1View/Download Full Text
Title: Weakly-supervised multi-action offline reinforcement learning for intelligent dosing of epilepsy in children
Authors: Li, Z
Shen, Y 
Xu, R
Yang, Y 
Cao, J 
Wu, L
Wu, Q
Issue Date: 2023
Source: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics), 2023, v. 13946, p. 208-223
Abstract: Epilepsy in childhood is a common neurological disorder in children. Most cases are benign childhood epilepsy, which can be controlled with medication by adaptive adjustment of the dosage of antiepileptic drugs (AEDs). Recently, reinforcement learning-based intelligent dosing has attracted increasing attention. In clinical practice, patients usually take more than one drug at a time, however, conventional reinforcement learning algorithms do not sufficiently address the combination of two or more active drugs. In this paper, we propose the multi-action offline reinforcement learning (MA-ORL) model to solve this problem. Concretely, MA-ORL inherits the basic framework of the actor-critic network. For the patient’s health and safety concerns, MA-ORL abandons the intrusive and high-risk “trial-and-error” interactions with the environment but directly learns from the offline clinical dataset in the initial phase until the model is sufficiently trained and ready to use. Besides, to choose multiple actions simultaneously, we replace the actor’s output in standard reinforcement learning with a 2D matrix indicating the mixed feature representation of all different actions, then multiply it with separate masks to obtain separated actions. In addition to reaching an optimal return (i.e., the reduction of seizure frequency), MA-ORL also emphasizes the accuracy of the recommended dosage. Therefore, we introduce weak supervision to the learning objective to restrict the range of the learning outcome. It guarantees the recommended dosage by MA-ORL is as close as the one prescribed by experienced physicians. We conduct extensive experiments on clinical medical records containing 245 cases of epilepsy in childhood. Experimental results show MA-ORL reaches the highest cumulative return among all baseline models. Moreover, the suggested amount of medication taken a day by MA-ORL is more accurate than any other benchmark.
Keywords: Multi-action reinforcement learning
Offline reinforcement learning
Weakly-supervised learning
Epilepsy in children
Publisher: Springer
Journal: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics) 
ISSN: 0302-9743
EISSN: 1611-3349
DOI: 10.1007/978-3-031-30678-5_16
Description: 28th International Conference on Database Systems for Advanced Applications, DASFAA 2023, Tianjin, China, April 17–20, 2023
Rights: © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023
This version of the proceeding paper has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use (https://www.springernature.com/gp/open-research/policies/accepted-manuscript-terms), but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/978-3-031-30678-5_16.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
LI_Weakly-Supervised_Multi-Action_Offline.pdfPre-Published version3.45 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

148
Last Week
0
Last month
Citations as of Dec 21, 2025

Downloads

80
Citations as of Dec 21, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.