Safe learning by constraint-aware policy optimization for robotic ultrasound imaging

Duan, A; Yang, C; Zhao, J; Huo, S; Zhou, P; Ma, W; Zheng, Y; NavarroAlarcon, D

doi:10.1109/TASE.2024.3378915

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/113798

DC Field	Value	Language
dc.contributor	Faculty of Engineering	-
dc.contributor	Research Institute for Smart Ageing	-
dc.creator	Duan, A	-
dc.creator	Yang, C	-
dc.creator	Zhao, J	-
dc.creator	Huo, S	-
dc.creator	Zhou, P	-
dc.creator	Ma, W	-
dc.creator	Zheng, Y	-
dc.creator	NavarroAlarcon, D	-
dc.date.accessioned	2025-06-24T06:37:59Z	-
dc.date.available	2025-06-24T06:37:59Z	-
dc.identifier.issn	1545-5955	-
dc.identifier.uri	http://hdl.handle.net/10397/113798	-
dc.language.iso	en	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.rights	© 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	en_US
dc.rights	The following publication A. Duan et al., "Safe Learning by Constraint-Aware Policy Optimization for Robotic Ultrasound Imaging," in IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 2349-2360, 2025 is available at https://doi.org/10.1109/TASE.2024.3378915.	en_US
dc.subject	Imitation learning	en_US
dc.subject	Medical robotics	en_US
dc.subject	Optimization	en_US
dc.subject	Reinforcement learning	en_US
dc.subject	Sequential decision making	en_US
dc.title	Safe learning by constraint-aware policy optimization for robotic ultrasound imaging	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.spage	2349	-
dc.identifier.epage	2360	-
dc.identifier.volume	22	-
dc.identifier.doi	10.1109/TASE.2024.3378915	-
dcterms.abstract	Ultrasound-based medical examination usually requires establishing proper contact between an ultrasound probe and a human body that ensures the quality of ultrasound images. The scanning skills are quite challenging for a robot to learn primarily due to the complex coupling between the applied force profile and the resulting ultrasound image quality. While reinforcement learning appears as a powerful tool for learning complex robot skills, the deployment of these algorithms in medical robots demands special attention due to the evident safety concerns that arise from physical probe-tissue interactions. In this paper, we explicitly consider external constraints on the force magnitude when searching for the optimal policy parameters to enhance safety during ultrasound-guided robotic interventions. In particular, we study policy optimization under the framework of a constrained Markov decision process. The resulting gradient-based policy update is then subject to the involved constraints, which can be readily addressed by the primal-dual interior-point technique. In addition, upon the observation that policy update requires consecutive policies to be close to each other to have stable and robust performance with reinforcement learning algorithms, we design the learning rate of policy gradient from an imitation perspective. The performance of the proposed constraint-aware policy optimization method is validated with experiments of robotic ultrasound imaging for spinal diagnosis. Note to Practitioners - This paper was motivated by the problem of safely learning the optimal interaction force strategy to facilitate robotic ultrasound imaging. Existing approaches to robotic ultrasound imaging usually empirically set a constant value for the scanning force, despite the fact the force strategy plays an important role in the quality of the ultrasound images. This paper suggests the usage of reinforcement learning to identify the optimal interaction force due to the complex acoustic coupling between the force and the ultrasound image quality. Specifically, we propose constraint-aware reinforcement learning in view of the safety-critical issues as a result of physical human-probe interaction. We then conduct a theoretical analysis of the proposed safe reinforcement learning, including monotonic improvement and policy value bound under mild assumptions. Preliminary real experiments on ultrasound imaging of the spine of a phantom for scoliosis assessment suggest that the proposed approach can safely learn the optimal scanning force without violating the prescribed force threshold. In the future, we would like to apply our approach to learning the optimal scanning force on different organs of interest of human subjects.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	IEEE transactions on automation science and engineering, 2025, v. 22, p. 2349-2360	-
dcterms.isPartOf	IEEE transactions on automation science and engineering	-
dcterms.issued	2025	-
dc.identifier.scopus	2-s2.0-85189322032	-
dc.identifier.eissn	1558-3783	-
dc.description.validate	202506 bcch	-
dc.description.oa	Accepted Manuscript	en_US
dc.identifier.FolderNumber	a3769b	en_US
dc.identifier.SubFormID	51002	en_US
dc.description.fundingSource	Self-funded	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
Duan_Safe_Learning_Constraint-aware.pdf	Pre-Published version	12.38 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Final Accepted Manuscript

Access

View full-text via PolyU eLinks

Show simple item record

SCOPUS^TM
Citations

5

Citations as of Dec 19, 2025

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

SCOPUSTM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

Google Scholar^TM