Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/113798
PIRA download icon_1.1View/Download Full Text
Title: Safe learning by constraint-aware policy optimization for robotic ultrasound imaging
Authors: Duan, A 
Yang, C
Zhao, J
Huo, S 
Zhou, P
Ma, W 
Zheng, Y 
NavarroAlarcon, D 
Issue Date: 2025
Source: IEEE transactions on automation science and engineering, 2025, v. 22, p. 2349-2360
Abstract: Ultrasound-based medical examination usually requires establishing proper contact between an ultrasound probe and a human body that ensures the quality of ultrasound images. The scanning skills are quite challenging for a robot to learn primarily due to the complex coupling between the applied force profile and the resulting ultrasound image quality. While reinforcement learning appears as a powerful tool for learning complex robot skills, the deployment of these algorithms in medical robots demands special attention due to the evident safety concerns that arise from physical probe-tissue interactions. In this paper, we explicitly consider external constraints on the force magnitude when searching for the optimal policy parameters to enhance safety during ultrasound-guided robotic interventions. In particular, we study policy optimization under the framework of a constrained Markov decision process. The resulting gradient-based policy update is then subject to the involved constraints, which can be readily addressed by the primal-dual interior-point technique. In addition, upon the observation that policy update requires consecutive policies to be close to each other to have stable and robust performance with reinforcement learning algorithms, we design the learning rate of policy gradient from an imitation perspective. The performance of the proposed constraint-aware policy optimization method is validated with experiments of robotic ultrasound imaging for spinal diagnosis. Note to Practitioners - This paper was motivated by the problem of safely learning the optimal interaction force strategy to facilitate robotic ultrasound imaging. Existing approaches to robotic ultrasound imaging usually empirically set a constant value for the scanning force, despite the fact the force strategy plays an important role in the quality of the ultrasound images. This paper suggests the usage of reinforcement learning to identify the optimal interaction force due to the complex acoustic coupling between the force and the ultrasound image quality. Specifically, we propose constraint-aware reinforcement learning in view of the safety-critical issues as a result of physical human-probe interaction. We then conduct a theoretical analysis of the proposed safe reinforcement learning, including monotonic improvement and policy value bound under mild assumptions. Preliminary real experiments on ultrasound imaging of the spine of a phantom for scoliosis assessment suggest that the proposed approach can safely learn the optimal scanning force without violating the prescribed force threshold. In the future, we would like to apply our approach to learning the optimal scanning force on different organs of interest of human subjects.
Keywords: Imitation learning
Medical robotics
Optimization
Reinforcement learning
Sequential decision making
Publisher: Institute of Electrical and Electronics Engineers
Journal: IEEE transactions on automation science and engineering 
ISSN: 1545-5955
EISSN: 1558-3783
DOI: 10.1109/TASE.2024.3378915
Rights: © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication A. Duan et al., "Safe Learning by Constraint-Aware Policy Optimization for Robotic Ultrasound Imaging," in IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 2349-2360, 2025 is available at https://doi.org/10.1109/TASE.2024.3378915.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Duan_Safe_Learning_Constraint-aware.pdfPre-Published version12.38 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

5
Citations as of Dec 19, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.