Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/113798
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorFaculty of Engineering-
dc.contributorResearch Institute for Smart Ageing-
dc.creatorDuan, A-
dc.creatorYang, C-
dc.creatorZhao, J-
dc.creatorHuo, S-
dc.creatorZhou, P-
dc.creatorMa, W-
dc.creatorZheng, Y-
dc.creatorNavarroAlarcon, D-
dc.date.accessioned2025-06-24T06:37:59Z-
dc.date.available2025-06-24T06:37:59Z-
dc.identifier.issn1545-5955-
dc.identifier.urihttp://hdl.handle.net/10397/113798-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication A. Duan et al., "Safe Learning by Constraint-Aware Policy Optimization for Robotic Ultrasound Imaging," in IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 2349-2360, 2025 is available at https://doi.org/10.1109/TASE.2024.3378915.en_US
dc.subjectImitation learningen_US
dc.subjectMedical roboticsen_US
dc.subjectOptimizationen_US
dc.subjectReinforcement learningen_US
dc.subjectSequential decision makingen_US
dc.titleSafe learning by constraint-aware policy optimization for robotic ultrasound imagingen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage2349-
dc.identifier.epage2360-
dc.identifier.volume22-
dc.identifier.doi10.1109/TASE.2024.3378915-
dcterms.abstractUltrasound-based medical examination usually requires establishing proper contact between an ultrasound probe and a human body that ensures the quality of ultrasound images. The scanning skills are quite challenging for a robot to learn primarily due to the complex coupling between the applied force profile and the resulting ultrasound image quality. While reinforcement learning appears as a powerful tool for learning complex robot skills, the deployment of these algorithms in medical robots demands special attention due to the evident safety concerns that arise from physical probe-tissue interactions. In this paper, we explicitly consider external constraints on the force magnitude when searching for the optimal policy parameters to enhance safety during ultrasound-guided robotic interventions. In particular, we study policy optimization under the framework of a constrained Markov decision process. The resulting gradient-based policy update is then subject to the involved constraints, which can be readily addressed by the primal-dual interior-point technique. In addition, upon the observation that policy update requires consecutive policies to be close to each other to have stable and robust performance with reinforcement learning algorithms, we design the learning rate of policy gradient from an imitation perspective. The performance of the proposed constraint-aware policy optimization method is validated with experiments of robotic ultrasound imaging for spinal diagnosis. Note to Practitioners - This paper was motivated by the problem of safely learning the optimal interaction force strategy to facilitate robotic ultrasound imaging. Existing approaches to robotic ultrasound imaging usually empirically set a constant value for the scanning force, despite the fact the force strategy plays an important role in the quality of the ultrasound images. This paper suggests the usage of reinforcement learning to identify the optimal interaction force due to the complex acoustic coupling between the force and the ultrasound image quality. Specifically, we propose constraint-aware reinforcement learning in view of the safety-critical issues as a result of physical human-probe interaction. We then conduct a theoretical analysis of the proposed safe reinforcement learning, including monotonic improvement and policy value bound under mild assumptions. Preliminary real experiments on ultrasound imaging of the spine of a phantom for scoliosis assessment suggest that the proposed approach can safely learn the optimal scanning force without violating the prescribed force threshold. In the future, we would like to apply our approach to learning the optimal scanning force on different organs of interest of human subjects.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE transactions on automation science and engineering, 2025, v. 22, p. 2349-2360-
dcterms.isPartOfIEEE transactions on automation science and engineering-
dcterms.issued2025-
dc.identifier.scopus2-s2.0-85189322032-
dc.identifier.eissn1558-3783-
dc.description.validate202506 bcch-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumbera3769ben_US
dc.identifier.SubFormID51002en_US
dc.description.fundingSourceSelf-fundeden_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Duan_Safe_Learning_Constraint-aware.pdfPre-Published version12.38 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

SCOPUSTM   
Citations

5
Citations as of Dec 19, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.