Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118121
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Aeronautical and Aviation Engineering-
dc.creatorZhang, Y-
dc.creatorWang, Y-
dc.creatorYan, P-
dc.creatorWen, W-
dc.date.accessioned2026-03-18T02:30:14Z-
dc.date.available2026-03-18T02:30:14Z-
dc.identifier.issn1524-9050-
dc.identifier.urihttp://hdl.handle.net/10397/118121-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication Y. Zhang, Y. Wang, P. Yan and W. Wen, 'Learning Safe, Optimal, and Real-Time Flight Interaction With Deep Confidence-Enhanced Reachability Guarantee,' in IEEE Transactions on Intelligent Transportation Systems, vol. 26, no. 12, pp. 23100-23112, Dec. 2025 is available at https://doi.org/10.1109/TITS.2025.3616580.en_US
dc.subjectDeep confidence-enhanced reachability guaranteesen_US
dc.subjectDeep reinforcement learningen_US
dc.subjectJoint planning and controlen_US
dc.subjectUnmanned aerial vehiclesen_US
dc.titleLearning safe, optimal, and real-time flight interaction with deep confidence-enhanced reachability guaranteeen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage23100-
dc.identifier.epage23112-
dc.identifier.volume26-
dc.identifier.issue12-
dc.identifier.doi10.1109/TITS.2025.3616580-
dcterms.abstractIn the low-altitude economy, ensuring the safe and agile flight of unmanned aerial vehicles (UAVs) in dynamic obstacle environments is essential for expanding interactive applications like parcel delivery. While deep reinforcement learning (DRL) shows promise for UAV motion planning and control, its trial-and-error exploration often struggles to ensure both agility and safety, especially under uncertain observational noise. Therefore, this paper proposes a deep confidence-enhanced reachability policy optimization (DCRPO) framework. By integrating safe DRL with nonlinear model predictive control (NMPC), DCRPO achieves high-level safety decisions, complex real-time joint planning and control for UAVs. Furthermore, we develop a deep confidence-enhanced reachability guarantee that constructs a set of stochastically forward-reachable planned trajectories under uncertainty, enabling robust safety collision probability certifications. This safe reachability mechanism adaptively selects belief space actions from planned actions to interact with the environment, further enhancing safety and reducing training time. In extensive experiments of UAVs traversing a fast-moving rectangular gate, the proposed method outperforms other state-of-the-art baseline methods under varying environments in terms of operational robustness. Furthermore, the proposed method significantly reduces overall collision violations and training time, greatly improving both training safety and efficiency. The demonstration video (https://youtu.be/7xkp9U7FSJg) and the source code (https://github.com/ZyyFLY/DCRPO) are also provided.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE transactions on intelligent transportation systems, Dec. 2025, v. 26, no. 12, p. 23100-23112-
dcterms.isPartOfIEEE transactions on intelligent transportation systems-
dcterms.issued2025-12-
dc.identifier.scopus2-s2.0-105019099058-
dc.identifier.eissn1558-0016-
dc.description.validate202603 bcjz-
dc.description.oaAccepted Manuscripten_US
dc.identifier.SubFormIDG001270/2026-02en_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work was supported in part by Hong Kong Innovation and Technology Fund-Innovation and Technology Support Program (ITF-ITSP) under the Project “Safety-Certified Multi-Source Fusion Positioning for Autonomous Vehicles in Complex Scenarios (ZPE8).”en_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Zhang_Learning_Safe_Optimal.pdfPre-Published version20.18 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.