Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118072
DC FieldValueLanguage
dc.contributorDepartment of Aeronautical and Aviation Engineeringen_US
dc.creatorZhu, Cen_US
dc.creatorNg, KKHen_US
dc.creatorChan, PWen_US
dc.creatorLiu, Yen_US
dc.creatorLeung, CYYen_US
dc.date.accessioned2026-03-12T01:17:27Z-
dc.date.available2026-03-12T01:17:27Z-
dc.identifier.issn1366-5545en_US
dc.identifier.urihttp://hdl.handle.net/10397/118072-
dc.language.isoenen_US
dc.publisherElsevier Ltden_US
dc.subjectAir traffic controlen_US
dc.subjectDynamic weather hazarden_US
dc.subjectQuickest priority-based conflict resolutionen_US
dc.subjectSelf-attention mechanismen_US
dc.subjectTrajectory optimisationen_US
dc.titlePriority-driven reinforcement learning for multi-aircraft trajectory optimisation under dynamic weather hazardsen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume205en_US
dc.identifier.doi10.1016/j.tre.2025.104496en_US
dcterms.abstractReinforcement Learning (RL) has emerged as a state-of-the-art technique for addressing challenges in air traffic control, and weather hazards and flight procedures can contribute to information biases when applying RL to real-world scenarios. This research focuses on the 3D Multi-Aircraft Trajectory Optimisation (3D-MATO) problem under dynamic weather hazards within the Terminal Manoeuvring Area and addresses the aforementioned concerns. We propose an integrated RL-based algorithm incorporating weather avoidance and quick conflict resolution. Given observed weather radar in Flight Information Regions (FIRs), we introduce the Dynamic Fast Marching Method (DFMM) algorithm to reroute flight paths at smaller time intervals, ensuring safer navigation around hazardous regions. To enhance decision-making quality, we develop a Quickest Priority-based Conflict Resolution (QPCR) strategy, which optimises approach sequences and refines available action choices. The RL agent is trained using a Deep Deterministic Policy Gradient (DDPG) framework, and further enhanced with a self-attention mechanism. A numerical study modelled the real-world approach procedures at Hong Kong International Airport involving varying numbers of approach aircraft under dynamic weather hazards. Results demonstrate the high efficiency and effectiveness of the proposed algorithm under traffic mix and weather conditions, highlighting the contributions of its key strategies and individual components.en_US
dcterms.accessRightsembargoed accessen_US
dcterms.bibliographicCitationTransportation research. Part E, Logistics and transportation review, Jan. 2026, v. 205, 104496en_US
dcterms.isPartOfTransportation research. Part E, Logistics and transportation reviewen_US
dcterms.issued2026-01-
dc.identifier.scopus2-s2.0-105022191268-
dc.identifier.eissn1878-5794en_US
dc.identifier.artn104496en_US
dc.description.validate202603 bchyen_US
dc.description.oaNot applicableen_US
dc.identifier.SubFormIDG001174/2026-01-
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThe work described in this paper was supported by grants from the Research Grants Council, the Hong Kong Government (Grant no. PolyU15201423), Department of Aeronautical and Aviation Engineering, The Hong Kong Polytechnic University, Hong Kong SAR (RMBU, RJ78), the National Natural Science Foundation of China (Grant number: 72301229), and the Research Institute of Sustainable Urban Development (BBG5).en_US
dc.description.pubStatusPublisheden_US
dc.date.embargo2029-01-31en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Open Access Information
Status embargoed access
Embargo End Date 2029-01-31
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.