Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/116640
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineering-
dc.creatorLong, M-
dc.creatorWang, R-
dc.creatorChen, J-
dc.creatorChung, E-
dc.creatorOguchi, T-
dc.date.accessioned2026-01-08T08:24:51Z-
dc.date.available2026-01-08T08:24:51Z-
dc.identifier.issn2168-0566-
dc.identifier.urihttp://hdl.handle.net/10397/116640-
dc.language.isoenen_US
dc.publisherTaylor & Francisen_US
dc.subjectArterial roaden_US
dc.subjectMulti-agent reinforcement learningen_US
dc.subjectTraffic signal controlen_US
dc.subjectTransit signal priorityen_US
dc.titleMARL-based cooperative transit signal priority for the arterial road to reduce schedule delayen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume13-
dc.identifier.issue1-
dc.identifier.doi10.1080/21680566.2025.2564703-
dcterms.abstractTransit signal priority (TSP) is an effective strategy to reduce transit delays and improve intersection efficiency. This paper introduces a Cooperative TSP strategy of Variable phase (CTSPV) using multi-agent reinforcement learning (MARL) to minimize transit schedule delays on arterial roads. The agents adjust phase sequences and durations based on real-time traffic, balancing transit and non-transit vehicle needs, resolving conflicting bus requests, and ensuring agent cooperation. Invalid action masking ensures compliance with green time and phase-skipping rules. Simulation results show CTSPV reduces person delay, queue lengths, and lateness by 8.7%, 31.6%, and 17.0%, respectively, compared to fixed-time signals. Testing different green time constraints highlights the importance of proper restrictions for efficient learning. Analysis of CTSPV's signal timing reveals agents prioritize phases with high traffic demand and bus priority, skipping phases with lower demand. Evaluation results of generalized rule-based strategies based on those RL-derived patterns demonstrate the good performance of RL-learned knowledge.-
dcterms.accessRightsembargoed accessen_US
dcterms.bibliographicCitationTransportmetrica. B, Transport dynamics, 2025, v. 13, no. 1, 2564703-
dcterms.isPartOfTransportmetrica. B, Transport dynamics-
dcterms.issued2025-
dc.identifier.scopus2-s2.0-105018775843-
dc.identifier.eissn2168-0582-
dc.identifier.artn2564703-
dc.description.validate202601 bcjz-
dc.description.oaNot applicableen_US
dc.identifier.SubFormIDG000664/2025-11en_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work was supported by the Science and Technology Research Program of Chongqing Municipal Education Commission under [grant number KJQN202500504], the Foundation of Chongqing Normal University under [grant numbers 24XWB040 and 25XLB001], the General Program of Chongqing Natural Science Foundation under [grant CSTB2025NSCQ-GPX1008], and the Mainland-Hong Kong Joint Funding Scheme under [grant MHP/038/23].en_US
dc.description.pubStatusPublisheden_US
dc.date.embargo2026-10-08en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Open Access Information
Status embargoed access
Embargo End Date 2026-10-08
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.