Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/116640
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineeringen_US
dc.creatorLong, Men_US
dc.creatorWang, Ren_US
dc.creatorChen, Jen_US
dc.creatorChung, Een_US
dc.creatorOguchi, Ten_US
dc.date.accessioned2026-01-08T08:24:51Z-
dc.date.available2026-01-08T08:24:51Z-
dc.identifier.issn2168-0566en_US
dc.identifier.urihttp://hdl.handle.net/10397/116640-
dc.language.isoenen_US
dc.publisherTaylor & Francisen_US
dc.subjectArterial roaden_US
dc.subjectMulti-agent reinforcement learningen_US
dc.subjectTraffic signal controlen_US
dc.subjectTransit signal priorityen_US
dc.titleMARL-based cooperative transit signal priority for the arterial road to reduce schedule delayen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume13en_US
dc.identifier.issue1en_US
dc.identifier.doi10.1080/21680566.2025.2564703en_US
dcterms.abstractTransit signal priority (TSP) is an effective strategy to reduce transit delays and improve intersection efficiency. This paper introduces a Cooperative TSP strategy of Variable phase (CTSPV) using multi-agent reinforcement learning (MARL) to minimize transit schedule delays on arterial roads. The agents adjust phase sequences and durations based on real-time traffic, balancing transit and non-transit vehicle needs, resolving conflicting bus requests, and ensuring agent cooperation. Invalid action masking ensures compliance with green time and phase-skipping rules. Simulation results show CTSPV reduces person delay, queue lengths, and lateness by 8.7%, 31.6%, and 17.0%, respectively, compared to fixed-time signals. Testing different green time constraints highlights the importance of proper restrictions for efficient learning. Analysis of CTSPV's signal timing reveals agents prioritize phases with high traffic demand and bus priority, skipping phases with lower demand. Evaluation results of generalized rule-based strategies based on those RL-derived patterns demonstrate the good performance of RL-learned knowledge.en_US
dcterms.accessRightsembargoed accessen_US
dcterms.bibliographicCitationTransportmetrica. B, Transport dynamics, 2025, v. 13, no. 1, 2564703en_US
dcterms.isPartOfTransportmetrica. B, Transport dynamicsen_US
dcterms.issued2025-
dc.identifier.scopus2-s2.0-105018775843-
dc.identifier.eissn2168-0582en_US
dc.identifier.artn2564703en_US
dc.description.validate202601 bcjzen_US
dc.description.oaNot applicableen_US
dc.identifier.SubFormIDG000664/2025-11-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work was supported by the Science and Technology Research Program of Chongqing Municipal Education Commission under [grant number KJQN202500504], the Foundation of Chongqing Normal University under [grant numbers 24XWB040 and 25XLB001], the General Program of Chongqing Natural Science Foundation under [grant CSTB2025NSCQ-GPX1008], and the Mainland-Hong Kong Joint Funding Scheme under [grant MHP/038/23].en_US
dc.description.pubStatusPublisheden_US
dc.date.embargo2026-10-08en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Open Access Information
Status embargoed access
Embargo End Date 2026-10-08
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

SCOPUSTM   
Citations

1
Citations as of Apr 3, 2026

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.