Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/97450
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Civil and Environmental Engineeringen_US
dc.creatorLiang, Een_US
dc.creatorWen, Ken_US
dc.creatorLam, WHKen_US
dc.creatorSumalee, Aen_US
dc.creatorZhong, Ren_US
dc.date.accessioned2023-03-06T01:18:36Z-
dc.date.available2023-03-06T01:18:36Z-
dc.identifier.issn2162-237Xen_US
dc.identifier.urihttp://hdl.handle.net/10397/97450-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication Liang, E., Wen, K., Lam, W. H., Sumalee, A., & Zhong, R. (2021). An integrated reinforcement learning and centralized programming approach for online taxi dispatching. IEEE Transactions on Neural Networks and Learning Systems, 33(9), 4742-4756 is available at https://doi.org/10.1109/TNNLS.2021.3060187.en_US
dc.subjectDeep reinforcement learning (RL)en_US
dc.subjectMultiagent systemen_US
dc.subjectOnline vehicle routingen_US
dc.subjectStochastic network trafficen_US
dc.subjectVehicle dispatchingen_US
dc.titleAn integrated reinforcement learning and centralized programming approach for online taxi dispatchingen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage4742en_US
dc.identifier.epage4756en_US
dc.identifier.volume33en_US
dc.identifier.issue9en_US
dc.identifier.doi10.1109/TNNLS.2021.3060187en_US
dcterms.abstractBalancing the supply and demand for ride-sourcing companies is a challenging issue, especially with real-time requests and stochastic traffic conditions of large-scale congested road networks. To tackle this challenge, this article proposes a robust and scalable approach that integrates reinforcement learning (RL) and a centralized programming (CP) structure to promote real-time taxi operations. Both real-time order matching decisions and vehicle relocation decisions at the microscopic network scale are integrated within a Markov decision process framework. The RL component learns the decomposed state-value function, which represents the taxi drivers' experience, the off-line historical demand pattern, and the traffic network congestion. The CP component plans nonmyopic decisions for drivers collectively under the prescribed system constraints to explicitly realize cooperation. Furthermore, to circumvent sparse reward and sample imbalance problems over the microscopic road network, this article proposed a temporal-difference learning algorithm with prioritized gradient descent and adaptive exploration techniques. A simulator is built and trained with the Manhattan road network and New York City yellow taxi data to simulate the real-time vehicle dispatching environment. Both centralized and decentralized taxi dispatching policies are examined with the simulator. This case study shows that the proposed approach can further improve taxi drivers' profits while reducing customers' waiting times compared to several existing vehicle dispatching algorithms.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE transactions on neural networks and learning systems, Sept. 2022, v. 33, no. 9, p. 4742-4756en_US
dcterms.isPartOfIEEE transactions on neural networks and learning systemsen_US
dcterms.issued2022-09-
dc.identifier.scopus2-s2.0-85102239222-
dc.identifier.eissn2162-2388en_US
dc.description.validate202203 bcfcen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberCEE-0506-
dc.description.fundingSourceSelf-fundeden_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS46480150-
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Lam_Integrated_Reinforcement_Learning.pdfPre-Published version10.56 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

109
Last Week
3
Last month
Citations as of Nov 9, 2025

Downloads

535
Citations as of Nov 9, 2025

SCOPUSTM   
Citations

49
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

55
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.