An integrated reinforcement learning and centralized programming approach for online taxi dispatching

Liang, E; Wen, K; Lam, WHK; Sumalee, A; Zhong, R

doi:10.1109/TNNLS.2021.3060187

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/97450

Title:	An integrated reinforcement learning and centralized programming approach for online taxi dispatching
Authors:	Liang, E Wen, K Lam, WHK Sumalee, A Zhong, R
Issue Date:	Sep-2022
Source:	IEEE transactions on neural networks and learning systems, Sept. 2022, v. 33, no. 9, p. 4742-4756
Abstract:	Balancing the supply and demand for ride-sourcing companies is a challenging issue, especially with real-time requests and stochastic traffic conditions of large-scale congested road networks. To tackle this challenge, this article proposes a robust and scalable approach that integrates reinforcement learning (RL) and a centralized programming (CP) structure to promote real-time taxi operations. Both real-time order matching decisions and vehicle relocation decisions at the microscopic network scale are integrated within a Markov decision process framework. The RL component learns the decomposed state-value function, which represents the taxi drivers' experience, the off-line historical demand pattern, and the traffic network congestion. The CP component plans nonmyopic decisions for drivers collectively under the prescribed system constraints to explicitly realize cooperation. Furthermore, to circumvent sparse reward and sample imbalance problems over the microscopic road network, this article proposed a temporal-difference learning algorithm with prioritized gradient descent and adaptive exploration techniques. A simulator is built and trained with the Manhattan road network and New York City yellow taxi data to simulate the real-time vehicle dispatching environment. Both centralized and decentralized taxi dispatching policies are examined with the simulator. This case study shows that the proposed approach can further improve taxi drivers' profits while reducing customers' waiting times compared to several existing vehicle dispatching algorithms.
Keywords:	Deep reinforcement learning (RL) Multiagent system Online vehicle routing Stochastic network traffic Vehicle dispatching
Publisher:	Institute of Electrical and Electronics Engineers
Journal:	IEEE transactions on neural networks and learning systems
ISSN:	2162-237X
EISSN:	2162-2388
DOI:	10.1109/TNNLS.2021.3060187
Rights:	© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication Liang, E., Wen, K., Lam, W. H., Sumalee, A., & Zhong, R. (2021). An integrated reinforcement learning and centralized programming approach for online taxi dispatching. IEEE Transactions on Neural Networks and Learning Systems, 33(9), 4742-4756 is available at https://doi.org/10.1109/TNNLS.2021.3060187.
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
Lam_Integrated_Reinforcement_Learning.pdf	Pre-Published version	10.56 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Final Accepted Manuscript

Access

View full-text via PolyU eLinks

Show full item record

Page views

135

Last Week
0

Last month

Citations as of Apr 12, 2026

Downloads

1,035

Citations as of Apr 12, 2026

SCOPUS^TM
Citations

60

Citations as of May 8, 2026

WEB OF SCIENCE^TM
Citations

63

Citations as of Apr 23, 2026

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Downloads

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM