Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/118033
| DC Field | Value | Language |
|---|---|---|
| dc.contributor | Department of Logistics and Maritime Studies | - |
| dc.contributor | Department of Industrial and Systems Engineering | - |
| dc.creator | Yi, Z | - |
| dc.creator | Xu, M | - |
| dc.creator | Wang, S | - |
| dc.date.accessioned | 2026-03-12T01:03:06Z | - |
| dc.date.available | 2026-03-12T01:03:06Z | - |
| dc.identifier.issn | 0968-090X | - |
| dc.identifier.uri | http://hdl.handle.net/10397/118033 | - |
| dc.language.iso | en | en_US |
| dc.publisher | Elsevier Ltd | en_US |
| dc.rights | © 2026 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ ). | en_US |
| dc.rights | The following publication Yi, Z., Xu, M., & Wang, S. (2026). An integrated deep reinforcement learning-linear control strategy for longitudinal control of connected and automated vehicles. Transportation Research Part C: Emerging Technologies, 184, 105541 is available at https://doi.org/10.1016/j.trc.2026.105541. | en_US |
| dc.subject | Connected and automated vehicle | en_US |
| dc.subject | Deep reinforcement learning | en_US |
| dc.subject | Linear controller | en_US |
| dc.subject | String stability | en_US |
| dc.subject | Twin delayed deep deterministic policy gradient algorithm | en_US |
| dc.title | An integrated deep reinforcement learning-linear control strategy for longitudinal control of connected and automated vehicles | en_US |
| dc.type | Journal/Magazine Article | en_US |
| dc.identifier.volume | 184 | - |
| dc.identifier.doi | 10.1016/j.trc.2026.105541 | - |
| dcterms.abstract | String stability is important to maintain the longitudinal control of connected and automated vehicles (CAVs). It prevents the amplification of the perturbations as they propagate through the platoon. A variety of methods based on the deep reinforcement learning (DRL) approach have been proposed for longitudinal control of CAVs, which show excellent performance. However, none of those methods consider string stability on theoretical grounds due to the lack of explicit mathematical models in the DRL approach. To address this problem, we integrate a novel linear controller in a DRL framework for longitudinal control of CAVs, referred to integrated DRL-linear control (IDL) strategy. It can guarantee string stability while striking a good balance among various benefits, including vehicle safety, comfort, and efficiency. We employ the twin delay depth deterministic policy gradient (TD3) algorithm, a promosing DRL, in the proposed framework for decision. Numerical simulation results demonstrate that the proposed approach ensures theoretical string stability while significantly enhancing vehicle safety, comfort, and efficiency compared to human-driven vehicles (HDVs) and a model-based cooperative adaptive cruise control (CACC) strategy. It also outperforms the deep deterministic policy gradient (DDPG) and pure TD3 strategies in terms of safety, comfort, and string stability. These results indicate that the proposed IDL strategy not only benefits from the advantages of the linear controller in analyzing theoretical string stability conditions but also retains the advantage of the DRL approach in terms of optimizing the trade-off between multiple benefits. | - |
| dcterms.accessRights | open access | en_US |
| dcterms.bibliographicCitation | Transportation research. Part C, Emerging technologies, Mar. 2026, v. 184, 105541 | - |
| dcterms.isPartOf | Transportation research. Part C, Emerging technologies | - |
| dcterms.issued | 2026-03 | - |
| dc.identifier.scopus | 2-s2.0-105030332023 | - |
| dc.identifier.eissn | 1879-2359 | - |
| dc.identifier.artn | 105541 | - |
| dc.description.validate | 202603 bcch | - |
| dc.description.oa | Version of Record | en_US |
| dc.identifier.FolderNumber | OA_TA | en_US |
| dc.description.fundingSource | RGC | en_US |
| dc.description.fundingSource | Others | en_US |
| dc.description.fundingText | This work was supported by the National Natural Science Foundation of China [Grant Nos. 72371221, 72361137006], the Research Grants Council of the Hong Kong Special Administrative Region, China [Project number HKSAR RGC TRS T32-707/22-N]. | en_US |
| dc.description.pubStatus | Published | en_US |
| dc.description.TA | Elsevier (2026) | en_US |
| dc.description.oaCategory | TA | en_US |
| Appears in Collections: | Journal/Magazine Article | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 1-s2.0-S0968090X2600029X-main.pdf | 7.47 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



