Risk-aware multi-task hybrid-attention multi-agent reinforcement learning for decision-making at unsignalized intersections

Wu, S; Duan, X; Zhou, J; Qu, K; Ho, IWH; Tian, D

doi:10.1109/TVT.2026.3670026

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/117977

DC Field	Value	Language
dc.contributor	Department of Electrical and Electronic Engineering	en_US
dc.creator	Wu, S	en_US
dc.creator	Duan, X	en_US
dc.creator	Zhou, J	en_US
dc.creator	Qu, K	en_US
dc.creator	Ho, IWH	en_US
dc.creator	Tian, D	en_US
dc.date.accessioned	2026-03-10T03:56:41Z	-
dc.date.available	2026-03-10T03:56:41Z	-
dc.identifier.issn	0018-9545	en_US
dc.identifier.uri	http://hdl.handle.net/10397/117977	-
dc.language.iso	en	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.subject	Decision-making	en_US
dc.subject	Hybrid-attention mechanism	en_US
dc.subject	Miniature intelligent vehicle	en_US
dc.subject	Multi-agent reinforcement learning	en_US
dc.subject	Unsignalized intersection	en_US
dc.title	Risk-aware multi-task hybrid-attention multi-agent reinforcement learning for decision-making at unsignalized intersections	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.doi	10.1109/TVT.2026.3670026	en_US
dcterms.abstract	Multi-Agent Reinforcement Learning (MARL) has demonstrated significant potential for cooperative decision-making in connected and autonomous vehicles (CAVs). However, existing approaches often fail to address the task-specific characteristics and varying requirements of vehicles in high-dynamic, unsignalized intersection scenarios. In these environments, vehicles are frequently exposed to conflict zones where risks, such as collisions, are difficult to perceive, particularly in the absence of traffic signals. Additionally, current methods lack effective mechanisms for balancing learning performance across multiple tasks. To overcome these challenges, we propose a novel multi-task MARL framework tailored for unsignalized intersections. The framework incorporates a hybrid-attention network that captures the influence of surrounding vehicles on different driving tasks, improving multi-agent decision-making. A multi-task diversity priority sampling mechanism is introduced to prioritize high-quality episodes from more complex tasks, enhancing performance in dynamic intersection settings. Furthermore, a risk-aware local decision corrector optimizes decision-making in high-risk conflict zones by enabling vehicles to predict and adapt to surrounding traffic behaviors. The proposed framework is evaluated through simulations, demonstrating superior performance compared to state-of-the-art baselines. A miniature intelligent vehicle testbed further validates its effectiveness and potential for real-world deployment.	en_US
dcterms.accessRights	embargoed access	en_US
dcterms.bibliographicCitation	IEEE transactions on vehicular technology, Date of Publication: 03 March 2026, Early Access, https://doi.org/10.1109/TVT.2026.3670026	en_US
dcterms.isPartOf	IEEE transactions on vehicular technology	en_US
dcterms.issued	2026	-
dc.identifier.eissn	1939-9359	en_US
dc.description.validate	202603 bcch	en_US
dc.description.oa	Not applicable	en_US
dc.identifier.FolderNumber	a4328	-
dc.identifier.SubFormID	52595	-
dc.description.fundingSource	Others	en_US
dc.description.fundingText	This work was supported by the National Natural Science Foundation of China under Grant 62173012, Grant 62432002, Grant U2433202 and Grant U22A2046, the Fundamental Research Funds for the Central Universities (Beihang Ganwei Action Plan Key Program) under Grant JK2024-19.	en_US
dc.description.pubStatus	Early release	en_US
dc.date.embargo	0000-00-00 (to be updated)	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Journal/Magazine Article

Open Access Information

Status	embargoed access
Embargo End Date	0000-00-00 (to be updated)

Access

View full-text via PolyU eLinks

Show simple item record

Google Scholar^TM

Check

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM