Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/117977
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineeringen_US
dc.creatorWu, Sen_US
dc.creatorDuan, Xen_US
dc.creatorZhou, Jen_US
dc.creatorQu, Ken_US
dc.creatorHo, IWHen_US
dc.creatorTian, Den_US
dc.date.accessioned2026-03-10T03:56:41Z-
dc.date.available2026-03-10T03:56:41Z-
dc.identifier.issn0018-9545en_US
dc.identifier.urihttp://hdl.handle.net/10397/117977-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.subjectDecision-makingen_US
dc.subjectHybrid-attention mechanismen_US
dc.subjectMiniature intelligent vehicleen_US
dc.subjectMulti-agent reinforcement learningen_US
dc.subjectUnsignalized intersectionen_US
dc.titleRisk-aware multi-task hybrid-attention multi-agent reinforcement learning for decision-making at unsignalized intersectionsen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.doi10.1109/TVT.2026.3670026en_US
dcterms.abstractMulti-Agent Reinforcement Learning (MARL) has demonstrated significant potential for cooperative decision-making in connected and autonomous vehicles (CAVs). However, existing approaches often fail to address the task-specific characteristics and varying requirements of vehicles in high-dynamic, unsignalized intersection scenarios. In these environments, vehicles are frequently exposed to conflict zones where risks, such as collisions, are difficult to perceive, particularly in the absence of traffic signals. Additionally, current methods lack effective mechanisms for balancing learning performance across multiple tasks. To overcome these challenges, we propose a novel multi-task MARL framework tailored for unsignalized intersections. The framework incorporates a hybrid-attention network that captures the influence of surrounding vehicles on different driving tasks, improving multi-agent decision-making. A multi-task diversity priority sampling mechanism is introduced to prioritize high-quality episodes from more complex tasks, enhancing performance in dynamic intersection settings. Furthermore, a risk-aware local decision corrector optimizes decision-making in high-risk conflict zones by enabling vehicles to predict and adapt to surrounding traffic behaviors. The proposed framework is evaluated through simulations, demonstrating superior performance compared to state-of-the-art baselines. A miniature intelligent vehicle testbed further validates its effectiveness and potential for real-world deployment.en_US
dcterms.accessRightsembargoed accessen_US
dcterms.bibliographicCitationIEEE transactions on vehicular technology, Date of Publication: 03 March 2026, Early Access, https://doi.org/10.1109/TVT.2026.3670026en_US
dcterms.isPartOfIEEE transactions on vehicular technologyen_US
dcterms.issued2026-
dc.identifier.eissn1939-9359en_US
dc.description.validate202603 bcchen_US
dc.description.oaNot applicableen_US
dc.identifier.FolderNumbera4328-
dc.identifier.SubFormID52595-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work was supported by the National Natural Science Foundation of China under Grant 62173012, Grant 62432002, Grant U2433202 and Grant U22A2046, the Fundamental Research Funds for the Central Universities (Beihang Ganwei Action Plan Key Program) under Grant JK2024-19.en_US
dc.description.pubStatusEarly releaseen_US
dc.date.embargo0000-00-00 (to be updated)en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Open Access Information
Status embargoed access
Embargo End Date 0000-00-00 (to be updated)
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.