Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/99698
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computingen_US
dc.creatorChen, Jen_US
dc.creatorCao, Jen_US
dc.creatorCheng, Zen_US
dc.creatorLi, Wen_US
dc.date.accessioned2023-07-18T07:31:56Z-
dc.date.available2023-07-18T07:31:56Z-
dc.identifier.urihttp://hdl.handle.net/10397/99698-
dc.language.isoenen_US
dc.rights© 2023 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved.en_US
dc.rightsPosted with permission of the International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).en_US
dc.rightsThe Version of Record is available online at the ACM Digital Library: https://dl.acm.org/doi/10.5555/3545946.3598797en_US
dc.rightsThe Version of Record is available online at the IFAAMAS repository: https://www.southampton.ac.uk/~eg/AAMAS2023/forms/contents.htmen_US
dc.subjectCollision avoidanceen_US
dc.subjectMulti-robot navigationen_US
dc.subjectReward shapingen_US
dc.subjectReinforcement learningen_US
dc.titleMitigating imminent collision for multi-robot navigation : a TTC-force reward shaping approachen_US
dc.typeConference Paperen_US
dc.identifier.spage1448en_US
dc.identifier.epage1456en_US
dc.identifier.doi10.5555/3545946.3598797en_US
dcterms.abstractWe study the distributed multi-robot navigation problem, which refers to a group of mobile robots avoiding collision with each other while navigating from their start positions to the goal positions. Existing works still suffer from two limitations: 1) accurately quantify the risk of collisions for heterogeneous robots and 2) effectively capture the state representation under dynamic environments. These limitations make the heterogeneous robots prone to collisions in high-density and dynamic environments. This work proposes a new time-to-collision force (TTC-force) reward shaping approach, termed Tfresh, incorporating reinforcement learning to learn a policy that adaptively chooses the optimal actions to mitigate the imminent collision. Specifically, we use TTC-force to quantify the risk of each robot exerted by its neighbors and shape the reward signal with TTC-force in applying the reinforcement learning scheme. Meanwhile, we design the spatial attention mechanism involving the dynamic adjacent matrix to capture the state representation effectively. We evaluate the learned policy in numerous simulated scenarios in which groups of mobile robots perform navigation tasks. The experimental results demonstrate that our approach outperforms the state-of-the-art methods regarding success rate, travel distance, and travel time.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn Proceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023), London, United Kingdom, May 29 – June 2, 2023, IFAAMAS, p. 1448–1456en_US
dcterms.issued2023-
dc.relation.ispartofbookProceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023)en_US
dc.relation.conferenceInternational Conference on Autonomous Agents and Multiagent Systems [AAMAS]en_US
dc.description.validate202307 bcwhen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumbera2265-
dc.identifier.SubFormID47267-
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextResearch Institute for Artificial Intelligence of Things, The Hong Kong Polytechnic University; National Key Research and Development Program of Chinaen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryPublisher permissionen_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Chen_Mitigating_Imminent_Collision.pdf1 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

144
Citations as of Apr 13, 2025

Downloads

50
Citations as of Apr 13, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.