Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/111321
DC FieldValueLanguage
dc.contributorDepartment of Industrial and Systems Engineeringen_US
dc.creatorTsang, YPen_US
dc.creatorMo, DYen_US
dc.creatorChung, KTen_US
dc.creatorLee, CKMen_US
dc.date.accessioned2025-02-17T08:29:37Z-
dc.date.available2025-02-17T08:29:37Z-
dc.identifier.issn0166-3615en_US
dc.identifier.urihttp://hdl.handle.net/10397/111321-
dc.language.isoenen_US
dc.publisherElsevier BVen_US
dc.subject3D bin packing problemen_US
dc.subjectDeep reinforcement learningen_US
dc.subjectDual bin strategyen_US
dc.subjectOnline optimisationen_US
dc.subjectRobotic warehouseen_US
dc.titleA deep reinforcement learning approach for online and concurrent 3D bin packing optimisation with bin replacement strategiesen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume164en_US
dcterms.abstractIn the realm of robotic palletisation, the quest for optimal space utilization remains vital but also presents a critical challenge, particularly due to the constraints of decision complexity and the need for real-time decision-making without complete prior information. The widely adopted rule-based heuristics approaches were ease to use, but failed to adapt dynamically to the complex and changing landscape of online 3D bin packing. This study is motivated by the need for a system that is both more agile and intelligent, capable of managing the intricacies of dual-bin scenarios and the variable inflow of items. This study introduces a novel deep reinforcement learning (DRL) optimiser, employing a double deep Q-network (DDQN) to obtain optimal packing policies in an online environment with two proposed bin replacement strategies. This approach surpasses the limitations of previous methods by facilitating the simultaneous management of multiple bins and enabling on-the-fly adjustments to decisions based on limited prior knowledge. In a case study involving a logistics company, the proposed optimizer demonstrated a significant improvement in average space utilization across various lookahead scenarios, outperforming traditional heuristics in simulation experiments. The proposed optimiser contributes significantly to the economic and environmental sustainability of robotic warehouses, positioning itself as a cornerstone for the future of smart logistics.en_US
dcterms.accessRightsembargoed accessen_US
dcterms.bibliographicCitationComputers in industry, Jan. 2025, v. 164, 104202en_US
dcterms.isPartOfComputers in industryen_US
dcterms.issued2025-01-
dc.identifier.eissn1872-6194en_US
dc.identifier.artn104202en_US
dc.description.validate202502 bcchen_US
dc.description.oaNot applicableen_US
dc.identifier.FolderNumbera3407-
dc.identifier.SubFormID50066-
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThe Hong Kong Polytechnic Universityen_US
dc.description.pubStatusPublisheden_US
dc.date.embargo2027-01-31en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Open Access Information
Status embargoed access
Embargo End Date 2027-01-31
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

35
Citations as of Apr 14, 2025

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.