Out-of-order execution enabled deep reinforcement learning for dynamic additive manufacturing scheduling

Sun, M; Ding, J; Zhao, Z; Chen, J; Huang, GQ; Wang, L

doi:10.1016/j.rcim.2024.102841

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/115393

DC Field	Value	Language
dc.contributor	Department of Industrial and Systems Engineering	-
dc.contributor	Research Institute for Advanced Manufacturing	-
dc.creator	Sun, M	-
dc.creator	Ding, J	-
dc.creator	Zhao, Z	-
dc.creator	Chen, J	-
dc.creator	Huang, GQ	-
dc.creator	Wang, L	-
dc.date.accessioned	2025-09-23T03:16:44Z	-
dc.date.available	2025-09-23T03:16:44Z	-
dc.identifier.issn	0736-5845	-
dc.identifier.uri	http://hdl.handle.net/10397/115393	-
dc.language.iso	en	en_US
dc.publisher	Pergamon Press	en_US
dc.subject	Out-of-order	en_US
dc.subject	Dynamic scheduling	en_US
dc.subject	Additive manufacturing	en_US
dc.subject	Dynamic order arrival	en_US
dc.subject	Dueling DQN	en_US
dc.title	Out-of-order execution enabled deep reinforcement learning for dynamic additive manufacturing scheduling	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.volume	91	-
dc.identifier.doi	10.1016/j.rcim.2024.102841	-
dcterms.abstract	Additive Manufacturing (AM) has revolutionized the production landscape by enabling on-demand customized manufacturing. However, the efficient management of dynamic AM orders poses significant challenges for production planning and scheduling. This paper addresses the dynamic scheduling problem considering batch processing, random order arrival and machine eligibility constraints, aiming to minimize total tardiness in a parallel non-identical AM machine environment. To tackle this problem, we propose the out-of-order enabled dueling deep Q network (O3-DDQN) approach. In the proposed approach, the problem is formulated as a Markov decision process (MDP). Three-dimensional features, encompassing dynamic orders, AM machines, and delays, are extracted using a ‘look around’ method to represent the production status at a rescheduling point. Additionally, five novel composite scheduling rules based on the out-of-order principle are introduced for selection when an AM machine completes processing or a new order arrives. Moreover, we design a reward function that is strongly correlated with the objective to evaluate the agent's chosen action. Experimental results demonstrate the superiority of the O3-DDQN approach over single scheduling rules, randomly selected rules, and the classic DQN method. The average improvement rate of performance reaches 13.09% compared to composite scheduling rules and random rules. Additionally, the O3-DDQN outperforms the classic DQN agent with a 6.54% improvement rate. The O3-DDQN algorithm improves scheduling in dynamic AM environments, enhancing productivity and on-time delivery. This research contributes to advancing AM production and offers insights into efficient resource allocation.	-
dcterms.accessRights	embargoed access	en_US
dcterms.bibliographicCitation	Robotics and computer - integrated manufacturing, Feb.2025, v. 91, 102841	-
dcterms.isPartOf	Robotics and computer - integrated manufacturing	-
dcterms.issued	2025-02	-
dc.identifier.scopus	2-s2.0-85199899989	-
dc.identifier.artn	102841	-
dc.description.validate	202509 bcrc	-
dc.description.oa	Not applicable	en_US
dc.identifier.FolderNumber	a4084b	en_US
dc.identifier.SubFormID	52057	en_US
dc.description.fundingSource	RGC	en_US
dc.description.fundingText	National Natural Science Foundation of China (No. 52305557); Guangdong Basic and Applied Basic Research Foundation (No. 2024A1515011930); Innovation and Technology Fund (No. PRP/038/24LI); Open Fund of State Key Laboratory of Intelligent Manufacturing Equipment and Technology (No. IMETKF2024022);	en_US
dc.description.pubStatus	Published	en_US
dc.date.embargo	2027-02-28	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Journal/Magazine Article

Open Access Information

Status	embargoed access
Embargo End Date	2027-02-28

Access

View full-text via PolyU eLinks

Show simple item record

Google Scholar^TM

Check

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM