Please use this identifier to cite or link to this item: http://hdl.handle.net/10125/79538

Deep Reinforcement Learning Techniques For Solving Hybrid Flow Shop Scheduling Problems: Proximal Policy Optimization (PPO) and Asynchronous Advantage Actor-Critic (A3C)

File Size Format  
0164.pdf 535.25 kB Adobe PDF View/Open

Item Summary

Title:Deep Reinforcement Learning Techniques For Solving Hybrid Flow Shop Scheduling Problems: Proximal Policy Optimization (PPO) and Asynchronous Advantage Actor-Critic (A3C)
Authors:Nahhas, Abdulrahman
Kharitonov, Andrey
Turowski , Klaus
Keywords:Intelligent Decision Support for Logistics and Supply Chain Management
asynchronous advantage actor-critic (a3c)
deep reinforcement learning
hybrid flow shop scheduling
proximal policy optimization (ppo)
show 1 moresimulation
show less
Date Issued:04 Jan 2022
Abstract:Well-studied scheduling practices are fundamental for the successful support of core business processes in any manufacturing environment. Particularly, the Hybrid Flow Shop (HFS) scheduling problems are present in many manufacturing environments. The current advances in the field of Deep Reinforcement Learning (DRL) attracted the attention of both practitioners and academics to investigate their adoption beyond synthetic game-like applications. Therefore, we present an approach that is based on DRL techniques in conjunction with a discrete event simulation model to solve a real-world four-stage HFS scheduling problem. The main narrative behind the presented concepts is to expose a DRL agent to a game-like environment using an indirect encoding. Two types of DRL techniques namely, Proximal Policy Optimization (PPO) and Asynchronous Advantage Actor-Critic (A3C), are evaluated for solving problems of different complexity. The computational results suggest that the DRL agents successfully learn appropriate policies for solving the investigated problem. In addition, the investigation shows that the agent can adjust their policies when we expose them to a different problem. We further evaluate the approach to solving problem instances published in the literature to establish a comparison.
Pages/Duration:10 pages
URI:http://hdl.handle.net/10125/79538
ISBN:978-0-9981331-5-7
DOI:10.24251/HICSS.2022.206
Rights:Attribution-NonCommercial-NoDerivatives 4.0 International
https://creativecommons.org/licenses/by-nc-nd/4.0/
Appears in Collections: Intelligent Decision Support for Logistics and Supply Chain Management


Please email libraryada-l@lists.hawaii.edu if you need this content in ADA-compliant format.

This item is licensed under a Creative Commons License Creative Commons