Deep Multi-Agent Reinforcement Learning using DNN-Weight Evolution to Optimize Supply Chain Performance

Fuji, TaikiIto, KiyotoMatsumoto, KohseiYano, Kazuo2017-12-282017-12-282018-01-03978-0-9981331-1-9http://hdl.handle.net/10125/50044To develop a supply chain management (SCM) system that performs optimally for both each entity in the chain and the entire chain, a multi-agent reinforcement learning (MARL) technique has been developed. To solve two problems of the MARL for SCM (building a Markov decision processes for a supply chain and avoiding learning stagnation in a way similar to the "prisoner's dilemma"), a learning management method with deep-neural-network (DNN)-weight evolution (LM-DWE) has been developed. By using a beer distribution game (BDG) as an example of a supply chain, experiments with a four-agent system were performed. Consequently, the LM-DWE successfully solved the above two problems and achieved 80.0% lower total cost than expert players of the BDG.10 pagesengAttribution-NonCommercial-NoDerivatives 4.0 InternationalIntelligent Decision Support for Logistics and Supply Chain ManagementDeep learning, Evolutionary computation, Multi-agent reinforcement learning, Multi-agent system, Supply chain managementDeep Multi-Agent Reinforcement Learning using DNN-Weight Evolution to Optimize Supply Chain PerformanceConference Paper10.24251/HICSS.2018.157