Deep Q-learning for the Control of special-purpose Automated Production Systems

Zinn, Jonas; Ockier, Paulina; Vogel-Heuser, Birgit

doi:10.1109/CASE48305.2020.9216863

ZiOcVo2020

Title:: Deep Q-learning for the Control of special-purpose Automated Production Systems
Document type:: Konferenzbeitrag
Author(s):: Zinn, Jonas; Ockier, Paulina; Vogel-Heuser, Birgit
Abstract:: This paper evaluates the use of Deep Reinforcement Learning to control special-purpose automated Production Systems, which are characterized by multiple end-effectors that are actuated in only one or two axes. Due to the large number of actuators of which only a few affect the processing of a workpiece at a given time, these systems are challenging to learn. In this paper, Deep Q-Learning is applied to a small use case focusing on sorting workpieces by color in a simulation of such a production system. The basic algorithm is hereby compared to four commonly used extensions: Double Q-learnings, Dueling Networks, Prioritized Experience Replay, and Hindsight Experience Replay. For the scope of this paper, simplifications are applied to the state and action space. While the baseline implementation of Deep Q-learning is able to correctly sort 30 previously seen workpiece combinations, it does not reliably generalize to unseen ones within 35,000 training episodes. In contrast, the algorithm using all four considered extensions is able to generalize to 80 out of 81possible workpiece combinations. «
This paper evaluates the use of Deep Reinforcement Learning to control special-purpose automated Production Systems, which are characterized by multiple end-effectors that are actuated in only one or two axes. Due to the large number of actuators of which only a few affect the processing of a workpiece at a given time, these systems are challenging to learn. In this paper, Deep Q-Learning is applied to a small use case focusing on sorting workpieces by color in a simulation of such a production... »
Book / Congress title:: 16th International Conference on Automation Science and Engineering (Case 2020)
Publisher address:: Hong Kong, China
Year:: 2020
Pages:: 1434--1440
Covered by:: Scopus; Web of Science
Fulltext / DOI:: doi:10.1109/CASE48305.2020.9216863
BibTeX

Occurrences:

mediaTUM Gesamtbestand Hochschulbibliographie 2020 Fakultäten Maschinenwesen Lehrstuhl für Automatisierung und Informationssysteme (Prof. Vogel-Heuser)

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Engineering and Design Departments Mechanical Engineering Lehrstuhl für Automatisierung und Informationssysteme (Prof. Vogel-Heuser)2020 Konferenz