TY - JOUR
T1 - Approximate Q-Learning for Stacking Problems with Continuous Production and Retrieval
AU - Scagnetti, Judith
AU - Beham, Andreas
AU - Wagner, Stefan
AU - Affenzeller, Michael
N1 - Publisher Copyright:
© 2018, © 2018 Taylor & Francis.
Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2019/1/2
Y1 - 2019/1/2
N2 - This paper presents for the first time a reinforcement learning algorithm with function approximation for stacking problems with continuous production and retrieval. The stacking problem is a hard combinatorial optimization problem. It deals with the arrangement of items in a localized area, where they are organized into stacks to allow a delivery in a required order. Due to the characteristics of stacking problems, for example, the high number of states, reinforcement learning is an appropriate method since it allows learning in an unknown environment. We apply a Sarsa (λ) algorithm to real-world problem instances arising in steel industry. We use linear function approximation and elaborate promising characteristics of instances for this method. Further, we propose features that do not require specific knowledge about the environment and hence are applicable to any stacking problem with similar characteristics. In our experiments we show fast learning of the applied method and it’s suitability for real-world instances.
AB - This paper presents for the first time a reinforcement learning algorithm with function approximation for stacking problems with continuous production and retrieval. The stacking problem is a hard combinatorial optimization problem. It deals with the arrangement of items in a localized area, where they are organized into stacks to allow a delivery in a required order. Due to the characteristics of stacking problems, for example, the high number of states, reinforcement learning is an appropriate method since it allows learning in an unknown environment. We apply a Sarsa (λ) algorithm to real-world problem instances arising in steel industry. We use linear function approximation and elaborate promising characteristics of instances for this method. Further, we propose features that do not require specific knowledge about the environment and hence are applicable to any stacking problem with similar characteristics. In our experiments we show fast learning of the applied method and it’s suitability for real-world instances.
UR - http://www.scopus.com/inward/record.url?scp=85056145380&partnerID=8YFLogxK
U2 - 10.1080/08839514.2018.1525852
DO - 10.1080/08839514.2018.1525852
M3 - Article
SN - 0883-9514
VL - 33
SP - 68
EP - 86
JO - Applied Artificial Intelligence
JF - Applied Artificial Intelligence
IS - 1
ER -