User: Guest  Login
Document type:
Masterarbeit
Author(s):
Philipp Scholl
Title:
Evaluation of Safe Policy Improvement with Soft Baseline Bootstrapping
Abstract:
Due to the high computing power of modern computers and the increasing availability of data, reinforcement learning is gaining importance in many areas of industry. In safety critical areas, however, reinforcement learning poses dangers, which is why algorithms that promise a safe improvement of the behavior policy are of high interest. Therefore, this master's thesis examines the Soft-SPIBB algorithms introduced in "Safe Policy Improvement with Soft Baseline Bootstrapping" by Nadjahi et al. Som...     »
Supervisor:
Hans-Joachim Bungartz
Advisor:
Felix Dietrich; Clemens Otte; Steffen Udluft
Year:
2021
Quarter:
2. Quartal
Year / month:
2021-04
Month:
Apr
Language:
en
University:
Technical University of Munich
Faculty:
Fakultät für Mathematik
 BibTeX