Benutzer: Gast  Login
Dokumenttyp:
Konferenzbeitrag
Art des Konferenzbeitrags:
Vortrag / Präsentation
Autor(en):
Gottwald, Martin; Shen, Hao; Diepold, Klaus
Titel:
A Critical Point Analysis of Actor-Critic Algorithms with Neural Networks
Abstract:
We investigate Actor-Critic algorithms from the non-convex optimisation perspective. For the past years, powerful Deep Reinforcement Learning algorithms, such as Deep Deterministic Policy Gradients, have been observed to struggle even in tiny toy problems. Yet, only the critic training has been subject to intensive research. To close this gap, we conduct a critical point analysis for the actor training. First, we find that the reward function must satisfy additional conditions next to those for...     »
Stichworte:
Dynamic Programming, Markov Decision Process, Function Approximation, Critical Points, Optimisation
Dewey-Dezimalklassifikation:
620 Ingenieurwissenschaften
Kongress- / Buchtitel:
6th IFAC Conference on Intelligent Control and Automation Sciences
Kongress / Zusatzinformationen:
Cluj-Napoca, Romania
Datum der Konferenz:
13-15 July 2022
Publikationsdatum:
13.07.2022
Jahr:
2022
Quartal:
2. Quartal
Jahr / Monat:
2022-07
Monat:
Jul
Seiten:
6
Reviewed:
ja
Sprache:
en
Erscheinungsform:
Print
TUM Einrichtung:
Lehrstuhl für Datenverarbeitung
Eingabe:
02.08.2022
Letzte Änderung:
02.08.2022
 BibTeX