User: Guest  Login
Document type:
Konferenzbeitrag
Contribution type:
Vortrag / Präsentation
Author(s):
Gottwald, Martin; Shen, Hao; Diepold, Klaus
Title:
A Critical Point Analysis of Actor-Critic Algorithms with Neural Networks
Abstract:
We investigate Actor-Critic algorithms from the non-convex optimisation perspective. For the past years, powerful Deep Reinforcement Learning algorithms, such as Deep Deterministic Policy Gradients, have been observed to struggle even in tiny toy problems. Yet, only the critic training has been subject to intensive research. To close this gap, we conduct a critical point analysis for the actor training. First, we find that the reward function must satisfy additional conditions next to those for...     »
Keywords:
Dynamic Programming, Markov Decision Process, Function Approximation, Critical Points, Optimisation
Dewey Decimal Classification:
620 Ingenieurwissenschaften
Book / Congress title:
6th IFAC Conference on Intelligent Control and Automation Sciences
Congress (additional information):
Cluj-Napoca, Romania
Date of congress:
13-15 July 2022
Date of publication:
13.07.2022
Year:
2022
Quarter:
2. Quartal
Year / month:
2022-07
Month:
Jul
Pages:
6
Reviewed:
ja
Language:
en
Publication format:
Print
TUM Institution:
Lehrstuhl für Datenverarbeitung
Ingested:
02.08.2022
Last change:
02.08.2022
 BibTeX