User: Guest  Login
Title:

On the Compatibility of Multistep Lookahead and Hessian Approximation for Neural Residual Gradient

Document type:
Konferenzbeitrag
Contribution type:
Poster
Author(s):
Gottwald, Martin; Shen, Hao
Abstract:
In this work, we investigate, how multistep lookahead affects critical points of Residual Gradient algorithms. We set up a compound Bellman Operator for k consecutive transitions similar to TD(λ) methods and analyse the critical points of the associated Mean Squared Bellman Error (MSBE). By collecting per state multiple successors at once, one can create a more informative objective without increasing the requirements for function approximation architectures. In an empirical analysis, we observe...     »
Keywords:
Critical Point Analysis, Gauss Newton Algorithm, Mean Squared Bellman Error, Multistep Lookahead, Residual Gradient
Dewey Decimal Classification:
620 Ingenieurwissenschaften
Book / Congress title:
The Multi-disciplinary Conference on Reinforcement Learning and Decision Making
Congress (additional information):
Providence, USA
Date of publication:
08.06.2022
Year:
2022
Quarter:
2. Quartal
Year / month:
2022-06
Month:
Jun
Pages:
4
Reviewed:
ja
Language:
en
Publication format:
WWW
WWW:
RLDM 2022 Abstract Booklet (Final Version)
TUM Institution:
Lehrstuhl für Datenverarbeitung
Format:
Text
Ingested:
02.08.2022
Last change:
02.08.2022
CC license:
by-nc-sa, http://creativecommons.org/licenses/by-nc-sa/3.0/de
 BibTeX