L1 Regularized Gradient Temporal-Difference Learning

Meyer, Dominik; Shen, Hao; Diepold, Klaus

Lehrstuhl für Datenverarbeitung (Prof. Diepold)

Zurück
Zurück zum Anfang der Trefferliste
Dauerhafter Link zum angezeigten Objekt

Wenn Sie Schwierigkeiten haben, das Dokument zu öffnen, versuchen Sie auch bitte diesen Link

Titel:: L1 Regularized Gradient Temporal-Difference Learning
Dokumenttyp:: Konferenzbeitrag
Art des Konferenzbeitrags:: Textbeitrag / Aufsatz
Autor(en):: Meyer, Dominik; Shen, Hao; Diepold, Klaus
Abstract:: The family of Gradient Temporal-Difference (GTD) learning algorithms shares a promising property of being stable with both linear function approximation and off-policy training. The success of the GTD family requires a suitable set of features, which are unfortunately not always available in reality. To overcome this difficulty, regularization is often employed as an effective method for feature selection in reinforcement learning. In the present work, we propose and investigate a family of L1 regularized GTD learning algorithms. «
The family of Gradient Temporal-Difference (GTD) learning algorithms shares a promising property of being stable with both linear function approximation and off-policy training. The success of the GTD family requires a suitable set of features, which are unfortunately not always available in reality. To overcome this difficulty, regularization is often employed as an effective method for feature selection in reinforcement learning. In the present work, we propose and investigate a family of L1... »
Stichworte:: Reinforcement Learning (RL); Gradient Temporal-Difference (GTD) learning; linear function approximation; Iterative Soft Thresholding (IST)
Kongress- / Buchtitel:: The 10th European Workshop on Reinforcement Learning (EWRL 2012)
Jahr:: 2012
Jahr / Monat:: 2012-07
Monat:: Jul
Reviewed:: ja
Sprache:: en
WWW:: L1 Regularized Gradient Temporal-Difference Learning
BibTeX

Vorkommen:

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Computation, Information and Technology Departments Computer Engineering Datenverarbeitung (Prof. Diepold)2012

mediaTUM Gesamtbestand Hochschulbibliographie 2012 Fakultäten Elektrotechnik und Informationstechnik Lehrstuhl für Datenverarbeitung (Prof. Diepold)