Benutzer: Gast  Login
Titel:

Safe Reinforcement Learning with Probabilistic Guarantees Satisfying Temporal Logic Specifications in Continuous Action Spaces

Dokumenttyp:
Konferenzbeitrag
Autor(en):
Hanna Krasowski; Prithvi Akella; Aaron D. Ames; Matthias Althoff
Abstract:
Vanilla Reinforcement Learning (RL) can efficiently solve complex tasks but does not provide any guarantees on system behavior. To bridge this gap, we propose a three-step safe RL procedure for continuous action spaces that provides probabilistic guarantees with respect to temporal logic specifications. First, our approach probabilistically verifies a candidate controller with respect to a temporal logic specification while randomizing the control inputs to the system within a bounded set. Secon...     »
Kongress- / Buchtitel:
IEEE Conference on Decision and Control (CDC)
Jahr:
2023
 BibTeX