Simon Ramstedt

Alumni

Site web

Google Scholar

Publications

Reinforcement Learning with Random Delays

Christopher Pal

Action and observation delays commonly occur in many Reinforcement Learning applications, such as remote control scenarios. We study the ana… (voir plus)tomy of randomly delayed environments, and show that partially resampling trajectory fragments in hindsight allows for off-policy multi-step value estimation. We apply this principle to derive Delay-Correcting Actor-Critic (DCAC), an algorithm based on Soft Actor-Critic with significantly better performance in environments with delays. This is shown theoretically and also demonstrated practically on a delay-augmented version of the MuJoCo continuous control benchmark.

2021-05-02

International Conference on Learning Representations (Poster)

doi.org

openreview.net

Real-Time Reinforcement Learning

Simon Ramstedt

Christopher Pal

2018-12-31

Advances in Neural Information Processing Systems 32 (NeurIPS 2019) (publié)

arxiv.org

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Simon Ramstedt

Publications

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Mots-clés populaires:

Simon Ramstedt

Publications