Portrait de Yann Bouteiller

Yann Bouteiller

Collaborateur·rice de recherche - Polytechnique Montreal
Superviseur⋅e principal⋅e
Co-supervisor
Sujets de recherche
Apprentissage par renforcement
Apprentissage profond
Neurosciences computationnelles
Systèmes dynamiques
Théorie de l'apprentissage automatique
Vision par ordinateur

Publications

From the Lab to the Theater: An Unconventional Field Robotics Journey
Ali Imran
Vivek Shankar Vardharajan
Rafael Gomes Braga
Abdalwhab Abdalwhab
Matthis Di-Giacomo
Alexandra Mercader
David St-Onge
Reinforcement Learning with Random Delays
Simon Ramstedt
Jonathan Binas
Action and observation delays commonly occur in many Reinforcement Learning applications, such as remote control scenarios. We study the ana… (voir plus)tomy of randomly delayed environments, and show that partially resampling trajectory fragments in hindsight allows for off-policy multi-step value estimation. We apply this principle to derive Delay-Correcting Actor-Critic (DCAC), an algorithm based on Soft Actor-Critic with significantly better performance in environments with delays. This is shown theoretically and also demonstrated practically on a delay-augmented version of the MuJoCo continuous control benchmark.