Pierre-Yves Oudeyer

Using Confounded Data in Latent Model-Based Reinforcement Learning

Maxime Gasse

Damien GRASSET

Guillaume Gaudron

Pierre-Yves Oudeyer

2022-12-31

Trans. Mach. Learn. Res. (published)

openreview.net

Learning to Guide and to Be Guided in the Architect-Builder Problem

Paul Barde

Tristan Karch

Derek Nowrouzezahrai

Clément Moulin-Frier

Christopher Pal

Pierre-Yves Oudeyer

We are interested in interactive agents that learn to coordinate, namely, a …

2022-01-27

ICLR.cc/2022/Conference (poster)

doi.org

openreview.net

Sim-to-Real Transfer with Neural-Augmented Robot Simulation

Despite the recent successes of deep reinforcement learning, teaching complex motor skills to a physical robot remains a hard problem. While… (see more) learning directly on a real system is usually impractical, doing so in simulation has proven to be fast and safe. Nevertheless, because of the "reality gap," policies trained in simulation often perform poorly when deployed on a real system. In this work, we introduce a method for training a recurrent neural network on the differences between simulated and real robot trajectories and then using this model to augment the simulator. This Neural-Augmented Simulation (NAS) can be used to learn control policies that transfer significantly better to real environments than policies learned on existing simulators. We demonstrate the potential of our approach through a set of experiments on the Mujoco simulator with added backlash and the Poppy Ergo Jr robot. NAS allows us to learn policies that are competitive with ones that would have been learned directly on the real robot.

2018-10-22

Proceedings of The 2nd Conference on Robot Learning (published)

proceedings.mlr.press

AI Policy Fellowship Publications

Mila Ventures Launchpad

AI Policy Compass

Pierre-Yves Oudeyer

Publications

AI Policy Fellowship Publications

Mila Ventures Launchpad

AI Policy Compass

Popular keywords:

Pierre-Yves Oudeyer

Publications