Florian Golemo

David Vázquez

Pedro O. Pinheiro

2021-04-01

ArXiv (preprint)

Active Domain Randomization

Bhairav Mehta

Manfred Diaz

Chris Pal

Liam Paull

Domain randomization is a popular technique for improving domain transfer, often used in a zero-shot setting when the target domain is unkno… (see more)wn or cannot easily be used for training. In this work, we empirically examine the effects of domain randomization on agent generalization. Our experiments show that domain randomization may lead to suboptimal, high-variance policies, which we attribute to the uniform sampling of environment parameters. We propose Active Domain Randomization, a novel algorithm that learns a parameter sampling strategy. Our method looks for the most informative environment variations within the given randomization ranges by leveraging the discrepancies of policy rollouts in randomized and reference environment instances. We find that training more frequently on these instances leads to better overall agent generalization. In addition, when domain randomization and policy transfer fail, Active Domain Randomization offers more insight into the deficiencies of both the chosen parameter ranges and the learned policy, allowing for more focused debugging. Our experiments across various physics-based simulated and a real-robot task show that this enhancement leads to more robust, consistent policies.

2020-05-12

Proceedings of the Conference on Robot Learning (published)

proceedings.mlr.press

Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images Using a View-Based Representation

Sai Rajeswar

Fahim Mannan

Jérôme Parent-Lévesque

David Vázquez

Derek Nowrouzezahrai

2020-03-20

International Journal of Computer Vision (published)

doi.org

Unsupervised Learning of Dense Visual Representations

Pedro O. Pinheiro

Amjad Almahairi

Ryan Y. Benmalek

Sim-to-Real Transfer with Neural-Augmented Robot Simulation

Adrien Ali Taiga

Pierre-Yves Oudeyer

Despite the recent successes of deep reinforcement learning, teaching complex motor skills to a physical robot remains a hard problem. While… (see more) learning directly on a real system is usually impractical, doing so in simulation has proven to be fast and safe. Nevertheless, because of the "reality gap," policies trained in simulation often perform poorly when deployed on a real system. In this work, we introduce a method for training a recurrent neural network on the differences between simulated and real robot trajectories and then using this model to augment the simulator. This Neural-Augmented Simulation (NAS) can be used to learn control policies that transfer significantly better to real environments than policies learned on existing simulators. We demonstrate the potential of our approach through a set of experiments on the Mujoco simulator with added backlash and the Poppy Ergo Jr robot. NAS allows us to learn policies that are competitive with ones that would have been learned directly on the real robot.

2018-10-23

Proceedings of The 2nd Conference on Robot Learning (published)

proceedings.mlr.press

HoME: a Household Multimodal Environment

Simon Brodeur

Ethan Perez

Ankesh Anand

Luca Celotti

Florian Strub

Jean Rouat

Hugo Larochelle