Publications

DEFactor: Differentiable Edge Factorization-based Probabilistic Graph Generation
Mohamed Ahmed
Marwin Segler
Amir Saffari
Generating novel molecules with optimal properties is a crucial step in many industries such as drug discovery. Recently, deep generative mo… (see more)dels have shown a promising way of performing de-novo molecular design. Although graph generative models are currently available they either have a graph size dependency in their number of parameters, limiting their use to only very small graphs or are formulated as a sequence of discrete actions needed to construct a graph, making the output graph non-differentiable w.r.t the model parameters, therefore preventing them to be used in scenarios such as conditional graph generation. In this work we propose a model for conditional graph generation that is computationally efficient and enables direct optimisation of the graph. We demonstrate favourable performance of our model on prototype-based molecular graph conditional generation tasks.
On Difficulties of Probability Distillation
Dopamine: A Research Framework for Deep Reinforcement Learning
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Bellemare Marc-Emmanuel
Deep reinforcement learning (deep RL) research has grown significantly in recent years. A number of software offerings now exist that provid… (see more)e stable, comprehensive implementations for benchmarking. At the same time, recent deep RL research has become more diverse in its goals. In this paper we introduce Dopamine, a new research framework for deep RL that aims to support some of that diversity. Dopamine is open-source, TensorFlow-based, and provides compact and reliable implementations of some state-of-the-art deep RL agents. We complement this offering with a taxonomy of the different research objectives in deep RL research. While by no means exhaustive, our analysis highlights the heterogeneity of research in the field, and the value of frameworks such as ours.
EnGAN: Latent Space MCMC and Maximum Entropy Generators for Energy-based Models
Knowledge Representation for Reinforcement Learning using General Value Functions
Gheorghe Comanici
Andre Barreto
Daniel Toyama
Eser Aygün
Sasha Vezhnevets
Shaobo Hou
Shibl Mourad
Learning powerful policies and better dynamics models by encouraging consistency
Learning of Sophisticated Curriculums by viewing them as Graphs over Tasks
Modeling the Long Term Future in Model-Based Reinforcement Learning
Nan Rosemary Ke
Amanpreet Singh
Devi Parikh
Dhruv Batra
Pix2Scene: Learning Implicit 3D Representations from Images
Probabilistic Planning with Sequential Monte Carlo Methods
Valentin Thomas
Cyril Ibrahim
Reinforced Imitation Learning from Observations
Shaping representations through communication
Olivier Tieleman
Angeliki Lazaridou
Shibl Mourad
Charles Blundell