Pierre-Luc Bacon

Mila > About Mila > Team > Pierre-Luc Bacon
Core Academic Member
Pierre-Luc Bacon
Assistant Professor, Université de Montréal
Pierre-Luc Bacon

Pierre-Luc Bacon is specialized in reinforcement learning. He is especially interested in the process of understanding and synthesizing disparate concepts into a coherent form. I like to establish connections to other disciplines to build a richer and more complete toolset. He will join the Department of Computer Science and Operations Research (DIRO) of the Université de Montréal in December 2019.

Publications

2021-05

XLVIN: eXecuted Latent Value Iteration Nets
Andreea Deac, Petar Veličković, Ognjen Milinkovic, Pierre-Luc Bacon, Jian Tang and Mladen Nikolic
arXiv e-prints
(2021-05-04)
ui.adsabs.harvard.eduPDF

2021-03

An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning.
Dilip Arumugam, Peter Henderson and Pierre-Luc Bacon
arXiv preprint arXiv:2103.06224
(2021-03-10)
dblp.uni-trier.dePDF

2020-09

Graph neural induction of value iteration.
Andreea Deac, Pierre-Luc Bacon and Jian Tang
arXiv preprint arXiv:2009.12604
(2020-09-26)
ui.adsabs.harvard.eduPDF

2020-07

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling
Yao Liu, Pierre-Luc Bacon and Emma Brunskill
ICML 2020
(2020-07-12)
proceedings.mlr.press
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon and Joelle Pineau
arXiv preprint arXiv:2007.02786
(2020-07-06)
ui.adsabs.harvard.eduPDF

2020-04

Options of Interest: Temporal Abstraction with Interest Functions
Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre-Luc Bacon and Doina Precup

2020-02

Policy Evaluation Networks.
Jean Harb, Tom Schaul, Doina Precup and Pierre-Luc Bacon
arXiv preprint arXiv:2002.11833
(2020-02-26)
ui.adsabs.harvard.eduPDF

2019-12

Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam, Raihan Seraj, Pierre-Luc Bacon and Doina Precup
arXiv preprint arXiv:1912.05104
(2019-12-11)
ui.adsabs.harvard.eduPDF

2018-11

The Barbados 2018 List of Open Issues in Continual Learning.
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc G. Bellemare and Doina Precup
arXiv preprint arXiv:1811.07004
(2018-11-16)
dblp.uni-trier.dePDF

2018-03

Constructing Temporal Abstractions Autonomously in Reinforcement Learning
Ai Magazine
(2018-03-27)
dblp.uni-trier.de

2018-02

Learning with Options that Terminate Off-Policy
Anna Harutyunyan, Peter Vrancx, Pierre-luc Bacon, Doina Precup and Ann Nowe
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
When Waiting is not an Option : Learning Options with a Deliberation Cost
Jean Harb, Pierre-luc Bacon, Martin Klissarov and Doina Precup
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson, Wei-Di Chang, Pierre-luc Bacon, David Meger, Joelle Pineau and Doina Precup
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF

2018-01

Learning Robust Options.
Daniel J. Mankowitz, Timothy A. Mann, Pierre-Luc Bacon, Doina Precup and Shie Mannor

Publications collected and formatted using Paperoni

array(1) { ["wp-wpml_current_language"]=> string(2) "en" }