Pierre-Luc Bacon

Membre Académique Principal
Pierre-Luc Bacon
Professeur adjoint, Université de Montréal
Pierre-Luc Bacon

Pierre-Luc Bacon est spécialisé en apprentissage par renforcement. Il s’intéresse plus particulièrement au problème d’apprentissage de représentations pour la prise de décisions séquentielles ayant des conséquences à long terme ainsi que ses ramifications en optimisation hiérarchique. Il se joindra au DIRO en décembre 2019.

Publications

2020-10

XLVIN: eXecuted Latent Value Iteration Nets
Andreea Deac, Petar Veličković, Ognjen Milinković, Pierre-Luc Bacon, Jian Tang and Mladen Nikolić
arXiv preprint arXiv:2010.13146
(2020-10-25)
arxiv.orgPDF

2020-09

Graph neural induction of value iteration.
Andreea Deac, Pierre-Luc Bacon and Jian Tang
arXiv preprint arXiv:2009.12604
(2020-09-26)
arxiv.orgPDF

2020-07

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling
Yao Liu, Pierre-Luc Bacon and Emma Brunskill
ICML 2020
(2020-07-12)
icml.cc
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon and Joelle Pineau
arXiv preprint arXiv:2007.02786
(2020-07-06)
dblp.uni-trier.dePDF

2020-02

Policy Evaluation Networks.
Jean Harb, Tom Schaul, Doina Precup and Pierre-Luc Bacon
arXiv preprint arXiv:2002.11833
(2020-02-26)
dblp.uni-trier.dePDF
Options of Interest: Temporal Abstraction with Interest Functions
Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre-Luc Bacon and Doina Precup

2019-12

Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam, Raihan Seraj, Pierre-Luc Bacon and Doina Precup
arXiv preprint arXiv:1912.05104
(2019-12-11)
ui.adsabs.harvard.eduPDF

2018-11

The Barbados 2018 List of Open Issues in Continual Learning.
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc G. Bellemare and Doina Precup
arXiv preprint arXiv:1811.07004
(2018-11-16)
dblp.uni-trier.dePDF

2018-07

Convergent Tree-Backup and Retrace with Function Approximation
ICML 2018
(2018-07-10)
proceedings.mlr.pressPDF

2018-03

Constructing Temporal Abstractions Autonomously in Reinforcement Learning
Ai Magazine
(2018-03-27)
dblp.uni-trier.de

2018-02

Learning Robust Options
Daniel J. Mankowitz, Timothy A. Mann, Pierre-Luc Bacon, Doina Precup and Shie Mannor
arXiv preprint arXiv:1802.03236
(2018-02-09)
arxiv.orgPDF
Learning with Options that Terminate Off-Policy
Anna Harutyunyan, Peter Vrancx, Pierre-luc Bacon, Doina Precup and Ann Nowe
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
When Waiting is not an Option : Learning Options with a Deliberation Cost
Jean Harb, Pierre-luc Bacon, Martin Klissarov and Doina Precup
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
Learning Robust Options
Daniel Mankowitz, Timothy Mann, Shie Mannor, Doina Precup and Pierre-luc Bacon
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson, Wei-Di Chang, Pierre-luc Bacon, David Meger, Joelle Pineau and Doina Precup
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF

Publications collected and formatted using Paperoni

array(1) { ["wp-wpml_current_language"]=> string(2) "fr" }