Joelle Pineau

Membre Académique Principal
Joelle Pineau
Professeure agrégée, McGill University, Facebook
Joelle Pineau

Joelle Pineau est professeure agrégée et boursière William Dawson à l’Université McGill où elle codirige le laboratoire de raisonnement et d’apprentissage. Membre du corp professoral de Mila, elle dirige également le laboratoire de recherche sur l’IA de Facebook à Montréal, au Canada. Elle détient un baccalauréat ès sciences en génie de l’Université de Waterloo et une maîtrise et un doctorat en robotique de l’Université Carnegie Mellon. La recherche de M. Pineau est axée sur le développement de nouveaux modèles et algorithmes pour la planification et l’apprentissage dans des domaines complexes partiellement observables. Elle travaille également sur l’application de ces algorithmes à des problèmes complexes en robotique, soins de santé, jeux et agents conversationnels. Elle est membre du comité de rédaction du Journal of Artificial Intelligence Research et du Journal of Machine Learning Research et est actuellement présidente de l’International Machine Learning Society. Elle est récipiendaire de la Bourse commémorative E.W.R. Steacie du CRSNG (2018), membre de l’Association pour l’avancement de l’intelligence artificielle (AAAI) et de l’Institut canadien de recherches avancées (CIFAR) et, en 2016, membre du Collège des nouveaux chercheurs, artistes et scientifiques de la Société royale du Canada.

Publications

2020-12

Intervention Design for Effective Sim2Real Transfer
Melissa Mozifian, Amy Zhang, Joelle Pineau and David Meger
arXiv preprint arXiv:2012.02055
(2020-12-03)
arxiv.orgPDF

2020-10

The Bottleneck Simulator: A Model-Based Deep Reinforcement Learning Approach
Iulian Vlad Serban, Chinnadhurai Sankar, Michael Pieper, Joelle Pineau and Yoshua Bengio
Journal of Artificial Intelligence Research
(2020-10-27)
dblp.uni-trier.dePDF
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv: Learning
(2020-10-15)
arxiv.orgPDF
Transparency and reproducibility in artificial intelligence.
Benjamin Haibe-Kains, George Alexandru Adam, Ahmed Hosny, Farnoosh Khodakarami, Levi Waldron, Bo Wang, Chris McIntosh, Anna Goldenberg, Anshul Kundaje, Casey S Greene, Tamara Broderick, Michael M Hoffman, Jeffrey T Leek, Keegan Korthauer, Wolfgang Huber, Alvis Brazma, Joelle Pineau, Robert Tibshirani, Trevor Hastie, John P A Ioannidis... (2 more)
Nature
(2020-10-14)
www.ncbi.nlm.nih.gov
Regularized Inverse Reinforcement Learning
Wonseok Jeon, Chen-Yang Su, Paul Barde, Thang Doan, Derek Nowrouzezahrai and Joelle Pineau
arXiv preprint arXiv:2010.03691
(2020-10-07)
ui.adsabs.harvard.eduPDF

2020-08

How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Prasanna Parthasarathi, Joelle Pineau and Sarath Chandar
arXiv preprint arXiv:2008.10427
(2020-08-24)
arxiv.orgPDF

2020-07

Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Amy Zhang, Shagun Sodhani, Khimya Khetarpal and Joelle Pineau
arXiv preprint arXiv:2007.07206
(2020-07-14)
dblp.uni-trier.dePDF
Building reproducible, reusable, and robust machine learning software.
DEBS 2020
(2020-07-13)
dblp.uni-trier.de
Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija, Philip Amortila and Joelle Pineau
Invariant Causal Prediction for Block MDPs
Clare Lyle, Amy Zhang, Angelos Filos, Shagun Sodhani, Marta Kwiatkowska, Yarin Gal, Doina Precup and Joelle Pineau
ICML 2020
(2020-07-12)
icml.cc
Online Learned Continual Compression with Adaptive Quantization Modules
Lucas Caccia, Eugene Belilovsky, Massimo Caccia and Joelle Pineau
Interference and Generalization in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau and Doina Precup
Handling Black Swan Events in Deep Learning with Diversely Extrapolated Neural Networks
Maxime Wabartha, Audrey Durand, Vincent François-Lavet and Joelle Pineau
IJCAI 2020
(2020-07-11)
static.ijcai.org
Automated Personalized Feedback Improves Learning Gains in An Intelligent Tutoring System.
Ekaterina Kochmar, Dung Do Vu, Robert Belfer, Varun Gupta, Iulian Vlad Serban and Joelle Pineau
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon and Joelle Pineau
arXiv preprint arXiv:2007.02786
(2020-07-06)
dblp.uni-trier.dePDF
A Large-Scale, Open-Domain, Mixed-Interface Dialogue-Based ITS for STEM
Iulian Vlad Serban, Varun Gupta, Ekaterina Kochmar, Dung Do Vu, Robert Belfer, Joelle Pineau, Aaron C. Courville, Laurent Charlin and Yoshua Bengio
Learning an Unreferenced Metric for Online Dialogue Evaluation
Koustuv Sinha, Prasanna Parthasarathi, Jasmine Wang, Ryan Lowe, William L. Hamilton and Joelle Pineau
Deep interpretability for GWAS
Deepak Sharma, Audrey Durand, Marc-André Legault, Louis-Philippe Lemieux Perreault, Audrey Lemaçon, Marie-Pierre Dubé and Joelle Pineau
arXiv preprint arXiv:2007.01516
(2020-07-03)
ui.adsabs.harvard.eduPDF
Development of a polygenic risk score to improve screening for fracture risk: A genetic risk prediction study
Vincenzo Forgetta, Julyan Keller-Baruch, Marie Forest, Audrey Durand, Sahir Bhatnagar, John P Kemp, Maria Nethander, Daniel Evans, John A Morris, Douglas P Kiel, Fernando Rivadeneira, Helena Johansson, Nicholas C Harvey, Dan Mellström, Magnus Karlsson, Cyrus Cooper, David M Evans, Robert Clarke, John A Kanis, Eric Orwoll... (6 more)
PLOS Medicine
(2020-07-02)
doaj.org
On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability (Extended Abstract).
Vincent Francois-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst and Raphael Fonteneau
IJCAI 2020
(2020-07-01)
dblp.uni-trier.de

2020-06

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher J. Pal and Derek Nowrouzezahrai
arXiv preprint arXiv:2006.13258
(2020-06-23)
ui.adsabs.harvard.eduPDF
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher J. Pal and Derek Nowrouzezahrai
NEURIPS 2020
(2020-06-01)
papers.nips.cc
Machine Learning for COVID-19 needs global collaboration and data-sharing
Nathan Peiffer-Smadja, Redwan Maatoug, François Xavier Lescure, Eric D’Ortenzio, Joëlle Pineau and Jean Rémi King
Nature Machine Intelligence
(2020-06-01)
www.nature.com

2020-05

RapidBrachyDL: Rapid Radiation Dose Calculations in Brachytherapy via Deep Learning.
Ximeng Mao, Joelle Pineau, Roy Keyes and Shirin A. Enger
International Journal of Radiation Oncology Biology Physics
(2020-05-12)
www.sciencedirect.com

2020-04

On the interaction between supervision and self-play in emergent communication
Ryan Lowe, Abhinav Gupta, Jakob Foerster, Douwe Kiela and Joelle Pineau
Language GANs Falling Short
Massimo Caccia, Lucas Caccia, William Fedus, Hugo Larochelle, Joelle Pineau and Laurent Charlin
Literature Mining for Incorporating Inductive Bias in Biomedical Prediction Tasks (Student Abstract)
Qizhen Zhang, Audrey Durand and Joelle Pineau
AAAI 2020
(2020-04-03)
aiide.org

2020-03

Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
Joelle Pineau, Philippe Vincent-Lamarre, Koustuv Sinha, Vincent Larivière, Alina Beygelzimer, Florence d'Alché-Buc, Emily B. Fox and Hugo Larochelle
arXiv preprint arXiv:2003.12206
(2020-03-27)
dblp.uni-trier.dePDF
Evaluating Logical Generalization in Graph Neural Networks.
Koustuv Sinha, Shagun Sodhani, Joelle Pineau and William L. Hamilton
arXiv preprint arXiv:2003.06560
(2020-03-14)
dblp.uni-trier.dePDF
Invariant Causal Prediction for Block MDPs.
Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal and Doina Precup
arXiv preprint arXiv:2003.06016
(2020-03-12)
arxiv.orgPDF
Stable Policy Optimization via Off-Policy Divergence Regularization.
Ahmed Touati, Amy Zhang, Joelle Pineau and Pascal Vincent
arXiv preprint arXiv:2003.04108
(2020-03-09)
ui.adsabs.harvard.eduPDF

2020-02

The importance of transparency and reproducibility in artificial intelligence research
Benjamin Haibe-Kains, George Alexandru Adam, Ahmed Hosny, Farnoosh Khodakarami, Levi Waldron, Bo Wang, Chris McIntosh, Anshul Kundaje, Casey S. Greene, Michael M. Hoffman, Jeffrey T. Leek, Wolfgang Huber, Alvis Brazma, Joelle Pineau, Robert Tibshirani, Trevor Hastie, John P.A. Ioannidis, John Quackenbush and Hugo J.W.L. Aerts
arXiv preprint arXiv:2003.00898
(2020-02-28)
arxiv.orgPDF
Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Wonseok Jeon, Paul Barde, Derek Nowrouzezahrai and Joelle Pineau
arXiv preprint arXiv:2002.10525
(2020-02-24)
ui.adsabs.harvard.eduPDF
Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking
Eric Crawford and Joelle Pineau
Provably efficient reconstruction of policy networks.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv preprint arXiv:2002.02863
(2020-02-07)
ui.adsabs.harvard.eduPDF

2020-01

Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning.
Peter Henderson, Jieru Hu, Joshua Romoff, Emma Brunskill, Dan Jurafsky and Joelle Pineau
arXiv preprint arXiv:2002.05651
(2020-01-31)
dblp.uni-trier.dePDF
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang, Amy Zhang, Ari S. Morcos, Joelle Pineau, Pieter Abbeel and Roberto Calandra
L4DC
(2020-01-01)
dblp.uni-trier.dePDF
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao, Vincent Francois-Lavet and Joelle Pineau
NEURIPS 2020
(2020-01-01)
papers.nips.ccPDF
Stable Policy Optimization via Off-Policy Divergence Regularization.
Ahmed Touati, Amy Zhang, Joelle Pineau and Pascal Vincent
UAI 2020
(2020-01-01)
dblp.uni-trier.dePDF

2019-12

Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Mahmoud Assran, Joshua Romoff, Nicolas Ballas, Joelle Pineau and Mike Rabbat
No-Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette, Yuchen Lu, Seton Steven Bocco, Max Smith, Satya O.-G., Jonathan K. Kummerfeld, Joelle Pineau, Satinder Singh and Aaron Courville
NEURIPS 2019
(2019-12-08)
papers.nips.ccPDF

2019-11

Online Learned Continual Compression with Stacked Quantization Module.
Lucas Caccia, Eugene Belilovsky, Massimo Caccia and Joelle Pineau
arXiv preprint arXiv:1911.08019
(2019-11-19)
dblp.uni-trier.dePDF
CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text
Koustuv Sinha, Shagun Sodhani, Jin Dong, Joelle Pineau and William L. Hamilton
Deep Generative Modeling of LiDAR Data
Lucas Caccia, Herke van Hoof, Aaron Courville and Joelle Pineau

2019-10

MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions.
Viswanath Sivakumar, Tim Rocktäschel, Alexander H. Miller, Heinrich Küttler, Nantas Nardelli, Mike Rabbat, Joelle Pineau and Sebastian Riedel
arXiv preprint arXiv:1910.04054
(2019-10-09)
dblp.uni-trier.dePDF
Benchmarking Batch Deep Reinforcement Learning Algorithms.
Scott Fujimoto, Edoardo Conti, Mohammad Ghavamzadeh and Joelle Pineau
arXiv preprint arXiv:1910.01708
(2019-10-03)
dblp.uni-trier.dePDF

2019-09

Assessing Generalization in TD methods for Deep Reinforcement Learning
Emmanuel Bengio, Doina Precup and Joelle Pineau
(venue unknown)
(2019-09-25)
openreview.netPDF
Novelty Search in representational space for sample efficient exploration
Ruo Yu Tao, Vincent François-Lavet and Joelle Pineau
(venue unknown)
(2019-09-25)
openreview.netPDF
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
Thang Doan, Bogdan Mazoure, Audrey Durand, Joelle Pineau and R. Devon Hjelm
arXiv: Learning
(2019-09-25)
dblp.uni-trier.dePDF
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang, Amy Zhang, Ari S. Morcos, Joelle Pineau, Pieter Abbeel and Roberto Calandra
arXiv preprint arXiv:2005.03648
(2019-09-25)
dblp.uni-trier.dePDF
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images
Denis Yarats, Amy Zhang, Ilya Kostrikov, Brandon Amos, Joelle Pineau and Rob Fergus
arXiv preprint arXiv:1910.01741
(2019-09-25)
dblp.uni-trier.dePDF
No Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette, Yuchen Lu, Steven Bocco, Max O. Smith, Satya Ortiz-Gagne, Jonathan K. Kummerfeld, Satinder Singh, Joelle Pineau and Aaron Courville
arXiv preprint arXiv:1909.02128
(2019-09-04)
arxiv.orgPDF

2019-06

Learning Causal State Representations of Partially Observable Environments
Amy Zhang, Zachary C. Lipton, Luis Pineda, Kamyar Azizzadenesheli, Anima Anandkumar, Laurent Itti, Joelle Pineau and Tommaso Furlanello
arXiv preprint arXiv:1906.10437
(2019-06-25)
dblp.uni-trier.dePDF
Separable value functions across time-scales
Joshua Romoff, Peter Henderson, Ahmed Touati, Yann Ollivier, Joelle Pineau and Emma Brunskill
ICML 2019
(2019-06-09)
proceedings.mlr.pressPDF
TarMAC: Targeted Multi-Agent Communication
Abhishek Das, Theophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Michael Rabbat and Joelle Pineau

2019-05

Recurrent Value Functions.
Pierre Thodoroff, Nishanth Anand, Lucas Caccia, Doina Precup and Joelle Pineau
arXiv preprint arXiv:1905.09562
(2019-05-23)
dblp.uni-trier.dePDF
On the Pitfalls of Measuring Emergent Communication
Ryan Lowe, Jakob Foerster, Y-Lan Boureau, Joelle Pineau and Yann Dauphin
Task-Agnostic Reinforcement Learning (TARL)
Danijar Hafner, Deepak Pathak, Frederik Ebert, Marc G Bellemare, Raia Hadsell, Rowan McAllister, Amy Zhang, Joelle Pineau, Ahmed Touati and Roberto Calandra
ICLR 2019
(2019-05-06)
iclr.cc
On overfitting and asymptotic bias in batch reinforcement learning with partial observability
Vincent Francois-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst and Raphael Fonteneau
Journal of Artificial Intelligence Research
(2019-05-05)
doi.orgPDF

2019-04

Multitask Metric Learning: Theory and Algorithm
Boyu Wang, Hejia Zhang, Peng Liu, Zebang Shen and Joelle Pineau
AISTATS 2019
(2019-04-11)
proceedings.mlr.pressPDF

2019-03

An Introduction to Deep Reinforcement Learning
Vincent François-Lavet, Peter Henderson, Riashat Islam, Marc G. Bellemare and Joelle Pineau
(venue unknown)
(2019-03-31)
nowpublishers.comPDF

2019-02

When AIs Outperform Doctors: Confronting the Challenges of a Tort-Induced Over-Reliance on Machine Learning
A. Michael Froomkin, Ian R. Kerr and Joelle Pineau
Ariz. L. Rev.
(2019-02-20)
repository.law.miami.eduPDF
Separating value functions across time-scales
Joshua Romoff, Peter Henderson, Ahmed Touati, Emma Brunskill, Joelle Pineau and Yann Ollivier
arXiv preprint arXiv:1902.01883
(2019-02-05)
arxiv.orgPDF

2019-01

The Second Conversational Intelligence Challenge (ConvAI2)
Emily Dinan, Varvara Logacheva, Valentin Malykh, Alexander H. Miller, Kurt Shuster, Jack Urbanek, Douwe Kiela, Arthur Szlam, Iulian Serban, Ryan Lowe, Shrimai Prabhumoye, Alan W. Black, Alexander I. Rudnicky, Jason Williams, Joelle Pineau, Mikhail Burtsev and Jason Weston
arXiv preprint arXiv:1902.00098
(2019-01-31)
link.springer.comPDF
Spatially Invariant Unsupervised Object Detection with Convolutional Neural Networks
Eric Crawford and Joelle Pineau
AAAI 2019
(2019-01-27)
www.aaai.orgPDF
On-line Adaptative Curriculum Learning for GANs
Thang Doan, Joao B Monteiro, Isabela Albuquerque, Bogdan Mazoure, Audrey Durand, Joelle Pineau and Devon Hjelm
Combined Reinforcement Learning via Abstract Representations
vincent francois-lavet, Yoshua Bengio, Doina Precup and Joelle Pineau
Seeded self-play for language learning.
Abhinav Gupta, Ryan Lowe, Jakob N. Foerster, Douwe Kiela and Joelle Pineau
EMNLP 2019
(2019-01-01)
dblp.uni-trier.de
Leveraging exploration in off-policy algorithms via normalizing flows.
Bogdan Mazoure, Thang Doan, Audrey Durand, R. Devon Hjelm and Joelle Pineau

2018-12

Ethical Challenges in Data-Driven Dialogue Systems
Peter Henderson, Koustuv Sinha, Nicolas Angelard-Gontier, Nan Rosemary Ke, Genevieve Fried, Ryan Lowe and Joelle Pineau
AAAI 2018
(2018-12-27)
dblp.uni-trier.de
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet, Peter Henderson, Riashat Islam, Marc G. Bellemare and Joelle Pineau
(venue unknown)
(2018-12-20)
nowpublishers.comPDF
Temporal Regularization for Markov Decision Process
NEURIPS 2018
(2018-12-03)
papers.nips.ccPDF

2018-11

Contextual Bandits for Adapting Treatment in a Mouse Model of de Novo Carcinogenesis
Audrey Durand, Charis Achilleos, Demetris Iacovides, Katerina Strati, Georgios D. Mitsis and Joelle Pineau
Machine Learning for Healthcare Conference
(2018-11-29)
proceedings.mlr.pressPDF
Natural Environment Benchmarks for Reinforcement Learning.
Amy Zhang, Yuxin Wu and Joelle Pineau
arXiv preprint arXiv:1811.06032
(2018-11-14)
dblp.uni-trier.dePDF
Compositional Language Understanding with Text-based Relational Reasoning.
Koustuv Sinha, Shagun Sodhani, William L. Hamilton and Joelle Pineau
arXiv preprint arXiv:1811.02959
(2018-11-07)
dblp.uni-trier.dePDF
The RLLChatbot: a solution to the ConvAI challenge.
Nicolas Gontier, Koustuv Sinha, Peter Henderson, Iulian Serban, Michael Noseworthy, Prasanna Parthasarathi and Joelle Pineau
arXiv preprint arXiv:1811.02714
(2018-11-07)
dblp.uni-trier.dePDF
Adversarial Gain
Peter Henderson, Koustuv Sinha, Rosemary Nan Ke and Joelle Pineau
arXiv: Learning
(2018-11-04)
arxiv.orgPDF
Temporal Regularization in Markov Decision Process
arXiv preprint arXiv:1811.00429
(2018-11-01)
ui.adsabs.harvard.eduPDF

2018-10

Extending Neural Generative Conversational Model using External Knowledge Sources
Prasanna Parthasarathi and Joelle Pineau
Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods.
Peter Henderson, Joshua Romoff and Joelle Pineau
arXiv preprint arXiv:1810.02525
(2018-10-05)
dblp.uni-trier.dePDF

2018-09

Machine Learning to Predict Osteoporotic Fracture Risk from Genotypes
Vincenzo Forgetta, Julyan Keller-Baruch, Marie Forest, Audrey Durand, Sahir Bhatnagar, John Kemp, John A Morris, John A Kanis, Douglas P Kiel, Eugene V McCloskey, Fernando Rivadeneira, Helena Johannson, Nicholas Harvey, Cyrus Cooper, David M Evans, Joelle Pineau, William D Leslie, Celia Mt Greenwood and J Brent Richards
bioRxiv
(2018-09-11)
www.biorxiv.orgPDF

2018-07

The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach.
Iulian Vlad Serban, Chinnadhurai Sankar, Michael Pieper, Joelle Pineau and Yoshua Bengio
arXiv preprint arXiv:1807.04723
(2018-07-12)
ui.adsabs.harvard.eduPDF
Focused Hierarchical RNNs for Conditional Sequence Processing
Nan Ke, Konrad Zolna, Alessandro Sordoni, Mila Zhouhan Lin, Yoshua Bengio, Joelle Pineau, Laurent Charlin and Christopher Pal
ICML 2018
(2018-07-10)
proceedings.mlr.pressPDF

2018-06

A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang, Nicolas Ballas and Joelle Pineau
arXiv preprint arXiv:1806.07937
(2018-06-20)
dblp.uni-trier.dePDF
Focused Hierarchical RNNs for Conditional Sequence Processing
Nan Rosemary Ke, Konrad Zolna, Alessandro Sordoni, Zhouhan Lin, Adam Trischler, Yoshua Bengio, Joelle Pineau, Laurent Charlin and Chris Pal
arXiv preprint arXiv:1806.04342
(2018-06-12)
aps.arxiv.orgPDF
RE-EVALUATE: Reproducibility in Evaluating Reinforcement Learning Algorithms
Khimya Khetarpal, Zafarali Ahmed, Andre Cianflone, Riashat Islam and Joelle Pineau
(venue unknown)
(2018-06-10)
openreview.netPDF
A Survey of Available Corpora For Building Data-Driven Dialogue Systems: The Journal Version
Iulian Vlad Serban, Ryan Lowe, Peter Henderson, Laurent Charlin and Joelle Pineau
Dialogue & Discourse
(2018-06-01)
dad.uni-bielefeld.dePDF
Randomized Value Functions via Multiplicative Normalizing Flows
Ahmed Touati, Harsh Satija, Joshua Romoff, Joelle Pineau and Pascal Vincent

2018-02

Disentangling the independently controllable factors of variation by interacting with the world
Valentin Thomas, Emmanuel Bengio, William Fedus, Jules Pondard, Philippe Beaudoin, Hugo Larochelle, Joelle Pineau, Doina Precup and Yoshua Bengio
arXiv preprint arXiv:1802.09484
(2018-02-26)
ui.adsabs.harvard.eduPDF
Sequential Coordination of Deep Models for Learning Visual Arithmetic
arXiv preprint arXiv:1809.04988
(2018-02-15)
dblp.uni-trier.dePDF
An inference-based policy gradient method for learning options
Matthew J. A. Smith, Herke van Hoof and Joelle Pineau
ICML 2018
(2018-02-15)
proceedings.mlr.pressPDF
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff, Peter Henderson, Alexandre Piché, Vincent François-Lavet and Joelle Pineau
Conference on Robot Learning
(2018-02-12)
proceedings.mlr.pressPDF
Decoupling Dynamics and Reward for Transfer Learning.
Amy Zhang, Harsh Satija and Joelle Pineau
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson, Wei-Di Chang, Pierre-luc Bacon, David Meger, Joelle Pineau and Doina Precup
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
Deep Reinforcement Learning that Matters
Peter Henderson, Riashat Islam, Joelle Pineau, David Meger, Doina Precup and Philip Bachman
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF

2018-01

A Deep Reinforcement Learning Chatbot (Short Version)
Iulian Vlad Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeswar, Alexandre de Brébisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau and Yoshua Bengio
arXiv preprint arXiv:1801.06700
(2018-01-20)
ui.adsabs.harvard.eduPDF
Genomic Prediction of Osteoporosis Using 426,000 Individuals from UK Biobank
Vincenzo Forgetta, Julyan Keller-Baruch, Marie Forest, Audrey Durand, Sahir Bhatnagar, John Kemp, John Morris, John Kanis, Douglas Kiel, Eugene Mccloskey, Helena Johansson, Nicholas Harvey, Dave Evans, Joelle Pineau, William Leslie, Celia M. T. Greenwood and J. Brent Richards
Journal of Bone and Mineral Research
(2018-01-01)
espace.library.uq.edu.au
A Decision-Theoretic Approach for the Collaborative Control of a Smart Wheelchair
Mahmoud Ghorbel, Joelle Pineau, Richard Gourdeau, Shervin Javdani and Siddhartha S. Srinivasa
International Journal of Social Robotics
(2018-01-01)
link.springer.com
Streaming kernel regression with provably adaptive mean, variance, and regularization
Audrey Durand, Odalric-Ambrym Maillard and Joelle Pineau
Journal of Machine Learning Research
(2018-01-01)
dl.acm.orgPDF
Reward Estimation for Variance Reduction in Deep Reinforcement Learning.
Joshua Romoff, Alexandre Piché, Peter Henderson, Vincent François-Lavet and Joelle Pineau

Publications collected and formatted using Paperoni

array(1) { ["wp-wpml_current_language"]=> string(2) "fr" }