Mila > Team > Joelle Pineau

Joelle Pineau

Core Academic Member
Associate Professor, McGill University, Facebook, Canada CIFAR AI Chair

Joelle Pineau is an Associate Professor and William Dawson Scholar at McGill University where she co-directs the Reasoning and Learning Lab. Member of Mila’s faculty corp, she also leads the Facebook AI Research lab in Montreal, Canada. She holds a BASc in Engineering from the University of Waterloo, and an MSc and PhD in Robotics from Carnegie Mellon University. Dr. Pineau’s research focuses on developing new models and algorithms for planning and learning in complex partially-observable domains. She also works on applying these algorithms to complex problems in robotics, health care, games and conversational agents. She serves on the editorial board of the Journal of Artificial Intelligence Research and the Journal of Machine Learning Research and is currently President of the International Machine Learning Society. She is a recipient of NSERC’s E.W.R. Steacie Memorial Fellowship (2018), a Fellow of the Association for the Advancement of Artificial Intelligence (AAAI), a Senior Fellow of the Canadian Institute for Advanced Research (CIFAR) and in 2016 was named a member of the College of New Scholars, Artists and Scientists by the Royal Society of Canada.

Publications

2021-12

Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs
harsh satija, Philip S. Thomas, Joelle Pineau and Romain Laroche

2021-11

Sometimes We Want Ungrammatical Translations
Prasanna Parthasarathi, Koustuv Sinha, Joelle Pineau and Adina Williams
EMNLP 2021
(2021-11-01)
aclanthology.org

2021-10

Block Contextual MDPs for Continual Learning.
Shagun Sodhani, Franziska Meier, Joelle Pineau and Amy Zhang
arXiv preprint arXiv:2110.06972
(2021-10-13)
dblp.uni-trier.dePDF
Circulating proteins to predict adverse COVID-19 outcomes
Chen-Yang Su Mr., Sirui Zhou, Edgar Gonzalez-Kozlova, Guillaume Butler-Laporte, Elsa Brunet-Ratnasingham, Tomoko Nakanishi, Wonseok Jeon, David Morrison, Laetitia Laurent, Joanthan Afilalo, Marc Afilalo, Danielle Henry, Yiheng Chen, Julia Carrasco-Zanini, Yossi Farjoun, Maik Pietzner, Nofar Kimchi, Zaman Afrasiabi, Nardin Rezk, Meriem Bouab... (46 more)
medRxiv
(2021-10-05)
search.bvsalud.org

2021-08

UnNatural Language Inference
Koustuv Sinha, Prasanna Parthasarathi, Joelle Pineau and Adina Williams
Improving reproducibility in machine learning research : a report from the NeurIPS 2019 reproducibility program
Joelle Pineau, Philippe Vincent-Lamarre, Koustuv Sinha, Vincent Larivière, Alina Beygelzimer, Florence d'Alché-Buc, Emily B. Fox and Hugo Larochelle

2021-07

A BRIEF STUDY ON THE EFFECTS OF TRAINING GENERATIVE DIALOGUE MODELS WITH A SEMANTIC LOSS
Prasanna Parthasarathi, Mohamed Abdelsalam, Sarath Chandar and Joelle Pineau
SIGDIAL 2021
(2021-07-29)
aclanthology.org
DO ENCODER REPRESENTATIONS OF GENERATIVE DIALOGUE MODELS HAVE SUFFICIENT SUMMARY OF THE INFORMATION ABOUT THE TASK
Prasanna Parthasarathi, Joelle Pineau and Sarath Chandar
SIGDIAL 2021
(2021-07-29)
aclanthology.org
Automated Data-Driven Generation of Personalized Pedagogical Interventions in Intelligent Tutoring Systems
Ekaterina Kochmar, Dung Do Vu, Robert Belfer, Varun Gupta, Iulian Vlad Serban and Joelle Pineau
International Journal of Artificial Intelligence in Education
(2021-07-27)
link.springer.comPDF
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Jongmin Lee, Wonseok Jeon, Byung-Jun Lee, Joelle Pineau and Kee-Eung Kim
Multi-Task Reinforcement Learning with Context-based Representations
Shagun Sodhani, Amy Zhang and Joelle Pineau

2021-06

Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task
Prasanna Parthasarathi, Joelle Pineau and Sarath Chandar
arXiv preprint arXiv:2106.10622
(2021-06-20)
dblp.uni-trier.dePDF
A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss
Prasanna Parthasarathi, Mohamed Abdelsalam, Joelle Pineau and Sarath Chandar
arXiv preprint arXiv:2106.10619
(2021-06-20)
arxiv.orgPDF
SPeCiaL: Self-Supervised Pretraining for Continual Learning.
Lucas Caccia and Joelle Pineau
arXiv preprint arXiv:2106.09065
(2021-06-16)
ui.adsabs.harvard.eduPDF

2021-05

Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations
Kalesha Bullard, Franziska Meier, Douwe Kiela, Joelle Pineau and Jakob Nicolaus Foerster
arXiv e-prints
(2021-05-04)
ui.adsabs.harvard.eduPDF
Correcting Momentum in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau and Doina Precup
arXiv e-prints
(2021-05-04)
ui.adsabs.harvard.eduPDF
GraphLog: A Benchmark for Measuring Logical Generalization in Graph Neural Networks
Koustuv Sinha, Shagun Sodhani, Joelle Pineau and William L. Hamilton
(venue unknown)
(2021-05-04)
openreview.netPDF
Learning Robust State Abstractions for Hidden-Parameter Block MDPs
Amy Zhang, Shagun Sodhani, Khimya Khetarpal and Joelle Pineau
Regularized Inverse Reinforcement Learning
Wonseok Jeon, Chen-Yang Su, Paul Barde, Thang Doan, Derek Nowrouzezahrai and Joelle Pineau

2021-04

Sometimes We Want Translationese.
Prasanna Parthasarathi, Koustuv Sinha, Joelle Pineau and Adina Williams
arXiv preprint arXiv:2104.07623
(2021-04-15)
ui.adsabs.harvard.eduPDF
Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
Koustuv Sinha, Robin Jia, Dieuwke Hupkes, Joelle Pineau, Adina Williams and Douwe Kiela
Reducing Representation Drift in Online Continual Learning.
Lucas Caccia, Rahaf Aljundi, Tinne Tuytelaars, Joelle Pineau and Eugene Belilovsky
arXiv preprint arXiv:2104.05025
(2021-04-11)
ui.adsabs.harvard.eduPDF
Exploring the Limits of Few-Shot Link Prediction in Knowledge Graphs.
Dora Jambor, Komal K. Teru, Joelle Pineau and William L. Hamilton

2021-03

Quasi-Equivalence Discovery for Zero-Shot Emergent Communication.
Kalesha Bullard, Douwe Kiela, Joelle Pineau and Jakob N. Foerster
arXiv preprint arXiv:2103.08067
(2021-03-14)
ui.adsabs.harvard.eduPDF
Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Manan Tomar, Amy Zhang, Roberto Calandra, Matthew E. Taylor and Joelle Pineau
Resolving Causal Confusion in Reinforcement Learning via Robust Exploration
Clare Lyle, Amy Zhang, Minqi Jiang, Joelle Pineau and Yarin Gal
ICLR 2021
(2021-03-09)
openreview.netPDF

2021-02

Domain Adversarial Reinforcement Learning.
Bonnie Li, Vincent François-Lavet, Thang Doan and Joelle Pineau
arXiv preprint arXiv:2102.07097
(2021-02-14)
ui.adsabs.harvard.eduPDF

2021-01

COVID-19 Prognosis via Self-Supervised Representation Learning and Multi-Image Prediction
Anuroop Sriram, Matthew J. Muckley, Koustuv Sinha, Farah Shamout, Joelle Pineau, Krzysztof J. Geras, Lea Azour, Yindalon Aphinyanaphongs, Nafissa Yakubova and William Moore
arXiv preprint arXiv:2101.04909
(2021-01-24)
europepmc.orgPDF
COVID-19 Deterioration Prediction via Self-Supervised Representation Learning and Multi-Image Prediction
Anuroop Sriram, Matthew Muckley, Koustuv Sinha, Farah Shamout, Joelle Pineau, Krzysztof J Geras, Lea Azour, Yindalon Aphinyanaphongs, Nafissa Yakubova and William Moore
arXiv: Computer Vision and Pattern Recognition
(2021-01-13)
europepmc.org
Democratising the Digital Revolution: The Role of Data Governance
Sylvie Delacroix, Joelle Pineau and Jessica Montgomery
Reflections on Artificial Intelligence for Humanity
(2021-01-01)
doi.org[Also on Social Science Research Network (2020-06-30)]
TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon and Joelle Pineau
Autonomous Agents and Multi-Agent Systems
(2021-01-01)
dl.acm.org

2020-12

Intervention Design for Effective Sim2Real Transfer.
Melissa Mozifian, Amy Zhang, Joelle Pineau and David Meger
arXiv preprint arXiv:2012.02055
(2020-12-03)
ui.adsabs.harvard.eduPDF

2020-11

RapidBrachyDL: Rapid Radiation Dose Calculations in Brachytherapy Via Deep Learning.
Ximeng Mao, Joelle Pineau, Roy Keyes and Shirin A. Enger
International Journal of Radiation Oncology Biology Physics
(2020-11-01)
www.sciencedirect.com

2020-10

The Bottleneck Simulator: A Model-Based Deep Reinforcement Learning Approach
Iulian Vlad Serban, Chinnadhurai Sankar, Michael Pieper, Joelle Pineau and Yoshua Bengio
Journal of Artificial Intelligence Research
(2020-10-27)
dblp.uni-trier.dePDF[Also on arXiv preprint arXiv:1807.04723 (2018-07-12)]
Transparency and reproducibility in artificial intelligence.
Benjamin Haibe-Kains, George Alexandru Adam, Ahmed Hosny, Farnoosh Khodakarami, Levi Waldron, Bo Wang, Chris McIntosh, Anna Goldenberg, Anshul Kundaje, Casey S Greene, Tamara Broderick, Michael M Hoffman, Jeffrey T Leek, Keegan Korthauer, Wolfgang Huber, Alvis Brazma, Joelle Pineau, Robert Tibshirani, Trevor Hastie, John P A Ioannidis... (2 more)
Nature
(2020-10-14)
europepmc.org

2020-08

Stable Policy Optimization via Off-Policy Divergence Regularization.
Ahmed Touati, Amy Zhang, Joelle Pineau and Pascal Vincent
How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Prasanna Parthasarathi, Joelle Pineau and Sarath Chandar
arXiv preprint arXiv:2008.10427
(2020-08-24)
ui.adsabs.harvard.eduPDF

2020-07

Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang, Amy Zhang, Ari S. Morcos, Joelle Pineau, Pieter Abbeel and Roberto Calandra
Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Amy Zhang, Shagun Sodhani, Khimya Khetarpal and Joelle Pineau
(venue unknown)
(2020-07-14)
dblp.uni-trier.de
Building reproducible, reusable, and robust machine learning software
DEBS 2020
(2020-07-13)
dblp.uni-trier.de
Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija, Philip Amortila and Joelle Pineau
Invariant Causal Prediction for Block MDPs
Clare Lyle, Amy Zhang, Angelos Filos, Shagun Sodhani, Marta Kwiatkowska, Yarin Gal, Doina Precup and Joelle Pineau
ICML 2020
(2020-07-12)
proceedings.mlr.pressPDF
Online Learned Continual Compression with Adaptive Quantization Modules
Lucas Caccia, Eugene Belilovsky, Massimo Caccia and Joelle Pineau
Interference and Generalization in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau and Doina Precup
Handling Black Swan Events in Deep Learning with Diversely Extrapolated Neural Networks
Maxime Wabartha, Audrey Durand, Vincent François-Lavet and Joelle Pineau
IJCAI 2020
(2020-07-09)
www.ijcai.orgPDF
On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability (Extended Abstract).
Vincent Francois-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst and Raphael Fonteneau
IJCAI 2020
(2020-07-09)
www.ijcai.orgPDF
Automated Personalized Feedback Improves Learning Gains in An Intelligent Tutoring System
Ekaterina Kochmar, Dung Do Vu, Robert Belfer, Varun Gupta, Iulian Vlad Serban and Joelle Pineau
TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon and Joelle Pineau
arXiv preprint arXiv:2007.02786
(2020-07-06)
ui.adsabs.harvard.eduPDF
A Large-Scale, Open-Domain, Mixed-Interface Dialogue-Based ITS for STEM
Iulian Vlad Serban, Varun Gupta, Ekaterina Kochmar, Dung Do Vu, Robert Belfer, Joelle Pineau, Aaron C. Courville, Laurent Charlin and Yoshua Bengio
Deep interpretability for GWAS.
Deepak Sharma, Audrey Durand, Marc-André Legault, Louis-Philippe Lemieux Perreault, Audrey Lemaçon, Marie-Pierre Dubé and Joelle Pineau
arXiv preprint arXiv:2007.01516
(2020-07-03)
ui.adsabs.harvard.eduPDF
Development of a polygenic risk score to improve screening for fracture risk: A genetic risk prediction study
Vincenzo Forgetta, Julyan Keller-Baruch, Marie Forest, Audrey Durand, Sahir Bhatnagar, John P Kemp, Maria Nethander, Daniel Evans, John A Morris, Douglas P Kiel, Fernando Rivadeneira, Helena Johansson, Nicholas C Harvey, Dan Mellström, Magnus Karlsson, Cyrus Cooper, David M Evans, Robert Clarke, John A Kanis, Eric Orwoll... (6 more)
PLOS Medicine
(2020-07-02)
doaj.orgPDF
Learning an Unreferenced Metric for Online Dialogue Evaluation.
Koustuv Sinha, Prasanna Parthasarathi, Jasmine Wang, Ryan Lowe, William L. Hamilton and Joelle Pineau

2020-06

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher J. Pal and Derek Nowrouzezahrai
Evaluating Logical Generalization in Graph Neural Networks.
Koustuv Sinha, Shagun Sodhani, Joelle Pineau and William L. Hamilton
arXiv: Learning
(2020-06-12)
ui.adsabs.harvard.eduPDF
Machine Learning for COVID-19 needs global collaboration and data-sharing
Nathan Peiffer-Smadja, Redwan Maatoug, François Xavier Lescure, Eric D’Ortenzio, Joëlle Pineau and Jean Rémi King
Nature Machine Intelligence
(2020-06-01)
www.nature.com

2020-04

On the interaction between supervision and self-play in emergent communication
Ryan Lowe, Abhinav Gupta, Jakob Foerster, Douwe Kiela and Joelle Pineau
Language GANs Falling Short
Massimo Caccia, Lucas Caccia, William Fedus, Hugo Larochelle, Joelle Pineau and Laurent Charlin
Literature Mining for Incorporating Inductive Bias in Biomedical Prediction Tasks (Student Abstract)
Qizhen Zhang, Audrey Durand and Joelle Pineau
AAAI 2020
(2020-04-03)
aiide.org
Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking
Eric Crawford and Joelle Pineau

2020-03

Invariant Causal Prediction for Block MDPs.
Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal and Doina Precup
arXiv preprint arXiv:2003.06016
(2020-03-12)
ui.adsabs.harvard.eduPDF

2020-02

The importance of transparency and reproducibility in artificial intelligence research
Benjamin Haibe-Kains, George Alexandru Adam, Ahmed Hosny, Farnoosh Khodakarami, Levi Waldron, Bo Wang, Chris McIntosh, Anshul Kundaje, Casey S. Greene, Michael M. Hoffman, Jeffrey T. Leek, Wolfgang Huber, Alvis Brazma, Joelle Pineau, Robert Tibshirani, Trevor Hastie, John P.A. Ioannidis, John Quackenbush and Hugo J.W.L. Aerts
arXiv preprint arXiv:2003.00898
(2020-02-28)
ui.adsabs.harvard.eduPDF
Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic.
Wonseok Jeon, Paul Barde, Derek Nowrouzezahrai and Joelle Pineau
arXiv preprint arXiv:2002.10525
(2020-02-24)
ui.adsabs.harvard.eduPDF
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv preprint arXiv:2002.02863
(2020-02-07)
ui.adsabs.harvard.eduPDF
Provably efficient reconstruction of policy networks.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv: Learning
(2020-02-07)
dblp.uni-trier.dePDF

2020-01

Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Peter Henderson, Jieru Hu, Joshua Romoff, Emma Brunskill, Dan Jurafsky and Joelle Pineau
Novelty Search in Representational Space for Sample Efficient Exploration
Ruo Yu Tao, Vincent Francois-Lavet and Joelle Pineau
Machine learning to predict osteoporotic fracture risk from genotypes
Vincenzo Forgetta, Julyan Keller-Baruch, Marie Forest, Audrey Durand, Sahir Bhatnagar, John Kemp, John A. Morris, John A. Kanis, Douglas P. Kiel, Eugene V. McCloskey, Fernando Rivadeneira, Helena Johannson, Nicholas Harvey, Cyrus Cooper, David M. Evans, Joelle Pineau, William D. Leslie, Celia M. T. Greenwood and J. Brent Richards

2019-12

Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift. (arXiv:1911.06970v2 [cs.LG] UPDATED)
Riashat Islam, Komal K. Teru, Deepak Sharma and Joelle Pineau
arXiv Computer Science
(2019-12-03)

2019-11

Online Learned Continual Compression with Stacked Quantization Module.
Lucas Caccia, Eugene Belilovsky, Massimo Caccia and Joelle Pineau
(venue unknown)
(2019-11-19)
dblp.uni-trier.de
Seeded self-play for language learning
Abhinav Gupta, Ryan Lowe, Jakob N. Foerster, Douwe Kiela and Joelle Pineau
EMNLP 2019
(2019-11-01)
www.aclweb.org
CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text
Koustuv Sinha, Shagun Sodhani, Jin Dong, Joelle Pineau and William L. Hamilton
Deep Generative Modeling of LiDAR Data
Lucas Caccia, Herke van Hoof, Aaron Courville and Joelle Pineau

2019-10

MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions.
Viswanath Sivakumar, Tim Rocktäschel, Alexander H. Miller, Heinrich Küttler, Nantas Nardelli, Mike Rabbat, Joelle Pineau and Sebastian Riedel
arXiv preprint arXiv:1910.04054
(2019-10-09)
ui.adsabs.harvard.eduPDF
Recurrent Value Functions. (arXiv:1905.09562v1 [cs.LG])
Pierre Thodoroff, Nishanth Anand, Lucas Caccia, Doina Precup and Joelle Pineau
arXiv Computer Science
(2019-10-07)
Benchmarking Batch Deep Reinforcement Learning Algorithms.
Scott Fujimoto, Edoardo Conti, Mohammad Ghavamzadeh and Joelle Pineau
arXiv preprint arXiv:1910.01708
(2019-10-03)
ui.adsabs.harvard.eduPDF

2019-09

Assessing Generalization in TD methods for Deep Reinforcement Learning
Emmanuel Bengio, Doina Precup and Joelle Pineau
(venue unknown)
(2019-09-25)
openreview.netPDF
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
Thang Doan, Bogdan Mazoure, Audrey Durand, Joelle Pineau and R Devon Hjelm
arXiv: Learning
(2019-09-25)
ui.adsabs.harvard.eduPDF
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images
Denis Yarats, Amy Zhang, Ilya Kostrikov, Brandon Amos, Joelle Pineau and Rob Fergus
No Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette, Yuchen Lu, Steven Bocco, Max O. Smith, Satya Ortiz-Gagne, Jonathan K. Kummerfeld, Satinder Singh, Joelle Pineau and Aaron Courville
arXiv preprint arXiv:1909.02128
(2019-09-04)
ui.adsabs.harvard.eduPDF
No-Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette, Yuchen Lu, Seton Steven Bocco, Max Smith, Satya O.-G., Jonathan K. Kummerfeld, Joelle Pineau, Satinder Singh and Aaron C. Courville
NEURIPS 2019
(2019-09-04)
papers.nips.ccPDF

2019-07

Spatially Invariant Unsupervised Object Detection with Convolutional Neural Networks
Eric Crawford and Joelle Pineau
AAAI 2019
(2019-07-17)
aaai.org
On-line Adaptative Curriculum Learning for GANs
Thang Doan, João Monteiro, Isabela Albuquerque, Bogdan Mazoure, Audrey Durand, Joelle Pineau and R. Devon Hjelm
Combined Reinforcement Learning via Abstract Representations
Vincent Francois-Lavet, Yoshua Bengio, Doina Precup and Joelle Pineau

2019-06

Learning Causal State Representations of Partially Observable Environments
Amy Zhang, Zachary C. Lipton, Luis Pineda, Kamyar Azizzadenesheli, Anima Anandkumar, Laurent Itti, Joelle Pineau and Tommaso Furlanello
arXiv preprint arXiv:1906.10437
(2019-06-25)
ui.adsabs.harvard.eduPDF
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Mahmoud Assran, Joshua Romoff, Nicolas Ballas, Joelle Pineau and Michael Rabbat

2019-05

Separating value functions across time-scales
Joshua Romoff, Peter Henderson, Ahmed Touati, Emma Brunskill, Joelle Pineau and Yann Ollivier
TarMAC: Targeted Multi-Agent Communication
Abhishek Das, Théophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Michael G. Rabbat and Joelle Pineau
Recurrent Value Functions.
Pierre Thodoroff, Nishanth Anand, Lucas Caccia, Doina Precup and Joelle Pineau
arXiv preprint arXiv:1905.09562
(2019-05-23)
ui.adsabs.harvard.eduPDF
ICLR Reproducibility Challenge 2019
Joelle Pineau, Koustuv Sinha, Genevieve Fried, Rosemary Nan Ke and Hugo Larochelle
(venue unknown)
(2019-05-22)
zenodo.orgPDF
Leveraging exploration in off-policy algorithms via normalizing flows
Bogdan Mazoure, Thang Doan, Audrey Durand, R Devon Hjelm and Joelle Pineau
arXiv preprint arXiv:1905.06893
(2019-05-16)
ui.adsabs.harvard.eduPDF
On the Pitfalls of Measuring Emergent Communication
Ryan Lowe, Jakob Foerster, Y-Lan Boureau, Joelle Pineau and Yann Dauphin
Task-Agnostic Reinforcement Learning (TARL)
Danijar Hafner, Deepak Pathak, Frederik Ebert, Marc G Bellemare, Raia Hadsell, Rowan McAllister, Amy Zhang, Joelle Pineau, Ahmed Touati and Roberto Calandra
ICLR 2019
(2019-05-06)
iclr.cc
On overfitting and asymptotic bias in batch reinforcement learning with partial observability
Vincent Francois-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst and Raphael Fonteneau
Journal of Artificial Intelligence Research
(2019-05-05)
orbi.ulg.ac.bePDF

2019-04

Multitask Metric Learning: Theory and Algorithm
Boyu Wang, Hejia Zhang, Peng Liu, Zebang Shen and Joelle Pineau
AISTATS 2019
(2019-04-11)
proceedings.mlr.pressPDF

2019-03

An Introduction to Deep Reinforcement Learning
Vincent François-Lavet, Peter Henderson, Riashat Islam, Marc G. Bellemare and Joelle Pineau

2019-01

The Second Conversational Intelligence Challenge (ConvAI2)
Emily Dinan, Varvara Logacheva, Valentin Malykh, Alexander H. Miller, Kurt Shuster, Jack Urbanek, Douwe Kiela, Arthur Szlam, Iulian Serban, Ryan Lowe, Shrimai Prabhumoye, Alan W. Black, Alexander I. Rudnicky, Jason Williams, Joelle Pineau, Mikhail S. Burtsev and Jason Weston
arXiv preprint arXiv:1902.00098
(2019-01-31)
ui.adsabs.harvard.eduPDF
Leveraging exploration in off-policy algorithms via normalizing flows.
Bogdan Mazoure, Thang Doan, Audrey Durand, Joelle Pineau and R. Devon Hjelm
Conference on Robot Learning
(2019-01-01)
proceedings.mlr.pressPDF
When AIs Outperform Doctors: Confronting the Challenges of a Tort-Induced Over-Reliance on Machine Learning
A. Michael Froomkin, Ian Kerr and Joelle Pineau

2018-12

Ethical Challenges in Data-Driven Dialogue Systems
Peter Henderson, Koustuv Sinha, Nicolas Angelard-Gontier, Nan Rosemary Ke, Genevieve Fried, Ryan Lowe and Joelle Pineau
AAAI 2018
(2018-12-27)
dl.acm.org
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet, Peter Henderson, Riashat Islam, Marc G. Bellemare and Joelle Pineau
(venue unknown)
(2018-12-20)
nowpublishers.comPDF

2018-11

Contextual Bandits for Adapting Treatment in a Mouse Model of de Novo Carcinogenesis
Audrey Durand, Charis Achilleos, Demetris Iacovides, Katerina Strati, Georgios D. Mitsis and Joelle Pineau
Machine Learning for Healthcare Conference
(2018-11-29)
proceedings.mlr.pressPDF
Natural Environment Benchmarks for Reinforcement Learning
Amy Zhang, Yuxin Wu and Joelle Pineau
arXiv preprint arXiv:1811.06032
(2018-11-14)
dblp.uni-trier.dePDF
Compositional Language Understanding with Text-based Relational Reasoning.
Koustuv Sinha, Shagun Sodhani, William L. Hamilton and Joelle Pineau
arXiv preprint arXiv:1811.02959
(2018-11-07)
ui.adsabs.harvard.eduPDF
The RLLChatbot: a solution to the ConvAI challenge.
Nicolas Gontier, Koustuv Sinha, Peter Henderson, Iulian Serban, Michael Noseworthy, Prasanna Parthasarathi and Joelle Pineau
arXiv preprint arXiv:1811.02714
(2018-11-07)
dblp.uni-trier.dePDF
Adversarial Gain
Peter Henderson, Koustuv Sinha, Rosemary Nan Ke and Joelle Pineau
arXiv: Learning
(2018-11-04)
arxiv.orgPDF
Temporal Regularization in Markov Decision Process
arXiv: Learning
(2018-11-01)
ui.adsabs.harvard.eduPDF

2018-10

Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods.
Peter Henderson, Joshua Romoff and Joelle Pineau
arXiv preprint arXiv:1810.02525
(2018-10-05)
ui.adsabs.harvard.eduPDF

2018-07

Focused Hierarchical RNNs for Conditional Sequence Processing
Nan Rosemary Ke, Konrad Zolna, Alessandro Sordoni, Zhouhan Lin, Adam Trischler, Yoshua Bengio, Joelle Pineau, Laurent Charlin and Christopher J. Pal

2018-06

A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang, Nicolas Ballas and Joelle Pineau
arXiv preprint arXiv:1806.07937
(2018-06-20)
ui.adsabs.harvard.eduPDF
RE-EVALUATE: Reproducibility in Evaluating Reinforcement Learning Algorithms
Khimya Khetarpal, Zafarali Ahmed, Andre Cianflone, Riashat Islam and Joelle Pineau
(venue unknown)
(2018-06-10)
openreview.netPDF
Randomized Value Functions via Multiplicative Normalizing Flows
Ahmed Touati, Harsh Satija, Joshua Romoff, Joelle Pineau and Pascal Vincent
A Survey of Available Corpora For Building Data-Driven Dialogue Systems: The Journal Version
Iulian Vlad Serban, Ryan Lowe, Peter Henderson, Laurent Charlin and Joelle Pineau
Dialogue & Discourse
(2018-06-01)
dad.uni-bielefeld.de

2018-02

Disentangling the independently controllable factors of variation by interacting with the world
Valentin Thomas, Emmanuel Bengio, William Fedus, Jules Pondard, Philippe Beaudoin, Hugo Larochelle, Joelle Pineau, Doina Precup and Yoshua Bengio
arXiv preprint arXiv:1802.09484
(2018-02-26)
ui.adsabs.harvard.eduPDF
Sequential Coordination of Deep Models for Learning Visual Arithmetic
arXiv preprint arXiv:1809.04988
(2018-02-15)
dblp.uni-trier.dePDF
An inference-based policy gradient method for learning options
Matthew J. A. Smith, Herke van Hoof and Joelle Pineau
ICML 2018
(2018-02-15)
proceedings.mlr.pressPDF
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff, Peter Henderson, Alexandre Piché, Vincent François-Lavet and Joelle Pineau
Decoupling Dynamics and Reward for Transfer Learning.
Amy Zhang, Harsh Satija and Joelle Pineau
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson, Wei-Di Chang, Pierre-luc Bacon, David Meger, Joelle Pineau and Doina Precup
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
Deep Reinforcement Learning that Matters
Peter Henderson, Riashat Islam, Joelle Pineau, David Meger, Doina Precup and Philip Bachman
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF

2018-01

A Deep Reinforcement Learning Chatbot (Short Version)
Iulian Vlad Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeswar, Alexandre de Brébisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau and Yoshua Bengio
arXiv preprint arXiv:1801.06700
(2018-01-20)
ui.adsabs.harvard.eduPDF
Genomic Prediction of Osteoporosis Using 426,000 Individuals from UK Biobank
Vincenzo Forgetta, Julyan Keller-Baruch, Marie Forest, Audrey Durand, Sahir Bhatnagar, John Kemp, John Morris, John Kanis, Douglas Kiel, Eugene Mccloskey, Helena Johansson, Nicholas Harvey, Dave Evans, Joelle Pineau, William Leslie, Celia M. T. Greenwood and J. Brent Richards
Journal of Bone and Mineral Research
(2018-01-01)
espace.library.uq.edu.au
Temporal Regularization for Markov Decision Process
NEURIPS 2018
(2018-01-01)
papers.nips.ccPDF
A Decision-Theoretic Approach for the Collaborative Control of a Smart Wheelchair
Mahmoud Ghorbel, Joelle Pineau, Richard Gourdeau, Shervin Javdani and Siddhartha S. Srinivasa
International Journal of Social Robotics
(2018-01-01)
europepmc.org
Streaming kernel regression with provably adaptive mean, variance, and regularization
Audrey Durand, Odalric-Ambrym Maillard and Joelle Pineau
Journal of Machine Learning Research
(2018-01-01)
ui.adsabs.harvard.eduPDF
Reward Estimation for Variance Reduction in Deep Reinforcement Learning.
Joshua Romoff, Alexandre Piché, Peter Henderson, Vincent François-Lavet and Joelle Pineau
ICLR 2018
(2018-01-01)
dblp.uni-trier.de
Extending Neural Generative Conversational Model using External Knowledge Sources
Prasanna Parthasarathi and Joelle Pineau

Publications collected and formatted using Paperoni