Mila > Team > Doina Precup

Doina Precup

Core Academic Member
Associate Professor, McGill University, DeepMind, Canada CIFAR AI Chair

Doina Precup teaches at McGill while conducting fundamental research on reinforcement learning, working in particular on AI applications in areas that have a social impact, such as health care. She’s interested in machine decision-making in situations where uncertainty is high.

She is a senior fellow of the Canadian Institute for Advanced Research, fellow of the Association for the Advancement of Artificial Intelligence and she also heads the Montreal office of Deepmind.

Specialist In: Artificial intelligence, machine learning, reinforcement learning, reasoning and planning under uncertainty, applications.

Publications

2021-12

Gradient Starvation: A Learning Proclivity in Neural Networks
Mohammad Pezeshki, Oumar Kaba, Yoshua Bengio, Aaron C. Courville, Doina Precup and Guillaume Lajoie
NEURIPS 2021
(2021-12-06)
papers.nips.ccPDF
Flexible Option Learning
Martin Klissarov and Doina Precup
NEURIPS 2021
(2021-12-06)
papers.nips.cc
On the Expressivity of Markov Reward
David Abel, Will Dabney, Anna Harutyunyan, Mark K. Ho, Michael L. Littman, Doina Precup and Satinder Singh
Temporally Abstract Partial Models
Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici and Doina Precup
A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning
Mingde Zhao, Zhen Liu, Sitao Luan, Shuyuan Zhang, Doina Precup and Yoshua Bengio
Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation
Emmanuel Bengio, Moksh Jain, Maksym Korablyov, Doina Precup and Yoshua Bengio

2021-11

Estimating treatment effect for individuals with progressive multiple sclerosis using deep learning
Jean-Pierre R. Falet, Joshua Durso-Finley, Brennan Nichyporuk, Julien Schroeter, Francesca Bovis, Maria-Pia Sormani, Doina Precup, Tal Arbel and Douglas Lorne Arnold
medRxiv
(2021-11-01)
www.medrxiv.org

2021-10

Temporal Abstraction in Reinforcement Learning with the Successor Representation.
Marlos C. Machado, André Barreto and Doina Precup
arXiv preprint arXiv:2110.05740
(2021-10-12)
ui.adsabs.harvard.eduPDF
Reward is enough
David Silver, Satinder P. Singh, Doina Precup and Richard S. Sutton
Artificial Intelligence
(2021-10-01)
www.sciencedirect.com

2021-09

Is Heterophily A Real Nightmare For Graph Neural Networks To Do Node Classification
Sitao Luan, Chenqing Hua, Qincheng Lu, Jiaqi Zhu, Mingde Zhao, Shuyuan Zhang, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:2109.05641
(2021-09-12)
ui.adsabs.harvard.eduPDF
Where Did You Learn That From? Surprising Effectiveness of Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning.
Maziar Gomrokchi, Susan Amin, Hossein Aboutalebi, Alexander Wong and Doina Precup
arXiv preprint arXiv:2109.03975
(2021-09-08)
dblp.uni-trier.dePDF
A Survey of Exploration Methods in Reinforcement Learning.
Susan Amin, Maziar Gomrokchi, Harsh Satija, Herke van Hoof and Doina Precup
arXiv preprint arXiv:2109.00157
(2021-09-01)
ui.adsabs.harvard.eduPDF

2021-08

Policy Gradients Incorporating the Future
David Venuto, Elaine Lau, Doina Precup and Ofir Nachum
arXiv preprint arXiv:2108.02096
(2021-08-04)
ui.adsabs.harvard.eduPDF

2021-07

Preferential Temporal Difference Learning
Nishanth Anand and Doina Precup
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Scott Fujimoto, David Meger and Doina Precup
Randomized Exploration in Reinforcement Learning with General Value Function Approximation
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup and Lin Yang
ICML 2021
(2021-07-18)
proceedings.mlr.pressPDF
Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards
Susan Amin, Maziar Gomrokchi, Hossein Aboutalebi, Harsh Satija and Doina Precup

2021-06

Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup and Lin F. Yang
arXiv preprint arXiv:2106.07841
(2021-06-15)
export.arxiv.orgPDF
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning.
Bogdan Mazoure, Paul Mineiro, Pavithra Srinath, Reza Sharifi Sedeh, Doina Precup and Adith Swaminathan
arXiv: Learning
(2021-06-01)
arxiv.orgPDF
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL.
Bogdan Mazoure, Paul Mineiro, Pavithra Srinath, Reza Sharifi Sedeh, Doina Precup and Adith Swaminathan
arXiv preprint arXiv:2106.00589
(2021-06-01)
ui.adsabs.harvard.eduPDF

2021-05

AndroidEnv: A Reinforcement Learning Platform for Android.
Daniel Toyama, Philippe Hamel, Anita Gergely, Gheorghe Comanici, Amelia Glaese, Zafarali Ahmed, Tyler Jackson, Shibl Mourad and Doina Precup
arXiv preprint arXiv:2105.13231
(2021-05-27)
ui.adsabs.harvard.eduPDF
Practical Marginalized Importance Sampling with the Successor Representation
Scott Fujimoto, David Meger and Doina Precup
(venue unknown)
(2021-05-04)
openreview.netPDF
Correcting Momentum in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau and Doina Precup
arXiv e-prints
(2021-05-04)
ui.adsabs.harvard.eduPDF
Offline Policy Optimization with Variance Regularization
Riashat Islam, Samarth Sinha, Homanga Bharadhwaj, Samin Yeasar Arnob, Zhuoran Yang, Zhaoran Wang, Animesh Garg, Lihong Li and Doina Precup
(venue unknown)
(2021-05-04)
openreview.netPDF
Conditional Networks
Anthony Ortiz, Kris Sankaran, Olac Fuentes, Christopher Kiekintveld, Pascal Vincent, Yoshua Bengio and Doina Precup
(venue unknown)
(2021-05-04)
openreview.net

2021-04

What is Going on Inside Recurrent Meta Reinforcement Learning Agents
Safa Alver and Doina Precup
arXiv preprint arXiv:2104.14644
(2021-04-29)
ui.adsabs.harvard.eduPDF

2021-03

Training a First-Order Theorem Prover from Synthetic Data
Vlad Firoiu, Eser Aygun, Ankit Anand, Zafarali Ahmed, Xavier Glorot, Laurent Orseau, Lei Zhang, Doina Precup and Shibl Mourad
arXiv preprint arXiv:2103.03798
(2021-03-05)
ui.adsabs.harvard.eduPDF

2021-02

Optimal Spectral-Norm Approximate Minimization of Weighted Finite Automata
Borja Balle, Clara Lacroce, Prakash Panangaden, Doina Precup and Guillaume Rabusseau
Variance Penalized On-Policy and Off-Policy Actor-Critic
Arushi Jain, Gandharv Patil, Ayush Jain, Khimya Khetarpal and Doina Precup

2021-01

Self-Supervised Attention-Aware Reinforcement Learning.
Haiping Wu, Khimya Khetarpal and Doina Precup
AAAI 2021
(2021-01-01)
dblp.uni-trier.de
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Arushi Jain, Khimya Khetarpal and Doina Precup

2020-12

Towards Continual Reinforcement Learning: A Review and Perspectives.
Khimya Khetarpal, Matthew Riemer, Irina Rish and Doina Precup
arXiv preprint arXiv:2012.13490
(2020-12-25)
ui.adsabs.harvard.eduPDF
Phylogenetic Manifold Regularization: A semi-supervised approach to predict transcription factor binding sites
Faizy Ahsan, Alexandre Drouin, Francois Laviolette, Doina Precup and Mathieu Blanchette
BIBM 2020
(2020-12-16)
dblp.uni-trier.de
Fast reinforcement learning with generalized policy updates
André Barreto, Shaobo Hou, Diana Borsa, David Silver and Doina Precup
Proceedings of the National Academy of Sciences of the United States of America
(2020-12-01)
europepmc.orgPDF

2020-11

Gradient Starvation: A Learning Proclivity in Neural Networks
Mohammad Pezeshki, Sékou-Oumar Kaba, Yoshua Bengio, Aaron Courville, Doina Precup and Guillaume Lajoie
arXiv preprint arXiv:2011.09468
(2020-11-18)
ui.adsabs.harvard.eduPDF
Diversity-Enriched Option-Critic.
Anand Kamat and Doina Precup
arXiv preprint arXiv:2011.02565
(2020-11-04)
ui.adsabs.harvard.eduPDF
A Study of Policy Gradient on a Class of Exactly Solvable Models.
Gavin McCracken, Colin Daniels, Rosie Zhao, Anna Brandenberger, Prakash Panangaden and Doina Precup
arXiv preprint arXiv:2011.01859
(2020-11-03)
ui.adsabs.harvard.eduPDF

2020-10

Connecting Weighted Automata, Tensor Networks and Recurrent Neural Networks through Spectral Learning.
arXiv preprint arXiv:2010.10029
(2020-10-19)
ui.adsabs.harvard.eduPDF
A Fully Tensorized Recurrent Neural Network.
Charles C. Onu, Jacob E. Miller and Doina Precup
arXiv preprint arXiv:2010.04196
(2020-10-08)
ui.adsabs.harvard.eduPDF

2020-09

Keynote Lecture Building Knowledge For AI AgentsWith Reinforcement Learning
ICCP 2020
(2020-09-03)
doi.org

2020-08

Training Matters: Unlocking Potentials of Deeper Graph Convolutional Neural Networks. (arXiv:2008.08838v1 [cs.LG])
Sitao Luan, Mingde Zhao, Xiao-Wen Chang and Doina Precup
arXiv Computer Science
(2020-08-21)
Complete the Missing Half: Augmenting Aggregation Filtering with Diversification for Graph Convolutional Networks.
Sitao Luan, Mingde Zhao, Chenqing Hua, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:2008.08844
(2020-08-20)
ui.adsabs.harvard.eduPDF
Training Matters: Unlocking Potentials of Deeper Graph Convolutional Neural Networks.
Sitao Luan, Mingde Zhao, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:2008.08838
(2020-08-20)
ui.adsabs.harvard.eduPDF

2020-07

What can I do here? A Theory of Affordances in Reinforcement Learning
Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, David Abel and Doina Precup
Invariant Causal Prediction for Block MDPs
Clare Lyle, Amy Zhang, Angelos Filos, Shagun Sodhani, Marta Kwiatkowska, Yarin Gal, Doina Precup and Joelle Pineau
ICML 2020
(2020-07-12)
proceedings.mlr.pressPDF
Interference and Generalization in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau and Doina Precup
SVRG for Policy Evaluation with Fewer Gradient Evaluations
Zilun Peng, Ahmed Touati, Pascal Vincent and Doina Precup

2020-06

Learning to Prove from Synthetic Theorems.
Eser Aygün, Zafarali Ahmed, Ankit Anand, Vlad Firoiu, Xavier Glorot, Laurent Orseau, Doina Precup and Shibl Mourad
arXiv preprint arXiv:2006.11259
(2020-06-19)
ui.adsabs.harvard.eduPDF
A Brief Look at Generalization in Visual Meta-Reinforcement Learning
Safa Alver and Doina Precup
arXiv preprint arXiv:2006.07262
(2020-06-12)
ui.adsabs.harvard.eduPDF
A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms.
Value Preserving State-Action Abstractions
David Abel, Nate Umbanhowar, Khimya Khetarpal, Dilip Arumugam, Doina Precup and Michael L. Littman
AISTATS 2020
(2020-06-03)
proceedings.mlr.pressPDF

2020-05

META-Learning State-based Eligibility Traces for More Sample-Efficient Policy Evaluation
Mingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang and Doina Precup
Option-Critic in Cooperative Multi-agent Systems
Jhelum Chakravorty, Patrick Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu and Doina Precup
AAMAS 2020
(2020-05-05)
ui.adsabs.harvard.edu
Gifting in Multi-Agent Reinforcement Learning
Andrei Lupu and Doina Precup
AAMAS 2020
(2020-05-05)
dl.acm.org

2020-04

Gifting in Multi-Agent Reinforcement Learning (Student Abstract)
Andrei Lupu and Doina Precup
AAAI 2020
(2020-04-03)
aaai.org
Algorithmic Improvements for Deep Reinforcement Learning Applied to Interactive Fiction.
Vishal Jain, William Fedus, Hugo Larochelle, Doina Precup and Marc G. Bellemare
Options of Interest: Temporal Abstraction with Interest Functions
Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre-Luc Bacon and Doina Precup

2020-03

Invariant Causal Prediction for Block MDPs.
Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal and Doina Precup
arXiv preprint arXiv:2003.06016
(2020-03-12)
ui.adsabs.harvard.eduPDF
Multiple Kernel Learning-Based Transfer Regression for Electric Load Forecasting
Di Wu, Boyu Wang, Doina Precup and Benoit Boulet
IEEE Transactions on Smart Grid
(2020-03-01)
ieeexplore.ieee.org

2020-02

Policy Evaluation Networks.
Jean Harb, Tom Schaul, Doina Precup and Pierre-Luc Bacon
arXiv preprint arXiv:2002.11833
(2020-02-26)
ui.adsabs.harvard.eduPDF
oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions.
David Venuto, Jhelum Chakravorty, Leonard Boussioux, Junhao Wang, Gavin McCracken and Doina Precup
arXiv preprint arXiv:2002.09043
(2020-02-20)
ui.adsabs.harvard.eduPDF
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv preprint arXiv:2002.02863
(2020-02-07)
ui.adsabs.harvard.eduPDF
Provably efficient reconstruction of policy networks.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv: Learning
(2020-02-07)
dblp.uni-trier.dePDF
Assessment of Extubation Readiness Using Spontaneous Breathing Trials in Extremely Preterm Neonates.
Wissam Shalish, Lara Kanbar, Lajos Kovacs, Sanjay Chawla, Martin Keszler, Smita Rao, Samantha Latremouille, Doina Precup, Karen Brown, Robert E Kearney and Guilherme M Sant'Anna
JAMA Pediatrics
(2020-02-01)
europepmc.orgPDF

2020-01

Exploring Bayesian Deep Learning Uncertainty Measures for Segmentation of New Lesions in Longitudinal MRIs
Nazanin Mohammadi Sepahvand, Raghav Mehta, Douglas Lorne Arnold, Doina Precup and Tal Arbel
(venue unknown)
(2020-01-25)
openreview.netPDF
On Efficiency in Hierarchical Reinforcement Learning
Zheng Wen, Doina Precup, Morteza Ibrahimi, Andre Barreto, Benjamin Van Roy and Satinder Singh
NEURIPS 2020
(2020-01-01)
papers.nips.ccPDF
Forethought and Hindsight in Credit Assignment
Veronica Chelu, Doina Precup and Hado P. van Hasselt
Reward Propagation Using Graph Convolutional Networks
Martin Klissarov and Doina Precup
Learning to cooperate: Emergent communication in multi-agent navigation.
Ivana Kajic, Eser Aygün and Doina Precup
Exploring uncertainty measures in deep networks for Multiple sclerosis lesion detection and segmentation.
Tanya Nair, Doina Precup, Douglas L. Arnold and Tal Arbel
Value-driven Hindsight Modelling
Arthur Guez, Fabio Viola, Theophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver and Nicolas Heess
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
Scott Fujimoto, David Meger and Doina Precup

2019-12

Shaping representations through communication: community size effect in artificial learning systems
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
arXiv preprint arXiv:1912.06208
(2019-12-12)
ui.adsabs.harvard.eduPDF
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning.
Riashat Islam, Raihan Seraj, Samin Yeasar Arnob and Doina Precup
arXiv preprint arXiv:1912.05109
(2019-12-11)
ui.adsabs.harvard.eduPDF
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam, Zafarali Ahmed and Doina Precup
arXiv preprint arXiv:1912.05128
(2019-12-11)
ui.adsabs.harvard.eduPDF
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam, Raihan Seraj, Pierre-Luc Bacon and Doina Precup
arXiv preprint arXiv:1912.05104
(2019-12-11)
ui.adsabs.harvard.eduPDF
Hindsight Credit Assignment
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Gheshlaghi Azar, Bilal Piot, Nicolas Heess, Hado P. van Hasselt, Gregory Wayne, Satinder Singh, Doina Precup and Remi Munos

2019-11

Option-Critic in Cooperative Multi-agent Systems
Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu and Doina Precup
arXiv preprint arXiv:1911.12825
(2019-11-28)
arxiv.orgPDF
Sidewalk Environment for Visual Navigation
Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira Kahou, Joseph Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo and Chris Pal
(venue unknown)
(2019-11-07)
zenodo.org
Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
Tianyu Li, Bogdan Mazoure, Doina Precup and Guillaume Rabusseau

2019-10

Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira Ebrahimi Kahou, Joseph Paul Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo and Chris Pal
Actor Critic with Differentially Private Critic.
Jonathan Lebensold, William L. Hamilton, Borja Balle and Doina Precup
arXiv preprint arXiv:1910.05876
(2019-10-14)
ui.adsabs.harvard.eduPDF
Improving Pathological Structure Segmentation via Transfer Learning Across Diseases
Barleen Kaur, Paul Lemaître, Raghav Mehta, Nazanin Mohammadi Sepahvand, Doina Precup, Douglas L. Arnold and Tal Arbel
DART/MIL3ID@MICCAI
(2019-10-13)
link.springer.com
Early Prediction of Alzheimer's Disease Progression Using Variational Autoencoders.
Sumana Basu, Konrad Wagstyl, Azar Zandifar, D. Louis Collins, Adriana Romero and Doina Precup
MICCAI 2019
(2019-10-13)
doi.org
Recurrent Value Functions. (arXiv:1905.09562v1 [cs.LG])
Pierre Thodoroff, Nishanth Anand, Lucas Caccia, Doina Precup and Joelle Pineau
arXiv Computer Science
(2019-10-07)
Singular value automata and approximate minimization
Mathematical Structures in Computer Science
(2019-10-01)
www.cambridge.orgPDF
Augmenting learning using symmetry in a biologically-inspired domain
Shruti Mishra, Abbas Abdolmaleki, Arthur Guez, Piotr Trochim and Doina Precup
arXiv: Learning
(2019-10-01)
ui.adsabs.harvard.eduPDF

2019-09

Assessing Generalization in TD methods for Deep Reinforcement Learning
Emmanuel Bengio, Doina Precup and Joelle Pineau
(venue unknown)
(2019-09-25)
openreview.netPDF
Avoidance Learning Using Observational Reinforcement Learning
David Venuto, Leonard Boussioux, Junhao Wang, Rola Dali, Jhelum Chakravorty, Yoshua Bengio and Doina Precup
arXiv preprint arXiv:1909.11228
(2019-09-24)
ui.adsabs.harvard.eduPDF
Revisit Policy Optimization in Matrix Form.
Sitao Luan, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:1909.09186
(2019-09-19)
ui.adsabs.harvard.eduPDF

2019-07

An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation.
Vincent Michalski, Vikram Voleti, Samira Ebrahimi Kahou, Anthony Ortiz, Pascal Vincent, Chris Pal and Doina Precup
arXiv: Computer Vision and Pattern Recognition
(2019-07-31)
ui.adsabs.harvard.eduPDF
Learning Options with Interest Functions
Khimya Khetarpal and Doina Precup
AAAI 2019
(2019-07-17)
www.aaai.org
Leveraging Observations in Bandits: Between Risks and Benefits
Andrei Lupu, Audrey Durand and Doina Precup
AAAI 2019
(2019-07-17)
aimagazine.org
Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Srinivas Venkattaramanujam, Eric Crawford, Thang Doan and Doina Precup
arXiv preprint arXiv:1907.02998
(2019-07-05)
ui.adsabs.harvard.eduPDF

2019-06

Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia
Charles C. Onu, Jonathan Lebensold, William L. Hamilton and Doina Precup

2019-05

Singular value automata and approximate minimization
Mathematical Structures in Computer Science
(2019-05-27)
ui.adsabs.harvard.eduPDF
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto, David Meger and Doina Precup
Per-Decision Option Discounting
Anna Harutyunyan, Peter Vrancx, Philippe Hamel, Ann Nowe and Doina Precup
ICML 2019
(2019-05-24)
proceedings.mlr.pressPDF
Prediction of Disease Progression in Multiple Sclerosis Patients using Deep Learning Analysis of MRI Data
Adrian Tousignant, Paul Lemaître, Doina Precup, Douglas L. Arnold and Tal Arbel
International Conference on Medical Imaging with Deep Learning
(2019-05-24)
proceedings.mlr.pressPDF
Building Knowledge for AI Agents with Reinforcement Learning
AAMAS 2019
(2019-05-08)
dblp.uni-trier.de
Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks
Sanjay Thakur, Herke van Hoof, Juan Camilo Gamboa Higuera, Doina Precup and David Meger

2019-04

Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization
Mingde Zhao, Ian Porada, Sitao Luan, Xiaowen Chang and Doina Precup
(venue unknown)
(2019-04-25)
arxiv.org
META-Learning State-based {\lambda} for More Sample-Efficient Policy Evaluation
Mingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang and Doina Precup
arXiv: Learning
(2019-04-25)
arxiv.orgPDF
The Termination Critic
Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Remi Munos and Doina Precup

2019-03

Learning proposals for sequential importance samplers using reinforced variational inference.
Zafarali Ahmed, Arjun Karuvally, Doina Precup and Simon Gravel
ICLR 2019
(2019-03-16)
dblp.uni-trier.dePDF

2019-02

The Impact of Time Interval between Extubation and Reintubation on Death or Bronchopulmonary Dysplasia in Extremely Preterm Infants.
Wissam Shalish, Lara Kanbar, Lajos Kovacs, Sanjay Chawla, Martin Keszler, Smita Rao, Bogdan Panaitescu, Alyse Laliberte, Doina Precup, Karen Brown, Robert E. Kearney and Guilherme M. Sant'Anna
The Journal of Pediatrics
(2019-02-01)
www.sciencedirect.com

2019-01

The Option Keyboard: Combining Skills in Reinforcement Learning
André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan J. Hunt, Shibl Mourad, David Silver and Doina Precup
Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks
Sitao Luan, Mingde Zhao, Xiao-Wen Chang and Doina Precup
Community size effect in artificial learning systems.
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
ViGIL@NeurIPS
(2019-01-01)
dblp.uni-trier.de
Learning Reliable Policies in the Bandit Setting with Application to Adaptive Clinical Trials.
Hossein Aboutalebi, Doina Precup and Tibor Schuster
IJCAI 2019
(2019-01-01)
dblp.uni-trier.dePDF
Temporally Extended Metrics for Markov Decision Processes.
AAAI 2019
(2019-01-01)
dblp.uni-trier.dePDF
Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials.
Hossein Aboutalebi, Doina Precup and Tibor Schuster

2018-12

Clustering-Oriented Representation Learning with Attractive-Repulsive Loss.
Kian Kenyon-Dean, Andre Cianflone, Lucas Page-Caccia, Guillaume Rabusseau, Jackie Chi Kit Cheung and Doina Precup
arXiv: Learning
(2018-12-18)
ui.adsabs.harvard.eduPDF
Prediction of Progression in Multiple Sclerosis Patients
Adrian Tousignant, Paul Lemaître, Doina Precup, Douglas Arnold and Tal Arbel
International Conference on Medical Imaging with Deep Learning -- Full Paper Track
(2018-12-13)
openreview.netPDF

2018-11

Environments for Lifelong Reinforcement Learning.
Khimya Khetarpal, Shagun Sodhani, Sarath Chandar and Doina Precup
arXiv preprint arXiv:1811.10732
(2018-11-26)
dblp.uni-trier.dePDF
The Barbados 2018 List of Open Issues in Continual Learning.
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc G. Bellemare and Doina Precup
arXiv preprint arXiv:1811.07004
(2018-11-16)
ui.adsabs.harvard.eduPDF
Temporal Regularization in Markov Decision Process
arXiv: Learning
(2018-11-01)
ui.adsabs.harvard.eduPDF

2018-09

Where Off-Policy Deep Reinforcement Learning Fails
Scott Fujimoto, David Meger and Doina Precup
(venue unknown)
(2018-09-27)
openreview.netPDF
Shaping representations through communication
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
(venue unknown)
(2018-09-27)
openreview.netPDF

2018-08

A Semi-Markov Chain Approach to Modeling Respiratory Patterns Prior to Extubation in Preterm Infants.
Charles C. Onu, Lara J. Kanbar, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv: Signal Processing
(2018-08-24)
arxiv.orgPDF
Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing
Charles C. Onu, Lara J. Kanbar, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07991
(2018-08-24)
ui.adsabs.harvard.eduPDF
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants
Lara J. Kanbar, Charles C. Onu, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07992
(2018-08-24)
ui.adsabs.harvard.eduPDF

2018-07

Attend Before you Act: Leveraging human visual attention for continual learning.
Khimya Khetarpal and Doina Precup
arXiv preprint arXiv:1807.09664
(2018-07-25)
ui.adsabs.harvard.eduPDF
Leveraging Observational Learning for Exploration in Bandits
Andrei Lupu, Audrey Durand and Doina Precup
AAMAS 2018
(2018-07-09)
dblp.uni-trier.de
Eligibility Traces for Options
Ayush Jain and Doina Precup
AAMAS 2018
(2018-07-09)
dblp.uni-trier.de
Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning
Convergent Tree Backup and Retrace with Function Approximation
ICML 2018
(2018-07-03)
proceedings.mlr.pressPDF
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants
Lara J. Kanbar, Charles C. Onu, Wissam Shalish, Karen A. Brown, Guilherme M. SantrAnna, Doina Precup and Robert E. Kearney
EMBC 2018
(2018-07-01)
doi.orgPDF

2018-06

Diffusion-Based Approximate Value Functions
Martin Klissarov and Doina Precup
(venue unknown)
(2018-06-15)
openreview.netPDF
Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization.
Kian Kenyon-Dean, Jackie Chi Kit Cheung and Doina Precup

2018-05

Dyna Planning using a Feature Based Generative Model.
Ryan Faulkner and Doina Precup
arXiv preprint arXiv:1805.10129
(2018-05-23)
ui.adsabs.harvard.eduPDF
Learning Safe Policies with Expert Guidance
Jessie Huang, Fa Wu, Doina Precup and Yang Cai

2018-03

Nonlinear Weighted Finite Automata
AISTATS 2018
(2018-03-31)
dblp.uni-trier.dePDF
Constructing Temporal Abstractions Autonomously in Reinforcement Learning
Ai Magazine
(2018-03-27)
doi.org

2018-02

Disentangling the independently controllable factors of variation by interacting with the world
Valentin Thomas, Emmanuel Bengio, William Fedus, Jules Pondard, Philippe Beaudoin, Hugo Larochelle, Joelle Pineau, Doina Precup and Yoshua Bengio
arXiv preprint arXiv:1802.09484
(2018-02-26)
ui.adsabs.harvard.eduPDF
Learning with Options that Terminate Off-Policy
Anna Harutyunyan, Peter Vrancx, Pierre-luc Bacon, Doina Precup and Ann Nowe
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
When Waiting is not an Option : Learning Options with a Deliberation Cost
Jean Harb, Pierre-luc Bacon, Martin Klissarov and Doina Precup
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson, Wei-Di Chang, Pierre-luc Bacon, David Meger, Joelle Pineau and Doina Precup
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
Deep Reinforcement Learning that Matters
Peter Henderson, Riashat Islam, Joelle Pineau, David Meger, Doina Precup and Philip Bachman
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF

2018-01

Patterns of reintubation in extremely preterm infants: a longitudinal cohort study
Wissam Shalish, Lara Kanbar, Martin Keszler, Sanjay Chawla, Lajos Kovacs, Smita Rao, Bogdan A Panaitescu, Alyse Laliberte, Doina Precup, Karen Brown, Robert E Kearney and Guilherme M Sant'Anna
Pediatric Research
(2018-01-31)
www.nature.com
Learning Predictive State Representations From Non-Uniform Sampling.
Yuri Grinberg, Hossein Aboutalebi, Melanie Lyman-Abramovitch, Borja Balle and Doina Precup
AAAI 2018
(2018-01-01)
dblp.uni-trier.de
Imitation Upper Confidence Bound for Bandits on a Graph.
Andrei Lupu and Doina Precup
AAAI 2018
(2018-01-01)
dblp.uni-trier.de
Learning Robust Options.
Daniel J. Mankowitz, Timothy A. Mann, Pierre-Luc Bacon, Doina Precup and Shie Mannor
Temporal Regularization for Markov Decision Process
NEURIPS 2018
(2018-01-01)
papers.nips.ccPDF

Publications collected and formatted using Paperoni