Doina Precup

Membre Académique Principal
Doina Precup
Professeur agrégé, Professeure agrégée, McGill University, DeepMind
Doina Precup

Doina Precup enseigne à l’Université McGill tout en menant des recherches fondamentales sur l’apprentissage par renforcement, notamment sur les applications de l’IA dans des domaines ayant un impact social, tels que les soins de santé. Elle s’intéresse à la prise de décision de la machine dans des situations d’incertitude élevée.

Elle est membre de l’Institut canadien de recherches avancées, membre de l’Association pour l’avancement de l’intelligence artificielle et elle dirige également le bureau montréalais de Deepmind.

Spécialiste dans les domaines suivants :  intelligence artificielle, apprentissage machine, apprentissage par renforcement, raisonnement et planification sous incertitude, applications.

Publications

2020-11

Gradient Starvation: A Learning Proclivity in Neural Networks
Mohammad Pezeshki, Sékou-Oumar Kaba, Yoshua Bengio, Aaron Courville, Doina Precup and Guillaume Lajoie
arXiv preprint arXiv:2011.09468
(2020-11-18)
arxiv.orgPDF
Diversity-Enriched Option-Critic.
Anand Kamat and Doina Precup
arXiv preprint arXiv:2011.02565
(2020-11-04)
arxiv.orgPDF
A Study of Policy Gradient on a Class of Exactly Solvable Models
Gavin McCracken, Colin Daniels, Rosie Zhao, Anna Brandenberger, Prakash Panangaden and Doina Precup
arXiv preprint arXiv:2011.01859
(2020-11-03)
arxiv.orgPDF

2020-10

Forethought and Hindsight in Credit Assignment.
Veronica Chelu, Doina Precup and Hado van Hasselt
arXiv: Learning
(2020-10-26)
arxiv.orgPDF
Connecting Weighted Automata, Tensor Networks and Recurrent Neural Networks through Spectral Learning
arXiv preprint arXiv:2010.10029
(2020-10-19)
arxiv.orgPDF
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv: Learning
(2020-10-15)
arxiv.orgPDF
A Fully Tensorized Recurrent Neural Network
Charles C. Onu, Jacob E. Miller and Doina Precup
arXiv preprint arXiv:2010.04196
(2020-10-08)
arxiv.orgPDF
Reward Propagation Using Graph Convolutional Networks
Martin Klissarov and Doina Precup
arXiv preprint arXiv:2010.02474
(2020-10-06)
arxiv.orgPDF

2020-09

Keynote Lecture Building Knowledge For AI AgentsWith Reinforcement Learning
ICCP 2020
(2020-09-03)
ieeexplore.ieee.org

2020-08

Complete the Missing Half: Augmenting Aggregation Filtering with Diversification for Graph Convolutional Networks
Sitao Luan, Mingde Zhao, Chenqing Hua, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:2008.08844
(2020-08-20)
arxiv.orgPDF
Training Matters: Unlocking Potentials of Deeper Graph Convolutional Neural Networks.
Sitao Luan, Mingde Zhao, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:2008.08838
(2020-08-20)
arxiv.orgPDF
Fast reinforcement learning with generalized policy updates
André Barreto, Shaobo Hou, Diana Borsa, David Silver and Doina Precup
Proceedings of the National Academy of Sciences of the United States of America
(2020-08-17)
syndication.highwire.org

2020-07

What can I do here? A Theory of Affordances in Reinforcement Learning
Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, David Abel and Doina Precup
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay.
Scott Fujimoto, David Meger and Doina Precup
arXiv preprint arXiv:2007.06049
(2020-07-12)
ui.adsabs.harvard.eduPDF
Invariant Causal Prediction for Block MDPs
Clare Lyle, Amy Zhang, Angelos Filos, Shagun Sodhani, Marta Kwiatkowska, Yarin Gal, Doina Precup and Joelle Pineau
ICML 2020
(2020-07-12)
icml.cc
Interference and Generalization in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau and Doina Precup
SVRG for Policy Evaluation with Fewer Gradient Evaluations
Zilun Peng, Ahmed Touati, Pascal Vincent and Doina Precup

2020-06

Learning to Prove from Synthetic Theorems.
Eser Aygün, Zafarali Ahmed, Ankit Anand, Vlad Firoiu, Xavier Glorot, Laurent Orseau, Doina Precup and Shibl Mourad
arXiv preprint arXiv:2006.11259
(2020-06-19)
dblp.uni-trier.dePDF
A Brief Look at Generalization in Visual Meta-Reinforcement Learning
Safa Alver and Doina Precup
arXiv preprint arXiv:2006.07262
(2020-06-12)
dblp.uni-trier.dePDF

2020-04

Gifting in Multi-Agent Reinforcement Learning (Student Abstract).
Andrei Lupu and Doina Precup
AAAI 2020
(2020-04-03)
aaai.org
Learning to cooperate: Emergent communication in multi-agent navigation.
Ivana Kajic, Eser Aygün and Doina Precup
arXiv preprint arXiv:2004.01097
(2020-04-02)
dblp.uni-trier.dePDF

2020-03

Invariant Causal Prediction for Block MDPs.
Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal and Doina Precup
arXiv preprint arXiv:2003.06016
(2020-03-12)
arxiv.orgPDF
Multiple Kernel Learning-Based Transfer Regression for Electric Load Forecasting
Di Wu, Boyu Wang, Doina Precup and Benoit Boulet
IEEE Transactions on Smart Grid
(2020-03-01)
dblp.uni-trier.de

2020-02

Policy Evaluation Networks.
Jean Harb, Tom Schaul, Doina Precup and Pierre-Luc Bacon
arXiv preprint arXiv:2002.11833
(2020-02-26)
dblp.uni-trier.dePDF
oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions.
David Venuto, Jhelum Chakravorty, Leonard Boussioux, Junhao Wang, Gavin McCracken and Doina Precup
arXiv preprint arXiv:2002.09043
(2020-02-20)
ui.adsabs.harvard.eduPDF
Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction
Vishal Jain, Liam Fedus, Hugo Larochelle, Doina Precup and Marc G. Bellemare
AAAI 2020
(2020-02-07)
aaai.orgPDF
Options of Interest: Temporal Abstraction with Interest Functions
Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre-Luc Bacon and Doina Precup
Provably efficient reconstruction of policy networks.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv preprint arXiv:2002.02863
(2020-02-07)
ui.adsabs.harvard.eduPDF
Assessment of Extubation Readiness Using Spontaneous Breathing Trials in Extremely Preterm Neonates.
Wissam Shalish, Lara Kanbar, Lajos Kovacs, Sanjay Chawla, Martin Keszler, Smita Rao, Samantha Latremouille, Doina Precup, Karen Brown, Robert E. Kearney and Guilherme M. Sant’Anna
JAMA Pediatrics
(2020-02-01)
jamanetwork.comPDF

2020-01

Forethought and Hindsight in Credit Assignment
Veronica Chelu, Doina Precup and Hado P. van Hasselt
NEURIPS 2020
(2020-01-01)
papers.nips.cc
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
Scott Fujimoto, David Meger and Doina Precup
NEURIPS 2020
(2020-01-01)
papers.nips.cc
Reward Propagation Using Graph Convolutional Networks
Martin Klissarov and Doina Precup
NEURIPS 2020
(2020-01-01)
proceedings.neurips.cc
On Efficiency in Hierarchical Reinforcement Learning
Zheng Wen, Doina Precup, Morteza Ibrahimi, Andre Barreto, Benjamin Van Roy and Satinder Singh
NEURIPS 2020
(2020-01-01)
papers.nips.cc
Value-driven Hindsight Modelling
Arthur Guez, Fabio Viola, Theophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver and Nicolas Heess
NEURIPS 2020
(2020-01-01)
papers.nips.ccPDF
META-Learning State-based Eligibility Traces for More Sample-Efficient Policy Evaluation.
Mingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang and Doina Precup
Autonomous Agents and Multi-Agent Systems
(2020-01-01)
dblp.uni-trier.de[LATEST on arXiv: Learning (2020-05-18)]
Exploring uncertainty measures in deep networks for Multiple sclerosis lesion detection and segmentation.
Tanya Nair, Doina Precup, Douglas L. Arnold and Tal Arbel
A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms.
Gifting in Multi-Agent Reinforcement Learning.
Andrei Lupu and Doina Precup
Autonomous Agents and Multi-Agent Systems
(2020-01-01)
dblp.uni-trier.de
Value Preserving State-Action Abstractions.
David Abel, Nate Umbanhowar, Khimya Khetarpal, Dilip Arumugam, Doina Precup and Michael L. Littman
AISTATS 2020
(2020-01-01)
proceedings.mlr.press

2019-12

Shaping representations through communication: community size effect in artificial learning systems
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
arXiv preprint arXiv:1912.06208
(2019-12-12)
dblp.uni-trier.dePDF
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning.
Riashat Islam, Raihan Seraj, Samin Yeasar Arnob and Doina Precup
arXiv preprint arXiv:1912.05109
(2019-12-11)
dblp.uni-trier.dePDF
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam, Zafarali Ahmed and Doina Precup
arXiv preprint arXiv:1912.05128
(2019-12-11)
ui.adsabs.harvard.eduPDF
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam, Raihan Seraj, Pierre-Luc Bacon and Doina Precup
arXiv preprint arXiv:1912.05104
(2019-12-11)
ui.adsabs.harvard.eduPDF
The Option Keyboard: Combining Skills in Reinforcement Learning
Andre Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygun, Philippe Hamel, Daniel Toyama, Jonathan J Hunt, Shibl Mourad, David Silver and Doina Precup
NEURIPS 2019
(2019-12-08)
papers.nips.ccPDF
Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks
Sitao Luan, Mingde Zhao, Xiao-Wen Chang and Doina Precup
Hindsight Credit Assignment
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Gheshlaghi Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Gregory Wayne, Satinder Singh, Doina Precup and Remi Munos

2019-11

Option-critic in cooperative multi-agent systems
Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu and Doina Precup
arXiv preprint arXiv:1911.12825
(2019-11-28)
export.arxiv.orgPDF
Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction
Vishal Jain, William Fedus, Hugo Larochelle, Doina Precup and Marc G. Bellemare
arXiv preprint arXiv:1911.12511
(2019-11-28)
arxiv.orgPDF
Option-critic in cooperative multi-agent systems
Jhelum Chakravorty, Patrick Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu and Doina Precup
Autonomous Agents and Multi-Agent Systems
(2019-11-01)
dl.acm.org
Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
Tianyu Li, Bogdan Mazoure, Doina Precup and Guillaume Rabusseau

2019-10

Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira Ebrahimi Kahou, Joseph Paul Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo and Chris Pal
Actor Critic with Differentially Private Critic.
Jonathan Lebensold, William L. Hamilton, Borja Balle and Doina Precup
arXiv preprint arXiv:1910.05876
(2019-10-14)
ui.adsabs.harvard.eduPDF
Improving Pathological Structure Segmentation via Transfer Learning Across Diseases
Barleen Kaur, Paul Lemaître, Raghav Mehta, Nazanin Mohammadi Sepahvand, Doina Precup, Douglas L. Arnold and Tal Arbel
DART/MIL3ID@MICCAI
(2019-10-13)
link.springer.com
Early Prediction of Alzheimer's Disease Progression Using Variational Autoencoders.
Sumana Basu, Konrad Wagstyl, Azar Zandifar, D. Louis Collins, Adriana Romero and Doina Precup
MICCAI 2019
(2019-10-13)
doi.org
Singular value automata and approximate minimization
Mathematical Structures in Computer Science
(2019-10-01)
ui.adsabs.harvard.eduPDF
Augmenting learning using symmetry in a biologically-inspired domain
Shruti Mishra, Abbas Abdolmaleki, Arthur Guez, Piotr Trochim and Doina Precup
arXiv preprint arXiv:1910.00528
(2019-10-01)
ui.adsabs.harvard.eduPDF

2019-09

Value-driven Hindsight Modelling.
Arthur Guez, Fabio Viola, Theophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver and Nicolas Heess
arXiv preprint arXiv:2002.08329
(2019-09-25)
dblp.uni-trier.dePDF
Assessing Generalization in TD methods for Deep Reinforcement Learning
Emmanuel Bengio, Doina Precup and Joelle Pineau
(venue unknown)
(2019-09-25)
openreview.netPDF
Avoidance Learning Using Observational Reinforcement Learning
David Venuto, Leonard Boussioux, Junhao Wang, Rola Dali, Jhelum Chakravorty, Yoshua Bengio and Doina Precup
arXiv preprint arXiv:1909.11228
(2019-09-24)
ui.adsabs.harvard.eduPDF
Revisit Policy Optimization in Matrix Form.
Sitao Luan, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:1909.09186
(2019-09-19)
dblp.uni-trier.dePDF

2019-07

An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation.
Vincent Michalski, Vikram Voleti, Samira Ebrahimi Kahou, Anthony Ortiz, Pascal Vincent, Chris Pal and Doina Precup
arXiv preprint arXiv:1908.00061
(2019-07-31)
dblp.uni-trier.dePDF
Learning Options with Interest Functions
Khimya Khetarpal and Doina Precup
AAAI 2019
(2019-07-17)
www.aaai.orgPDF
Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Srinivas Venkattaramanujam, Eric Crawford, Thang Doan and Doina Precup
arXiv preprint arXiv:1907.02998
(2019-07-05)
ui.adsabs.harvard.eduPDF

2019-06

Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia
Charles C. Onu, Jonathan Lebensold, William L. Hamilton and Doina Precup
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto, David Meger and Doina Precup
Per-Decision Option Discounting
Anna Harutyunyan, Peter Vrancx, Philippe Hamel, Ann Nowe and Doina Precup
ICML 2019
(2019-06-09)
proceedings.mlr.pressPDF

2019-05

Singular value automata and approximate minimization
Mathematical Structures in Computer Science
(2019-05-27)
ui.adsabs.harvard.eduPDF
Prediction of Disease Progression in Multiple Sclerosis Patients using Deep Learning Analysis of MRI Data
Adrian Tousignant, Paul Lemaître, Doina Precup, Douglas L. Arnold and Tal Arbel
International Conference on Medical Imaging with Deep Learning
(2019-05-24)
proceedings.mlr.pressPDF
Recurrent Value Functions.
Pierre Thodoroff, Nishanth Anand, Lucas Caccia, Doina Precup and Joelle Pineau
arXiv preprint arXiv:1905.09562
(2019-05-23)
dblp.uni-trier.dePDF
Building Knowledge for AI Agents with Reinforcement Learning
AAMAS 2019
(2019-05-08)
dblp.uni-trier.de
Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks
Sanjay Thakur, Herke van Hoof, Juan Camilo Gamboa Higuera, Doina Precup and David Meger

2019-04

Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization
Mingde Zhao, Ian Porada, Sitao Luan, Xiaowen Chang and Doina Precup
(venue unknown)
(2019-04-25)
arxiv.org
META-Learning State-based {\lambda} for More Sample-Efficient Policy Evaluation
Mingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:1904.11439
(2019-04-25)
ui.adsabs.harvard.eduPDF
The Termination Critic
Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Remi Munos and Doina Precup

2019-03

Learning proposals for sequential importance samplers using reinforced variational inference.
Zafarali Ahmed, Arjun Karuvally, Doina Precup and Simon Gravel
ICLR 2019
(2019-03-16)
dblp.uni-trier.dePDF

2019-02

The Impact of Time Interval between Extubation and Reintubation on Death or Bronchopulmonary Dysplasia in Extremely Preterm Infants.
Wissam Shalish, Lara Kanbar, Lajos Kovacs, Sanjay Chawla, Martin Keszler, Smita Rao, Bogdan Panaitescu, Alyse Laliberte, Doina Precup, Karen Brown, Robert E. Kearney and Guilherme M. Sant'Anna
The Journal of Pediatrics
(2019-02-01)
www.sciencedirect.com

2019-01

Leveraging observations in bandits: Between risks and benefits
Andrei Lupu, Audrey Durand and Doina Precup
AAAI 2019
(2019-01-27)
www.aaai.orgPDF
Combined Reinforcement Learning via Abstract Representations
vincent francois-lavet, Yoshua Bengio, Doina Precup and Joelle Pineau
Community size effect in artificial learning systems.
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
ViGIL@NeurIPS
(2019-01-01)
dblp.uni-trier.dePDF
Learning Reliable Policies in the Bandit Setting with Application to Adaptive Clinical Trials.
Hossein Aboutalebi, Doina Precup and Tibor Schuster
IJCAI 2019
(2019-01-01)
dblp.uni-trier.dePDF
Temporally Extended Metrics for Markov Decision Processes.
AAAI 2019
(2019-01-01)
dblp.uni-trier.dePDF
Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials.
Hossein Aboutalebi, Doina Precup and Tibor Schuster

2018-12

Clustering-Oriented Representation Learning with Attractive-Repulsive Loss.
Kian Kenyon-Dean, Andre Cianflone, Lucas Page-Caccia, Guillaume Rabusseau, Jackie Chi Kit Cheung and Doina Precup
arXiv preprint arXiv:1812.07627
(2018-12-18)
ui.adsabs.harvard.eduPDF
Prediction of Progression in Multiple Sclerosis Patients
Adrian Tousignant, Paul Lemaître, Doina Precup, Douglas Arnold and Tal Arbel
International Conference on Medical Imaging with Deep Learning -- Full Paper Track
(2018-12-13)
openreview.netPDF
Learning safe policies with expert guidance
Jessie Huang, Fa Wu, Doina Precup and Yang Cai
Temporal Regularization for Markov Decision Process
NEURIPS 2018
(2018-12-03)
papers.nips.ccPDF

2018-11

Environments for Lifelong Reinforcement Learning.
Khimya Khetarpal, Shagun Sodhani, Sarath Chandar and Doina Precup
arXiv preprint arXiv:1811.10732
(2018-11-26)
dblp.uni-trier.dePDF
The Barbados 2018 List of Open Issues in Continual Learning.
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc G. Bellemare and Doina Precup
arXiv preprint arXiv:1811.07004
(2018-11-16)
dblp.uni-trier.dePDF
Temporal Regularization in Markov Decision Process
arXiv preprint arXiv:1811.00429
(2018-11-01)
ui.adsabs.harvard.eduPDF

2018-09

Where Off-Policy Deep Reinforcement Learning Fails
Scott Fujimoto, David Meger and Doina Precup
(venue unknown)
(2018-09-27)
openreview.netPDF
Shaping representations through communication
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
(venue unknown)
(2018-09-27)
openreview.netPDF

2018-08

A Semi-Markov Chain Approach to Modeling Respiratory Patterns Prior to Extubation in Preterm Infants.
Charles C. Onu, Lara J. Kanbar, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07989
(2018-08-24)
export.arxiv.orgPDF
Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing
Charles C. Onu, Lara J. Kanbar, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07991
(2018-08-24)
export.arxiv.orgPDF
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants.
Lara J. Kanbar, Charles C. Onu, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07992
(2018-08-24)
arxiv.orgPDF

2018-07

Attend Before you Act: Leveraging human visual attention for continual learning.
Khimya Khetarpal and Doina Precup
arXiv preprint arXiv:1807.09664
(2018-07-25)
dblp.uni-trier.dePDF
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Arushi Jain, Khimya Khetarpal and Doina Precup
arXiv preprint arXiv:1807.08060
(2018-07-21)
ui.adsabs.harvard.eduPDF
Convergent Tree-Backup and Retrace with Function Approximation
ICML 2018
(2018-07-10)
proceedings.mlr.pressPDF
Leveraging Observational Learning for Exploration in Bandits
Andrei Lupu, Audrey Durand and Doina Precup
AAMAS 2018
(2018-07-09)
celweb.vuse.vanderbilt.edu
Eligibility Traces for Options
Ayush Jain and Doina Precup
AAMAS 2018
(2018-07-09)
celweb.vuse.vanderbilt.edu
Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants
Lara J. Kanbar, Charles C. Onu, Wissam Shalish, Karen A. Brown, Guilherme M. SantrAnna, Doina Precup and Robert E. Kearney
EMBC 2018
(2018-07-01)
www.ncbi.nlm.nih.govPDF

2018-06

Diffusion-Based Approximate Value Functions
Martin Klissarov and Doina Precup
(venue unknown)
(2018-06-15)
openreview.netPDF

2018-05

Dyna Planning using a Feature Based Generative Model.
Ryan Faulkner and Doina Precup
arXiv preprint arXiv:1805.10129
(2018-05-23)
dblp.uni-trier.dePDF

2018-03

Nonlinear Weighted Finite Automata
AISTATS 2018
(2018-03-31)
proceedings.mlr.pressPDF
Constructing Temporal Abstractions Autonomously in Reinforcement Learning
Ai Magazine
(2018-03-27)
dblp.uni-trier.de

2018-02

Disentangling the independently controllable factors of variation by interacting with the world
Valentin Thomas, Emmanuel Bengio, William Fedus, Jules Pondard, Philippe Beaudoin, Hugo Larochelle, Joelle Pineau, Doina Precup and Yoshua Bengio
arXiv preprint arXiv:1802.09484
(2018-02-26)
ui.adsabs.harvard.eduPDF
Learning Robust Options
Daniel J. Mankowitz, Timothy A. Mann, Pierre-Luc Bacon, Doina Precup and Shie Mannor
arXiv preprint arXiv:1802.03236
(2018-02-09)
arxiv.orgPDF
Learning with Options that Terminate Off-Policy
Anna Harutyunyan, Peter Vrancx, Pierre-luc Bacon, Doina Precup and Ann Nowe
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
When Waiting is not an Option : Learning Options with a Deliberation Cost
Jean Harb, Pierre-luc Bacon, Martin Klissarov and Doina Precup
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
Learning Predictive State Representations from Non-uniform Sampling
Yuri Grinberg, Hossein Aboutalebi, Melanie Lyman-Abramovitch, Borja Balle and Doina Precup
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
Learning Robust Options
Daniel Mankowitz, Timothy Mann, Shie Mannor, Doina Precup and Pierre-luc Bacon
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson, Wei-Di Chang, Pierre-luc Bacon, David Meger, Joelle Pineau and Doina Precup
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
Deep Reinforcement Learning that Matters
Peter Henderson, Riashat Islam, Joelle Pineau, David Meger, Doina Precup and Philip Bachman
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF

2018-01

Patterns of reintubation in extremely preterm infants: a longitudinal cohort study.
Wissam Shalish, Lara Kanbar, Martin Keszler, Sanjay Chawla, Lajos Kovacs, Smita Rao, Bogdan A Panaitescu, Alyse Laliberte, Doina Precup, Karen Brown, Robert E Kearney and Guilherme M Sant'Anna
Pediatric Research
(2018-01-31)
www.nature.com
Imitation Upper Confidence Bound for Bandits on a Graph.
Andrei Lupu and Doina Precup
AAAI 2018
(2018-01-01)
dblp.uni-trier.de
Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization.
Kian Kenyon-Dean, Jackie Chi Kit Cheung and Doina Precup

Publications collected and formatted using Paperoni

array(1) { ["wp-wpml_current_language"]=> string(2) "fr" }