Doina Precup

Mila > À propos de Mila > Équipe > Doina Precup
Membre Académique Principal
Doina Precup
Professeur agrégé, Professeure agrégée, McGill University, DeepMind
Doina Precup

Doina Precup enseigne à l’Université McGill tout en menant des recherches fondamentales sur l’apprentissage par renforcement, notamment sur les applications de l’IA dans des domaines ayant un impact social, tels que les soins de santé. Elle s’intéresse à la prise de décision de la machine dans des situations d’incertitude élevée.

Elle est membre de l’Institut canadien de recherches avancées, membre de l’Association pour l’avancement de l’intelligence artificielle et elle dirige également le bureau montréalais de Deepmind.

Spécialiste dans les domaines suivants :  intelligence artificielle, apprentissage machine, apprentissage par renforcement, raisonnement et planification sous incertitude, applications.

Publications

2021-10

Reward is enough
David Silver, Satinder Singh, Doina Precup and Richard S. Sutton
Artificial Intelligence
(2021-10-01)
www.sciencedirect.com

2021-08

Temporally Abstract Partial Models
Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici and Doina Precup
arXiv preprint arXiv:2108.03213
(2021-08-06)
ui.adsabs.harvard.eduPDF
Policy Gradients Incorporating the Future
David Venuto, Elaine Lau, Doina Precup and Ofir Nachum
arXiv preprint arXiv:2108.02096
(2021-08-04)
ui.adsabs.harvard.eduPDF

2021-07

Preferential Temporal Difference Learning
Nishanth Anand and Doina Precup
Randomized Exploration in Reinforcement Learning with General Value Function Approximation
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup and Lin Yang
ICML 2021
(2021-07-18)
icml.cc
Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards
Susan Amin, Maziar Gomrokchi, Hossein Aboutalebi, Harsh Satija and Doina Precup

2021-06

Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup and Lin F. Yang
arXiv preprint arXiv:2106.07841
(2021-06-15)
aps.arxiv.orgPDF
Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation.
Emmanuel Bengio, Moksh Jain, Maksym Korablyov, Doina Precup and Yoshua Bengio
arXiv preprint arXiv:2106.04399
(2021-06-08)
dblp.uni-trier.dePDF
A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning
Mingde Zhao, Zhen Liu, Sitao Luan, Shuyuan Zhang, Doina Precup and Yoshua Bengio
arXiv: Artificial Intelligence
(2021-06-03)
ui.adsabs.harvard.eduPDF
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL.
Bogdan Mazoure, Paul Mineiro, Pavithra Srinath, Reza Sharifi Sedeh, Doina Precup and Adith Swaminathan
arXiv preprint arXiv:2106.00589
(2021-06-01)
dblp.uni-trier.dePDF

2021-05

AndroidEnv: A Reinforcement Learning Platform for Android.
Daniel Toyama, Philippe Hamel, Anita Gergely, Gheorghe Comanici, Amelia Glaese, Zafarali Ahmed, Tyler Jackson, Shibl Mourad and Doina Precup
arXiv preprint arXiv:2105.13231
(2021-05-27)
ui.adsabs.harvard.eduPDF
Practical Marginalized Importance Sampling with the Successor Representation
Scott Fujimoto, David Meger and Doina Precup
(venue unknown)
(2021-05-04)
openreview.netPDF
Correcting Momentum in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau and Doina Precup
arxiv:cs.LG
(2021-05-04)
dblp.uni-trier.dePDF
Offline Policy Optimization with Variance Regularization
Riashat Islam, Samarth Sinha, Homanga Bharadhwaj, Samin Yeasar Arnob, Zhuoran Yang, Zhaoran Wang, Animesh Garg, Lihong Li and Doina Precup
(venue unknown)
(2021-05-04)
openreview.netPDF
Conditional Networks
Anthony Ortiz, Kris Sankaran, Olac Fuentes, Christopher Kiekintveld, Pascal Vincent, Yoshua Bengio and Doina Precup
(venue unknown)
(2021-05-04)
openreview.net

2021-04

What is Going on Inside Recurrent Meta Reinforcement Learning Agents
Safa Alver and Doina Precup
arXiv preprint arXiv:2104.14644
(2021-04-29)
ui.adsabs.harvard.eduPDF

2021-03

Training a First-Order Theorem Prover from Synthetic Data.
Vlad Firoiu, Eser Aygün, Ankit Anand, Zafarali Ahmed, Xavier Glorot, Laurent Orseau, Lei Zhang, Doina Precup and Shibl Mourad
arXiv preprint arXiv:2103.03798
(2021-03-05)
dblp.uni-trier.dePDF

2021-02

Variance Penalized On-Policy and Off-Policy Actor-Critic.
Arushi Jain, Gandharv Patil, Ayush Jain, Khimya Khetarpal and Doina Precup

2021-01

Self-Supervised Attention-Aware Reinforcement Learning.
Haiping Wu, Khimya Khetarpal and Doina Precup
AAAI 2021
(2021-01-01)
dblp.uni-trier.de
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Arushi Jain, Khimya Khetarpal and Doina Precup

2020-12

Towards Continual Reinforcement Learning: A Review and Perspectives.
Khimya Khetarpal, Matthew Riemer, Irina Rish and Doina Precup
arXiv preprint arXiv:2012.13490
(2020-12-25)
ui.adsabs.harvard.eduPDF
Phylogenetic Manifold Regularization: A semi-supervised approach to predict transcription factor binding sites
Faizy Ahsan, Alexandre Drouin, Francois Laviolette, Doina Precup and Mathieu Blanchette
BIBM 2020
(2020-12-16)
dblp.uni-trier.de
Fast reinforcement learning with generalized policy updates
André Barreto, Shaobo Hou, Diana Borsa, David Silver and Doina Precup
Proceedings of the National Academy of Sciences of the United States of America
(2020-12-01)
europepmc.orgPDF

2020-11

Gradient Starvation: A Learning Proclivity in Neural Networks.
Mohammad Pezeshki, Sékou-Oumar Kaba, Yoshua Bengio, Aaron C. Courville, Doina Precup and Guillaume Lajoie
arXiv preprint arXiv:2011.09468
(2020-11-18)
ui.adsabs.harvard.eduPDF
Diversity-Enriched Option-Critic.
Anand Kamat and Doina Precup
arXiv preprint arXiv:2011.02565
(2020-11-04)
ui.adsabs.harvard.eduPDF
A Study of Policy Gradient on a Class of Exactly Solvable Models.
Gavin McCracken, Colin Daniels, Rosie Zhao, Anna Brandenberger, Prakash Panangaden and Doina Precup
arXiv preprint arXiv:2011.01859
(2020-11-03)
ui.adsabs.harvard.eduPDF

2020-10

Connecting Weighted Automata, Tensor Networks and Recurrent Neural Networks through Spectral Learning.
arXiv preprint arXiv:2010.10029
(2020-10-19)
dblp.uni-trier.dePDF
A Fully Tensorized Recurrent Neural Network.
Charles C. Onu, Jacob E. Miller and Doina Precup
arXiv preprint arXiv:2010.04196
(2020-10-08)
ui.adsabs.harvard.eduPDF

2020-09

Keynote Lecture Building Knowledge For AI AgentsWith Reinforcement Learning
ICCP 2020
(2020-09-03)
dblp.uni-trier.de

2020-08

Complete the Missing Half: Augmenting Aggregation Filtering with Diversification for Graph Convolutional Networks.
Sitao Luan, Mingde Zhao, Chenqing Hua, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:2008.08844
(2020-08-20)
ui.adsabs.harvard.eduPDF
Training Matters: Unlocking Potentials of Deeper Graph Convolutional Neural Networks.
Sitao Luan, Mingde Zhao, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:2008.08838
(2020-08-20)
ui.adsabs.harvard.eduPDF

2020-07

What can I do here? A Theory of Affordances in Reinforcement Learning
Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, David Abel and Doina Precup
Invariant Causal Prediction for Block MDPs
Clare Lyle, Amy Zhang, Angelos Filos, Shagun Sodhani, Marta Kwiatkowska, Yarin Gal, Doina Precup and Joelle Pineau
ICML 2020
(2020-07-12)
proceedings.mlr.press

2020-06

Learning to Prove from Synthetic Theorems.
Eser Aygün, Zafarali Ahmed, Ankit Anand, Vlad Firoiu, Xavier Glorot, Laurent Orseau, Doina Precup and Shibl Mourad
arXiv: Logic in Computer Science
(2020-06-19)
ui.adsabs.harvard.eduPDF
A Brief Look at Generalization in Visual Meta-Reinforcement Learning
Safa Alver and Doina Precup
arXiv preprint arXiv:2006.07262
(2020-06-12)
ui.adsabs.harvard.eduPDF
Value Preserving State-Action Abstractions
David Abel, Nate Umbanhowar, Khimya Khetarpal, Dilip Arumugam, Doina Precup and Michael L. Littman
AISTATS 2020
(2020-06-03)
proceedings.mlr.pressPDF

2020-05

META-Learning State-based Eligibility Traces for More Sample-Efficient Policy Evaluation
Mingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang and Doina Precup
Option-Critic in Cooperative Multi-agent Systems
Jhelum Chakravorty, Patrick Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu and Doina Precup
AAMAS 2020
(2020-05-05)
ui.adsabs.harvard.edu
Gifting in Multi-Agent Reinforcement Learning
Andrei Lupu and Doina Precup
AAMAS 2020
(2020-05-05)
dl.acm.org

2020-04

Gifting in Multi-Agent Reinforcement Learning (Student Abstract)
Andrei Lupu and Doina Precup
AAAI 2020
(2020-04-03)
www.aaai.org

2020-03

Invariant Causal Prediction for Block MDPs.
Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal and Doina Precup
arXiv preprint arXiv:2003.06016
(2020-03-12)
ui.adsabs.harvard.eduPDF
Multiple Kernel Learning-Based Transfer Regression for Electric Load Forecasting
Di Wu, Boyu Wang, Doina Precup and Benoit Boulet
IEEE Transactions on Smart Grid
(2020-03-01)
doi.org

2020-02

Policy Evaluation Networks.
Jean Harb, Tom Schaul, Doina Precup and Pierre-Luc Bacon
arXiv preprint arXiv:2002.11833
(2020-02-26)
ui.adsabs.harvard.eduPDF
oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions.
David Venuto, Jhelum Chakravorty, Leonard Boussioux, Junhao Wang, Gavin McCracken and Doina Precup
arXiv preprint arXiv:2002.09043
(2020-02-20)
ui.adsabs.harvard.eduPDF
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv preprint arXiv:2002.02863
(2020-02-07)
arxiv.orgPDF
Provably efficient reconstruction of policy networks.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv: Learning
(2020-02-07)
ui.adsabs.harvard.eduPDF
Assessment of Extubation Readiness Using Spontaneous Breathing Trials in Extremely Preterm Neonates.
Wissam Shalish, Lara Kanbar, Lajos Kovacs, Sanjay Chawla, Martin Keszler, Smita Rao, Samantha Latremouille, Doina Precup, Karen Brown, Robert E Kearney and Guilherme M Sant'Anna
JAMA Pediatrics
(2020-02-01)
europepmc.orgPDF

2020-01

On Efficiency in Hierarchical Reinforcement Learning
Zheng Wen, Doina Precup, Morteza Ibrahimi, Andre Barreto, Benjamin Van Roy and Satinder Singh
NEURIPS 2020
(2020-01-01)
papers.nips.ccPDF
Forethought and Hindsight in Credit Assignment
Veronica Chelu, Doina Precup and Hado P. van Hasselt
Reward Propagation Using Graph Convolutional Networks
Martin Klissarov and Doina Precup
Learning to cooperate: Emergent communication in multi-agent navigation.
Ivana Kajic, Eser Aygün and Doina Precup
Value-driven Hindsight Modelling
Arthur Guez, Fabio Viola, Theophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver and Nicolas Heess

2019-12

Shaping representations through communication: community size effect in artificial learning systems
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
arXiv preprint arXiv:1912.06208
(2019-12-12)
ui.adsabs.harvard.eduPDF
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning.
Riashat Islam, Raihan Seraj, Samin Yeasar Arnob and Doina Precup
arXiv preprint arXiv:1912.05109
(2019-12-11)
ui.adsabs.harvard.eduPDF
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam, Zafarali Ahmed and Doina Precup
arXiv preprint arXiv:1912.05128
(2019-12-11)
ui.adsabs.harvard.eduPDF
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam, Raihan Seraj, Pierre-Luc Bacon and Doina Precup
arXiv preprint arXiv:1912.05104
(2019-12-11)
ui.adsabs.harvard.eduPDF
Hindsight Credit Assignment
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Gheshlaghi Azar, Bilal Piot, Nicolas Heess, Hado P. van Hasselt, Gregory Wayne, Satinder Singh, Doina Precup and Remi Munos

2019-11

Option-Critic in Cooperative Multi-agent Systems
Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu and Doina Precup
arXiv preprint arXiv:1911.12825
(2019-11-28)
arxiv.orgPDF

2019-10

Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira Ebrahimi Kahou, Joseph Paul Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo and Chris Pal
Actor Critic with Differentially Private Critic.
Jonathan Lebensold, William L. Hamilton, Borja Balle and Doina Precup
arXiv preprint arXiv:1910.05876
(2019-10-14)
ui.adsabs.harvard.eduPDF
Improving Pathological Structure Segmentation via Transfer Learning Across Diseases
Barleen Kaur, Paul Lemaître, Raghav Mehta, Nazanin Mohammadi Sepahvand, Doina Precup, Douglas L. Arnold and Tal Arbel
DART/MIL3ID@MICCAI
(2019-10-13)
link.springer.com
Early Prediction of Alzheimer's Disease Progression Using Variational Autoencoders.
Sumana Basu, Konrad Wagstyl, Azar Zandifar, D. Louis Collins, Adriana Romero and Doina Precup
MICCAI 2019
(2019-10-13)
doi.org
Augmenting learning using symmetry in a biologically-inspired domain
Shruti Mishra, Abbas Abdolmaleki, Arthur Guez, Piotr Trochim and Doina Precup
arXiv preprint arXiv:1910.00528
(2019-10-01)
ui.adsabs.harvard.eduPDF

2019-09

Assessing Generalization in TD methods for Deep Reinforcement Learning
Emmanuel Bengio, Doina Precup and Joelle Pineau
(venue unknown)
(2019-09-25)
openreview.netPDF
Avoidance Learning Using Observational Reinforcement Learning
David Venuto, Leonard Boussioux, Junhao Wang, Rola Dali, Jhelum Chakravorty, Yoshua Bengio and Doina Precup
arXiv preprint arXiv:1909.11228
(2019-09-24)
ui.adsabs.harvard.eduPDF
Revisit Policy Optimization in Matrix Form.
Sitao Luan, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:1909.09186
(2019-09-19)
ui.adsabs.harvard.eduPDF

2019-07

An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation.
Vincent Michalski, Vikram Voleti, Samira Ebrahimi Kahou, Anthony Ortiz, Pascal Vincent, Chris Pal and Doina Precup
arXiv preprint arXiv:1908.00061
(2019-07-31)
ui.adsabs.harvard.eduPDF
Learning Options with Interest Functions
Khimya Khetarpal and Doina Precup
AAAI 2019
(2019-07-17)
www.aaai.org
Leveraging Observations in Bandits: Between Risks and Benefits
Andrei Lupu, Audrey Durand and Doina Precup
AAAI 2019
(2019-07-17)
aimagazine.org
Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Srinivas Venkattaramanujam, Eric Crawford, Thang Doan and Doina Precup
arXiv preprint arXiv:1907.02998
(2019-07-05)
ui.adsabs.harvard.eduPDF

2019-05

Singular value automata and approximate minimization
Mathematical Structures in Computer Science
(2019-05-27)
ui.adsabs.harvard.eduPDF
Per-Decision Option Discounting
Anna Harutyunyan, Peter Vrancx, Philippe Hamel, Ann Nowe and Doina Precup
ICML 2019
(2019-05-24)
proceedings.mlr.pressPDF
Recurrent Value Functions.
Pierre Thodoroff, Nishanth Anand, Lucas Caccia, Doina Precup and Joelle Pineau
arXiv preprint arXiv:1905.09562
(2019-05-23)
dblp.uni-trier.dePDF
Building Knowledge for AI Agents with Reinforcement Learning
AAMAS 2019
(2019-05-08)
dblp.uni-trier.de

2019-04

Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization
Mingde Zhao, Ian Porada, Sitao Luan, Xiaowen Chang and Doina Precup
(venue unknown)
(2019-04-25)
arxiv.org
META-Learning State-based {\lambda} for More Sample-Efficient Policy Evaluation
Mingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang and Doina Precup
arXiv: Learning
(2019-04-25)
arxiv.orgPDF
The Termination Critic
Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Remi Munos and Doina Precup

2019-03

Learning proposals for sequential importance samplers using reinforced variational inference.
Zafarali Ahmed, Arjun Karuvally, Doina Precup and Simon Gravel
ICLR 2019
(2019-03-16)
dblp.uni-trier.dePDF

2019-02

The Impact of Time Interval between Extubation and Reintubation on Death or Bronchopulmonary Dysplasia in Extremely Preterm Infants.
Wissam Shalish, Lara Kanbar, Lajos Kovacs, Sanjay Chawla, Martin Keszler, Smita Rao, Bogdan Panaitescu, Alyse Laliberte, Doina Precup, Karen Brown, Robert E. Kearney and Guilherme M. Sant'Anna
The Journal of Pediatrics
(2019-02-01)
www.sciencedirect.com

2019-01

The Option Keyboard: Combining Skills in Reinforcement Learning
André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan J. Hunt, Shibl Mourad, David Silver and Doina Precup
Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks
Sitao Luan, Mingde Zhao, Xiao-Wen Chang and Doina Precup
Community size effect in artificial learning systems.
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
ViGIL@NeurIPS
(2019-01-01)
dblp.uni-trier.de
Learning Reliable Policies in the Bandit Setting with Application to Adaptive Clinical Trials.
Hossein Aboutalebi, Doina Precup and Tibor Schuster
IJCAI 2019
(2019-01-01)
dblp.uni-trier.dePDF
Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials.
Hossein Aboutalebi, Doina Precup and Tibor Schuster

2018-11

Environments for Lifelong Reinforcement Learning.
Khimya Khetarpal, Shagun Sodhani, Sarath Chandar and Doina Precup
arXiv preprint arXiv:1811.10732
(2018-11-26)
dblp.uni-trier.dePDF
The Barbados 2018 List of Open Issues in Continual Learning.
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc G. Bellemare and Doina Precup
arXiv preprint arXiv:1811.07004
(2018-11-16)
dblp.uni-trier.dePDF
Temporal Regularization in Markov Decision Process
arXiv: Learning
(2018-11-01)
ui.adsabs.harvard.eduPDF

2018-09

Where Off-Policy Deep Reinforcement Learning Fails
Scott Fujimoto, David Meger and Doina Precup
(venue unknown)
(2018-09-27)
openreview.netPDF
Shaping representations through communication
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
(venue unknown)
(2018-09-27)
openreview.netPDF

2018-08

A Semi-Markov Chain Approach to Modeling Respiratory Patterns Prior to Extubation in Preterm Infants.
Charles C. Onu, Lara J. Kanbar, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv: Signal Processing
(2018-08-24)
arxiv.orgPDF
Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing
Charles C. Onu, Lara J. Kanbar, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07991
(2018-08-24)
ui.adsabs.harvard.eduPDF
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants
Lara J. Kanbar, Charles C. Onu, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07992
(2018-08-24)
ui.adsabs.harvard.eduPDF

2018-07

Attend Before you Act: Leveraging human visual attention for continual learning.
Khimya Khetarpal and Doina Precup
arXiv preprint arXiv:1807.09664
(2018-07-25)
ui.adsabs.harvard.eduPDF
Leveraging Observational Learning for Exploration in Bandits
Andrei Lupu, Audrey Durand and Doina Precup
AAMAS 2018
(2018-07-09)
dblp.uni-trier.de
Eligibility Traces for Options
Ayush Jain and Doina Precup
AAMAS 2018
(2018-07-09)
dblp.uni-trier.de
Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants
Lara J. Kanbar, Charles C. Onu, Wissam Shalish, Karen A. Brown, Guilherme M. SantrAnna, Doina Precup and Robert E. Kearney
EMBC 2018
(2018-07-01)
doi.orgPDF

2018-06

Diffusion-Based Approximate Value Functions
Martin Klissarov and Doina Precup
(venue unknown)
(2018-06-15)
openreview.netPDF

2018-05

Dyna Planning using a Feature Based Generative Model.
Ryan Faulkner and Doina Precup
arXiv preprint arXiv:1805.10129
(2018-05-23)
ui.adsabs.harvard.eduPDF
Learning Safe Policies with Expert Guidance
Jessie Huang, Fa Wu, Doina Precup and Yang Cai

2018-03

Nonlinear Weighted Finite Automata
AISTATS 2018
(2018-03-31)
dblp.uni-trier.dePDF
Constructing Temporal Abstractions Autonomously in Reinforcement Learning
Ai Magazine
(2018-03-27)
doi.org

2018-02

Disentangling the independently controllable factors of variation by interacting with the world
Valentin Thomas, Emmanuel Bengio, William Fedus, Jules Pondard, Philippe Beaudoin, Hugo Larochelle, Joelle Pineau, Doina Precup and Yoshua Bengio
arXiv preprint arXiv:1802.09484
(2018-02-26)
ui.adsabs.harvard.eduPDF
Learning with Options that Terminate Off-Policy
Anna Harutyunyan, Peter Vrancx, Pierre-luc Bacon, Doina Precup and Ann Nowe
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
When Waiting is not an Option : Learning Options with a Deliberation Cost
Jean Harb, Pierre-luc Bacon, Martin Klissarov and Doina Precup
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson, Wei-Di Chang, Pierre-luc Bacon, David Meger, Joelle Pineau and Doina Precup
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
Deep Reinforcement Learning that Matters
Peter Henderson, Riashat Islam, Joelle Pineau, David Meger, Doina Precup and Philip Bachman
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF

2018-01

Patterns of reintubation in extremely preterm infants: a longitudinal cohort study
Wissam Shalish, Lara Kanbar, Martin Keszler, Sanjay Chawla, Lajos Kovacs, Smita Rao, Bogdan A Panaitescu, Alyse Laliberte, Doina Precup, Karen Brown, Robert E Kearney and Guilherme M Sant'Anna
Pediatric Research
(2018-01-31)
www.nature.com
Learning Predictive State Representations From Non-Uniform Sampling.
Yuri Grinberg, Hossein Aboutalebi, Melanie Lyman-Abramovitch, Borja Balle and Doina Precup
AAAI 2018
(2018-01-01)
dblp.uni-trier.de
Imitation Upper Confidence Bound for Bandits on a Graph.
Andrei Lupu and Doina Precup
AAAI 2018
(2018-01-01)
dblp.uni-trier.de
Learning Robust Options.
Daniel J. Mankowitz, Timothy A. Mann, Pierre-Luc Bacon, Doina Precup and Shie Mannor
Temporal Regularization for Markov Decision Process
NEURIPS 2018
(2018-01-01)
papers.nips.ccPDF

Publications collected and formatted using Paperoni

array(1) { ["wp-wpml_current_language"]=> string(2) "fr" }