Doina Precup

Mila > À propos de Mila > Équipe > Doina Precup
Membre Académique Principal
Doina Precup
Professeur agrégé, Professeure agrégée, McGill University, DeepMind
Doina Precup

Doina Precup enseigne à l’Université McGill tout en menant des recherches fondamentales sur l’apprentissage par renforcement, notamment sur les applications de l’IA dans des domaines ayant un impact social, tels que les soins de santé. Elle s’intéresse à la prise de décision de la machine dans des situations d’incertitude élevée.

Elle est membre de l’Institut canadien de recherches avancées, membre de l’Association pour l’avancement de l’intelligence artificielle et elle dirige également le bureau montréalais de Deepmind.

Spécialiste dans les domaines suivants :  intelligence artificielle, apprentissage machine, apprentissage par renforcement, raisonnement et planification sous incertitude, applications.

Publications

2021-05

Reward Is Enough
David Silver, Satinder Singh, Doina Precup and Richard S. Sutton
Artificial Intelligence
(2021-05-24)
www.sciencedirect.com
Practical Marginalized Importance Sampling with the Successor Representation
Scott Fujimoto, David Meger and Doina Precup
(venue unknown)
(2021-05-04)
openreview.netPDF
Correcting Momentum in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau and Doina Precup
(venue unknown)
(2021-05-04)
openreview.netPDF
Offline Policy Optimization with Variance Regularization
Riashat Islam, Samarth Sinha, Homanga Bharadhwaj, Samin Yeasar Arnob, Zhuoran Yang, Zhaoran Wang, Animesh Garg, Lihong Li and Doina Precup
(venue unknown)
(2021-05-04)
openreview.netPDF
Conditional Networks
Anthony Ortiz, Kris Sankaran, Olac Fuentes, Christopher Kiekintveld, Pascal Vincent, Yoshua Bengio and Doina Precup
(venue unknown)
(2021-05-04)
openreview.net
What is Going on Inside Recurrent Meta Reinforcement Learning Agents
Safa Alver and Doina Precup
arXiv: Learning
(2021-05-02)
dblp.uni-trier.dePDF

2021-02

Optimal Spectral-Norm Approximate Minimization of Weighted Finite Automata.
Borja Balle, Clara Lacroce, Prakash Panangaden, Doina Precup and Guillaume Rabusseau
arXiv preprint arXiv:2102.06860
(2021-02-13)
dblp.uni-trier.dePDF

2021-01

Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Arushi Jain, Khimya Khetarpal and Doina Precup

2020-12

Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards.
Susan Amin, Maziar Gomrokchi, Hossein Aboutalebi, Harsh Satija and Doina Precup
arXiv preprint arXiv:2012.13658
(2020-12-26)
dblp.uni-trier.dePDF
Towards Continual Reinforcement Learning: A Review and Perspectives.
Khimya Khetarpal, Matthew Riemer, Irina Rish and Doina Precup
arXiv preprint arXiv:2012.13490
(2020-12-25)
dblp.uni-trier.dePDF
Phylogenetic Manifold Regularization: A semi-supervised approach to predict transcription factor binding sites
Faizy Ahsan, Alexandre Drouin, Francois Laviolette, Doina Precup and Mathieu Blanchette
BIBM 2020
(2020-12-16)
dblp.uni-trier.de

2020-11

Gradient Starvation: A Learning Proclivity in Neural Networks.
Mohammad Pezeshki, Sékou-Oumar Kaba, Yoshua Bengio, Aaron C. Courville, Doina Precup and Guillaume Lajoie
arXiv preprint arXiv:2011.09468
(2020-11-18)
dblp.uni-trier.dePDF
Diversity-Enriched Option-Critic.
Anand Kamat and Doina Precup
arXiv preprint arXiv:2011.02565
(2020-11-04)
ui.adsabs.harvard.eduPDF
A Study of Policy Gradient on a Class of Exactly Solvable Models.
Gavin McCracken, Colin Daniels, Rosie Zhao, Anna Brandenberger, Prakash Panangaden and Doina Precup
arXiv preprint arXiv:2011.01859
(2020-11-03)
ui.adsabs.harvard.eduPDF

2020-10

Connecting Weighted Automata, Tensor Networks and Recurrent Neural Networks through Spectral Learning.
arXiv preprint arXiv:2010.10029
(2020-10-19)
dblp.uni-trier.dePDF
A Fully Tensorized Recurrent Neural Network
Charles C. Onu, Jacob E. Miller and Doina Precup
arXiv preprint arXiv:2010.04196
(2020-10-08)
arxiv.orgPDF

2020-09

Keynote Lecture Building Knowledge For AI AgentsWith Reinforcement Learning
ICCP 2020
(2020-09-03)
dblp.uni-trier.de

2020-08

Complete the Missing Half: Augmenting Aggregation Filtering with Diversification for Graph Convolutional Networks.
Sitao Luan, Mingde Zhao, Chenqing Hua, Xiao-Wen Chang and Doina Precup
arXiv: Learning
(2020-08-20)
ui.adsabs.harvard.eduPDF
Training Matters: Unlocking Potentials of Deeper Graph Convolutional Neural Networks.
Sitao Luan, Mingde Zhao, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:2008.08838
(2020-08-20)
dblp.uni-trier.dePDF
Fast reinforcement learning with generalized policy updates
André Barreto, Shaobo Hou, Diana Borsa, David Silver and Doina Precup
Proceedings of the National Academy of Sciences of the United States of America
(2020-08-17)
europepmc.orgPDF

2020-07

What can I do here? A Theory of Affordances in Reinforcement Learning
Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, David Abel and Doina Precup
Invariant Causal Prediction for Block MDPs
Clare Lyle, Amy Zhang, Angelos Filos, Shagun Sodhani, Marta Kwiatkowska, Yarin Gal, Doina Precup and Joelle Pineau
ICML 2020
(2020-07-12)
proceedings.mlr.press
Interference and Generalization in Temporal Difference Learning
Emmanuel Bengio, Joelle Pineau and Doina Precup
SVRG for Policy Evaluation with Fewer Gradient Evaluations.
Zilun Peng, Ahmed Touati, Pascal Vincent and Doina Precup

2020-06

Learning to Prove from Synthetic Theorems.
Eser Aygün, Zafarali Ahmed, Ankit Anand, Vlad Firoiu, Xavier Glorot, Laurent Orseau, Doina Precup and Shibl Mourad
arXiv preprint arXiv:2006.11259
(2020-06-19)
dblp.uni-trier.dePDF
A Brief Look at Generalization in Visual Meta-Reinforcement Learning
Safa Alver and Doina Precup
arXiv preprint arXiv:2006.07262
(2020-06-12)
ui.adsabs.harvard.eduPDF
A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms.
Value Preserving State-Action Abstractions.
David Abel, Nate Umbanhowar, Khimya Khetarpal, Dilip Arumugam, Doina Precup and Michael L. Littman
AISTATS 2020
(2020-06-03)
proceedings.mlr.pressPDF

2020-05

META-Learning State-based Eligibility Traces for More Sample-Efficient Policy Evaluation
Mingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang and Doina Precup
Option-Critic in Cooperative Multi-agent Systems
Jhelum Chakravorty, Patrick Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu and Doina Precup
AAMAS 2020
(2020-05-05)
dl.acm.org
Gifting in Multi-Agent Reinforcement Learning
Andrei Lupu and Doina Precup
AAMAS 2020
(2020-05-05)
dl.acm.org

2020-04

Gifting in Multi-Agent Reinforcement Learning (Student Abstract)
Andrei Lupu and Doina Precup
AAAI 2020
(2020-04-03)
www.aaai.org
Algorithmic Improvements for Deep Reinforcement Learning Applied to Interactive Fiction.
Vishal Jain, William Fedus, Hugo Larochelle, Doina Precup and Marc G. Bellemare
Options of Interest: Temporal Abstraction with Interest Functions
Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre-Luc Bacon and Doina Precup

2020-03

Invariant Causal Prediction for Block MDPs.
Amy Zhang, Clare Lyle, Shagun Sodhani, Angelos Filos, Marta Kwiatkowska, Joelle Pineau, Yarin Gal and Doina Precup
arXiv preprint arXiv:2003.06016
(2020-03-12)
aps.arxiv.orgPDF
Multiple Kernel Learning-Based Transfer Regression for Electric Load Forecasting
Di Wu, Boyu Wang, Doina Precup and Benoit Boulet
IEEE Transactions on Smart Grid
(2020-03-01)
doi.org

2020-02

Policy Evaluation Networks.
Jean Harb, Tom Schaul, Doina Precup and Pierre-Luc Bacon
arXiv preprint arXiv:2002.11833
(2020-02-26)
ui.adsabs.harvard.eduPDF
oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions.
David Venuto, Jhelum Chakravorty, Leonard Boussioux, Junhao Wang, Gavin McCracken and Doina Precup
arXiv preprint arXiv:2002.09043
(2020-02-20)
ui.adsabs.harvard.eduPDF
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv: Learning
(2020-02-07)
arxiv.orgPDF
Provably efficient reconstruction of policy networks.
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup and Guillaume Rabusseau
arXiv preprint arXiv:2002.02863
(2020-02-07)
ui.adsabs.harvard.eduPDF
Assessment of Extubation Readiness Using Spontaneous Breathing Trials in Extremely Preterm Neonates.
Wissam Shalish, Lara Kanbar, Lajos Kovacs, Sanjay Chawla, Martin Keszler, Smita Rao, Samantha Latremouille, Doina Precup, Karen Brown, Robert E Kearney and Guilherme M Sant'Anna
JAMA Pediatrics
(2020-02-01)
europepmc.orgPDF

2020-01

On Efficiency in Hierarchical Reinforcement Learning
Zheng Wen, Doina Precup, Morteza Ibrahimi, Andre Barreto, Benjamin Van Roy and Satinder Singh
NEURIPS 2020
(2020-01-01)
papers.nips.ccPDF
Forethought and Hindsight in Credit Assignment
Veronica Chelu, Doina Precup and Hado P. van Hasselt
Reward Propagation Using Graph Convolutional Networks
Martin Klissarov and Doina Precup
Learning to cooperate: Emergent communication in multi-agent navigation.
Ivana Kajic, Eser Aygün and Doina Precup
Exploring uncertainty measures in deep networks for Multiple sclerosis lesion detection and segmentation.
Tanya Nair, Doina Precup, Douglas L. Arnold and Tal Arbel
Value-driven Hindsight Modelling
Arthur Guez, Fabio Viola, Theophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver and Nicolas Heess
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
Scott Fujimoto, David Meger and Doina Precup

2019-12

Shaping representations through communication: community size effect in artificial learning systems
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
arXiv preprint arXiv:1912.06208
(2019-12-12)
dblp.uni-trier.dePDF
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning.
Riashat Islam, Raihan Seraj, Samin Yeasar Arnob and Doina Precup
arXiv preprint arXiv:1912.05109
(2019-12-11)
dblp.uni-trier.dePDF
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam, Zafarali Ahmed and Doina Precup
arXiv preprint arXiv:1912.05128
(2019-12-11)
ui.adsabs.harvard.eduPDF
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
Riashat Islam, Raihan Seraj, Pierre-Luc Bacon and Doina Precup
arXiv preprint arXiv:1912.05104
(2019-12-11)
ui.adsabs.harvard.eduPDF
Hindsight Credit Assignment
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Gheshlaghi Azar, Bilal Piot, Nicolas Heess, Hado P. van Hasselt, Gregory Wayne, Satinder Singh, Doina Precup and Remi Munos

2019-11

Option-critic in cooperative multi-agent systems
Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu and Doina Precup
arXiv preprint arXiv:1911.12825
(2019-11-28)
export.arxiv.orgPDF
Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
Tianyu Li, Bogdan Mazoure, Doina Precup and Guillaume Rabusseau

2019-10

Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira Ebrahimi Kahou, Joseph Paul Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo and Chris Pal
Actor Critic with Differentially Private Critic.
Jonathan Lebensold, William L. Hamilton, Borja Balle and Doina Precup
arXiv preprint arXiv:1910.05876
(2019-10-14)
ui.adsabs.harvard.eduPDF
Improving Pathological Structure Segmentation via Transfer Learning Across Diseases
Barleen Kaur, Paul Lemaître, Raghav Mehta, Nazanin Mohammadi Sepahvand, Doina Precup, Douglas L. Arnold and Tal Arbel
DART/MIL3ID@MICCAI
(2019-10-13)
link.springer.com
Early Prediction of Alzheimer's Disease Progression Using Variational Autoencoders.
Sumana Basu, Konrad Wagstyl, Azar Zandifar, D. Louis Collins, Adriana Romero and Doina Precup
MICCAI 2019
(2019-10-13)
doi.org
Singular value automata and approximate minimization
Mathematical Structures in Computer Science
(2019-10-01)
ui.adsabs.harvard.eduPDF
Augmenting learning using symmetry in a biologically-inspired domain
Shruti Mishra, Abbas Abdolmaleki, Arthur Guez, Piotr Trochim and Doina Precup
arXiv preprint arXiv:1910.00528
(2019-10-01)
ui.adsabs.harvard.eduPDF

2019-09

Assessing Generalization in TD methods for Deep Reinforcement Learning
Emmanuel Bengio, Doina Precup and Joelle Pineau
(venue unknown)
(2019-09-25)
openreview.netPDF
Avoidance Learning Using Observational Reinforcement Learning
David Venuto, Leonard Boussioux, Junhao Wang, Rola Dali, Jhelum Chakravorty, Yoshua Bengio and Doina Precup
arXiv preprint arXiv:1909.11228
(2019-09-24)
ui.adsabs.harvard.eduPDF
Revisit Policy Optimization in Matrix Form.
Sitao Luan, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:1909.09186
(2019-09-19)
dblp.uni-trier.dePDF

2019-07

An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation.
Vincent Michalski, Vikram Voleti, Samira Ebrahimi Kahou, Anthony Ortiz, Pascal Vincent, Chris Pal and Doina Precup
arXiv preprint arXiv:1908.00061
(2019-07-31)
dblp.uni-trier.dePDF
Learning Options with Interest Functions
Khimya Khetarpal and Doina Precup
AAAI 2019
(2019-07-17)
www.aaai.org
Leveraging Observations in Bandits: Between Risks and Benefits
Andrei Lupu, Audrey Durand and Doina Precup
AAAI 2019
(2019-07-17)
aimagazine.org
Combined Reinforcement Learning via Abstract Representations
Vincent Francois-Lavet, Yoshua Bengio, Doina Precup and Joelle Pineau
Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Srinivas Venkattaramanujam, Eric Crawford, Thang Doan and Doina Precup
arXiv: Learning
(2019-07-05)
ui.adsabs.harvard.eduPDF

2019-06

Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia
Charles C. Onu, Jonathan Lebensold, William L. Hamilton and Doina Precup

2019-05

Singular value automata and approximate minimization
Mathematical Structures in Computer Science
(2019-05-27)
ui.adsabs.harvard.eduPDF
Per-Decision Option Discounting
Anna Harutyunyan, Peter Vrancx, Philippe Hamel, Ann Nowe and Doina Precup
ICML 2019
(2019-05-24)
proceedings.mlr.pressPDF
Prediction of Disease Progression in Multiple Sclerosis Patients using Deep Learning Analysis of MRI Data
Adrian Tousignant, Paul Lemaître, Doina Precup, Douglas L. Arnold and Tal Arbel
International Conference on Medical Imaging with Deep Learning
(2019-05-24)
proceedings.mlr.pressPDF
Recurrent Value Functions.
Pierre Thodoroff, Nishanth Anand, Lucas Caccia, Doina Precup and Joelle Pineau
arXiv preprint arXiv:1905.09562
(2019-05-23)
ui.adsabs.harvard.eduPDF
Building Knowledge for AI Agents with Reinforcement Learning
AAMAS 2019
(2019-05-08)
dblp.uni-trier.de
Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks
Sanjay Thakur, Herke van Hoof, Juan Camilo Gamboa Higuera, Doina Precup and David Meger

2019-04

Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization
Mingde Zhao, Ian Porada, Sitao Luan, Xiaowen Chang and Doina Precup
(venue unknown)
(2019-04-25)
arxiv.org
META-Learning State-based {\lambda} for More Sample-Efficient Policy Evaluation
Mingde Zhao, Sitao Luan, Ian Porada, Xiao-Wen Chang and Doina Precup
arXiv preprint arXiv:1904.11439
(2019-04-25)
ui.adsabs.harvard.eduPDF
The Termination Critic
Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Remi Munos and Doina Precup

2019-03

Learning proposals for sequential importance samplers using reinforced variational inference.
Zafarali Ahmed, Arjun Karuvally, Doina Precup and Simon Gravel
ICLR 2019
(2019-03-16)
dblp.uni-trier.dePDF

2019-02

The Impact of Time Interval between Extubation and Reintubation on Death or Bronchopulmonary Dysplasia in Extremely Preterm Infants.
Wissam Shalish, Lara Kanbar, Lajos Kovacs, Sanjay Chawla, Martin Keszler, Smita Rao, Bogdan Panaitescu, Alyse Laliberte, Doina Precup, Karen Brown, Robert E. Kearney and Guilherme M. Sant'Anna
The Journal of Pediatrics
(2019-02-01)
www.sciencedirect.com

2019-01

The Option Keyboard: Combining Skills in Reinforcement Learning
André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan J. Hunt, Shibl Mourad, David Silver and Doina Precup
NEURIPS 2019
(2019-01-01)
papers.nips.ccPDF
Break the Ceiling: Stronger Multi-scale Deep Graph Convolutional Networks
Sitao Luan, Mingde Zhao, Xiao-Wen Chang and Doina Precup
Community size effect in artificial learning systems.
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
ViGIL@NeurIPS
(2019-01-01)
dblp.uni-trier.dePDF
Learning Reliable Policies in the Bandit Setting with Application to Adaptive Clinical Trials.
Hossein Aboutalebi, Doina Precup and Tibor Schuster
IJCAI 2019
(2019-01-01)
dblp.uni-trier.dePDF
Temporally Extended Metrics for Markov Decision Processes.
AAAI 2019
(2019-01-01)
dblp.uni-trier.dePDF
Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials.
Hossein Aboutalebi, Doina Precup and Tibor Schuster

2018-12

Clustering-Oriented Representation Learning with Attractive-Repulsive Loss.
Kian Kenyon-Dean, Andre Cianflone, Lucas Page-Caccia, Guillaume Rabusseau, Jackie Chi Kit Cheung and Doina Precup
arXiv preprint arXiv:1812.07627
(2018-12-18)
ui.adsabs.harvard.eduPDF
Prediction of Progression in Multiple Sclerosis Patients
Adrian Tousignant, Paul Lemaître, Doina Precup, Douglas Arnold and Tal Arbel
International Conference on Medical Imaging with Deep Learning -- Full Paper Track
(2018-12-13)
openreview.netPDF
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto, David Meger and Doina Precup

2018-11

Environments for Lifelong Reinforcement Learning.
Khimya Khetarpal, Shagun Sodhani, Sarath Chandar and Doina Precup
arXiv preprint arXiv:1811.10732
(2018-11-26)
dblp.uni-trier.dePDF
The Barbados 2018 List of Open Issues in Continual Learning.
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc G. Bellemare and Doina Precup
arXiv preprint arXiv:1811.07004
(2018-11-16)
dblp.uni-trier.dePDF
Temporal Regularization in Markov Decision Process
arXiv preprint arXiv:1811.00429
(2018-11-01)
ui.adsabs.harvard.eduPDF

2018-09

Where Off-Policy Deep Reinforcement Learning Fails
Scott Fujimoto, David Meger and Doina Precup
(venue unknown)
(2018-09-27)
openreview.netPDF
Shaping representations through communication
Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell and Doina Precup
(venue unknown)
(2018-09-27)
openreview.netPDF

2018-08

A Semi-Markov Chain Approach to Modeling Respiratory Patterns Prior to Extubation in Preterm Infants.
Charles C. Onu, Lara J. Kanbar, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07989
(2018-08-24)
arxiv.orgPDF
Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing
Charles C. Onu, Lara J. Kanbar, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07991
(2018-08-24)
ui.adsabs.harvard.eduPDF
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants.
Lara J. Kanbar, Charles C. Onu, Wissam Shalish, Karen A. Brown, Guilherme M. Sant'Anna, Robert E. Kearney and Doina Precup
arXiv preprint arXiv:1808.07992
(2018-08-24)
arxiv.orgPDF

2018-07

Attend Before you Act: Leveraging human visual attention for continual learning.
Khimya Khetarpal and Doina Precup
arXiv preprint arXiv:1807.09664
(2018-07-25)
dblp.uni-trier.dePDF
Leveraging Observational Learning for Exploration in Bandits
Andrei Lupu, Audrey Durand and Doina Precup
AAMAS 2018
(2018-07-09)
celweb.vuse.vanderbilt.edu
Eligibility Traces for Options
Ayush Jain and Doina Precup
AAMAS 2018
(2018-07-09)
dblp.uni-trier.de
Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning
Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants
Lara J. Kanbar, Charles C. Onu, Wissam Shalish, Karen A. Brown, Guilherme M. SantrAnna, Doina Precup and Robert E. Kearney
EMBC 2018
(2018-07-01)
www.ncbi.nlm.nih.govPDF

2018-06

Diffusion-Based Approximate Value Functions
Martin Klissarov and Doina Precup
(venue unknown)
(2018-06-15)
openreview.netPDF
Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization.
Kian Kenyon-Dean, Jackie Chi Kit Cheung and Doina Precup

2018-05

Dyna Planning using a Feature Based Generative Model.
Ryan Faulkner and Doina Precup
arXiv preprint arXiv:1805.10129
(2018-05-23)
dblp.uni-trier.dePDF
Learning Safe Policies with Expert Guidance
Jessie Huang, Fa Wu, Doina Precup and Yang Cai

2018-03

Nonlinear Weighted Finite Automata
AISTATS 2018
(2018-03-31)
proceedings.mlr.pressPDF
Constructing Temporal Abstractions Autonomously in Reinforcement Learning
Ai Magazine
(2018-03-27)
dblp.uni-trier.de

2018-02

Disentangling the independently controllable factors of variation by interacting with the world
Valentin Thomas, Emmanuel Bengio, William Fedus, Jules Pondard, Philippe Beaudoin, Hugo Larochelle, Joelle Pineau, Doina Precup and Yoshua Bengio
arXiv preprint arXiv:1802.09484
(2018-02-26)
ui.adsabs.harvard.eduPDF
Learning with Options that Terminate Off-Policy
Anna Harutyunyan, Peter Vrancx, Pierre-luc Bacon, Doina Precup and Ann Nowe
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
When Waiting is not an Option : Learning Options with a Deliberation Cost
Jean Harb, Pierre-luc Bacon, Martin Klissarov and Doina Precup
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
Learning Predictive State Representations from Non-uniform Sampling
Yuri Grinberg, Hossein Aboutalebi, Melanie Lyman-Abramovitch, Borja Balle and Doina Precup
AAAI 2018
(2018-02-07)
dblp.uni-trier.dePDF
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson, Wei-Di Chang, Pierre-luc Bacon, David Meger, Joelle Pineau and Doina Precup
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF
Deep Reinforcement Learning that Matters
Peter Henderson, Riashat Islam, Joelle Pineau, David Meger, Doina Precup and Philip Bachman
AAAI 2018
(2018-02-07)
ui.adsabs.harvard.eduPDF

2018-01

Patterns of reintubation in extremely preterm infants: a longitudinal cohort study
Wissam Shalish, Lara Kanbar, Martin Keszler, Sanjay Chawla, Lajos Kovacs, Smita Rao, Bogdan A Panaitescu, Alyse Laliberte, Doina Precup, Karen Brown, Robert E Kearney and Guilherme M Sant'Anna
Pediatric Research
(2018-01-31)
www.nature.com
Imitation Upper Confidence Bound for Bandits on a Graph.
Andrei Lupu and Doina Precup
AAAI 2018
(2018-01-01)
dblp.uni-trier.de
Learning Robust Options.
Daniel J. Mankowitz, Timothy A. Mann, Pierre-Luc Bacon, Doina Precup and Shie Mannor
Temporal Regularization for Markov Decision Process
NEURIPS 2018
(2018-01-01)
papers.nips.ccPDF

Publications collected and formatted using Paperoni

array(1) { ["wp-wpml_current_language"]=> string(2) "fr" }