Publications

Implicit Regularization in Deep Learning: A View from Function Space

Aristide Baratin

Thomas George

César Laurent

2020-08-03

ArXiv (prépublication)

arxiv.org

BDD-based optimization for the quadratic stable set problem

Jaime E. González

Andr'e Augusto Cire

Andrea Lodi

Louis-Martin Rousseau

2020-08-01

Discrete Optimization (published)

doi.org

BDD-based optimization for the quadratic stable set problem

Jaime E. González

Andr'e Augusto Cire

Andrea Lodi

Louis-Martin Rousseau

2020-08-01

Discrete Optimization (publié)

doi.org

Optimal Local and Remote Controllers With Unreliable Uplink Channels: An Elementary Proof

Mohammad Afshari

Aditya Mahajan

Recently, a model of a decentralized control system with local and remote controllers connected over unreliable channels was presented in [… (voir plus)1]. The model has a nonclassical information structure that is not partially nested. Nonetheless, it is shown in [1] that the optimal control strategies are linear functions of the state estimate (which is a nonlinear function of the observations). Their proof is based on a fairly sophisticated dynamic programming argument. In this article, we present an alternative and elementary proof of the result which uses common information-based conditional independence and completion of squares.

2020-08-01

IEEE Transactions on Automatic Control (publié)

doi.org

arxiv.org

Precision, Equity, and Public Health and Epidemiology Informatics – A Scoping Review

David Buckeridge

2020-08-01

Yearbook of Medical Informatics (publié)

doi.org

Renewal Monte Carlo: Renewal Theory-Based Reinforcement Learning

Jayakumar Subramanian

Aditya Mahajan

An online reinforcement learning algorithm called renewal Monte Carlo (RMC) is presented. RMC works for infinite horizon Markov decision pro… (voir plus)cesses with a designated start state. RMC is a Monte Carlo algorithm that retains the key advantages of Monte Carlo—viz., simplicity, ease of implementation, and low bias—while circumventing the main drawbacks of Monte Carlo—viz., high variance and delayed updates. Given a parameterized policy

2020-08-01

IEEE Transactions on Automatic Control (publié)

doi.org

Neuronal activity remodels the F-actin based submembrane lattice in dendrites but not axons of hippocampal neurons

Flavie Lavoie-Cardinal

Anthony Bilodeau

Mado Lemieux

Marc-André Gardner

Theresa Wiesner

Gabrielle Laramée

Christian Gagné

Paul De Koninck

2020-07-20

Scientific Reports (publié)

doi.org

Survey on Applications of Multi-Armed and Contextual Bandits

Djallel Bouneffouf

Irina Rish

Charu Aggarwal

In recent years, the multi-armed bandit (MAB) framework has attracted a lot of attention in various applications, from recommender systems a… (voir plus)nd information retrieval to healthcare and finance. This success is due to its stellar performance combined with attractive properties, such as learning from less feedback. The multiarmed bandit field is currently experiencing a renaissance, as novel problem settings and algorithms motivated by various practical applications are being introduced, building on top of the classical bandit problem. This article aims to provide a comprehensive review of top recent developments in multiple real-life applications of the multi-armed bandit. Specifically, we introduce a taxonomy of common MAB-based applications and summarize the state-of-the-art for each of those domains. Furthermore, we identify important current trends and provide new perspectives pertaining to the future of this burgeoning field.

2020-07-19

2020 IEEE Congress on Evolutionary Computation (CEC) (publié)

doi.org

Extendable and invertible manifold learning with geometry regularized autoencoders

Andres F. Duque Correa

Sacha Morin

Guy Wolf

Kevin R. Moon

A fundamental task in data exploration is to extract simplified low dimensional representations that capture intrinsic geometry in data, esp… (voir plus)ecially for faithfully visualizing data in two or three dimensions. Common approaches to this task use kernel methods for manifold learning. However, these methods typically only provide an embedding of fixed input data and cannot extend to new data points. Autoencoders have also recently become popular for representation learning. But while they naturally compute feature extractors that are both extendable to new data and invertible (i.e., reconstructing original features from latent representation), they have limited capabilities to follow global intrinsic geometry compared to kernel-based manifold learning. We present a new method for integrating both approaches by incorporating a geometric regularization term in the bottleneck of the autoencoder. Our regularization, based on the diffusion potential distances from the recently-proposed PHATE visualization method, encourages the learned latent representation to follow intrinsic data geometry, similar to manifold learning algorithms, while still enabling faithful extension to new data and reconstruction of data in the original feature space from latent coordinates. We compare our approach with leading kernel methods and autoencoder models for manifold learning to provide qualitative and quantitative evidence of our advantages in preserving intrinsic structure, out of sample extension, and reconstruction. Our method is easily implemented for big-data applications, whereas other methods are limited in this regard.

2020-07-14

ArXiv (preprint)

doi.org

arxiv.org

Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP

Amy Zhang

Shagun Sodhani

Khimya Khetarpal

Joelle Pineau

Multi-task reinforcement learning is a rich paradigm where information from previously seen environments can be leveraged for better perform… (voir plus)ance and improved sample-efficiency in new environments. In this work, we leverage ideas of common structure underlying a family of Markov decision processes (MDPs) to improve performance in the few-shot regime. We use assumptions of structure from Hidden-Parameter MDPs and Block MDPs to propose a new framework, HiP-BMDP, and approach for learning a common representation and universal dynamics model. To this end, we provide transfer and generalization bounds based on task and state similarity, along with sample complexity bounds that depend on the aggregate number of samples across tasks, rather than the number of tasks, a significant improvement over prior work. To demonstrate the efficacy of the proposed method, we empirically compare and show improvements against other multi-task and meta-reinforcement learning baselines.

2020-07-14

ArXiv (prépublication)

arxiv.org

Chaotic Continual Learning

Touraj Laleh

Mojtaba Faramarzi

Irina Rish

Sarath Chandar Anbil Parthipan

Training a deep neural network requires the model to go over training data for several epochs and update network parameters. In continual le… (voir plus)arning, this process results in catastrophic forgetting which is one of the core issues of this domain. Most proposed approaches for this issue try to compensate for the effects of parameter updates in the batch incremental setup in which the training model visits a lot of samples for several epochs. However, it is not realistic to expect training data will always be fed to model in a batch incremental setup. This paper proposes a chaotic stream learner that mimics the chaotic behavior of biological neurons and does not updates network parameters. In addition, it can work with fewer samples compared to deep learning models on stream learning setup. Our experiments on MNIST, CIFAR10, and Omniglot show that the chaotic stream learner has less catastrophic forgetting by its nature in comparison to a CNN model in continual learning.

2020-07-13

ICML.cc/2020/Workshop/LifelongML (inconnu)

openreview.net

Historical Issue Data of Projects on Jira

A. Nicholson

Deeksha M. Arya

Jin Guo

2020-07-13

(publié)

doi.org

La recherche en IA au service du monde réel

Boussole des politiques en IA

Vie étudiante et ressources

Publications

La recherche en IA au service du monde réel

Boussole des politiques en IA

Vie étudiante et ressources

Mots-clés populaires:

Publications