Publications

Equivariant Networks for Pixelized Spheres

Pixelizations of Platonic solids such as the cube and icosahedron have been widely used to represent spherical data, from climate records to… (see more) Cosmic Microwave Background maps. Platonic solids have well-known global symmetries. Once we pixelize each face of the solid, each face also possesses its own local symmetries in the form of Euclidean isometries. One way to combine these symmetries is through a hierarchy. However, this approach does not adequately model the interplay between the two levels of symmetry transformations. We show how to model this interplay using ideas from group theory, identify the equivariant linear maps, and introduce equivariant padding that respects these symmetries. Deep networks that use these maps as their building blocks generalize gauge equivariant CNNs on pixelized spheres. These deep networks achieve state-of-the-art results on semantic segmentation for climate data and omnidirectional image processing. Code is available at https://git.io/JGiZA.

2021-06-30

Proceedings of the 38th International Conference on Machine Learning (published)

doi.org

proceedings.mlr.press

Guest Editorial Explainable AI: Towards Fairness, Accountability, Transparency and Trust in Healthcare

Arash Shaban-Nejad

Martin Michalowski

John S. Brownstein

David L Buckeridge

2021-06-30

IEEE Journal of Biomedical and Health Informatics (published)

doi.org

Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards

A major challenge in reinforcement learning is the design of exploration strategies, especially for environments with sparse reward structur… (see more)es and continuous state and action spaces. Intuitively, if the reinforcement signal is very scarce, the agent should rely on some form of short-term memory in order to cover its environment efficiently. We propose a new exploration method, based on two intuitions: (1) the choice of the next exploratory action should depend not only on the (Markovian) state of the environment, but also on the agent's trajectory so far, and (2) the agent should utilize a measure of spread in the state space to avoid getting stuck in a small region. Our method leverages concepts often used in statistical physics to provide explanations for the behavior of simplified (polymer) chains in order to generate persistent (locally self-avoiding) trajectories in state space. We discuss the theoretical properties of locally self-avoiding walks and their ability to provide a kind of short-term memory through a decaying temporal correlation within the trajectory. We provide empirical evaluations of our approach in a simulated 2D navigation task, as well as higher-dimensional MuJoCo continuous control locomotion tasks with sparse rewards.

2021-06-30

Proceedings of the 38th International Conference on Machine Learning (published)

doi.org

proceedings.mlr.press

Out-of-Distribution Generalization via Risk Extrapolation

David Krueger

Ethan Caballero

Joern-Henrik Jacobsen

Rémi Le Priol

Distributional shift is one of the major obstacles when transferring machine learning prediction systems from the lab to the real world. To … (see more)tackle this problem, we assume that variation across training domains is representative of the variation we might encounter at test time, but also that shifts at test time may be more extreme in magnitude. In particular, we show that reducing differences in risk across training domains can reduce a model's sensitivity to a wide range of extreme distributional shifts, including the challenging setting where the input contains both causal and anti-causal elements. We motivate this approach, Risk Extrapolation (REx), as a form of robust optimization over a perturbation set of extrapolated domains (MM-REx), and propose a penalty on the variance of training risks (V-REx) as a simpler variant. We prove that variants of REx can recover the causal mechanisms of the targets, while also providing some robustness to changes in the input distribution ("covariate shift"). By appropriately trading-off robustness to causally induced distributional shifts and covariate shift, REx is able to outperform alternative methods such as Invariant Risk Minimization in situations where these types of shift co-occur.

2021-06-30

Proceedings of the 38th International Conference on Machine Learning (published)

doi.org

proceedings.mlr.press

RNN with Particle Flow for Probabilistic Spatio-temporal Forecasting

Soumyasundar Pal

Liheng Ma

Yingxue Zhang

Mark J. Coates

Spatio-temporal forecasting has numerous applications in analyzing wireless, traffic, and financial networks. Many classical statistical mod… (see more)els often fall short in handling the complexity and high non-linearity present in time-series data. Recent advances in deep learning allow for better modelling of spatial and temporal dependencies. While most of these models focus on obtaining accurate point forecasts, they do not characterize the prediction uncertainty. In this work, we consider the time-series data as a random realization from a nonlinear state-space model and target Bayesian inference of the hidden states for probabilistic forecasting. We use particle flow as the tool for approximating the posterior distribution of the states, as it is shown to be highly effective in complex, high-dimensional settings. Thorough experimentation on several real world time-series datasets demonstrates that our approach provides better characterization of uncertainty while maintaining comparable accuracy to the state-of-the art point forecasting methods.

2021-06-30

Proceedings of the 38th International Conference on Machine Learning (published)

proceedings.mlr.press

Smart About Meds (SAM): a pilot randomized controlled trial of a mobile application to improve medication adherence following hospital discharge

Bettina Habib

David Buckeridge

Melissa Bustillo

Santiago Nicolas Marquez

Manish Thakur

Thai Tran

Daniala L. Weir

Robyn Tamblyn

The objectives of this pilot study were (1) to assess the feasibility of a larger evaluation of Smart About Meds (SAM), a patient-centered m… (see more)edication management mobile application, and (2) to evaluate SAM’s potential to improve outcomes of interest, including adherence to medication changes made at hospital discharge and the occurrence of adverse events. We conducted a pilot randomized controlled trial among patients discharged from internal medicine units of an academic health center between June 2019 and March 2020. Block randomization was used to randomize patients to intervention (received access to SAM at discharge) or control (received usual care). Patients were followed for 30 days post-discharge, during which app use was recorded. Pharmacy claims data were used to measure adherence to medication changes made at discharge, and physician billing data were used to identify emergency department visits and hospital readmissions during follow-up. Forty-nine patients were eligible for inclusion in the study at hospital discharge (23 intervention, 26 control). In the 30 days of post-discharge, 15 (65.2%) intervention patients used the SAM app. During this period, intervention patients adhered to a larger proportion of medication changes (83.7%) than control patients (77.8%), including newly prescribed medications (72.7% vs 61.7%) and dose changes (90.9% vs 81.8%). A smaller proportion of intervention patients (8.7%) were readmitted to hospital during follow-up than control patients (15.4%). The high uptake of SAM among intervention patients supports the feasibility of a larger trial. Results also suggest that SAM has the potential to enhance adherence to medication changes and reduce the risk of downstream adverse events. This hypothesis needs to be tested in a larger trial. Clinicaltrials.gov, registration number NCT04676165.

2021-06-30

JAMIA Open (published)

doi.org

Measures of balance in combinatorial optimization

Philippe Olivier

Andrea Lodi

Gilles Pesant

2021-06-24

4OR (published)

doi.org

Deep learning for AI

Yoshua Bengio

Yann Lecun

Geoffrey Hinton

How can neural networks learn the rich internal representations required for difficult tasks such as recognizing objects or understanding la… (see more)nguage?

2021-06-20

Communications of the ACM (published)

doi.org

Large-Scale Intrinsic Functional Brain Organization Emerges from Three Canonical Spatiotemporal Patterns

Taylor Bolt

Jason S. Nomi

Danilo Bzdok

Catie Chang

B.T. Yeo

Lucina Q. Uddin

Shella Keilholz

2021-06-19

(published)

doi.org

Digitizing a sustainable future

Lucia A. Reisch

Lucas Joppa

Peter Howson

Artur Gil

Panayiota Alevizou

Nina Michaelidou

Ruby Appiah-Campbell

Tilman Santarius

Susanne Köhler

Massimo Pizzol

Pia-Johanna Schweizer

Dipti Srinivasan

Lynn H. Kaack

Priya L. Donti

David Rolnick

2021-06-17

One Earth (published)

doi.org

The Cost of Untracked Diversity in Brain-Imaging Prediction

Oualid Benkarim

Casey Paquola

Bo-yong Park

Valeria Kebets

Seok-Jun Hong

Reinder Vos de Wael

Shaoshi Zhang

B.T. Thomas Yeo

Michael Eickenberg

Tian Ge

Jean-Baptiste Poline

Boris Bernhardt

Danilo Bzdok

Brain-imaging research enjoys increasing adoption of supervised machine learning for singlesubject disease classification. Yet, the success … (see more)of these algorithms likely depends on population diversity, including demographic differences and other factors that may be outside of primary scientific interest. Here, we capitalize on propensity scores as a composite confound index to quantify diversity due to major sources of population stratification. We delineate the impact of population heterogeneity on the predictive accuracy and pattern stability in two separate clinical cohorts: the Autism Brain Imaging Data Exchange (ABIDE, n=297) and the Healthy Brain Network (HBN, n=551). Across various analysis scenarios, our results uncover the extent to which cross-validated prediction performances are interlocked with diversity. The instability of extracted brain patterns attributable to diversity is located preferentially to the default mode network. Our collective findings highlight the limitations of prevailing deconfounding practices in mitigating the full consequences of population diversity.

2021-06-16

bioRxiv (preprint)

doi.org

Improving Continuous Normalizing Flows using a Multi-Resolution Framework

Vikram Voleti

Chris Finlay

Adam Oberman

Christopher Pal

Recent work has shown that Continuous Normalizing Flows (CNFs) can serve as generative models of images with exact likelihood calculation an… (see more)d invertible generation/density estimation. In this work we introduce a Multi-Resolution variant of such models (MRCNF). We introduce a transformation between resolutions that allows for no change in the log likelihood. We show that this approach yields comparable likelihood values for various image datasets, with improved performance at higher resolutions, with fewer parameters, using only 1 GPU.

2021-06-14

ICML.cc/2021/Workshop/INNF (poster)

openreview.net

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Publications

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Popular keywords:

Publications