Publications

Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning

Samin Yeasar Arnob

Riashat Islam

Doina Precup

2021-12-31

ArXiv (preprint)

arxiv.org

Machine learning application development: practitioners’ insights

Md. Saidur Rahman

Foutse Khomh

Alaleh Hamidi

Jinghui Cheng

Giuliano Antoniol

Hironori Washizaki

2021-12-31

ArXiv (preprint)

doi.org

arxiv.org

Single-Shot Pruning for Offline Reinforcement Learning

Samin Yeasar Arnob

Riyasat Ohib

Sergey Plis

Doina Precup

2021-12-31

ArXiv (preprint)

arxiv.org

Heterogeneous Crowd Simulation Using Parametric Reinforcement Learning

Kaidong Hu

Michael Brandon Haworth

Glen Berseth

Vladimir Pavlovic

Petros Faloutsos

Mubbasir. T. Kapadia

Agent-based synthetic crowd simulation affords the cost-effective large-scale simulation and animation of interacting digital humans. Model-… (see more)based approaches have successfully generated a plethora of simulators with a variety of foundations. However, prior approaches have been based on statically defined models predicated on simplifying assumptions, limited video-based datasets, or homogeneous policies. Recent works have applied reinforcement learning to learn policies for navigation. However, these approaches may learn static homogeneous rules, are typically limited in their generalization to trained scenarios, and limited in their usability in synthetic crowd domains. In this article, we present a multi-agent reinforcement learning-based approach that learns a parametric predictive collision avoidance and steering policy. We show that training over a parameter space produces a flexible model across crowd configurations. That is, our goal-conditioned approach learns a parametric policy that affords heterogeneous synthetic crowds. We propose a model-free approach without centralization of internal agent information, control signals, or agent communication. The model is extensively evaluated. The results show policy generalization across unseen scenarios, agent parameters, and out-of-distribution parameterizations. The learned model has comparable computational performance to traditional methods. Qualitatively the model produces both expected (laminar flow, shuffling, bottleneck) and unexpected (side-stepping) emergent qualitative behaviours, and quantitatively the approach is performant across measures of movement quality.

2021-12-29

IEEE Transactions on Visualization and Computer Graphics (published)

doi.org

Heterogeneous Crowd Simulation Using Parametric Reinforcement Learning

Kaidong Hu

Brandon Haworth

Glen Berseth

Vladimir Pavlovic

Petros Faloutsos

Mubbasir Kapadia

Agent-based synthetic crowd simulation affords the cost-effective large-scale simulation and animation of interacting digital humans. Model-… (see more)based approaches have successfully generated a plethora of simulators with a variety of foundations. However, prior approaches have been based on statically defined models predicated on simplifying assumptions, limited video-based datasets, or homogeneous policies. Recent works have applied reinforcement learning to learn policies for navigation. However, these approaches may learn static homogeneous rules, are typically limited in their generalization to trained scenarios, and limited in their usability in synthetic crowd domains. In this article, we present a multi-agent reinforcement learning-based approach that learns a parametric predictive collision avoidance and steering policy. We show that training over a parameter space produces a flexible model across crowd configurations. That is, our goal-conditioned approach learns a parametric policy that affords heterogeneous synthetic crowds. We propose a model-free approach without centralization of internal agent information, control signals, or agent communication. The model is extensively evaluated. The results show policy generalization across unseen scenarios, agent parameters, and out-of-distribution parameterizations. The learned model has comparable computational performance to traditional methods. Qualitatively the model produces both expected (laminar flow, shuffling, bottleneck) and unexpected (side-stepping) emergent qualitative behaviours, and quantitatively the approach is performant across measures of movement quality.

2021-12-29

IEEE Transactions on Visualization and Computer Graphics (published)

doi.org

Single Allocation Hub Location with Heterogeneous Economies of Scale

Borzou Rostami

Masoud Chitsaz

Okan Arslan

Gilbert Laporte

Andrea Lodi

The economies of scale in hub location is usually modeled by a constant parameter, which captures the benefits companies obtain through cons… (see more)olidation. In their article “Single allocation hub location with heterogeneous economies of scale,” Rostami et al. relax this assumption and consider hub-hub connection costs as piecewise linear functions of the flow amounts. This spoils the triangular inequality property of the distance matrix, making the classical flow-based model invalid and further complicates the problem. The authors tackle the challenge by building a mixed-integer quadratically constrained program and by developing a methodology based on constructing Lagrangian function, linear dual functions, and specialized polynomial-time algorithms to generate enhanced cuts. The developed method offers a new strategy in Benders-type decomposition through relaxing a set of complicating constraints in subproblems when such relaxation is tight. The results confirm the efficacy of the solution methods in solving large-scale problem instances.

2021-12-28

Operational Research (published)

doi.org

Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models

Enoch Amoatey Tetteh

Joseph D Viviano

Yoshua Bengio

David Scott Krueger

Joseph Paul Cohen

Learning models that generalize under different distribution shifts in medical imaging has been a long-standing research challenge. There ha… (see more)ve been several proposals for efficient and robust visual representation learning among vision research practitioners, especially in the sensitive and critical biomedical domain. In this paper, we propose an idea for out-of-distribution generalization of chest X-ray pathologies that uses a simple balanced batch sampling technique. We observed that balanced sampling between the multiple training datasets improves the performance over baseline models trained without balancing.

2021-12-27

ArXiv (preprint)

arxiv.org

COVID-19 Seroprevalence in Canada Modelling Waning and Boosting COVID-19 Immunity in Canada a Canadian Immunization Research Network Study

David W. Dick

Lauren Childs

Zhilan Feng

Jing Li

Gergely Röst

David Buckeridge

Nick H. Ogden

Jane Heffernan

2021-12-23

Vaccines (published)

doi.org

Fall 2021 Resurgence and COVID-19 Seroprevalence in Canada: Modelling waning and boosting COVID-19 immunity in Canada, A Canadian Immunization Research Network Study

David W. Dick

Lauren Childs

Zhilan Feng

Jing Li

Gergely Röst

David Buckeridge

Nick H. Ogden

Jane Heffernan

2021-12-23

Vaccines (published)

doi.org

Generative Models of Brain Dynamics -- A review

Mahta Ramezanian Panahi

Germán Abrevaya

Jean-Christophe Gagnon-Audet

Vikram Voleti

Irina Rish

Guillaume Dumas

The principled design and discovery of biologically- and physically-informed models of neuronal dynamics has been advancing since the mid-tw… (see more)entieth century. Recent developments in artificial intelligence (AI) have accelerated this progress. This review article gives a high-level overview of the approaches across different scales of organization and levels of abstraction. The studies covered in this paper include fundamental models in computational neuroscience, nonlinear dynamics, data-driven methods, as well as emergent practices. While not all of these models span the intersection of neuroscience, AI, and system dynamics, all of them do or can work in tandem as generative models, which, as we argue, provide superior properties for the analysis of neuroscientific data. We discuss the limitations and unique dynamical traits of brain data and the complementary need for hypothesis- and data-driven modeling. By way of conclusion, we present several hybrid generative models from recent literature in scientific machine learning, which can be efficiently deployed to yield interpretable models of neural dynamics.

2021-12-22

ArXiv (preprint)

arxiv.org

Recovery after stroke: the severely impaired are a distinct group

Anna K. Bonkhoff

Thomas Hope

Danilo Bzdok

Adrian G Guggisberg

Rachel L Hawe

Sean P Dukelow

F. Chollet

D. X. Lin

Christian Grefkes

Howard Bowman

2021-12-22

Journal of Neurology Neurosurgery & Psychiatry (published)

doi.org

The Myelin‐Weighted Connectome in Parkinson's Disease

Tommy Boshkovski

Julien Cohen-Adad

Bratislav Mišić

Isabelle Arnulf

Jean‐Christophe Corvol

Marie Vidailhet

Stéphane Lehéricy

Nikola Stikov

Matteo Mancini

2021-12-22

Movement Disorders (published)

doi.org

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Publications

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Popular keywords:

Publications