Publications

Halting Time is Predictable for Large Models: A Universality Property and Average-case Analysis
Average-case analysis computes the complexity of an algorithm averaged over all possible inputs. Compared to worst-case analysis, it is more… (voir plus) representative of the typical behavior of an algorithm, but remains largely unexplored in optimization. One difficulty is that the analysis can depend on the probability distribution of the inputs to the model. However, we show that this is not the case for a class of large-scale problems trained with first-order methods including random least squares and one-hidden layer neural networks with random weights. In fact, the halting time exhibits a universality property: it is independent of the probability distribution. With this barrier for average-case analysis removed, we provide the first explicit average-case convergence rates showing a tighter complexity not captured by traditional worst-case analysis. Finally, numerical simulations suggest this universality property holds for a more general class of algorithms and problems.
Geographic concentration of SARS-CoV-2 cases by social determinants of health in metropolitan areas in Canada: a cross-sectional study
Yiqing Xia
Huiting Ma
Gary Moloney
Héctor A. Velásquez García
Monica Sirski
Naveed Z. Janjua
David Vickers
Tyler Williamson
Alan Katz
Kristy Yiu
Kristy Yiu
Rafal Kustra
David L. Buckeridge
Marc Brisson
Stefan D. Baral
Sharmistha Mishra
Mathieu Maheu-Giroux
The feature of geographical concentration of COVID-19 cases was consistent across CMAs, but the pattern by social determinants varied. Geogr… (voir plus)aphically-prioritized allocation of resources and services should be tailored to the local drivers of inequalities in transmission in response to SARS-CoV-2’s resurgence.
Erratum to: Rapid simultaneous acquisition of macromolecular tissue volume, susceptibility, and relaxometry maps (Magn Reson Med. 2022;87:781‐790.)
Fang Frank Yu
Susie Yi Huang
Ashwin Kumar
Thomas Witzel
Congyu Liao
Tanguy Duval
Berkin Bilgic
The Brain-Computer Metaphor Debate Is Useless: A Matter of Semantics
Blake A. Richards
Timothy P. Lillicrap
It is commonly assumed that usage of the word “computer” in the brain sciences reflects a metaphor. However, there is no single definiti… (voir plus)on of the word “computer” in use. In fact, based on the usage of the word “computer” in computer science, a computer is merely some physical machinery that can in theory compute any computable function. According to this definition the brain is literally a computer; there is no metaphor. But, this deviates from how the word “computer” is used in other academic disciplines. According to the definition used outside of computer science, “computers” are human-made devices that engage in sequential processing of inputs to produce outputs. According to this definition, brains are not computers, and arguably, computers serve as a weak metaphor for brains. Thus, we argue that the recurring brain-computer metaphor debate is actually just a semantic disagreement, because brains are either literally computers or clearly not very much like computers at all, depending on one's definitions. We propose that the best path forward is simply to put the debate to rest, and instead, have researchers be clear about which definition they are using in their work. In some circumstances, one can use the definition from computer science and simply ask, what type of computer is the brain? In other circumstances, it is important to use the other definition, and to clarify the ways in which our brains are radically different from the laptops, smartphones, and servers that surround us in modern life.
Minimizing Entropy to Discover Good Solutions to Recurrent Mixed Integer Programs
Charly Robinson La Rocca
Jean-François Cordeau
Current state-of-the-art solvers for mixed-integer programming (MIP) problems are designed to perform well on a wide range of problems. Howe… (voir plus)ver, for many real-world use cases, problem instances come from a narrow distribution. This has motivated the development of specialized methods that can exploit the information in historical datasets to guide the design of heuristics. Recent works have shown that machine learning (ML) can be integrated with an MIP solver to inject domain knowledge and efficiently close the optimality gap. This hybridization is usually done with deep learning (DL), which requires a large dataset and extensive hyperparameter tuning to perform well. This paper proposes an online heuristic that uses the notion of entropy to efficiently build a model with minimal training data and tuning. We test our method on the locomotive assignment problem (LAP), a recurring real-world problem that is challenging to solve at scale. Experimental results show a speed up of an order of magnitude compared to a general purpose solver (CPLEX) with a relative gap of less than 2%. We also observe that for some instances our method can discover better solutions than CPLEX within the time limit.
RECOVER: sequential model optimization platform for combination drug repurposing identifies novel synergistic compounds in vitro
Thomas Gaudelet
Andrew Anighoro
Torsten Gross
Francisco Martínez-Peña
Eileen L. Tang
S. SurajM
Cristian Regep
Jeremy B.R. Hayter
N. Valiante
Almer M. van der Sloot
Mike Tyers
Charles E.S. Roberts
Michael M. Bronstein
Luke Lee Lairson
Jake P. Taylor-King
Tackling Climate Change with Machine Learning
Priya L. Donti
Lynn H. Kaack
Kelly Kochanski
Alexandre Lacoste
Andrew Slavin Ross
Nikola Milojevic-Dupont
Natasha Jaques
Anna Waldman-Brown
Alexandra Luccioni
Evan D. Sherwin
S. Karthik Mukkavilli
Konrad P. Kording
Carla Gomes
Andrew Y. Ng
Demis Hassabis
John C. Platt
Felix Creutzig … (voir 2 de plus)
Jennifer Chayes
Climate change is one of the greatest challenges facing humanity, and we, as machine learning experts, may wonder how we can help. Here we d… (voir plus)escribe how machine learning can be a powerful tool in reducing greenhouse gas emissions and helping society adapt to a changing climate. From smart grids to disaster management, we identify high impact problems where existing gaps can be filled by machine learning, in collaboration with other fields. Our recommendations encompass exciting research questions as well as promising business opportunities. We call on the machine learning community to join the global effort against climate change.
Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning
Michael Teng
Michiel van de Panne
Frank N. Wood
Distributional reinforcement learning (RL) aims to learn a value-network that predicts the full distribution of the returns for a given stat… (voir plus)e, often modeled via a quantile-based critic. This approach has been successfully integrated into common RL methods for continuous control, giving rise to algorithms such as Distributional Soft Actor-Critic (DSAC). In this paper, we introduce multi-sample target values (MTV) for distributional RL, as a principled replacement for single-sample target value estimation, as commonly employed in current practice. The improved distributional estimates further lend themselves to UCB-based exploration. These two ideas are combined to yield our distributional RL algorithm, E2DC (Extra Exploration with Distributional Critics). We evaluate our approach on a range of continuous control tasks and demonstrate state-of-the-art model-free performance on difficult tasks such as Humanoid control. We provide further insight into the method via visualization and analysis of the learned distributions and their evolution during training.
TIML: Task-Informed Meta-Learning for Agriculture
Labeled datasets for agriculture are extremely spatially imbalanced. When developing algorithms for data-sparse regions, a natural approach … (voir plus)is to use transfer learning from data-rich regions. While standard transfer learning approaches typically leverage only direct inputs and outputs, geospatial imagery and agricultural data are rich in metadata that can inform transfer learning algorithms, such as the spatial coordinates of data-points or the class of task being learned. We build on previous work exploring the use of meta-learning for agricultural contexts in data-sparse regions and introduce task-informed meta-learning (TIML), an augmentation to model-agnostic meta-learning which takes advantage of task-specific metadata. We apply TIML to crop type classification and yield estimation, and find that TIML significantly improves performance compared to a range of benchmarks in both contexts, across a diversity of model architectures. While we focus on tasks from agriculture, TIML could offer benefits to any meta-learning setup with task-specific metadata, such as classification of geo-tagged images and species distribution modelling.
Quantum-Inspired Interpretable AI-Empowered Decision Support System for Detection of Early-Stage Rheumatoid Arthritis in Primary Care Using Scarce Dataset
Samira Abbasgholizadeh Rahimi
Mojtaba Kolahdoozi
Jose L. Salmeron
Amir Mohammad Navali
Alireza Sadeghpour
Seyed Amir Mir Mohammadi
Rheumatoid arthritis (RA) is a chronic inflammatory and long-term autoimmune disease that can lead to joint and bone erosion. This can lead … (voir plus)to patients’ disability if not treated in a timely manner. Early detection of RA in settings such as primary care (as the first contact with patients) can have an important role on the timely treatment of the disease. We aim to develop a web-based Decision Support System (DSS) to provide a proper assistance for primary care providers in early detection of RA patients. Using Sparse Fuzzy Cognitive Maps, as well as quantum-learning algorithm, we developed an online web-based DSS to assist in early detection of RA patients, and subsequently classify the disease severity into six different levels. The development process was completed in collaborating with two specialists in orthopedic as well as rheumatology orthopedic surgery. We used a sample of anonymous patient data for development of our model which was collected from Shohada University Hospital, Tabriz, Iran. We compared the results of our model with other machine learning methods (e.g., linear discriminant analysis, Support Vector Machines, and K-Nearest Neighbors). In addition to outperforming other methods of machine learning in terms of accuracy when all of the clinical features are used (accuracy of 69.23%), our model identified the relation of the different features with each other and gave higher explainability comparing to the other methods. For future works, we suggest applying the proposed model in different contexts and comparing the results, as well as assessing its usefulness in clinical practice.
Active Learning for Capturing Human Decision Policies in a Data Frugal Context
Loïc Grossetête
Alexandre Marois
Bénédicte Chatelais
Daniel Lafond
Evaluation of a prenatal screening decision aid: A mixed methods pilot study.
Titilayo Tatiana Agbadje
S. A. Rahimi
Mélissa Côté
Andrée-Anne Tremblay
Mariama Penda Diallo
Hélène Elidor
Alex Poulin Herron
Codjo Djignefa Djade
France Légaré