David Rolnick

Biographie

David Rolnick est professeur adjoint et titulaire d’une chaire en IA Canada-CIFAR à l'École d'informatique de l'Université McGill et membre académique principal de Mila – Institut québécois d’intelligence artificielle. Ses travaux portent sur les applications de l'apprentissage automatique dans la lutte contre le changement climatique. Il est cofondateur et président de Climate Change AI et codirecteur scientifique de Sustainability in the Digital Age. David Rolnick a obtenu un doctorat en mathématiques appliquées du Massachusetts Institute of Technology (MIT). Il a été chercheur postdoctoral en sciences mathématiques à la National Science Foundation (NSF), chercheur diplômé à la NSF et boursier Fulbright. Il a figuré sur la liste des « 35 innovateurs de moins de 35 ans » de la MIT Technology Review en 2021.

Étudiants actuels

Aditya Aditya

Collaborateur·rice de recherche

Collaborateur·rice alumni - McGill

Collaborateur·rice de recherche - Cambridge University

Co-superviseur⋅e :

Postdoctorat - McGill

Michael Bunsen

Collaborateur·rice de recherche - McGill

Shahana Chatterjee

Collaborateur·rice de recherche - N/A

Co-superviseur⋅e :

Yuyan Chen

Doctorat - McGill

Eya Cherif

Collaborateur·rice de recherche - Leipzig University

Othmane Echchabi

Maîtrise recherche - McGill

Collaborateur·rice de recherche

Mohamed Elabbas

Collaborateur·rice de recherche

Jannik Endres

Collaborateur·rice de recherche

Jacopo Ghirri

Visiteur de recherche indépendant - Politecnico di Milano

Paula Harder

Visiteur de recherche indépendant

Collaborateur·rice de recherche - Johannes Kepler University

Christina Isaicu Isaicu

Collaborateur·rice de recherche - University of Amsterdam

Gaurav Iyer

Maîtrise recherche - McGill

Doctorat - McGill

Devin Kwok

Doctorat - McGill

Visiteur de recherche indépendant - Université de Montréal

Pierre-Louis Lemaire

Collaborateur·rice de recherche - Polytechnique Montréal

Superviseur⋅e principal⋅e :

Alex Hernandez-Garcia

Joshi Manoj

Collaborateur·rice de recherche - University of East Anglia

David Mickisch

Collaborateur·rice de recherche

Juan Nathaniel Nathaniel

Collaborateur·rice de recherche - Columbia university

Postdoctorat - McGill

Co-superviseur⋅e :

Lena Podina

Collaborateur·rice de recherche - University of Waterloo

Co-superviseur⋅e :

Venkatesh Ramesh

Collaborateur·rice alumni - UdeM

Marlena Reil

Maîtrise recherche - McGill

Carla Roesch

Collaborateur·rice de recherche - Columbia university

Rasha Saha

Maîtrise recherche - McGill

Luca Marie Schmidt

Collaborateur·rice de recherche - University of Tübingen

Visiteur de recherche indépendant

Ilija Trajković

Collaborateur·rice de recherche - Karlsruhe Institute of Technology

Doctorat - McGill

Collaborateur·rice alumni - UdeM

Superviseur⋅e principal⋅e :

Collaborateur·rice de recherche

Catherine Villeneuve

Doctorat - McGill

Shan Zhao

Collaborateur·rice de recherche - Technical University of Munich

Démocratiser l'accès aux données satellitaires grâce à l'IA

Billets de blogue

diagram illustrating how the AI foundation model for Earth observation, Galileo, works

21 octobre 2025

par

Gabriel Tseng

David Rolnick

Lire l'article

Publications

BATIS: Bayesian Approaches for Targeted Improvement of Species Distribution Models

Catherine Villeneuve

Benjamin Akera

Mélisande Teng

Species distribution models (SDMs), which aim to predict species occurrence based on environmental variables, are widely used to monitor and… (voir plus) respond to biodiversity change. Recent deep learning advances for SDMs have been shown to perform well on complex and heterogeneous datasets, but their effectiveness remains limited by spatial biases in the data. In this paper, we revisit deep SDMs from a Bayesian perspective and introduce BATIS, a novel and practical framework wherein prior predictions are updated iteratively using limited observational data. Models must appropriately capture both aleatoric and epistemic uncertainty to effectively combine fine-grained local insights with broader ecological patterns. We benchmark an extensive set of uncertainty quantification approaches on a novel dataset including citizen science observations from the eBird platform. Our empirical study shows how Bayesian deep learning approaches can greatly improve the reliability of SDMs in data-scarce locations, which can contribute to ecological understanding and conservation efforts.

2026-03-13

AAAI Conference on Artificial Intelligence (publié)

Benchmarking the geographic generalization of deep learning models for precipitation downscaling

Paula Harder

Luca Schmidt

Francis Pelletier

Nicole Ludwig

Matthew Chantry

Christian Lessig

Alex Hernandez-Garcia

Earth System Models (ESM) are our main tool for projecting the impacts of climate change. However, running these models at sufficient resolu… (voir plus)tion for local-scale risk-assessments is not computationally feasible. Deep learning-based super-resolution models offer a promising solution to downscale ESM outputs to higher resolutions by learning from data. Yet, due to regional variations in climatic processes, these models typically require retraining for each geographical area–demanding high-resolution observational data, which is unevenly available across the globe. This highlights the need to assess how well these models generalize across geographic regions. To address this, we introduce RainShift, a dataset and benchmark for evaluating downscaling under geographic distribution shifts. We evaluate state-of-the-art downscaling approaches including GANs and diffusion models in generalizing across data gaps between the Global North and Global South. Our findings reveal substantial performance drops in out-of-distribution regions, depending on model and geographic area. While expanding the training domain generally improves generalization, it is insufficient to overcome shifts between geographically distinct regions. We show that addressing these shifts through, for example, domain adaptation can improve spatial generalization. Our work advances the global applicability of downscaling methods and represents a step toward reducing inequities in access to high-resolution climate information.

2026-01-26

Scientific Reports (publié)

In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior

Anaïs Berkes

In-context reinforcement learning (ICRL) promises fast adaptation to unseen environments without parameter updates, but current methods eith… (voir plus)er cannot improve beyond the training distribution or require near-optimal data, limiting practical adoption. We introduce SPICE, a Bayesian ICRL method that learns a prior over Q-values via deep ensemble and updates this prior at test-time using in-context information through Bayesian updates. To recover from poor priors resulting from training on sub-optimal data, our online inference follows an Upper-Confidence Bound rule that favours exploration and adaptation. We prove that SPICE achieves regret-optimal behaviour in both stochastic bandits and finite-horizon MDPs, even when pretrained only on suboptimal trajectories. We validate these findings empirically across bandit and control benchmarks. SPICE achieves near-optimal decisions on unseen tasks, substantially reduces regret compared to prior ICRL and meta-RL approaches while rapidly adapting to unseen tasks and remaining robust under distribution shift.

2025-12-31

arXiv (prépublication)

Adsorption energies are necessary but not sufficient to identify good catalysts

Alexander Davis

Alexandre AGM Duval

Oleksandr Voznyy

Alex Hern'andez-Garcia

Félix Therrien

2025-12-04

ArXiv (prépublication)

Deploying Geospatial Foundation Models in the Real World: Lessons from WorldCereal

Christina Butsko

Gabriel Tseng

Kristof Van Tricht

Giorgia Milli

Ruben Cartuyvels

Inbal Becker Reshef

Zoltan Szantoi

Hannah Kerner

The increasing availability of geospatial foundation models has the potential to transform remote sensing applications such as land cover cl… (voir plus)assification, environmental monitoring, and change detection. Despite promising benchmark results, the deployment of these models in operational settings is challenging and rare. Standardized evaluation tasks often fail to capture real-world complexities relevant for end-user adoption such as data heterogeneity, resource constraints, and application-specific requirements. This paper presents a structured approach to integrate geospatial foundation models into operational mapping systems. Our protocol has three key steps: defining application requirements, adapting the model to domain-specific data and conducting rigorous empirical testing. Using the Presto model in a case study for crop mapping, we demonstrate that fine-tuning a pre-trained model significantly improves performance over conventional supervised methods. Our results highlight the model’s strong spatial and temporal generalization capabilities. Our protocol provides a replicable blueprint for practitioners and lays the groundwork for future research to operationalize foundation models in diverse remote sensing applications. Application of the protocol to the WorldCereal global crop-mapping system showcases the framework’s scalability.

2025-12-01

Proceedings of The TerraBytes {ICML} Workshop: Towards global datasets and models for Earth Observation (publié)

proceedings.mlr.press

On Global Applicability and Location Transferability of Generative Deep Learning Models for Precipitation Downscaling

Paula Harder

Christian Lessig

Matthew Chantry

Francis Pelletier

2025-11-30

ArXiv (prépublication)

A HOT Dataset: 150,000 Buildings for HVAC Operations Transfer Research

Anaïs Berkes

Donna Vakalis

2025-11-10

Proceedings of the 12th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (publié)

A HOT Dataset: 150,000 Buildings for HVAC Operations Transfer Research

Anaïs Berkes

Donna Vakalis

About 12% of global energy consumption is attributable to heating, ventilation, and air conditioning (HVAC) systems in buildings [11]. Machi… (voir plus)ne learning-based intelligent HVAC control offers significant energy efficiency potential, but progress is constrained by limited data for training and evaluating performance across different kinds of buildings. Existing datasets primarily target energy prediction rather than control applications, forcing studies to rely on limited building sets or single-variable perturbations that fail to capture real-world complexity. We present HOT (HVAC Operations Transfer), the first large-scale open-source dataset purpose-built for research into transfer learning in building control. HOT contains 159,744 unique building-weather combinations with systematic variations across envelope properties, occupancy patterns, and climate conditions spanning all 19 ASHRAE climate zones across 76 global locations. We formalise a comprehensive similarity-based framework with quantitative metrics for assessing transfer feasibility between source and target buildings across multiple context dimensions. Our key contributions: (1) a large-scale, open dataset and tooling enabling systematic, multi-variable transfer studies across 19 climate zones; (2) a quantitative similarity framework spanning geometry, thermal, climate, and function; and (3) zero-shot climate transfer experiments showing why realistic context variation matters for HVAC control.

2025-11-10

Proceedings of the 12th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (publié)

Alberta Wells Dataset: Pinpointing Oil and Gas Wells from Satellite Imagery

Pratinav Seth

Michelle Lin

Brefo Dwamena Yaw

Jade Boutot

Mary Kang

Millions of abandoned oil and gas wells are scattered across the world, leaching methane into the atmosphere and toxic compounds into the gr… (voir plus)oundwater. Many of these locations are unknown, preventing the wells from being plugged and their polluting effects averted. Remote sensing is a relatively unexplored tool for pinpointing abandoned wells at scale. We introduce the first large-scale benchmark dataset for this problem, leveraging medium-resolution multi-spectral satellite imagery from Planet Labs. Our curated dataset comprises over 213,000 wells (abandoned, suspended, and active) from Alberta, a region with especially high well density, sourced from the Alberta Energy Regulator and verified by domain experts. We evaluate baseline algorithms for well detection and segmentation, showing the promise of computer vision approaches but also significant room for improvement.

2025-10-05

Proceedings of the 42nd International Conference on Machine Learning (publié)

proceedings.mlr.press

HVAC-SPICE: Value-Uncertainty In-Context RL with Thompson Sampling for Zero-Shot HVAC Control

Anaïs Berkes

Urban buildings consume 40\% of global energy, yet most rely on inefficient rule-based HVAC systems due to the impracticality of deploying a… (voir plus)dvanced controllers across diverse building stock. In-context reinforcement learning (ICRL) offers promise for rapid deployment without per-building training, but standard supervised learning objectives that maximise likelihood of training actions inherit behaviour-policy bias and provide weak exploration under the distribution shifts common when transferring across buildings and climates. We present SPICE (Sampling Policies In-Context with Ensemble uncertainty), a novel ICRL method specifically designed for zero-shot building control that addresses these fundamental limitations. SPICE introduces two key methodological innovations: (i) a propensity-corrected, return-aware training objective that prioritises high-advantage, high-uncertainty actions to enable improvement beyond suboptimal training demonstrations, and (ii) lightweight value ensembles with randomised priors that provide explicit uncertainty estimates for principled episode-level Thompson sampling. At deployment, SPICE samples one value head per episode and acts greedily, resulting in temporally coherent exploration without test-time gradients or building-specific models. We establish a comprehensive experimental protocol using the HOT dataset to evaluate SPICE across diverse building types and climate zones, focusing on the energy efficiency, occupant comfort, and zero-shot transfer capabilities that are critical for urban-scale deployment.

2025-09-29

NeurIPS.cc/2025/Workshop/UrbanAI (poster)

openreview.net

Graph Dreamer: Temporal Graph World Models for Sample-Efficient and Generalisable Reinforcement Learning

Anaïs Berkes

Donna Vakalis

2025-09-21

NeurIPS.cc/2025/Workshop/WiML (publié)

openreview.net

Catalyst GFlowNet for electrocatalyst design: A hydrogen evolution reaction case study

Alexandre AGM Duval

Alex Hernandez-Garcia

Félix Therrien

Efficient and inexpensive energy storage is essential for accelerating the adoption of renewable energy and ensuring a stable supply, despit… (voir plus)e fluctuations in sources such as wind and solar. Electrocatalysts play a key role in hydrogen energy storage (HES), allowing the energy to be stored as hydrogen. However, the development of affordable and high-performance catalysts for this process remains a significant challenge. We introduce Catalyst GFlowNet, a generative model that leverages machine learning-based predictors of formation and adsorption energy to design crystal surfaces that act as efficient catalysts. We demonstrate the performance of the model through a proof-of-concept application to the hydrogen evolution reaction, a key reaction in HES, for which we successfully identified platinum as the most efficient known catalyst. In future work, we aim to extend this approach to the oxygen evolution reaction, where current optimal catalysts are expensive metal oxides, and open the search space to discover new materials. This generative modeling framework offers a promising pathway for accelerating the search for novel and efficient catalysts.

2025-09-19

NeurIPS.cc/2025/Workshop/AI4Mat (poster)