Publications

Scalable Option Learning in High-Throughput Environments

Mikael Henaff

Michael Matthews

Michael G. Rabbat

Hierarchical reinforcement learning (RL) has the potential to enable effective decision-making over long timescales. Existing approaches, wh… (see more)ile promising, have yet to realize the benefits of large-scale training. In this work, we identify and solve several key challenges in scaling hierarchical RL to high-throughput environments. We propose Scalable Option Learning (SOL), a highly scalable hierarchical RL algorithm which achieves a 25x higher throughput compared to existing hierarchical methods. We train our hierarchical agents using 20 billion frames of experience on the complex game of NetHack, significantly surpassing flat agents and demonstrating positive scaling trends. We also validate our algorithm on MiniHack and Mujoco environments, showcasing its general applicability. Our code is open sourced at github.com/facebookresearch/sol.

2025-08-29

ArXiv (preprint)

doi.org

arxiv.org

A Transparent and Generalizable Deep Learning Framework for Genomic Ancestry Prediction

Camille Rochefort-Boulanger

Matthew Scicluna

Raphaël Poujol

Jean-Christophe Grenier

Pierre Luc Carrier

Sébastien Lemieux

Julie G Hussin

1 Accurately capturing genetic ancestry is critical for ensuring reproducibility and fairness in genomic st… (see more)udies and downstream health research. This study aims to address the prediction of ancestry from genetic data using deep learning, with a focus on generalizability across datasets with diverse populations and on explainability to improve model transparency. We adapt the Diet Network, a deep learning architecture proven effective in handling high-dimensional data, to learn population ancestry from single nucleotide polymorphisms (SNPs) data using the populational Thousand Genomes Project dataset. Our results highlight the model’s ability to generalize to diverse populations in the CARTaGENE and Montreal Heart Institute biobanks and that predictions remain robust to high levels of missing SNPs. We show that, despite the lack of North African populations in the training dataset, the model learns latent representations that reflect meaningful population structure for North African individuals in the biobanks. To improve model transparency, we apply Saliency Maps, DeepLift, GradientShap and Integrated Gradients attribution techniques and evaluate their performance in identifying SNPs leveraged by the model. Using DeepLift, we show that model’s predictions are driven by population-specific signals consistent with those identified by traditional population genetics metrics. This work presents a generalizable and interpretable deep learning framework for genetic ancestry inference in large-scale biobanks with genetic data. By enabling more widespread genomic ancestry characterization in these cohorts, this study contributes practical tools for integrating genetic data into downstream biomedical applications, supporting more inclusive and equitable healthcare solutions.

2025-08-29

bioRxiv (preprint)

doi.org

Assessing the exposure of buildings to long-term sea level rise across the Global South

M. Willard-Stepan

N. Gomez

Jeffrey A. Cardille

E. D. Galbraith

E. M. Bennett

2025-08-28

npj Urban Sustainability (published)

doi.org

Distributed Combined Space Partitioning and Network Flow Optimization: an Optimal Transport Approach (Extended Version)

Théo Laurentin

Patrick Coirault

Emmanuel Moulay

Antoine Lesage-Landry

Jerome Le Ny

2025-08-28

ArXiv (preprint)

arxiv.org

Aperiodic and Periodic EEG Component Lifespan Trajectories: Monotonic Decrease versus Growth-then-Decline

Min Li

Ying Wang

Yaqi Chen

Adrien E. E. Dubois

Gangyong Jia

Q. M. Jonathan Wu

Maria L. Bringas-Vega

Guillaume Dumas

Pedro A. Valdés‐Sosa

1.1 Unraveling the lifespan trajectories of human brain development is critical for understanding brain health and … (see more)disease. Recent research demonstrates that electroencephalography signals are composed of periodic and aperiodic components reflecting distinct physiological substrates. This dissociation raises the possibility that they follow different developmental tendencies. Here, we delineate the lifespan trajectories of aperiodic and periodic neural oscillations using a large international cohort (N=1,563, ages 5–95, resting state, eyes closed). We reveal two fundamental developmental patterns: a Monotonic decrease in aperiodic activity and a Growth-and-Decline pattern for periodic activity. Both components have inflections around age 20 and transition to a stable senescent phase around age 40. Spatially, anterior regions mainly exhibit aperiodic activity, while periodic activity concentrate on posterior regions and these patterns remain stable throughout life. Crucially, multimodal analysis shows these trajectories map onto distinct biological substrates. The periodic component’s Growth and Decline trajectory aligns with GABAergic function and myelination. In contrast, the monotonically decreasing trajectory of aperiodic activity mirrors fundamental biomarkers of biological aging, such as DNA methylation and telomere length. Transforming age to a logarithmic scale simplifies these nonlinear trajectories into a linear decreasing and a piecewise concave linear model for aperiodic and periodic components. This form provides a robust and parsimonious framework for quantifying maturation and identifying neurological deviations. We delineate distinct lifespan trajectories of aperiodic and periodic neural activity in a large-scale international cohort (N=1,563, ages 5–95). Aperiodic activity undergoes a Monotonic Decrease with age. In contrast, periodic activity follows a Growth-then-Decline trajectory, peaking in early adulthood. Both trajectories feature a critical transition around age 20 and stabilize into a protracted senescent phase from approximately 40 onward. These neural trajectories map onto distinct biological substrates: periodic activity tracks integrative functions (myelination, GABAergic, and aperiodic decline mirrors fundamental aging processes (DNA methylation). A stable pattern observed throughout the lifespan is the spatial segregation of neural activity, where aperiodic signals are dominant in anterior regions and periodic signals are concentrated in posterior ones. Logarithmically transforming age linearized the developmental trajectories, yielding a monotonic decline for the aperiodic component and a concave piecewise for the periodic one. This process establishes robust linear norms for the personalized assessment of brain dysfunction.

2025-08-26

bioRxiv (preprint)

doi.org

R3Mem: Bridging Memory Retention and Retrieval via Reversible Compression.

Xiaoqiang Wang 0007

Suyuchen Wang

Yun Zhu

Bang Liu

2025-08-26

DBLP.org (unknown)

openreview.net

Predictive Performance Precision Analysis in Medicine: Identification of low-confidence predictions at patient and profile levels (MED3pa I)

Olivier Lefebvre

Félix Camirand Lemyre

Jean-François Ethier

Lyna Hiba Chikouche

Ludmila Amriou

Dan Poenaru

Martin Vallières

Artificial Intelligence models are increasingly used in healthcare, yet global performance metrics can mask variations in reliability across… (see more) individual patients or subgroups with shared attributes, called patient profiles . This study introduces MED3pa, a method that identifies when models are less reliable, allowing clinicians to better assess model limitations. We propose a framework that estimates predictive confidence using three combined approaches: Individualized (IPC), Aggregated (APC), and Mixed Predictive Confidence (MPC). IPC estimates confidence for each patient, APC assesses it across profiles, and MPC combines both. We evaluate our method on four datasets: one simulated, two public, and one private clinical dataset. Metrics by Declaration Rate (MDR) curves show how performance changes when retaining only the most confident predictions, while interpretable decision trees reveal profiles with higher or lower model confidence. We demonstrate our method in internal, temporal, and external validation settings, as well as through a clinical example. In internal validation, limiting predictions to the 93% most confident cases improved sensitivity by 14.3% and the AUC by 5.1%. In the clinical example, MED3pa identified a patient profile with high misclassification risk, demonstrating its potential for safer deployment. By identifying low-confidence predictions, our framework improves model reliability in clinical settings. It can be integrated into decision support systems to help clinicians make more informed decisions. Confidence thresholds help balance model performance with the proportion of patients for whom predictions are considered reliable. Better leveraging confidence in model predictions could improve reliability and trustworthiness, supporting safer and more effective use in healthcare.

2025-08-25

medRxiv (preprint)

doi.org

Source-free Domain Adaptation Requires Penalized Diversity

Laya Rafiee Sevyeri

Ivaxi Sheth

Farhood Farahnak

Alexandre See

Samira Ebrahimi Kahou

Thomas Fevens

Mohammad Havaei

While neural networks are capable of achieving human-like performance in many tasks such as image classification, the impressive performance… (see more) of each model is limited to its own dataset. Source-free domain adaptation (SFDA) was introduced to address knowledge transfer between different domains in the absence of source data, thus, increasing data privacy. Diversity in representation space can be vital to a model`s adaptability in varied and difficult domains. In unsupervised SFDA, the diversity is limited to learning a single hypothesis on the source or learning multiple hypotheses with a shared feature extractor. Motivated by the improved predictive performance of ensembles, we propose a novel unsupervised SFDA algorithm that promotes representational diversity through the use of separate feature extractors with Distinct Backbone Architectures (DBA). Although diversity in feature space is increased, the unconstrained mutual information (MI) maximization may potentially introduce amplification of weak hypotheses. Thus we introduce the Weak Hypothesis Penalization (WHP) regularizer as a mitigation strategy. Our work proposes Penalized Diversity (PD) where the synergy of DBA and WHP is applied to unsupervised source-free domain adaptation for covariate shift. In addition, PD is augmented with a weighted MI maximization objective for label distribution shift. Empirical results on natural, synthetic, and medical domains demonstrate the effectiveness of PD under different distributional shifts.

2025-08-25

Machine Learning (published)

doi.org

arxiv.org

Posttraumatic Growth in Intensive Care Unit Health Care Professionals After COVID-19

Elie Azoulay

Laurent Argaud

Vincent Labbé

Guillaume Dumas

Fabrice Bruneel

Mercé Jourdain

Christophe Guitton

Amelie Seguin

Samir Jaber

David Schnell

Isabelle Vinatier

Fanny Ardisson

Michel Ramakers

Antoine Herault

Olivier Lesieur

Alain Cariou

Antoine Vieillard-Baron

Olivier Guisset

Frédéric Pochard

Michael Darmon … (see 1 more)

Nancy Kentish-Barnes

2025-08-24

JAMA Network Open (published)

doi.org

Uncovering executive function profiles within interindividual variability: A data driven clustering exploration of design fluency in school-aged children

Myriam Sahraoui

Karim Jerbi CoCo Lab

Vanessa Hadid

Bruno Gauthier

2025-08-23

bioRxiv (preprint)

doi.org

Communication Efficient LLM Pre-training with SparseLoCo

Amir M. Sarfi

Benjamin Therien

Joel Lidin

Eugene Belilovsky

2025-08-20

ArXiv (preprint)

doi.org

arxiv.org

Low-dimensional embeddings of high-dimensional data

Cyril de Bodt

Alex Diaz-Papkovich

Michael Bleher

Kerstin Bunte

Corinna Coupette

Sebastian Damrich

Enrique Fita Sanmartin

Fred Hamprecht

EmHoke-'Agnes Horv'at

Dhruv Kohli

Smita Krishnaswamy

John A. Lee 0001

Boudewijn P. F. Lelieveldt

Leland McInnes

Ian T. Nabney

Maximilian Noichl

Pavlin G. Polivcar

Bastian Rieck

Guy Wolf

Gal Mishne … (see 1 more)

Dmitry Kobak

Large collections of high-dimensional data have become nearly ubiquitous across many academic fields and application domains, ranging from b… (see more)iology to the humanities. Since working directly with high-dimensional data poses challenges, the demand for algorithms that create low-dimensional representations, or embeddings, for data visualization, exploration, and analysis is now greater than ever. In recent years, numerous embedding algorithms have been developed, and their usage has become widespread in research and industry. This surge of interest has resulted in a large and fragmented research field that faces technical challenges alongside fundamental debates, and it has left practitioners without clear guidance on how to effectively employ existing methods. Aiming to increase coherence and facilitate future work, in this review we provide a detailed and critical overview of recent developments, derive a list of best practices for creating and using low-dimensional embeddings, evaluate popular approaches on a variety of datasets, and discuss the remaining challenges and open problems in the field.

2025-08-20

ArXiv (preprint)

doi.org

arxiv.org

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Publications

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Popular keywords:

Publications