Publications

GraIP: A Benchmarking Framework For Neural Graph Inverse Problems

Semih Cantürk

Andrei Manolache

Arman Mielke

Chendi Qian

Antoine Siraudin

Christopher Morris

Mathias Niepert

Guy Wolf

A wide range of graph learning tasks, such as structure discovery, temporal graph analysis, and combinatorial optimization, focus on inferri… (see more)ng graph structures from data, rather than making predictions on given graphs. However, the respective methods to solve such problems are often developed in an isolated, task-specific manner and thus lack a unifying theoretical foundation. Here, we provide a stepping stone towards the formation of such a foundation and further development by introducing the Neural Graph Inverse Problem (GraIP) conceptual framework, which formalizes and reframes a broad class of graph learning tasks as inverse problems. Unlike discriminative approaches that directly predict target variables from given graph inputs, the GraIP paradigm addresses inverse problems, i.e., it relies on observational data and aims to recover the underlying graph structure by reversing the forward process, such as message passing or network dynamics, that produced the observed outputs. We demonstrate the versatility of GraIP across various graph learning tasks, including rewiring, causal discovery, and neural relational inference. We also propose benchmark datasets and metrics for each GraIP domain considered, and characterize and empirically evaluate existing baseline methods used to solve them. Overall, our unifying perspective bridges seemingly disparate applications and provides a principled approach to structural learning in constrained and combinatorial settings while encouraging cross-pollination of existing methods across graph inverse problems.

2026-01-25

ArXiv (preprint)

arxiv.org

<i>In silico</i> Neutron Relative Biological Effectiveness Estimations For Pre-DNA Repair And Post-DNA Repair Endpoints

Nicolas Desjardins

J. Kildea

2026-01-22

Physics in Medicine & Biology (published)

doi.org

Foundation models for electrocardiogram interpretation: clinical implications

Alexis Nolin-Lapalme

Achille Sowa

Jacques Delfrate

Olivier Tastet

Denis Corbin

Merve Kulbay

Derman Ozdemir

Marie-Jeanne Noël

François-Christophe Marois-Blanchet

François Harvey

Surbhi Sharma

Minhaj Ansari

I-Min Chiu

Valentina D'souza

Sam F. Friedman

Michael Chassé

Brian J. Potter

Jonathan Afilalo

Pierre Adil Elias

Gilbert Jabbour … (see 13 more)

Mourad Bahani

Marie-Pierre Dubé

Patrick M. Boyle

Neal A. Chatterjee

Joshua Barrios

Geoffrey H. Tison

David Ouyang

Mahnaz Maddah

Shaan Khurshid

Julia Cadrin-Tourigny

Rafik Tadros

Julie Hussin

Robert Avram

The 12-lead electrocardiogram (ECG) remains a cornerstone of cardiac diagnostics, yet existing artificial intelligence (AI) solutions for au… (see more)tomated interpretation often lack generalizability, remain closed source, and are primarily trained using supervised learning (SL), which requires extensive labelled datasets and may limit adaptability across diverse clinical settings. Self-supervised learning (SSL) can potentially overcome these limitations by learning robust representations from unlabelled data. To address these challenges, this study developed and compared two open-source foundational ECG models: DeepECG-SL, a supervised multilabel ECG model, and DeepECG-SSL, a self-supervised model. Both models were trained on over 1 million ECGs using a standardized preprocessing pipeline and automated free-text extraction from ECG reports to predict 77 cardiac conditions. DeepECG-SSL leveraged unlabelled data through self-supervised contrastive learning and masked lead modelling before fine-tuning for downstream tasks, while DeepECG-SL was trained directly on labelled diagnostic data in an end-to-end fashion. Performance was evaluated across seven private, multilingual healthcare systems and four public ECG repositories, with assessment of fairness by age and sex, and investigation of privacy vulnerabilities as well as memory and compute requirements. DeepECG-SSL achieved micro-averaged area under the receiver operating characteristic curves (AUROCs) across all 77 cardiac conditions for ECG interpretation of 0.990 [95% confidence interval (CI): 0.990, 0.990] on the internal dataset (MHI-ds), 0.981 (95% CI: 0.981, 0.981) on external public datasets (UKB, CLSA, MIMIC-IV and PTB), and 0.983 (95% CI: 0.983, 0.983) on external private datasets (UW, UCSF, JGH, NYP, MGH, CSH and CHUM), while DeepECG-SL demonstrated AUROCs of 0.992 (95% CI: 0.992, 0.992), 0.980 (95% CI: 0.980, 0.980), and 0.983 (95% CI: 0.983, 0.984), respectively. Fairness analyses revealed minimal disparities (true-positive rate and false-positive rate difference <0.1) across age and sex groups for both models. DeepECG-SSL demonstrated superior performance on limited-data digital biomarker tasks, with the largest improvements in long QT syndrome (LQTS) genotype classification (AUROC 0.931 vs 0.850, P = .026, n = 127 ECGs) and 5 year atrial fibrillation risk prediction (AUROC 0.742 vs 0.734, P < 0.001, n = 132 050 ECGs), while achieving superior performance in left ventricular ejection fraction ≤40% classification (AUROC 0.926 vs 0.917, P < 0.001, n = 25 252 ECGs) and comparable performance in LQTS detection (AUROC 0.767 vs 0.735, P = 0.117, n = 934 ECGs). This study establishes SSL as a promising paradigm for ECG analysis, particularly in settings with limited annotated data, enhancing accessibility, generalizability, and fairness in AI-driven cardiac diagnostics. By releasing model weights, preprocessing tools, and validation code, this work aims to support robust, data-efficient AI diagnostics across diverse clinical environments and questions.

2026-01-21

European Heart Journal (published)

doi.org

Monitoring morphometric drift in lifelong learning segmentation of the spinal cord.

Enamundram Naga Karthik

Sandrine Bédard

Jan Valosek

Christoph S. Aigner

Élise Bannier

Josef Bednařík

Virginie Callot

Anna Combes

Armin Curt

Gergely David

Falk Eippert

Lynn Farner

Michael G Fehlings

Patrick Freund

Tobias Granberg

Cristina Granziera

Rhscir Network Imaging Group

Ulrike Horn

Tomáš Horák

Suzanne Humphreys … (see 36 more)

Markus Hupp

Anne Kerbrat

Nawal Kinany

Shannon Kolind

Petr Kudlička

Anna Lebret

Lisa Eunyoung Lee

Caterina Mainero

Allan R. Martin

Megan McGrath

Govind Nair

Kristin P. O'Grady

Jiwon Oh

Russell Ouellette

Nikolai Pfender

Dario Pfyffer

Pierre-François Pradat

Alexandre Prat

Emanuele Pravatà

Daniel S. Reich

Ilaria Ricchi

Naama Rotem-Kohavi

Simon Schading-Sassenhausen

Maryam Seif

Andrew Smith

Seth A Smith

Grace Sweeney

Roger Tam

Anthony Traboulsee

Constantina Andrada Treaba

Charidimos Tsagkas

Zachary Vavasour

Dimitri Van De Ville

Kenneth Arnold Weber II

Sarath Chandar

Julien Cohen-Adad

Morphometric measures derived from spinal cord segmentations can serve as diagnostic and prognostic biomarkers in neurological diseases and … (see more)injuries affecting the spinal cord. For instance, the spinal cord cross-sectional area can be used to monitor cord atrophy in multiple sclerosis and to characterize compression in degenerative cervical myelopathy. While robust, automatic segmentation methods to a wide variety of contrasts and pathologies have been developed over the past few years, whether their predictions are stable as the model is updated using new datasets has not been assessed. This is particularly important for deriving normative values from healthy participants. In this study, we present a spinal cord segmentation model trained on a multisite (n=75) dataset, including 9 different MRI contrasts and several spinal cord pathologies. We also introduce a lifelong learning framework to automatically monitor the morphometric drift as the model is updated using additional datasets. The framework is triggered by an automatic GitHub Actions workflow every time a new model is created, recording the morphometric values derived from the model's predictions over time. As a real-world application of the proposed framework, we employed the spinal cord segmentation model to update a recently-introduced normative database of healthy participants containing commonly used measures of spinal cord morphometry. Results showed that: (i) our model performs well compared to its previous versions and existing pathology-specific models on the lumbar spinal cord, images with severe compression, and in the presence of intramedullary lesions and/or atrophy achieving an average Dice score of 0.95 ± 0.03; (ii) the automatic workflow for monitoring morphometric drift provides a quick feedback loop for developing future segmentation models; and (iii) the scaling factor required to update the database of morphometric measures is nearly constant among slices across the given vertebral levels, showing minimum drift between the current and previous versions of the model monitored by the framework. The model is freely available in Spinal Cord Toolbox v7.0.

2026-01-21

Imaging Neuroscience (published)

doi.org

openreview.net

Analog-to-Stochastic Converter Using Magnetic Tunnel Junction Devices for Vision Chips

Naoya Onizawa

Daisaku Katagiri

Warren J. Gross

Takahiro Hanyu

This paper introduces an analog-to-stochastic converter using a magnetic tunnel junction (MTJ) device for vision chips based on stochastic c… (see more)omputation. Stochastic computation has been recently exploited for area-efficient hardware implementation, such as low-density parity-check (LDPC) decoders and image processors. However, power-and-area hungry two-step (analog-to-digital and digital-to-stochastic) converters are required for the analog to stochastic signal conversion. To realize a one-step conversion, an MTJ device is used as it inherently exhibits a probabilistic switching behavior between two resistance states. Exploiting the device-based probabilistic behavior, analog signals can be directly and area-efficiently converted to stochastic signals to mitigate the signal-conversion overhead. The analog-to-stochastic signal conversion is theoretically described and the conversion characteristic is evaluated using device and circuit parameters. In addition, the resistance variability of the MTJ device is considered in order to compensate the variability effect on the signal conversion. Based on the theoretical analysis, the analog-to-stochastic converter is designed in 90nm CMOS and 100nm MTJ technologies and is verified using a SPICE simulator (NS-SPICE) that handles both transistors and MTJ devices.

2026-01-20

arXiv (preprint)

doi.org

arxiv.org

Divergent creativity in humans and large language models

Antoine Bellemare-Pepin

François Lespinasse

Philipp Thölke

Yann Harel

Kory Mathewson

Jay A. Olson

Yoshua Bengio

Karim Jerbi

Psychology Department

U. Montr'eal

Montreal

Qc

Canada

Music department

C. University

Sociology

Anthropology department

Mila

Departmentof Psychology

University of Toronto Mississauga … (see 5 more)

Mississauga

On

Department of Computer Science

Operations Research

Unique Center

The recent surge of Large Language Models (LLMs) has led to claims that they are approaching a level of creativity akin to human capabilitie… (see more)s. This idea has sparked a blend of excitement and apprehension. However, a critical piece that has been missing in this discourse is a systematic evaluation of LLMs’ semantic diversity, particularly in comparison to human divergent thinking. To bridge this gap, we leverage recent advances in computational creativity to analyze semantic divergence in both state-of-the-art LLMs and a substantial dataset of 100,000 humans. These divergence-based measures index associative thinking—the ability to access and combine remote concepts in semantic space—an established facet of creative cognition. We benchmark performance on the Divergent Association Task (DAT) and across multiple creative-writing tasks (haiku, story synopses, and flash fiction), using identical, objective scoring. We found evidence that LLMs can surpass average human performance on the DAT, and approach human creative writing abilities, yet they remain below the mean creativity scores observed among the more creative segment of human participants. Notably, even the top performing LLMs are still largely surpassed by the aggregated top half of human participants, underscoring a ceiling that current LLMs still fail to surpass. We also systematically varied linguistic strategy prompts and temperature, observing reliable gains in semantic divergence for several models. Our human-machine benchmarking framework addresses the polemic surrounding the imminent replacement of human creative labor by AI, disentangling the quality of the respective creative linguistic outputs using established objective measures. While prompting deeper exploration of the distinctive elements of human inventive thought compared to those of AI systems, we lay out a series of techniques to improve their outputs with respect to semantic diversity, such as prompt design and hyper-parameter tuning.

2026-01-20

Scientific Reports (published)

doi.org

arxiv.org

Spatial analysis of healthcare services availability and demand for people aged 65 and over in Québec

Juliette Duc

Nevena Veljanovic

Sébastien Barbat-Artigas

David L. Buckeridge

Delphine Bosson-Rieutort

As people age, their healthcare needs increase and become more complex, requiring a corresponding increase in healthcare and services use. M… (see more)oreover, heterogeneity of healthcare needs and availability can be observed among the health regions within Canadian provinces, especially between rural and urban regions. The province of Québec has received limited attention in this regard. This study aims to describe and compare healthcare services location and aging population healthcare demand across Québec. We used data from Données Québec to describe the distribution of available healthcare (such as facilities, their services and capacity) and potential demand of services (represented by the location of the aged population) and mapped their relationship based on urbanization level. Analyses were performed using QGIS and R software. We found a substantial variability of the population aged 65 and over, the number of facilities, the number and type of services, and long-term care (LTC) beds between regions in Québec. The number of LTC beds was significantly correlated with the number of people aged 65 and over (R² = 0.88, p 0.001), but not with their proportion. LTC accommodation is a service most offered in urban areas, especially in the Montréal region. H

2026-01-20

Research in Health Services & Regions (published)

doi.org

Diffusion Large Language Models for Black-Box Optimization

Ye Yuan

Can Chen

Zipeng Sun

Dinghuai Zhang

Christopher Pal

Xue Liu

Offline black-box optimization (BBO) aims to find optimal designs based solely on an offline dataset of designs and their labels. Such scena… (see more)rios frequently arise in domains like DNA sequence design and robotics, where only a few labeled data points are available. Traditional methods typically rely on task-specific proxy or generative models, overlooking the in-context learning capabilities of pre-trained large language models (LLMs). Recent efforts have adapted autoregressive LLMs to BBO by framing task descriptions and offline datasets as natural language prompts, enabling direct design generation. However, these designs often contain bidirectional dependencies, which left-to-right models struggle to capture. In this paper, we explore diffusion LLMs for BBO, leveraging their bidirectional modeling and iterative refinement capabilities. This motivates our in-context denoising module: we condition the diffusion LLM on the task description and the offline dataset, both formatted in natural language, and prompt it to denoise masked designs into improved candidates. To guide the generation toward high-performing designs, we introduce masked diffusion tree search, which casts the denoising process as a step-wise Monte Carlo Tree Search that dynamically balances exploration and exploitation. Each node represents a partially masked design, each denoising step is an action, and candidates are evaluated via expected improvement under a Gaussian Process trained on the offline dataset. Our method, dLLM, achieves state-of-the-art results in few-shot settings on design-bench.

2026-01-19

ArXiv (preprint)

doi.org

arxiv.org

Enhancing link prediction in biomedical knowledge graphs with BioPathNet

Emy Yue Hu

Svitlana Oleshko

Samuele Firmani

Hui Cheng

Zhaocheng Zhu

Maria Ulmer

Matthias Arnold

Maria Colomé-Tatché

Jian Tang

Sophie Xhonneux

Annalisa Marsico

Understanding complex interactions in biomedical networks is crucial for advancements in biomedicine, but traditional link prediction (LP) m… (see more)ethods are limited in capturing this complexity. We present BioPathNet, a graph neural network framework based on the neural Bellman–Ford network (NBFNet), addressing limitations of traditional representation-based learning methods through path-based reasoning for LP in biomedical knowledge graphs. Unlike node-embedding frameworks, BioPathNet learns representations between node pairs by considering all relations along paths, enhancing prediction accuracy and interpretability, and allowing visualization of influential paths and biological validation. BioPathNet leverages a background regulatory graph for enhanced message passing and uses stringent negative sampling to improve precision and scalability. BioPathNet outperforms or matches existing methods across diverse tasks including gene function annotation, drug–disease indication, synthetic lethality and lncRNA–target interaction prediction. Our study identifies promising additional drug indications for diseases such as acute lymphoblastic leukaemia and Alzheimer’s disease, validated by medical experts and clinical trials. In addition, we prioritize putative synthetic lethal gene pairs and regulatory lncRNA–target interactions. BioPathNet’s interpretability will enable researchers to trace prediction paths and gain molecular insights.

2026-01-19

Nature Biomedical Engineering (published)

doi.org

Enhancing link prediction in biomedical knowledge graphs with BioPathNet

Emy Yue Hu

Svitlana Oleshko

Samuele Firmani

Hui Cheng

Zhaocheng Zhu

Maria Ulmer

Matthias Arnold

Maria Colomé-Tatché

Jian Tang

Sophie Xhonneux

Annalisa Marsico

2026-01-19

Nature Biomedical Engineering (published)

doi.org

Modeling and Simulation of Neocortical Micro- and Mesocircuitry. Part I: Anatomy

Michael W. Reimann

Sirio Bolaños-Puchet

Jean-Denis Courcol

Daniela Egas Santander

Alexis Arnaudon

Benoît Coste

Fabien Delalondre

Thomas Delemontex

Adrien Devresse

Hugo Dictus

Alexander Dietz

András Ecker

Cyrille Favreau

Gianluca Ficarelli

Michael Gevaert

Juan B. Hernando

Joni Herttuainen

James B. Isbister

Lida Kanari

Daniel Keller … (see 24 more)

James King

Pramod Kumbhar

Samuel Lapere

Jānis Lazovskis

Huanxiang Lu

Nicolas Ninin

Fernando Pereira

Judit Planas

Christoph Pokorny

Juan Luis Riquelme

Armando Romani

Ying Shi

Jason P. Smith

Vishal Sood

Mohit Srivastava

Werner Van Geit

Liesbeth Vanherpe

Matthias Wolf

Ran Levi

Kathryn Hess

Felix Schürmann

Eilif B. Muller

Henry Markram

Srikanth Ramaswamy

The function of the neocortex is fundamentally determined by its repeating microcircuit motif, but also by its rich, interregional connectiv… (see more)ity. We present a data-driven computational model of the anatomy of non-barrel primary somatosensory cortex of juvenile rat, integrating whole-brain scale data while providing cellular and subcellular specificity. The model consists of 4.2 million morphologically detailed neurons, placed in a digital brain atlas. They are connected by 14.2 billion synapses, comprising local, long-range and extrinsic connectivity. We delineated the limits of determining connectivity from anatomy, finding that it reproduces the targeting of PV+ and VIP+ interneurons only with explicitly added specificity, but the one of Sst+ neurons even without. Globally, connectivity was characterized by local clusters tied together through hub neurons in layer 5, demonstrating how local and interegional connectivity are complicit, inseparable networks. A 211,712 neuron subvolume of the model has been made freely and openly available to the community.

2026-01-19

eLife (published)

doi.org

Modeling and Simulation of Neocortical Micro- and Mesocircuitry. Part II: Physiology and Experimentation

James B. Isbister

András Ecker

Christoph Pokorny

Sirio Bolaños-Puchet

Daniela Egas Santander

Alexis Arnaudon

Omar Awile

Natali Barros-Zulaica

Jorge Blanco Alonso

Elvis Boci

Giuseppe Chindemi

Jean-Denis Courcol

Tanguy Damart

Thomas Delemontex

Alexander Dietz

Gianluca Ficarelli

Michael Gevaert

Joni Herttuainen

Genrich Ivaska

Weina Ji … (see 22 more)

Daniel Keller

James King

Pramod Kumbhar

Samuel Lapere

Polina Litvak

Darshan Mandge

Eilif B. Muller

Fernando Pereira

Judit Planas

Rajnish Ranjan

Maria Reva

Armando Romani

Christian Rössert

Felix Schürmann

Vishal Sood

Aleksandra Teska

Anıl Tuncel

Werner Van Geit

Matthias Wolf

Henry Markram

Srikanth Ramaswamy

Michael W. Reimann

Cortical dynamics underlie many cognitive processes and emerge from complex multi-scale interactions, which can be studied in large-scale, b… (see more)iophysically detailed models. We present a model comprising eight somatosensory cortex subregions, 4.2 million morpho-logical and electrically-detailed neurons, and 13.2 billion local and long-range synapses. In silico tools enabled reproduction and extension of complex laboratory experiments under a single parameterization, providing strong validation. We reproduced millisecond-precise stimulus-responses, stimulus-encoding under targeted optogenetic activation, and selective propagation of stimulus-evoked activity to downstream areas. The model’s di-rect correspondence with biology generated predictions about how multiscale organisation shapes activity. We predict that structural and functional recurrency increases towards deeper layers and that stronger innervation by long-range connectivity increases local correlated activity. The model also predicts the role of inhibitory interneuron types in stimulus encoding, and of different layers in driving layer 2/3 stimulus responses. Simu-slation tools and a large subvolume of the model are made available.

2026-01-19

eLife (published)

doi.org

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Publications

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Popular keywords:

Publications