Portrait of Danilo Bzdok

Danilo Bzdok

Core Academic Member

bzdokdan@mila.quebec

Canada CIFAR AI Chair

Associate Professor, McGill University, Department of Biomedical Engineering

Research Topics

Computational Biology

Deep Learning

Large Language Models (LLM)

Natural Language Processing

Biography

Danilo Bzdok is a computer scientist and medical doctor by training with a unique dual background in systems neuroscience and machine learning algorithms. After training at RWTH Aachen University (Germany), Université de Lausanne (Switzerland) and Harvard Medical School, Bzdok completed two doctoral degrees, one in neuroscience at Forschungszentrum Jülich in Germany, and another in computer science (machine learning statistics) at INRIA–Saclay and the Neurospin brain imaging centre in Paris.

Danilo is currently an associate professor at McGill University’s Faculty of Medicine and a Canada CIFAR AI Chair at Mila – Quebec Artificial Intelligence Institute. His interdisciplinary research centres around narrowing knowledge gaps in the brain basis of human-defining types of thinking in order to uncover key computational design principles underlying human intelligence.

Current Students

Badr Ait Hammou

Postdoctorate - McGill University

Karan Bali Bali

PhD - McGill University

PhD - McGill University

Yanis Bencheikh

Master's Research - HEC Montréal

Co-supervisor :

Anwesha Bhattacharya

PhD - McGill University

Pedro Carneiro Carneiro

PhD - McGill University

PhD - McGill University

Ryan McPhedrain McPhedrain

Postdoctorate - McGill University

Master's Research - McGill University

Sepehr Radmannia

Independent visiting researcher - McGill University

PhD - McGill University

Chloé Savignac

PhD - McGill University

PhD - McGill University

PhD - McGill University

PhD - McGill University

PhD - McGill University

Blog Posts

A tab representation of the Pre trained CLIP Vision Transformer

October 8, 2025

Why AI Models Hallucinate and How to Fix Them

by

Read the article

March 25, 2025

Using LLMs to better understand autism diagnosis

by

Read the article

Towards Precision Medicine: Understanding Inference and Prediction Divergence in Biomedicine

September 8, 2020

Towards Precision Medicine: Understanding Inference and Prediction Divergence in Biomedicine

by

Read the article

Publications

Steering CLIP's vision transformer with sparse autoencoders

Ethan Goldfarb

Lorenz Hufe

Yossi Gandelsman

Robert Graham

Wojciech Samek

While vision models are highly capable, their internal mechanisms remain poorly understood-- a challenge which sparse autoencoders (SAEs) ha… (see more)ve helped address in language, but which remains underexplored in vision. We address this gap by training SAEs on CLIP's vision transformer and uncover key differences between vision and language processing, including distinct sparsity patterns for SAEs trained across layers and token types. We then provide the first systematic analysis of the steerability of CLIP's vision transformer by introducing metrics to quantify how precisely SAE features can be steered to affect the model's output. We find that 10-15% of neurons and features are steerable, with SAEs providing thousands more steerable features than the base model. Through targeted suppression of SAE features, we then demonstrate improved performance on three vision disentanglement tasks (CelebA, Waterbirds, and typographic attacks), finding optimal disentanglement in middle model layers, and achieving state-of-the-art performance on defense against typographic attacks. We release our CLIP SAE models and code to support future research in vision transformer interpretability.

2025-03-30

thecvf.com/CVPR/2025/Workshop/MIV (poster)

Large language models deconstruct the clinical intuition behind diagnosing autism

Emmett Rabot

Eugene Belilovsky

L. Mottron

2025-03-01

Cell (published)

A hierarchical Bayesian brain parcellation framework for fusion of functional imaging datasets

Da Zhi

Ladan Shahshahani

Caroline Nettekoven

Ana Lúısa Pinho

Jörn Diedrichsen

2025-01-02

Imaging Neuroscience (published)

Estimating Unknown Population Sizes Using the Hypergeometric Distribution

The multivariate hypergeometric distribution describes sampling without replacement from a discrete population of elements divided into mult… (see more)iple categories. Addressing a gap in the literature, we tackle the challenge of estimating discrete distributions when both the total population size and the category sizes are unknown. Here, we propose a novel solution using the hypergeometric likelihood to solve this estimation problem, even in the presence of severe under-sampling. Our approach accounts for a data generating process where the ground-truth is a mixture of distributions conditional on a continuous latent variable, as seen in collaborative filtering, using the variational autoencoder framework. Empirical data simulation demonstrates that our method outperforms other likelihood functions used to model count data, both in terms of accuracy of population size estimate and learning an informative latent space. We showcase our method’s versatility through applications in NLP, by inferring and estimating the complexity of latent vocabularies in reading passage excerpts, and in biology, by accurately recovering the true number of gene transcripts from sparse single-cell genomics data.

2024-07-08

Proceedings of the 41st International Conference on Machine Learning (published)

proceedings.mlr.press

Supervised latent factor modeling isolates cell-type-specific transcriptomic modules that underlie Alzheimer’s disease progression

Yasser Iturria-Medina

Jo Anne Stratton

Smita Krishnaswamy

David A. Bennett

2024-05-17

Communications Biology (published)

Distinctive whole-brain cell types predict tissue damage patterns in thirteen neurodegenerative conditions

Veronika Pak

Quadri Adewale

Mahsa Dadar

Yashar Zeighami

Yasser Iturria-Medina

For over a century, brain research narrative has mainly centered on neuron cells. Accordingly, most neurodegenerative studies focus on neuro… (see more)nal dysfunction and their selective vulnerability, while we lack comprehensive analyses of other major cell types’ contribution. By unifying spatial gene expression, structural MRI, and cell deconvolution, here we describe how the human brain distribution of canonical cell types extensively predicts tissue damage in thirteen neurodegenerative conditions, including early-and late-onset Alzheimer’s disease, Parkinson’s disease, dementia with Lewy bodies, amyotrophic lateral sclerosis, mutations in presenilin-1, and three clinical variants of frontotemporal lobar degeneration (behavioural variant, semantic and non-fluent primary progressive aphasia) along with associated 3-repeat and 4-repeat tauopathies and TDP43 proteinopathies types A and C. We reconstructed comprehensive whole-brain reference maps of cellular abundance for six major cell types and identified characteristic axes of spatial overlapping with atrophy. Our results support the strong mediating role of non-neuronal cells, primarily microglia and astrocytes, in spatial vulnerability to tissue loss in neurodegeneration, with distinct and shared across-disorders pathomechanisms. These observations provide critical insights into the multicellular pathophysiology underlying spatiotemporal advance in neurodegeneration. Notably, they also emphasize the need to exceed the current neuro-centric view of brain diseases, supporting the imperative for cell-specific therapeutic targets in neurodegeneration.

2024-02-23

bioRxiv (preprint)

Data science opportunities of large language models for neuroscience and biomedicine

Andrew Thieme

Oleksiy Levkovskyy

Paul Wren

Thomas Ray

2024-02-01

Neuron (published)

Data science opportunities of large language models for neuroscience and biomedicine

Andrew Thieme

Oleksiy Levkovskyy

Paul Wren

Thomas Ray

2024-02-01

Neuron (published)

Data science opportunities of large language models for neuroscience and biomedicine

Andrew Thieme

Oleksiy Levkovskyy

Paul Wren

Thomas Ray

2024-02-01

Neuron (published)

Data science opportunities of large language models for neuroscience and biomedicine

Andrew Thieme

Oleksiy Levkovskyy

Paul Wren

Thomas Ray

2024-02-01

Neuron (published)

Performance reserves in brain-imaging-based phenotype prediction

Marc-Andre Schulz

Stefan Haufe

John-Dylan Haynes

Kerstin Ritter

Machine learning studies have shown that various phenotypes can be predicted from structural and functional brain images. However, in most s… (see more)uch studies, prediction performance ranged from moderate to disappointing. It is unclear whether prediction performance will substantially improve with larger sample sizes or whether insufficient predictive information in brain images impedes further progress. Here, we systematically assess the effect of sample size on prediction performance using sample sizes far beyond what is possible in common neuroimaging studies. We project 3-9 fold improvements in prediction performance for behavioral and mental health phenotypes when moving from one thousand to one million samples. Moreover, we find that moving from single imaging modalities to multimodal input data can lead to further improvements in prediction performance, often on par with doubling the sample size. Our analyses reveal considerable performance reserves for neuroimaging-based phenotype prediction. Machine learning models may benefit much more from extremely large neuroimaging datasets than currently believed.

2023-12-29

Cell reports (published)

Aberrant functional brain network organization is associated with relapse during 1‐year follow‐up in alcohol‐dependent patients

Justin Böhmer

Pablo Reinhardt

Maria Garbusow

Michael Marxen

Michael N. Smolka

Ulrich S. Zimmermann

Andreas Heinz

Eva Friedel

Johann D. Kruschwitz

Henrik Walter

2023-10-02

Addiction Biology (published)