Publications

Personalized Prediction of Future Lesion Activity and Treatment Effect in Multiple Sclerosis from Baseline MRI

Joshua D. Durso-Finley

Jean-Pierre R. Falet

Douglas Arnold

Precision medicine for chronic diseases such as multiple sclerosis (MS) involves choosing a treatment which best balances efficacy and side … (see more)effects/preferences for individual patients. Making this choice as early as possible is important, as delays in finding an effective therapy can lead to irreversible disability accrual. To this end, we present the first deep neural network model for individualized treatment decisions from baseline magnetic resonance imaging (MRI) (with clinical information if available) for MS patients which (a) predicts future new and enlarging T2 weighted (NE-T2) lesion counts on follow-up MRI on multiple treatments and (b) estimates the conditional average treatment effect (CATE), as defined by the predicted future suppression of NE-T2 lesions, between different treatment options relative to placebo. Our model is validated on a proprietary federated dataset of 1817 multi-sequence MRIs acquired from MS patients during four multi-centre randomized clinical trials. Our framework achieves high average precision in the binarized regression of future NE-T2 lesions on five different treatments, identifies heterogeneous treatment effects, and provides a personalized treatment recommendation that accounts for treatment-associated risk (side effects, patient preference, administration difficulties,...).

2022-12-04

Proceedings of The 5th International Conference on Medical Imaging with Deep Learning (published)

doi.org

openreview.net

Segmentation-Consistent Probabilistic Lesion Counting

Julien Schroeter

Chelsea Myers-Colet

Douglas Arnold

Tal Arbel

Lesion counts are important indicators of disease severity, patient prognosis, and treatment efficacy, yet counting as a task in medical ima… (see more)ging is often overlooked in favor of segmentation. This work introduces a novel continuously differentiable function that maps lesion segmentation predictions to lesion count probability distributions in a consistent manner. The proposed end-to-end approach—which consists of voxel clustering, lesion-level voxel probability aggregation, and Poisson-binomial counting—is non-parametric and thus offers a robust and consistent way to augment lesion segmentation models with post hoc counting capabilities. Experiments on Gadolinium-enhancing lesion counting demonstrate that our method outputs accurate and well-calibrated count distributions that capture meaningful uncertainty information. They also reveal that our model is suitable for multi-task learning of lesion segmentation, is efficient in low data regimes, and is robust to adversarial attacks.

2022-12-04

Proceedings of The 5th International Conference on Medical Imaging with Deep Learning (published)

doi.org

openreview.net

Tackling hypo and hyper sensory processing heterogeneity in autism: From clinical stratification to genetic pathways

Aline Lefebvre

Julian Tillmann

Freddy Cliquet

Frederique Amsellem

Anna Maruani

Claire Leblond

Anita Beggiato

David Germanaud

Anouck Amestoy

Myriam Ly‐Le Moal

Daniel Umbricht

Christopher H. Chatham

Lorraine Murtagh

Manuel Bouvard

Marion Leboyer

Tony Charman

Thomas Bourgeron

Richard Delorme

Guillaume Dumas

2022-12-04

Autism Research (published)

doi.org

CrossSplit: Mitigating Label Noise Memorization through Data Splitting

Jihye Kim

Aristide Baratin

Yan Zhang

Simon Lacoste-Julien

We approach the problem of improving robustness of deep learning algorithms in the presence of label noise. Building upon existing label cor… (see more)rection and co-teaching methods, we propose a novel training procedure to mitigate the memorization of noisy labels, called CrossSplit, which uses a pair of neural networks trained on two disjoint parts of the labelled dataset. CrossSplit combines two main ingredients: (i) Cross-split label correction. The idea is that, since the model trained on one part of the data cannot memorize example-label pairs from the other part, the training labels presented to each network can be smoothly adjusted by using the predictions of its peer network; (ii) Cross-split semi-supervised training. A network trained on one part of the data also uses the unlabeled inputs of the other part. Extensive experiments on CIFAR-10, CIFAR-100, Tiny-ImageNet and mini-WebVision datasets demonstrate that our method can outperform the current state-of-the-art in a wide range of noise ratios.

2022-12-03

ArXiv (preprint)

doi.org

arxiv.org

Performative Prediction in Time Series: A Case Study

Rupali Bhati

Jennifer Jones

David Langelier

Anthony Reiman

Jonathan Greenland

Kristin Campbell

Audrey Durand

2022-12-02

NeurIPS.cc/2022/Workshop/TS4H (poster)

openreview.net

Active Keyword Selection to Track Evolving Topics on Twitter

Sacha Lévy

Farimah Poursafaei

Kellin Pelrine

Reihaneh Rabbany

How can we study social interactions on evolving topics at a mass scale? Over the past decade, researchers from diverse fields such as econo… (see more)mics, political science, and public health have often done this by querying Twitter's public API endpoints with hand-picked topical keywords to search or stream discussions. However, despite the API's accessibility, it remains difficult to select and update keywords to collect high-quality data relevant to topics of interest. In this paper, we propose an active learning method for rapidly refining query keywords to increase both the yielded topic relevance and dataset size. We leverage a large open-source COVID-19 Twitter dataset to illustrate the applicability of our method in tracking Tweets around the key sub-topics of Vaccine, Mask, and Lockdown. Our experiments show that our method achieves an average topic-related keyword recall 2x higher than baselines. We open-source our code along with a web interface for keyword selection to make data collection from Twitter more systematic for researchers.

2022-12-01

2022 IEEE International Conference on Data Mining Workshops (ICDMW) (published)

doi.org

arxiv.org

APOE alleles are associated with sex-specific structural differences in brain regions affected in Alzheimer’s disease and related dementia

Chloé Savignac

Sylvia Villeneuve

AmanPreet Badhwar

Karin Saltoun

Kimia Shafighi

Chris Zajner

Vaibhav Sharma

Sarah A. Gagliano Taliun

Sali Farhan

Judes Poirier

Danilo Bzdok

2022-12-01

PLoS Biology (published)

doi.org

Autism incidence and spatial analysis in more than 7 million pupils in English schools: a retrospective, longitudinal, school registry study.

Andres Roman-Urrestarazu

Justin Christopher Yang

R. van Kessel

Varun Warrier

Guillaume Dumas

H. Jongsma

Gabriel Gatica-bahamonde

Carrie Allison

F. Matthews

Simon Baron-Cohen

Carol Brayne

2022-12-01

The Lancet Child & Adolescent Health (published)

doi.org

Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining

Andreas Madsen

Nicholas Meade

Vaibhav Adlakha

Siva Reddy

To explain NLP models a popular approach is to use importance measures, such as attention, which inform input tokens are important for makin… (see more)g a prediction. However, an open question is how well these explanations accurately reflect a model's logic, a property called faithfulness. To answer this question, we propose Recursive ROAR, a new faithfulness metric. This works by recursively masking allegedly important tokens and then retraining the model. The principle is that this should result in worse model performance compared to masking random tokens. The result is a performance curve given a masking-ratio. Furthermore, we propose a summarizing metric using relative area-between-curves (RACU), which allows for easy comparison across papers, models, and tasks. We evaluate 4 different importance measures on 8 different datasets, using both LSTM-attention models and RoBERTa models. We find that the faithfulness of importance measures is both model-dependent and task-dependent. This conclusion contradicts previous evaluations in both computer vision and faithfulness of attention literature.

2022-12-01

Findings of the Association for Computational Linguistics: EMNLP 2022 (published)

doi.org

arxiv.org

Implementing automation in deep brain stimulation: has the time come?

Marco Bonizzato

Alfonso Fasano

2022-12-01

The Lancet Digital Health (published)

doi.org

Improving Passage Retrieval with Zero-Shot Question Generation

Devendra Singh Sachan