Publications

A cry for help: Early detection of brain injury in newborns

Charles Onu

Samantha Latremouille

Arsenii Gorin

Junhao Wang

Uchenna Ekwochi

P. Ubuane

O. Kehinde

Muhammad A. Salisu

Datonye Briggs

Yoshua Bengio

Doina Precup

2023-10-11

ArXiv (preprint)

doi.org

arxiv.org

Going beyond the means: Exploring the role of bias from digital determinants of health in technologies

Marie-Laure Charpignon

Adrien Carrel

Yihang Jiang

Teddy Kwaga

Beatriz Cantada

Terry Hyslop

Christopher E. Cox

Krista Haines

Valencia Koomson

Guillaume Dumas

Michael Morley

Jessilyn Dunn

An-Kwok Ian Wong

2023-10-11

PLOS Digital Health (published)

doi.org

AAPM Medical Physics Practice Guideline 14.a: Yttrium‐90 microsphere radioembolization

Nathan C. Busse

Muthana S. A. L. Al‐Ghazi

Nadine Abi‐Jaoudeh

Diane Alvarez

Ahmet S. Ayan

Erli Chen

Michael D. Chuong

William A. Dezarn

S. Enger

Stephen A. Graves

Robert F. Hobbs

Mary Ellen Jafari

S. Peter Kim

Nichole M. Maughan

Andrew M. Polemi

Jennifer R. Stickel

2023-10-10

Journal of Applied Clinical Medical Physics (published)

doi.org

A general framework for the practical disintegration of PAC-Bayesian bounds

Paul Viallard

Pascal Germain

Amaury Habrard

Emilie Morvant

2023-10-10

Machine-mediated learning (published)

doi.org

arxiv.org

Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning

Bahareh Nikpour

Narges Armanfard

Attention mechanisms have demonstrated significant potential in enhancing learning models by identifying key portions of input data, particu… (see more)larly in scenarios with limited training samples. Inspired by human perception, we propose that focusing on essential data segments, rather than the entire dataset, can improve the accuracy and reliability of the learning models. However, identifying these critical data segments, or"hard attention finding,"is challenging, especially in few-shot learning, due to the scarcity of training data and the complexity of model parameters. To address this, we introduce LaHA, a novel framework that leverages language-guided deep reinforcement learning to identify and utilize informative data regions, thereby improving both interpretability and performance. Extensive experiments on benchmark datasets validate the effectiveness of LaHA.

2023-10-10

ArXiv (preprint)

doi.org

arxiv.org

Large Language Models can Learn Rules

Zhaocheng Zhu

Yuan Xue

Xinyun Chen

Denny Zhou

Jian Tang

Dale Schuurmans

Hanjun Dai

2023-10-10

ArXiv (preprint)

doi.org

openreview.net

A deep learning benchmark for first break detection from hardrock seismic reflection data

Pierre-Luc St-Charles

Bruno Rousseau

Joumana Ghosn

Gilles Bellefleur

Ernst Schetselaar

Deep learning techniques are used to tackle a variety of tasks related to seismic data processing and interpretation. Although many works ha… (see more)ve shown the benefits of deep learning, assessing the generalization capabilities of proposed methods for data acquired in different conditions and geologic environments remains challenging. This is especially true for applications in hardrock environments. The primary factors that impede the adoption of machine learning in geosciences include the lack of publicly available and labeled data sets and the use of inadequate evaluation methodologies. Because machine learning models are prone to overfit and underperform when the data used to train them are site specific, the applicability of these models on new survey data that could be considered “out-of-distribution” is rarely addressed. This is unfortunate, as evaluating predictive models in out-of-distribution settings can provide a good insight into their usefulness in real-world use cases. To tackle these issues, we develop a simple benchmarking methodology for first break picking to evaluate the transferability of deep learning models that are trained across different environments and acquisition conditions. For this, we consider a reflection seismic survey data set acquired at five distinct hardrock mining sites combined with annotations for first break picking. We train and evaluate a baseline deep learning solution based on a U-Net for future comparisons and discuss potential improvements to this approach.

2023-10-09

Geophysics (published)

doi.org

Debiasing Counterfactuals in the Presence of Spurious Correlations

Amar Kumar

Nima Fathi

Raghav Mehta

Brennan Nichyporuk

Jean-Pierre R. Falet

Sotirios A. Tsaftaris

Tal Arbel

Deep learning models can perform well in complex medical imaging classification tasks, even when basing their conclusions on spurious correl… (see more)ations (i.e. confounders), should they be prevalent in the training dataset, rather than on the causal image markers of interest. This would thereby limit their ability to generalize across the population. Explainability based on counterfactual image generation can be used to expose the confounders but does not provide a strategy to mitigate the bias. In this work, we introduce the first end-to-end training framework that integrates both (i) popular debiasing classifiers (e.g. distributionally robust optimization (DRO)) to avoid latching onto the spurious correlations and (ii) counterfactual image generation to unveil generalizable imaging markers of relevance to the task. Additionally, we propose a novel metric, Spurious Correlation Latching Score (SCLS), to quantify the extent of the classifier reliance on the spurious correlation as exposed by the counterfactual images. Through comprehensive experiments on two public datasets (with the simulated and real visual artifacts), we demonstrate that the debiasing method: (i) learns generalizable markers across the population, and (ii) successfully ignores spurious correlations and focuses on the underlying disease pathology.

2023-10-08

Clinical Image-Based Procedures, Fairness of AI in Medical Imaging, and Ethical and Philosophical Issues in Medical Imaging (published)

doi.org

openreview.net

On the effectiveness of log representation for log-based anomaly detection

Xingfang Wu

Heng Li

Foutse Khomh

2023-10-08

Empirical Software Engineering (published)

doi.org

arxiv.org

Improving Image-Based Precision Medicine with Uncertainty-Aware Causal Models

Joshua D. Durso-Finley

Jean-Pierre R. Falet

Raghav Mehta

Douglas Arnold

Nick Pawlowski

Tal Arbel

Image-based precision medicine aims to personalize treatment decisions based on an individual's unique imaging features so as to improve the… (see more)ir clinical outcome. Machine learning frameworks that integrate uncertainty estimation as part of their treatment recommendations would be safer and more reliable. However, little work has been done in adapting uncertainty estimation techniques and validation metrics for precision medicine. In this paper, we use Bayesian deep learning for estimating the posterior distribution over factual and counterfactual outcomes on several treatments. This allows for estimating the uncertainty for each treatment option and for the individual treatment effects (ITE) between any two treatments. We train and evaluate this model to predict future new and enlarging T2 lesion counts on a large, multi-center dataset of MR brain images of patients with multiple sclerosis, exposed to several treatments during randomized controlled trials. We evaluate the correlation of the uncertainty estimate with the factual error, and, given the lack of ground truth counterfactual outcomes, demonstrate how uncertainty for the ITE prediction relates to bounds on the ITE error. Lastly, we demonstrate how knowledge of uncertainty could modify clinical decision-making to improve individual patient and clinical trial outcomes.

2023-10-07

OpenReview.net/Archive (published)

doi.org

openreview.net

MDFD: Study of Distributed Non-IID Scenarios and Frechet Distance-Based Evaluation

Wei Wang

Mingwei Zhang

Ziwen Wu

Qianxi Chen

Yuemei Li

With the development of distributed machine learning and federated learning, the solution to the data island problem is promoted. People use… (see more) computer clusters to train machine learning models on data distributed in different regions. In the early stage of research, researchers usually assume that the data sets of each node are independent identically distribution (IID), but this is a strong assumption, which is challenging to meet in practical applications. Therefore, research on non-IID has become a hot spot in recent years. However, there is no uniform standard for designing and evaluating non-IID scenarios. This paper proposes a Frechet distance-independent non-IID distribution dataset metric MDFD. And we conducted experiments on different types of distributed machine-learning methods in different non-IID scenarios to verify the effectiveness of MDFD.

2023-10-07

International Conference on Information Photonics (published)

doi.org

Mitigating Calibration Bias Without Fixed Attribute Grouping for Improved Fairness in Medical Imaging Analysis

Changjian Shui

Justin Szeto

Raghav Mehta

Douglas L. Arnold

Tal Arbel

2023-10-07

OpenReview (published)

doi.org