Publications

Towards Robust Saliency Maps

Nham Le

Arie Gurfinkel

Chuqin Geng

Saliency maps are one of the most popular tools to interpret the operation of a neural network: they compute input features deemed relevant … (see more)to the final prediction, which are often subsets of pixels that are easily understandable by a human being. However, it is known that relying solely on human assessment to judge a saliency map method can be misleading. In this work, we propose a new neural network verification specification called saliency-robustness, which aims to use formal methods to prove a relationship between Vanilla Gradient (VG) -- a simple yet surprisingly effective saliency map method -- and the network's prediction: given a network, if an input

2024-09-05

ACML.org/2024/Conference (published)

proceedings.mlr.press

openreview.net

Nteasee: A mixed methods study of expert and general population perspectives on deploying AI for health in African countries

Mercy Nyamewaa Asiedu

Iskandar Haykel

Awa Dieng

K. Kauer

Tousif Ahmed

Florence Ofori

Charisma Chan

Stephen R. Pfohl

Negar Rostamzadeh

Katherine Heller

2024-09-04

ArXiv (preprint)

doi.org

arxiv.org

Nteasee: Understanding Needs in AI for Health in Africa -- A Mixed-Methods Study of Expert and General Population Perspectives

Mercy Nyamewaa Asiedu

Iskandar Haykel

Awa Dieng

K. Kauer

Tousif Ahmed

Florence Ofori

Charisma Chan

Stephen R. Pfohl

Negar Rostamzadeh

Katherine Heller

Artificial Intelligence (AI) for health has the potential to significantly change and improve healthcare. However in most African countries,… (see more) identifying culturally and contextually attuned approaches for deploying these solutions is not well understood. To bridge this gap, we conduct a qualitative study to investigate the best practices, fairness indicators, and potential biases to mitigate when deploying AI for health in African countries, as well as explore opportunities where artificial intelligence could make a positive impact in health. We used a mixed methods approach combining in-depth interviews (IDIs) and surveys. We conduct 1.5-2 hour long IDIs with 50 experts in health, policy, and AI across 17 countries, and through an inductive approach we conduct a qualitative thematic analysis on expert IDI responses. We administer a blinded 30-minute survey with case studies to 672 general population participants across 5 countries in Africa and analyze responses on quantitative scales, statistically comparing responses by country, age, gender, and level of familiarity with AI. We thematically summarize open-ended responses from surveys. Our results find generally positive attitudes, high levels of trust, accompanied by moderate levels of concern among general population participants for AI usage for health in Africa. This contrasts with expert responses, where major themes revolved around trust/mistrust, ethical concerns, and systemic barriers to integration, among others. This work presents the first-of-its-kind qualitative research study of the potential of AI for health in Africa from an algorithmic fairness angle, with perspectives from both experts and the general population. We hope that this work guides policymakers and drives home the need for further research and the inclusion of general population perspectives in decision-making around AI usage.

2024-09-04

ArXiv (preprint)

doi.org

arxiv.org

Reputation Gaming in Crowd Technical Knowledge Sharing

Iren Mazloomzadeh

Gias Uddin

Foutse Khomh

Ashkan Sami

Stack Overflow incentive system awards users with reputation scores to ensure quality. The decentralized nature of the forum may make the in… (see more)centive system prone to manipulation. This paper offers, for the first time, a comprehensive study of the reported types of reputation manipulation scenarios that might be exercised in Stack Overflow and the prevalence of such reputation gamers by a qualitative study of 1,697 posts from meta Stack Exchange sites. We found four different types of reputation fraud scenarios, such as voting rings where communities form to upvote each other repeatedly on similar posts. We developed algorithms that enable platform managers to automatically identify these suspicious reputation gaming scenarios for review. The first algorithm identifies isolated/semi-isolated communities where probable reputation frauds may occur mostly by collaborating with each other. The second algorithm looks for sudden unusual big jumps in the reputation scores of users. We evaluated the performance of our algorithms by examining the reputation history dashboard of Stack Overflow users from the Stack Overflow website. We observed that around 60-80% of users flagged as suspicious by our algorithms experienced reductions in their reputation scores by Stack Overflow.

2024-09-04

ACM Transactions on Software Engineering and Methodology (published)

doi.org

Advancing EDGE Zones to identify spatial conservation priorities of tetrapod evolutionary history

Sebastian Pipins

Jonathan E. M. Baillie

Alex Bowmer

Laura J. Pollock

Nisha Owen

Rikki Gumbs

2024-09-03

Nature Communications (published)

doi.org

Advancing EDGE Zones to identify spatial conservation priorities of tetrapod evolutionary history

Sebastian Pipins

Jonathan E. M. Baillie

Alex Bowmer

Laura J. Pollock

Nisha Owen

Rikki Gumbs

2024-09-03

Nature Communications (published)

doi.org

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Usman Anwar

Abulhair Saparov

Javier Rando

Daniel Paleka

Miles Turpin

Peter Hase

Ekdeep Singh Lubana

Erik Jenner

Stephen Casper

Oliver Sourbut

Benjamin L. Edelman

Zhaowei Zhang

Mario Günther

Anton Korinek

Jose Hernandez-Orallo

Lewis Hammond

Eric J Bigelow

Alexander Pan

Lauro Langosco

Tomasz Korbak … (see 22 more)

Heidi Chenyu Zhang

Ruiqi Zhong

Sean O hEigeartaigh

Gabriel Recchia

Giulio Corsi

Alan Chan

Markus Anderljung

Lilian Edwards

Aleksandar Petrov

Yoshua Bengio

Christian Schroeder de Witt

Danqi Chen

Samuel Albanie

Sumeet Ramesh Motwani

Tegan Maharaj

Jakob Nicolaus Foerster

Philip Torr

Florian Tramèr

He He

Atoosa Kasirzadeh

Yejin Choi

David Scott Krueger

2024-09-02

TMLR (accepted)

doi.org

openreview.net

Online Convex Optimization for On-Board Routing in High-Throughput Satellites

Olivier B'elanger

Jean-Luc Lupien

Olfa Ben Yahia

Stéphane Martel

Antoine Lesage-Landry

Gunes Karabulut Kurt

The rise in low Earth orbit (LEO) satellite Internet services has led to increasing demand, often exceeding available data rates and comprom… (see more)ising the quality of service. While deploying more satellites offers a short-term fix, designing higher-performance satellites with enhanced transmission capabilities provides a more sustainable solution. Achieving the necessary high capacity requires interconnecting multiple modem banks within a satellite payload. However, there is a notable gap in research on internal packet routing within extremely high-throughput satellites. To address this, we propose a real-time optimal flow allocation and priority queue scheduling method using online convex optimization-based model predictive control. We model the problem as a multi-commodity flow instance and employ an online interior-point method to solve the routing and scheduling optimization iteratively. This approach minimizes packet loss and supports real-time rerouting with low computational overhead. Our method is tested in simulation on a next-generation extremely high-throughput satellite model, demonstrating its effectiveness compared to a reference batch optimization and to traditional methods.

2024-09-02

ArXiv (preprint)

doi.org

arxiv.org

THInC: A Theory-Driven Framework for Computational Humor Detection

Victor De Marez

Thomas Winters

Ayla Rigouts Terryn

Humor is a fundamental aspect of human communication and cognition, as it plays a crucial role in social engagement. Although theories about… (see more) humor have evolved over centuries, there is still no agreement on a single, comprehensive humor theory. Likewise, computationally recognizing humor remains a significant challenge despite recent advances in large language models. Moreover, most computational approaches to detecting humor are not based on existing humor theories. This paper contributes to bridging this long-standing gap between humor theory research and computational humor detection by creating an interpretable framework for humor classification, grounded in multiple humor theories, called THInC (Theory-driven Humor Interpretation and Classification). THInC ensembles interpretable GA2M classifiers, each representing a different humor theory. We engineered a transparent flow to actively create proxy features that quantitatively reflect different aspects of theories. An implementation of this framework achieves an F1 score of 0.85. The associative interpretability of the framework enables analysis of proxy efficacy, alignment of joke features with theories, and identification of globally contributing features. This paper marks a pioneering effort in creating a humor detection framework that is informed by diverse humor theories and offers a foundation for future advancements in theory-driven humor classification. It also serves as a first step in automatically comparing humor theories in a quantitative manner.

2024-09-02

ArXiv (preprint)

doi.org

arxiv.org

THInC: A Theory-Driven Framework for Computational Humor Detection

Victor De Marez

Thomas Winters

Ayla Rigouts Terryn

Humor is a fundamental aspect of human communication and cognition, as it plays a crucial role in social engagement. Although theories about… (see more) humor have evolved over centuries, there is still no agreement on a single, comprehensive humor theory. Likewise, computationally recognizing humor remains a significant challenge despite recent advances in large language models. Moreover, most computational approaches to detecting humor are not based on existing humor theories. This paper contributes to bridging this long-standing gap between humor theory research and computational humor detection by creating an interpretable framework for humor classification, grounded in multiple humor theories, called THInC (Theory-driven Humor Interpretation and Classification). THInC ensembles interpretable GA2M classifiers, each representing a different humor theory. We engineered a transparent flow to actively create proxy features that quantitatively reflect different aspects of theories. An implementation of this framework achieves an F1 score of 0.85. The associative interpretability of the framework enables analysis of proxy efficacy, alignment of joke features with theories, and identification of globally contributing features. This paper marks a pioneering effort in creating a humor detection framework that is informed by diverse humor theories and offers a foundation for future advancements in theory-driven humor classification. It also serves as a first step in automatically comparing humor theories in a quantitative manner.

2024-09-02

ArXiv (preprint)

doi.org

arxiv.org

Audio Editing with Non-Rigid Text Prompts

Francesco Paissan

Zhepei Wang

Mirco Ravanelli

Paris Smaragdis

Cem Subakan

In this paper, we explore audio-editing with non-rigid text edits. We show that the proposed editing pipeline is able to create audio edits … (see more)that remain faithful to the input audio. We explore text prompts that perform addition, style transfer, and in-painting. We quantitatively and qualitatively show that the edits are able to obtain results which outperform Audio-LDM, a recently released text-prompted audio generation model. Qualitative inspection of the results points out that the edits given by our approach remain more faithful to the input audio in terms of keeping the original onsets and offsets of the audio events.

2024-09-01

Interspeech 2024 (published)

doi.org

arxiv.org

Clinical Care Trajectory Assessment of Children with Congenital Diaphragmatic Hernia and Neurodevelopmental Impairment

Alexandra Dimmer

Gabriel Altit

Sabrina Beauseigle

Elena Guadagno

Louise Koclas

Katryn Paquette

Ana Sant’Anna

Adam Shapiro

Dan Poenaru

Pramod Puligandla

2024-09-01

Journal of Pediatric Surgery (published)

doi.org

AI Advantage

Mila AI Policy Fellowship

Strategic Priorities

AI Advantage

Mila AI Policy Fellowship

Publications

AI Advantage

Mila AI Policy Fellowship

Strategic Priorities

AI Advantage

Mila AI Policy Fellowship

Popular keywords:

Publications