Publications

Affirmative safety: An approach to risk management for high-risk AI

Akash Wasil

Joshua Clymer

David M. Krueger

Emily Dardaman

Simeon Campos

Evan Murphy

Prominent AI experts have suggested that companies developing high-risk AI systems should be required to show that such systems are safe bef… (voir plus)ore they can be developed or deployed. The goal of this paper is to expand on this idea and explore its implications for risk management. We argue that entities developing or deploying high-risk AI systems should be required to present evidence of affirmative safety: a proactive case that their activities keep risks below acceptable thresholds. We begin the paper by highlighting global security risks from AI that have been acknowledged by AI experts and world governments. Next, we briefly describe principles of risk management from other high-risk fields (e.g., nuclear safety). Then, we propose a risk management approach for advanced AI in which model developers must provide evidence that their activities keep certain risks below regulator-set thresholds. As a first step toward understanding what affirmative safety cases should include, we illustrate how certain kinds of technical evidence and operational evidence can support an affirmative safety case. In the technical section, we discuss behavioral evidence (evidence about model outputs), cognitive evidence (evidence about model internals), and developmental evidence (evidence about the training process). In the operational section, we offer examples of organizational practices that could contribute to affirmative safety cases: information security practices, safety culture, and emergency response capacity. Finally, we briefly compare our approach to the NIST AI Risk Management Framework. Overall, we hope our work contributes to ongoing discussions about national and global security risks posed by AI and regulatory approaches to address these risks.

2024-04-13

ArXiv (prépublication)

doi.org

arxiv.org

CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds

David Budaghyan

Arsenii Gorin

Cem Subakan

Charles C. Onu

Doina Precup

This paper describes the Ubenwa CryCeleb dataset - a labeled collection of infant cries - and the accompanying CryCeleb 2023 task, which is … (voir plus)a public speaker verification challenge based on cry sounds. We released more than 6 hours of manually segmented cry sounds from 786 newborns for academic use, aiming to encourage research in infant cry analysis. The inaugural public competition attracted 59 participants, 11 of whom improved the baseline performance. The top-performing system achieved a significant improvement scoring 25.8% equal error rate, which is still far from the performance of state-of-the-art adult speaker verification systems. Therefore, we believe there is room for further research on this dataset, potentially extending beyond the verification task.

2024-04-13

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (publié)

doi.org

arxiv.org

Directed Scattering for Knowledge Graph-Based Cellular Signaling Analysis

Aarthi Venkat

Joyce Chew

Ferran Cardoso Rodriguez

Christopher J. Tape

Michael Perlmutter

Smita Krishnaswamy

Directed graphs are a natural model for many phenomena, in particular scientific knowledge graphs such as molecular interaction or chemical … (voir plus)reaction networks that define cellular signaling relationships. In these situations, source nodes typically have distinct biophysical properties from sinks. Due to their ordered and unidirectional relationships, many such networks also have hierarchical and multiscale structure. However, the majority of methods performing node- and edge-level tasks in machine learning do not take these properties into account, and thus have not been leveraged effectively for scientific tasks such as cellular signaling network inference. We propose a new framework called Directed Scattering Autoencoder (DSAE) which uses a directed version of a geometric scattering transform, combined with the non-linear dimensionality reduction properties of an autoencoder and the geometric properties of the hyperbolic space to learn latent hierarchies. We show this method outperforms numerous others on tasks such as embedding directed graphs and learning cellular signaling networks.

2024-04-13

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (publié)

doi.org

arxiv.org

Focal Modulation Networks for Interpretable Sound Classification

Luca Della Libera

Cem Subakan

Mirco Ravanelli

The increasing success of deep neural networks has raised concerns about their inherent black-box nature, posing challenges related to inter… (voir plus)pretability and trust. While there has been extensive exploration of interpretation techniques in vision and language, interpretability in the audio domain has received limited attention, primarily focusing on post-hoc explanations. This paper addresses the problem of interpretability by-design in the audio domain by utilizing the recently proposed attention-free focal modulation networks (FocalNets). We apply FocalNets to the task of environmental sound classification for the first time and evaluate their interpretability properties on the popular ESC-50 dataset. Our method outperforms a similarly sized vision transformer both in terms of accuracy and interpretability. Furthermore, it is competitive against PIQ, a method specifically designed for post-hoc interpretation in the audio domain.

2024-04-13

2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) (publié)

doi.org

arxiv.org

Resource-Efficient Separation Transformer

Luca Della Libera

Cem Subakan

Mirco Ravanelli

Samuele Cornell

Frédéric Lepoutre

François Grondin

Transformers have recently achieved state-of-the-art performance in speech separation. These models, however, are computationally demanding … (voir plus)and require a lot of learnable parameters. This paper explores Transformer-based speech separation with a reduced computational cost. Our main contribution is the development of the Resource-Efficient Separation Transformer (RE-SepFormer), a self-attention-based architecture that reduces the computational burden in two ways. First, it uses non-overlapping blocks in the latent space. Second, it operates on compact latent summaries calculated from each chunk. The RE-SepFormer reaches a competitive performance on the popular WSJ0-2Mix and WHAM! datasets in both causal and non-causal settings. Remarkably, it scales significantly better than the previous Transformer-based architectures in terms of memory and inference time, making it more suitable for processing long mixtures.

2024-04-13

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (publié)

doi.org

arxiv.org

SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning

Luca Zampierin

Ghouthi Boukli hacene

Bac Nguyen

Mirco Ravanaelli

2024-04-13

2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) (publié)

doi.org

arxiv.org

Towards Practical Tool Usage for Continually Learning LLMs

Jerry Huang

Prasanna Parthasarathi

Mehdi Rezagholizadeh

A. Chandar

Large language models (LLMs) show an innate skill for solving language based tasks. But insights have suggested an inability to adjust for i… (voir plus)nformation or task-solving skills becoming outdated, as their knowledge, stored directly within their parameters, remains static in time. Tool use helps by offloading work to systems that the LLM can access through an interface, but LLMs that use them still must adapt to nonstationary environments for prolonged use, as new tools can emerge and existing tools can change. Nevertheless, tools require less specialized knowledge, therefore we hypothesize they are better suited for continual learning (CL) as they rely less on parametric memory for solving tasks and instead focus on learning when to apply pre-defined tools. To verify this, we develop a synthetic benchmark and follow this by aggregating existing NLP tasks to form a more realistic testing scenario. While we demonstrate scaling model size is not a solution, regardless of tool usage, continual learning techniques can enable tool LLMs to both adapt faster while forgetting less, highlighting their potential as continual learners.

2024-04-13

ArXiv (prépublication)

doi.org

arxiv.org

Why People Contribute Software Documentation

Deeksha M. Arya

Jin L.C. Guo

Martin P. Robillard

2024-04-13

IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies (publié)

doi.org

Assessing Numerical Analysis Performance with the Practi Mobile App

Maria Cutumisu

Kristin Garn

Raymond J. Spiteri

2024-04-11

Education sciences (publié)

doi.org

Model-independent Approach of the JUNO 8B Solar Neutrino Program

Jun-Zhang Zhao

Bin Yue

Haoqi Lu

Yufeng Li

J. Ling

Zeyuan Yu

Angel Abusleme

Thomas Adam

Shakeel Ahmad

Rizwan Ahmed

Sebastiano Aiello

Muhammad Akram

Abid Aleem

Tsagkarakis Alexandros

Fengpeng An

Q. An

Giuseppe Andronico

Nikolay Anfimov

Vito Antonelli

Tatiana Antoshkina … (voir 477 de plus)

Burin Asavapibhop

J. Andr'e

Didier Auguste

Weidong Bai

Nikita Balashov

Wander Baldini

Andrea Barresi

Davide Basilico

Eric Baussan

Marco Bellato

Antonio Bergnoli

Thilo Birkenfeld

Sylvie Blin

D. Blum

Simon Blyth

Anastasia Bolshakova

Mathieu Bongrand

Clément Bordereau

Dominique Breton

Augusto Brigatti

Riccardo Brugnera

Riccardo Bruno

Antonio Budano

Jose Busto

I. Butorov

Anatael Cabrera

Barbara Caccianiga

Hao Cai

Xiao Cai

Yanke Cai

Zucong Cai

Riccardo Callegari

Antonio Cammi

Agustin Campeny

Guofu Cao

Jun Cao

Rossella Caruso

C. Cerna

Chi Chan

Jinfan Chang

Yun Chang

Guoming Chen

Pingping Chen

Po-An Chen

Shaomin Chen

Xurong Chen

Yixue Chen

Yu Chen

Zhiyuan Chen

Zikang Chen

Jie Cheng

Yaping Cheng

Alexander Chepurnov

Alexey Chetverikov

Davide Chiesa

Pietro Chimenti

Artem Chukanov

Gérard Claverie

Catia Clementi

Barbara Clerbaux

Marta Colomer Molla

Selma Conforti Di Lorenzo

Daniele Corti

Flavio Dal Corso

Olivia Dalager

C. Taille

Z. Y. Deng

Ziyan Deng

Wilfried Depnering

Marco Diaz

Xuefeng Ding

Yayun Ding

Bayu Dirgantara

Sergey Dmitrievsky

Tadeas Dohnal

Dmitry Dolzhikov

Georgy Donchenko

Jianmeng Dong

Evgeny Doroshkevich

Marcos Dracos

Frédéric Druillole

Ran Du

S. X. Du

Stefano Dusini

Martin Dvorak

Timo Enqvist

H. Enzmann

Andrea Fabbri

Dongsheng Fan

Lei Fan

Jian Fang

Wen Fang

Marco Fargetta

Dmitry Fedoseev

Zheng-hao Fei

Li-Cheng Feng

Qichun Feng

R. Ford

Amélie Fournier

H. Gan

Feng Gao

Alberto Garfagnini

Arsenii Gavrikov

Marco Giammarchi

Nunzio Giudice

Maxim Gonchar

G. Gong

Hui Gong

Yuri Gornushkin

A. Gottel

Marco Grassi

Maxim Gromov

Vasily Gromov

M. H. Gu

Xiang Zhou

Yunting Gu

Mengyun Guan

Yuduo Guan

Nunzio Guardone

Cong Guo

Jingyuan Guo

Wanlei Guo

Xinheng Guo

Yuhang Guo

Paul Hackspacher

Caren Hagner

Ran Han

Yang Han

Miao He

W. He

Tobias Heinz

Patrick Hellmuth

Yue-kun Heng

Rafael Herrera

Yuenkeung Hor

Shaojing Hou

Yee Hsiung

Bei-Zhen Hu

Hang Hu

Jianrun Hu

Jun Hu

Shouyang Hu

T. Hu

Yuxiang Hu

Zhuojun Hu

Guihong Huang

Hanxiong Huang

Kaixuan Huang

Wenhao Huang

Xinglong Huang

X. T. Huang

Yongbo Huang

Jiaqi Hui

L. Huo

Wenju Huo

Cédric Huss

Safeer Hussain

Ara Ioannisian

Roberto Isocrate

Beatrice Jelmini

Ignacio Jeria

Xiaolu Ji

Huihui Jia

Junji Jia

Siyu Jian

Di Jiang

Wei Jiang

Xiaoshan Jiang

Xiang Jing

Cécile Jollet

L. Kalousis

Philipp Kampmann

Li Kang

Rebin Karaparambil

Narine Kazarian

Amina Khatun

Khanchai Khosonthongkee

Denis Korablev

K. Kouzakov

Alexey Krasnoperov

Nikolay Kutovskiy

Pasi Kuusiniemi

Tobias Lachenmaier

Cecilia Landini

Sébastien Leblanc

Victor Lebrin

F. Lefèvre

R. Lei

Rupert Leitner

Jason Leung

Daozheng Li

Demin Li

Fei Li

Fule Li

Gaosong Li

Huiling Li

Mengzhao Li

Min Li

Nan Li

Qingjiang Li

Ruhui Li

Rui Li

Shanfeng Li

Tao Li

Teng Li

Weidong Li

Wei-guo Li

Xiaomei Li

Xiao-Nan Li

Xinglong Li

Yi Li

Yichen Li

Zepeng Li

Zhaohan Li

Zhibing Li

Ziyuan Li

Zonghui Li

Hao Liang

Jiaming Yan

Ayut Limphirat

Gen Lin

Shengxin Lin

Tao Lin

Ivano Lippi

Haidong Liu

Hongbang Liu

Hongjuan Liu

Hongtao Liu

H. Liu

Jianglai Liu

Jinchang Liu

Min Liu

Qian Liu

Qi Liu

Runxuan Liu

Shubin Liu

Shulin Liu

Xiaowei Liu

Xiwen Liu

Yang Liu

Yunzhe Liu

Alexey Lokhov

Paolo Lombardi

Claudio Lombardo

K. Loo

Chuan Lu

Jingbin Lu

Junguang Lu

Shuxian Du

Bayarto Lubsandorzhiev

Sultim Lubsandorzhiev

Livia Ludhova

Arslan Lukanov

Daibin Luo

Feng Luo

Guang Luo

Shu Luo

Wu Luo

Xiaojie Luo

Vladimir Lyashuk

Biao Ma

Bing Ma

R. Q. Ma

Si Ma

Xiaoyan Ma

Xubo Ma

Jihane Maalmi

Jingyu Mai

Yury Malyshkin

Roberto Carlos Mandujano

Fabio Mantovani

Francesco Manzali

Xin Mao

Yajun Mao

S. Mari

F. Marini

Cristina Martellini

Gisèle Martin-chassard

Agnese Martini

Matthias Mayer

Davit Mayilyan

Ints Mednieks

Yu Meng

Anselmo Meregaglia

Emanuela Meroni

David J. Meyhofer

Mauro Mezzetto

Jonathan Andrew Miller

Lino Miramonti

Paolo Montini

Michele Montuschi

Axel Muller

Massimiliano Nastasi

D. Naumov

Elena Naumova

Diana Navas-Nicolas

Igor Nemchenok

Minh Thuan Nguyen Thi

Alexey Nikolaev

Feipeng Ning

Zhe Ning

Hiroshi Nunokawa

Lothar Oberauer

Juan Pedro Ochoa-Ricoux

Alexander Olshevskiy

Domizia Orestano

Fausto Ortica

Rainer Othegraven

Alessandro Paoloni

Sergio Parmeggiano

Y. P. Pei

Nicomede Pelliccia

Anguo Peng

Yuekun Heng

Z-R Peng

Frédéric Perrot

P. Petitjean

Fabrizio Petrucci

Oliver Pilarczyk

Luis Felipe Piñeres Rico

Artyom Popov

Pascal Poussot

Ezio Previtali

Fazhi Qi

M. Qi

Sen Qian

Xiangyang Qian

Zhen Qian

Hao-xue Qiao

Zhonghua Qin

Shoukang Qiu

Gioacchino Ranucci

Neill Raper

A. Re

Henning Rebber

Abdel Rebii

Mariia Redchuk

Bin Ren

Jie Ren

Barbara Ricci

Mariam Rifai

Mathieu Roche

Narongkiat Rodphai

Aldo M. Romani

Bedřich Roskovec

Xianhui Ruan

Arseniy Rybnikov

Andrey Sadovsky

Paolo Saggese

Simone Sanfilippo

Anut Sangka

Utane Sawangwit

Julia Sawatzki

Michaela Schever

Cédric Schwab

Konstantin Schweizer

Alexandr Selyunin

Andrea Serafini

Giulio Settanta

Mariangela Settimo

Zhuang Shao

Vladislav Sharov

Arina Shaydurova

Jingyan Shi

Yanan Shi

Vitaly Shutov

Andrey Sidorenkov

Fedor Šimkovic

Chiara Sirignano

Jaruchit Siripak

Monica Sisti

Maciej Slupecki

Mikhail Smirnov

Oleg Smirnov

Thiago Sogo-Bezerra

Sergey Sokolov

Julanan Songwadhana

Boonrucksar Soonthornthum

Albert Sotnikov

Ondvrej vSr'amek

Warintorn Sreethawong

A. Stahl

Luca Stanco

Konstantin Stankevich

Duvsan Vstef'anik

Hans Steiger

Jochen Steinmann

Tobias Sterr

M. Stock

Virginia Strati

Alexander Studenikin

Jun Su

Shifeng Sun

Xilei Sun

Yongjie Sun Sun

Yongzhao Sun

Zhengyang Sun

Narumon Suwonjandee

Michal Szelezniak

Jian Tang

Qiang Tang

Quan Tang

Xiao Tang

Alexander Tietzsch

Igor Tkachev

Tomas Tmej

M. Torri

K. Treskov

Andrea Triossi

Giancarlo Troni

Wladyslaw Trzaska

Cristina Tuve

Nikita Ushakov

Vadim Vedin

Giuseppe Verde

Maxim Vialkov

Benoit Viaud

Cornelius Moritz Vollbrecht

C. Volpe

Katharina von Sturm

Vit Vorobel

Dmitriy Voronin

Lucia Votano

Pablo Walker

Caishen Wang

Chung-Hsiang Wang

En Wang

Guoli Wang

Jian Wang

Jun Wang

Lucinda W. Wang

Meifen Wang

Meng Wang

Ruiguang Wang

Siguang Wang

Wei Wang

Wenshuai Wang

Xi Wang

Xiangyue Wang

Yangfu Wang

Yaoguang Wang

Yi Wang

Yifang Wang

Yong Wang

Yuman Wang

Zhe Wang

Z. Wang

Zhimin Wang

Zongyi Wang

Apimook Watcharangkool

Wei Wei

Wenlu Wei

Yadong Wei

K. Wen

Kaile Wen

Christopher Wiebusch

S. Wong

Bjoern Wonsak

Diru Wu

Qun Wu

Zhi Wu

Michael Wurm

Jacques Wurtz

Christian Wysotzki

Yufei Xi

Dongqin Xia

Xiang Xiao

Xiaochuan Xie

Yu-guang Xie

Zhangquan Xie

Z. P. Xie

Zhao-Liang Xin

Z. Xing

Benda D. Xu

Chengze Xu

Donglian Xu

Fanrong Xu

The physics potential of detecting 8B solar neutrinos will be exploited at the Jiangmen Underground Neutrino Observatory (JUNO), in a model-… (voir plus)independent manner by using three distinct channels of the charged current (CC), neutral current (NC), and elastic scattering (ES) interactions. Due to the largest-ever mass of 13C nuclei in the liquid scintillator detectors and the expected low background level, 8B solar neutrinos are observable in the CC and NC interactions on 13C for the first time. By virtue of optimized event selections and muon veto strategies, backgrounds from the accidental coincidence, muon-induced isotopes, and external backgrounds can be greatly suppressed. Excellent signal-to-background ratios can be achieved in the CC, NC, and ES channels to guarantee the observation of the 8B solar neutrinos. From the sensitivity studies performed in this work, we show that JUNO, with 10 yr of data, can reach the 1σ precision levels of 5%, 8%, and 20% for the 8B neutrino flux, sin 2 θ 12 , and Δ m 21 2 , respectively. Probing the details of both solar physics and neutrino physics would be unique and helpful. In addition, when combined with the Sudbury Neutrino Observatory measurement, the world's best precision of 3% is expected for the measurement of the 8B neutrino flux.

2024-04-11

The Astrophysical Journal (publié)

doi.org

arxiv.org

Towards Causal Deep Learning for Vulnerability Detection

Md Mahbubur Rahman

Ira Ceka

Chengzhi Mao

Saikat Chakraborty

Baishakhi Ray

Wei Le

Deep learning vulnerability detection has shown promising results in recent years. However, an important challenge that still blocks it from… (voir plus) being very useful in practice is that the model is not robust under perturbation and it cannot generalize well over the out-of-distribution (OOD) data, e.g., applying a trained model to unseen projects in real world. We hypothesize that this is because the model learned non-robust features, e.g., variable names, that have spurious correlations with labels. When the perturbed and OOD datasets no longer have the same spurious features, the model prediction fails. To address the challenge, in this paper, we introduced causality into deep learning vulnerability detection. Our approach CausalVul consists of two phases. First, we designed novel perturbations to discover spurious features that the model may use to make predictions. Second, we applied the causal learning algorithms, specifically, do-calculus, on top of existing deep learning models to systematically remove the use of spurious features and thus promote causal based prediction. Our results show that CausalVul consistently improved the model accuracy, robustness and OOD performance for all the state-of-the-art models and datasets we experimented. To the best of our knowledge, this is the first work that introduces do calculus based causal learning to software engineering models and shows it's indeed useful for improving the model accuracy, robustness and generalization. Our replication package is located at https://figshare.com/s/0ffda320dcb96c249ef2.

2024-04-11

Proceedings of the IEEE/ACM 46th International Conference on Software Engineering (publié)

doi.org

arxiv.org

Deep learning for high-resolution dose prediction in high dose rate brachytherapy for breast cancer treatment.

Sébastien Quetin

Boris Bahoric

Farhad Maleki

S. Enger

OBJECTIVE Monte Carlo (MC) simulations are the benchmark for accurate radiotherapy dose calculations, notably in patient-specific high dose … (voir plus)rate brachytherapy (HDR BT), in cases where considering tissue heterogeneities is critical. However, the lengthy computational time limits the practical application of MC simulations. Prior research used Deep Learning (DL) for dose prediction as an alternative to MC simulations. While accurate dose predictions akin to MC were attained, GPU limitations constrained these predictions to large voxels of 3mm × 3mm × 3mm. This study aimed to enable dose predictions as accurate as MC simulations in 1mm × 1mm × 1mm voxels within a clinically acceptable timeframe. Approach: Computed tomography scans of 98 breast cancer patients treated with Iridium-192-based HDR BT were used: 70 for training, 14 for validation, and 14 for testing. A new cropping strategy based on the distance to the seed was devised to reduce the volume size, enabling efficient training of 3D DL models using 1 mm × 1 mm × 1 mm dose grids. Additionally, novel DL architecture with layer-level fusion were proposed to predict MC simulated dose to medium-in-medium (Dm,m). These architectures fuse information from TG-43 dose to water-in-water (Dw,w) with patient tissue composition at the layer-level. Different inputs describing patient body composition were investigated. Main results: The proposed approach demonstrated state-of-the-art performance, on par with the MC Dm,m maps, but 300 times faster. The mean absolute percent error for dosimetric indices between the MC and DL-predicted complete treatment plans was 0.17%±0.15% for the planning target volume V100, 0.30%±0.32% for the skin D2cc, 0.82%±0.79% for the lung D2cc, 0.34%±0.29% for the chest wall D2cc and 1.08%±0.98% for the heart D2cc. Significance: Unlike the time-consuming MC simulations, the proposed novel strategy efficiently converts TG-43 Dw,w maps into precise Dm,m maps at high resolution, enabling clinical integration.

2024-04-10

Physics in Medicine and Biology (publié)

doi.org

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Publications

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Mots-clés populaires:

Publications