Publications

An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration

Hiroki Naganuma

Ryuichiro Hataya

Kotaro Yoshida

Ioannis Mitliagkas

2025-01-01

Trans. Mach. Learn. Res. (publié)

openreview.net

An empirical study of testing machine learning in the wild

Moses Openja

Foutse Khomh

Armstrong Foundjem

Zhen Ming (Jack) Jiang

Mouna Abidi

Ahmed E. Hassan

Background: Recently, machine and deep learning (ML/DL) algorithms have been increasingly adopted in many software systems. Due to their in… (voir plus)ductive nature, ensuring the quality of these systems remains a significant challenge for the research community. Traditionally, software systems were constructed deductively, by writing explicit rules that govern the behavior of the system as program code. However, ML/DL systems infer rules from training data i.e., they are generated inductively). Recent research in ML/DL quality assurance has adapted concepts from traditional software testing, such as mutation testing, to improve reliability. However, it is unclear if these proposed testing techniques are adopted in practice, or if new testing strategies have emerged from real-world ML deployments. There is little empirical evidence about the testing strategies. Aims: To fill this gap, we perform the first fine-grained empirical study on ML testing in the wild to identify the ML properties being tested, the testing strategies, and their implementation throughout the ML workflow. Method: We conducted a mixed-methods study to understand ML software testing practices. We analyzed test files and cases from 11 open-source ML/DL projects on GitHub. Using open coding, we manually examined the testing strategies, tested ML properties, and implemented testing methods to understand their practical application in building and releasing ML/DL software systems. Results: Our findings reveal several key insights: 1.) The most common testing strategies, accounting for less than 40%, are Grey-box and White-box methods, such as Negative Testing , Oracle Approximation , and Statistical Testing . 2.) A wide range of

$17$ ML properties are tested, out of which only 20% to 30% are frequently tested, including Consistency , Correctness , and Efficiency . 3.) Bias and Fairness is more tested in Recommendation (6%) and CV (3.9%) systems, while Security & Privacy is tested in CV (2%), Application Platforms (0.9%), and NLP (0.5%). 4.) We identified 13 types of testing methods, such as Unit Testing , Input Testing , and Model Testing . Conclusions: This study sheds light on the current adoption of software testing techniques and highlights gaps and limitations in existing ML testing practices.

2025-01-01

ACM Trans. Softw. Eng. Methodol. (publié)

doi.org

arxiv.org

Evaluating machine learning-driven intrusion detection systems in IoT: Performance and energy consumption

Saeid Jamshidi

Kawser Wazed Nafi

Amin Nikanjam

Foutse Khomh

2025-01-01

Comput. Ind. Eng. (publié)

doi.org

arxiv.org

Evolution of High-Throughput Satellite Systems: A Vision of Programmable Regenerative Payload

Olfa Ben Yahia

Zineb Garroussi

Olivier Bélanger

Brunilde Sansò

Jean-François Frigon

Stéphane Martel

Antoine Lesage-Landry

Gunes Karabulut Kurt

High-throughput satellite (HTS), with its digital payload technology, is expected to play a key role as an enabler of the upcoming sixth-gen… (voir plus)eration (6G) networks. HTS is mainly designed to provide higher data rates and capacities. Fueled by technological advancements, including beamforming, advanced modulation techniques, reconfigurable phased array technologies, and electronically steerable antennas, HTS has emerged as a fundamental component for future network generations. This paper offers a comprehensive state-of-the-art on HTS systems, focusing on standardization, patents, channel multiple access techniques, routing, load balancing, and the role of software-defined networking (SDN). In addition, we provide a vision for next-generation satellite systems that we have named Extremely-HTS (EHTS) toward autonomous satellites supported by the main requirements and key technologies expected for these systems. The EHTS system will be designed to maximize spectrum reuse and data rates and to flexibly steer the capacity to satisfy user demand. We introduce a novel architecture for future programmable regenerative payloads as well.

2025-01-01

IEEE Commun. Surv. Tutorials (publié)

doi.org

A “fine-cuts” approach disentangling psychopathic, autistic and alexithymic traits in their associations with affective, cognitive and motor empathy

Julia Ayache

Nikki Stevenson

Elisha Patel

Alexander Sumich

Guillaume Dumas

Nadja Heym

2025-01-01

Personality and Individual Differences (publié)

doi.org

Generalization Limits of Graph Neural Networks in Identity Effects Learning

Giuseppe Alessio D’Inverno

Simone Brugiapaglia

Mirco Ravanelli

Graph Neural Networks (GNNs) have emerged as a powerful tool for data-driven learning on various graph domains. They are usually based on a … (voir plus)message-passing mechanism and have gained increasing popularity for their intuitive formulation, which is closely linked to the Weisfeiler-Lehman (WL) test for graph isomorphism to which they have been proven equivalent in terms of expressive power. In this work, we establish new generalization properties and fundamental limits of GNNs in the context of learning so-called identity effects, i.e., the task of determining whether an object is composed of two identical components or not. Our study is motivated by the need to understand the capabilities of GNNs when performing simple cognitive tasks, with potential applications in computational linguistics and chemistry. We analyze two case studies: (i) two-letters words, for which we show that GNNs trained via stochastic gradient descent are unable to generalize to unseen letters when utilizing orthogonal encodings like one-hot representations; (ii) dicyclic graphs, i.e., graphs composed of two cycles, for which we present positive existence results leveraging the connection between GNNs and the WL test. Our theoretical analysis is supported by an extensive numerical study.

2025-01-01

Neural Networks (publié)

doi.org

arxiv.org

Generating Complex Question Decompositions in the Face of Distribution Shifts.

Kelvin Han

Claire Gardent

Marah Ihab Abdin

Jyoti Aneja

Hany Hassan Awadalla

Ammar Ahmed Awadallah

Ahmad Awan

Nguyen Bach

Amit Bahree

Arash Bakhtiari

Jianmin Bao

Harkirat Singh Behl

Alon Benhaim

Misha Bilenko

Johan Bjorck

Sébastien Bubeck

Martin Cai

Qin Cai

Vishrav Chaudhary

Dong Chen … (voir 342 de plus)

Weizhu Chen

Yen-Chun Chen 0001

Yi-ling Chen

Hao Cheng

Parul Chopra

Xiyang Dai

Matthew Dixon

Ronen Eldan

Victor Fragoso

Jianfeng Gao

Mei Gao

Min Gao

Amit Garg

Allison Del Giorno

Abhishek Goswami

S. Gunasekar

Emman Haider

Jun-heng Hao

Russell J. Hewett

Wen-Wei Hu

Jamie Huynh

Dan Iter

Sam Ade Jacobs

Mojan Javaheripi

Xin Jin

Nikos Karampatziakis

Piero Kauffmann

Mahoud Khademi

Dongwoo Kim

Young Jin Kim

Lev Kurilenko

James R. Lee

Yin Tat Lee

Yuanzhi Li

Yunsheng Li

Chen Liang

Lars Lidén

Xihui

Zeqi Lin

Ce Lin

Liyuan Liu

Mengchen Liu

Liu Weishung

Xiaodong Liu

Chong Liu

Piyush Luo

Ali Madan

David Mahmoudzadeh

Matt Majercak

Caio Mazzola

César Teodoro

Arindam Mendes

Hardik Mitra

Anh Modi

Brandon Nguyen

Norick Barun

Daniel Patra

Thomas Perez-Becker

Portet Reid

Heyang Pryzant

Marko Qin

Liliang Radmilac

Gustavo Ren

Corby de Rosa

Sambudha Rosset

Roy Olatunji

Olli Ruwase

Amin Saarikivi

Adil Saied

Michael Salim

Shital Santacroce

Ning Shah

Shang Hiteshi

Yelong Sharma

Swadheen Shen

Xia Shukla

Masahiro Song

Andrea Tanaka

Praneetha Tupini

Michael Wu

Bin Wyatt

Can Xiao

Jiahang Xu

Weijiang Xu

Jilong Xu

Sonali Xue

Fan Yadav

Jianwei Yang

Yifan Yang

Ziyi Yang

Donghan Yang

Yu Lu

Chenruidong Yuan

Cyril Zhang

Jianwen Zhang

Zhang

Li Lyna

Yi Zhang

Yue Zhang

Yunan Zhang 0001

Zhang Xiren

Zhou

Phi-3

Priyanka Agrawal

Chris Alberti

Fantine Huot

Joshua Maynez

Ji Ma

Kuzman Ganchev

Viraat Aryabumi

John Dang

Dwarak Talupuru

Saurabh Dash

David Cairuz

Hangyu Lin

Bharat Venkitesh

Madeline Smith

Jon Ander Campos

Yi Chern Tan

Kelly Marchisio

Max Bartolo

Sebastian Ruder

Acyr F. Locatelli

Julia Kreutzer

Nick Frosst

Aidan Gomez

Phil Blunsom

Marzieh Fadaee

Tom B. Brown

Benjamin Mann

Nick Ryder

Melanie Subbiah

Jared Kaplan

Prafulla Dhariwal

Arvind Neelakantan

Pranav Shyam

Girish Sastry

Amanda Askell

Sandhini Agarwal

Ariel Herbert-Voss

Gretchen Krueger

T. Henighan

Rewon Child

Aditya Ramesh

Daniel M. Ziegler

Jeffrey Wu

Clemens Winter

Chris Hesse

Mark Chen

Eric Sigler

Ma-teusz Litwin

Scott Gray

Benjamin Chess

J. Clark

Christopher Berner

Sam McCandlish

Alec Radford

Ilya Sutskever

Dario Amodei Gemma Team

Morgane Rivière

Shreya Pathak Pier

Giuseppe Sessa

Cassidy Hardin

Surya Bhupati-raju

L'eonard Hussenot

Thomas Mesnard

Bobak Shahriari

Alexandre Ramé

Johan Ferret

Peter Liu

Pouya Dehghani Tafti

Abe Friesen

Michelle Casbon

Sabela Ramos

Ravin Kumar

Charline Le Lan

Sammy Jerome

Anton Tsitsulin

Nino Vieillard

Piotr Stańczyk

Sertan Girgin

Nikola Momchev

Matt Hoffman

Shantanu Thakoor

Jean-Bastien Grill

Behnam Neyshabur

Olivier Bachem

Alanna Wal-ton

Aliaksei Severyn

Alicia Parrish

Aliya Ah-mad

Allen Hutchison

Alvin Abdagic

Amanda Carl

Amy Shen

Andy Brock

Andy Coenen

Anthony Laforge

Antonia Paterson

Ben Bastian

Bilal Piot

Boxi Wu

Brandon Royal

Charlie Chen

Chintu Kumar

Chris Perry

Christoper A. Welty

Christopher A. Choquette-Choo

Danila Sinopalnikov

David Wein-berger

Dimple Vijaykumar

Dominika Rogozi´nska

D. Herbison

Elisa Bandy

Emma Wang

Eric Noland

Erica Moreira

Evan Senter

Evgenii Elty-shev

Francesco Visin

Gabriel Rasskin

Gary Wei

Glenn Cameron

Gus Martins

Hadi Hashemi

Hanna Klimczak-Pluci´nska

Harleen Batra

Harsh Dhand

Ivan Nardini

Jacinda Mein

Jack Zhou

James Svens-son

Jeff Stanway

Jetha Chan

J. Zhou

Joana Carrasqueira

Joana Iljazi

Jocelyn Becker

Joe Fer-nandez

Joost Van Amersfoort

Josh Gordon

Josh Lipschultz

Joshua Newlan

Junsong Ji

Kareem Mo-hamed

Kartikeya Badola

Kat Black

Katie Mil-lican

Keelin McDonell

Kelvin Nguyen

Kiranbir Sodhia

Kish Greene

Lars Lowe Sjoesund

Lauren Usui

Laurent Sifre

L. Heuermann

Leti-cia Lago

Lilly McNealus

Livio Baldini

Soares Logan

Lucas Kilpatrick

Luciano Dixon

Martins Machel

Manvinder Reid

Mark Singh

Martin Görner Iverson

Mateo Wirth Mat Velloso

Matt Davi-dow

Matt Miller

Matthew Rahtz

Matthew Watson

Meg Risdal

Mehran Kazemi

Michael Moynihan

Ming Zhang

Minsuk Kahng

Minwoo Park

Mofi Rahman

Mohit Khatwani

Natalie Dao

Nenshad Bardoliwalla

N. Devanathan

Neta Dumai

Nilay Chauhan

O. Wahltinez

Pankil Botarda

Parker Barnes

Paul R. Barham

Paul Michel

Peng-chong Jin

Petko Georgiev

Phil Culliton

Pradeep Kup-pala

Ramona Comanescu

Ramona Merhej

Reena Jana

R. Rokni

Rishabh Agarwal

Ryan Mullins

Samaneh Saadat

S. M. Carthy

Sarah Cogan

Sarah Perrin

S'ebastien M. R. Arnold

Se-bastian Krause

Shengyang Dai

S. Garg

Shruti Sheth

S. Ronstrom

Susan Chan

Timothy Jordan

Bing Yu

Tom Eccles

Tom Hennigan

Tomas Kocisky

Tulsee Doshi

Vihan Jain

Vikas Yadav

Vilobh Meshram

Vishal Dharmadhikari

Warren Barkley

Wei Wei

Wenming Ye

Woohyun Han

Woosuk Kwon

Xiang Xu

Zhe Shen

Zhitao Gong

Zichuan Wei

Victor Cotruta

Phoebe Kirk

Anand Rao

Minh Giang

Ludovic Peran

Tris Brian Warkentin

Eli Collins

Joelle Barral

Zoubin Ghahramani

Raia Hadsell

D. Sculley

Jeanine Banks

Anca Dragan

2025-01-01

NAACL (Long Papers) (publié)

doi.org

Graph Anomaly Detection in Time Series: A Survey.

Thi Kieu Khanh Ho

Ali Karami

Narges Armanfard

2025-01-01

IEEE Transactions on Pattern Analysis and Machine Intelligence (publié)

doi.org

arxiv.org

Hadamard product in deep learning: Introduction, Advances and Challenges.

Grigorios G Chrysos

Yongtao Wu

Razvan Pascanu

Philip Torr

Volkan Cevher

2025-01-01

IEEE Transactions on Pattern Analysis and Machine Intelligence (publié)

doi.org

arxiv.org

https://www.neuromodec.org/journal/4/2/NzBlvmDpUYspQQbvI4B Online Transcranial Random Noise Stimulation of the Right Temporoparietal Junction Acutely Modulates Human-Machine Social Interactions

Vincent Chamberland

Quentin Moreau

Lisane Moses

Gabriela Milanova

Guillaume Dumas

2025-01-01

Neuromodec Journal (publié)

doi.org

ICLR 2025 Workshop on Tackling Climate Change with Machine Learning: Data-Centric Approaches in ML for Climate Action

Konstantin Klemmer

Melissa Chapman

Lily Xu

Poon Kin Ho

Mélisande Teng

Patrick Emami

Yoshua Bengio

Climate change is one of the greatest problems society has ever faced, with increasingly severe consequences for humanity as natural disaste… (voir plus)rs multiply, sea levels rise, and ecosystems falter. While no silver bullet, machine learning can be an invaluable tool in fighting climate change via a wide array of applications and techniques, from designing smart electric grids to tracking greenhouse gas emissions through satellite imagery. These applications require algorithmic innovations in machine learning and close collaboration with diverse fields and practitioners. This workshop is intended as a forum for those in the global machine learning community who wish to help tackle climate change, and is further aimed to help foster cross-pollination between researchers in machine learning and experts in complementary climate-relevant fields. Building on our past workshops on this topic, this workshop particularly aims to explore data-centric ML approaches for climate action. Data-centric ML is not only a timely topic within the ICLR community, as analyzing and engineering (pre)training datasets becomes increasingly important, but holds specific challenges and opportunities in climate-related areas. We also want to take the opportunity of ICLR being hosted in Singapore to engage with local communities and shine a light on work that deploys, analyzes or critiques ML methods and their use for climate change adaptation and mitigation on the Asian continent.

2025-01-01

ICLR.cc/2025/Workshop_Proposals (publié)

openreview.net

An identification of models to help in the design of national strategies and policies to reduce greenhouse gas emissions.

Danielle Maia de Souza

Radhwane Boukelouha

Emma Frejinger

Catherine Morency

Normand Mousseau

Martin Trépanier

2025-01-01

Transportation Research Procedia (publié)

doi.org

Avantage IA

Mettre à profit l'IA pour un avenir durable

Bourse Mila en politiques de l'IA

Avantage IA

Mettre à profit l'IA pour un avenir durable

Publications

Avantage IA

Mettre à profit l'IA pour un avenir durable

Bourse Mila en politiques de l'IA

Avantage IA

Mettre à profit l'IA pour un avenir durable

Mots-clés populaires:

Publications