Publications

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons

Shaona Ghosh

Heather Frase

Adina Williams

Sarah Luger

Paul Rottger

Fazl Barez

Sean McGregor

Kenneth Fricklas

Mala Kumar

Quentin Feuillade--Montixi

Kurt Bollacker

Felix Friedrich

Ryan Tsang

Bertie Vidgen

Alicia Parrish

Chris Knotz

Eleonora Presani

Jonathan Bennion

Marisa Ferrara Boston

Mike Kuniavsky … (voir 81 de plus)

Wiebke Hutiri

James Ezick

Malek Ben Salem

Rajat Sahay

Sujata Goswami

Usman Gohar

Ben Huang

Supheakmungkol Sarin

Elie Alhajjar

Canyu Chen

Roman Eng

K. Manjusha

Virendra Mehta

Eileen Peters Long

Murali Krishna Emani

Natan Vidra

Benjamin Rukundo

Abolfazl Shahbazi

Kongtao Chen

Rajat Ghosh

Vithursan Thangarasa

Pierre Peign'e

Abhinav Singh

Max Bartolo

Satyapriya Krishna

Mubashara Akhtar

Rafael Gold

Cody Coleman

Luis Oala

Vassil Tashev

Joseph Marvin Imperial

Amy Russ

Sasidhar Kunapuli

Nicolas Miailhe

Julien Delaunay

Bhaktipriya Radharapu

Rajat Shinde

Tuesday

Debojyoti Dutta

D. Grabb

Ananya Gangavarapu

Saurav Sahay

Agasthya Gangavarapu

Patrick Schramowski

Stephen Singam

Tom David

Xudong Han

Priyanka Mary Mammen

Tarunima Prabhakar

Venelin Kovatchev

Ahmed M. Ahmed

Kelvin Manyeki

Sandeep Madireddy

Foutse Khomh

Fedor Zhdanov

Joachim Baumann

N. Vasan

Xianjun Yang

Carlos Mougn

Jibin Rajan Varghese

Hussain Chinoy

Seshakrishna Jitendar

Manil Maskey

Claire V. Hardgrove

Tianhao Li

Aakash Gupta

Emil Joswin

Yifan Mai

Shachi H. Kumar

Çigdem Patlak

Kevin Lu

Vincent Alessi

Sree Bhargavi Balija

Chenhe Gu

Robert Sullivan

James Gealy

Matt Lavrisa

James Goel

Peter Mattson

Percy Liang

Joaquin Vanschoren

2025-02-19

ArXiv (prépublication)

arxiv.org

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons

Shaona Ghosh

Heather Frase

Adina Williams

Sarah Luger

Paul Rottger

Fazl Barez

Sean McGregor

Kenneth Fricklas

Mala Kumar

Quentin Feuillade--Montixi

Kurt Bollacker

Felix Friedrich

Ryan Tsang

Bertie Vidgen

Alicia Parrish

Chris Knotz

Eleonora Presani

Jonathan Bennion

Marisa Ferrara Boston

Mike Kuniavsky … (voir 81 de plus)

Wiebke Hutiri

James Ezick

Malek Ben Salem

Rajat Sahay

Sujata Goswami

Usman Gohar

Ben Huang

Supheakmungkol Sarin

Elie Alhajjar

Canyu Chen

Roman Eng

K. Manjusha

Virendra Mehta

Eileen Peters Long

Murali Krishna Emani

Natan Vidra

Benjamin Rukundo

Abolfazl Shahbazi

Kongtao Chen

Rajat Ghosh

Vithursan Thangarasa

Pierre Peign'e

Abhinav Singh

Max Bartolo

Satyapriya Krishna

Mubashara Akhtar

Rafael Gold

Cody Coleman

Luis Oala

Vassil Tashev

Joseph Marvin Imperial

Amy Russ

Sasidhar Kunapuli

Nicolas Miailhe

Julien Delaunay

Bhaktipriya Radharapu

Rajat Shinde

Tuesday

Debojyoti Dutta

Declan Grabb

Ananya Gangavarapu

Saurav Sahay

Agasthya Gangavarapu

Patrick Schramowski

Stephen Singam

Tom David

Xudong Han

Priyanka Mary Mammen

Tarunima Prabhakar

Venelin Kovatchev

Ahmed M. Ahmed

Kelvin Manyeki

Sandeep Madireddy

Foutse Khomh

Fedor Zhdanov

Joachim Baumann

N. Vasan

Xianjun Yang

Carlos Mougn

Jibin Rajan Varghese

Hussain Chinoy

Seshakrishna Jitendar

Manil Maskey

Claire V. Hardgrove

Tianhao Li

Aakash Gupta

Emil Joswin

Yifan Mai

Shachi H. Kumar

Çigdem Patlak

Kevin Lu

Vincent Alessi

Sree Bhargavi Balija

Chenhe Gu

Robert Sullivan

James Gealy

Matt Lavrisa

James Goel

Peter Mattson

Percy Liang

Joaquin Vanschoren

2025-02-19

ArXiv (prépublication)

doi.org

arxiv.org

Dehumanizing Machines: Mitigating Anthropomorphic Behaviors in Text Generation Systems

Myra Cheng

Su Lin Blodgett

Alicia DeVrio

Lisa Egede

Alexandra Olteanu

As text generation systems' outputs are increasingly anthropomorphic -- perceived as human-like -- scholars have also raised increasing conc… (voir plus)erns about how such outputs can lead to harmful outcomes, such as users over-relying or developing emotional dependence on these systems. How to intervene on such system outputs to mitigate anthropomorphic behaviors and their attendant harmful outcomes, however, remains understudied. With this work, we aim to provide empirical and theoretical grounding for developing such interventions. To do so, we compile an inventory of interventions grounded both in prior literature and a crowdsourced study where participants edited system outputs to make them less human-like. Drawing on this inventory, we also develop a conceptual framework to help characterize the landscape of possible interventions, articulate distinctions between different types of interventions, and provide a theoretical basis for evaluating the effectiveness of different interventions.

2025-02-19

ArXiv (prépublication)

arxiv.org

Dehumanizing Machines: Mitigating Anthropomorphic Behaviors in Text Generation Systems

Myra Cheng

Su Lin Blodgett

Alicia DeVrio

Lisa Egede

Alexandra Olteanu

As text generation systems' outputs are increasingly anthropomorphic -- perceived as human-like -- scholars have also raised increasing conc… (voir plus)erns about how such outputs can lead to harmful outcomes, such as users over-relying or developing emotional dependence on these systems. How to intervene on such system outputs to mitigate anthropomorphic behaviors and their attendant harmful outcomes, however, remains understudied. With this work, we aim to provide empirical and theoretical grounding for developing such interventions. To do so, we compile an inventory of interventions grounded both in prior literature and a crowdsourced study where participants edited system outputs to make them less human-like. Drawing on this inventory, we also develop a conceptual framework to help characterize the landscape of possible interventions, articulate distinctions between different types of interventions, and provide a theoretical basis for evaluating the effectiveness of different interventions.

2025-02-19

ArXiv (prépublication)

doi.org

arxiv.org

Object-centric Binding in Contrastive Language-Image Pretraining

Rim Assouel

Pietro Astolfi

Florian Bordes

Michal Drozdzal

Adriana Romero Soriano

Recent advances in vision language models (VLM) have been driven by contrastive models such as CLIP, which learn to associate visual informa… (voir plus)tion with their corresponding text descriptions. However, these models have limitations in understanding complex compositional scenes involving multiple objects and their spatial relationships. To address these challenges, we propose a novel approach that diverges from commonly used strategies, which rely on the design of hard-negative augmentations. Instead, our work focuses on integrating inductive biases into pre-trained CLIP-like models to improve their compositional understanding without using any additional hard-negatives. To that end, we introduce a binding module that connects a scene graph, derived from a text description, with a slot-structured image representation, facilitating a structured similarity assessment between the two modalities. We also leverage relationships as text-conditioned visual constraints, thereby capturing the intricate interactions between objects and their contextual relationships more effectively. Our resulting model not only enhances the performance of CLIP-based models in multi-object compositional understanding but also paves the way towards more accurate and sample-efficient image-text matching of complex scenes.

2025-02-19

ArXiv (prépublication)

arxiv.org

Object-centric Binding in Contrastive Language-Image Pretraining

Rim Assouel

Pietro Astolfi

Florian Bordes

Michal Drozdzal

Adriana Romero Soriano

2025-02-19

ArXiv (prépublication)

doi.org

arxiv.org

Making the Write Connections: Linking Writing Support Tools with Writer's Needs

Zixin Zhao

Damien Masson

Young-Ho Kim

Gerald Penn

Fanny Chevalier

2025-02-18

ArXiv (prépublication)

doi.org

arxiv.org

Making the Write Connections: Linking Writing Support Tools with Writer's Needs

Zixin Zhao

Damien Masson

Young-Ho Kim

Gerald Penn

Fanny Chevalier

This work sheds light on whether and how creative writers' needs are met by existing research and commercial writing support tools (WST). We… (voir plus) conducted a need finding study to gain insight into the writers' process during creative writing through a qualitative analysis of the response from an online questionnaire and Reddit discussions on r/Writing. Using a systematic analysis of 115 tools and 67 research papers, we map out the landscape of how digital tools facilitate the writing process. Our triangulation of data reveals that research predominantly focuses on the writing activity and overlooks pre-writing activities and the importance of visualization. We distill 10 key takeaways to inform future research on WST and point to opportunities surrounding underexplored areas. Our work offers a holistic and up-to-date account of how tools have transformed the writing process, guiding the design of future tools that address writers' evolving and unmet needs.

2025-02-18

ArXiv (prépublication)

doi.org

arxiv.org

Random Forest Autoencoders for Guided Representation Learning

Adrien Aumon

Shuang Ni

Myriam Lizotte

Guy Wolf

Kevin R. Moon

Jake S. Rhodes

Decades of research have produced robust methods for unsupervised data visualization, yet supervised visualization…

2025-02-18

ArXiv (prépublication)

arxiv.org

Random Forest Autoencoders for Guided Representation Learning

Adrien Aumon

Shuang Ni

Myriam Lizotte

Guy Wolf

Kevin R. Moon

Jake S. Rhodes

Decades of research have produced robust methods for unsupervised data visualization, yet supervised visualization…

2025-02-18

ArXiv (prépublication)

doi.org

arxiv.org

Adversarial Alignment for LLMs Requires Simpler, Reproducible, and More Measurable Objectives

Leo Schwinn

Yan Scholten

Tom Wollschlager

Sophie Xhonneux

Stephen Casper

Stephan Günnemann

Gauthier Gidel

2025-02-17

ArXiv (prépublication)

doi.org

arxiv.org

Adversarial Alignment for LLMs Requires Simpler, Reproducible, and More Measurable Objectives

Leo Schwinn

Yan Scholten

Tom Wollschlager

Sophie Xhonneux

Stephen Casper

Stephan Günnemann

Gauthier Gidel

Misaligned research objectives have considerably hindered progress in adversarial robustness research over the past decade. For instance, an… (voir plus) extensive focus on optimizing target metrics, while neglecting rigorous standardized evaluation, has led researchers to pursue ad-hoc heuristic defenses that were seemingly effective. Yet, most of these were exposed as flawed by subsequent evaluations, ultimately contributing little measurable progress to the field. In this position paper, we illustrate that current research on the robustness of large language models (LLMs) risks repeating past patterns with potentially worsened real-world implications. To address this, we argue that realigned objectives are necessary for meaningful progress in adversarial alignment. To this end, we build on established cybersecurity taxonomy to formally define differences between past and emerging threat models that apply to LLMs. Using this framework, we illustrate that progress requires disentangling adversarial alignment into addressable sub-problems and returning to core academic principles, such as measureability, reproducibility, and comparability. Although the field presents significant challenges, the fresh start on adversarial robustness offers the unique opportunity to build on past experience while avoiding previous mistakes.

2025-02-17

ArXiv (prépublication)

arxiv.org

À la hauteur du moment

Perspectives sur l’IA pour les responsables des politiques

Mila Techaide 2025

Développement du groupe d'experts de l'ONU sur l'IA

Transition à la direction scientifique de Mila

À la hauteur du moment

Perspectives sur l’IA pour les responsables des politiques

Publications

À la hauteur du moment

Perspectives sur l’IA pour les responsables des politiques

Mila Techaide 2025

Développement du groupe d'experts de l'ONU sur l'IA

Transition à la direction scientifique de Mila

À la hauteur du moment

Perspectives sur l’IA pour les responsables des politiques

Mots-clés populaires:

Publications