Publications

Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models

Siddarth Venkatraman

Mohsin Hasan

Minsu Kim

Luca Scimeca

Marcin Sendera

Yoshua Bengio

Glen Berseth

Nikolay Malkin

Any well-behaved generative model over a variable …

2025-05-01

ICML.cc/2025/Conference (poster)

Plasticity as the Mirror of Empowerment

David Abel

Michael Bowling

Andre Barreto

Will Dabney

Shi Dong

Steven Hansen

Anna Harutyunyan

Khimya Khetarpal

Clare Lyle

Razvan Pascanu

Georgios Piliouras

Doina Precup

Jonathan Richens

Mark Rowland

Tom Schaul

Satinder Singh

2025-05-01

arXiv (published)

PoisonBench: Assessing Language Model Vulnerability to Poisoned Preference Data

Tingchen Fu

Mrinank Sharma

Philip Torr

Shay B. Cohen

David Scott Krueger

Fazl Barez

Preference learning is a central component for aligning current LLMs, but this process can be vulnerable to data poisoning attacks. To addre… (see more)ss this concern, we introduce PoisonBench, a benchmark for evaluating large language models' susceptibility to data poisoning during preference learning. Data poisoning attacks can manipulate large language model responses to include hidden malicious content or biases, potentially causing the model to generate harmful or unintended outputs while appearing to function normally. We deploy two distinct attack types across eight realistic scenarios, assessing 22 widely-used models. Our findings reveal concerning trends: (1) Scaling up parameter size does not always enhance resilience against poisoning attacks and the influence on model resilience varies among different model suites. (2) There exists a log-linear relationship between the effects of the attack and the data poison ratio; (3) The effect of data poisoning can generalize to extrapolated triggers that are not included in the poisoned data. These results expose weaknesses in current preference learning techniques, highlighting the urgent need for more robust defenses against malicious models and data manipulation.

2025-05-01

ICML.cc/2025/Conference (poster)

Position: Probabilistic Modelling is Sufficient for Causal Inference

Bruno Mlodozeniec

David Scott Krueger

Richard E. Turner

2025-05-01

ICML.cc/2025/Position_Paper_Track (oral)

Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of Mind

Mouad Abrini

Omri Abend

Dina M. Acklin

Henny Admoni

Gregor Aichinger

Nitay Alon

Zahra Ashktorab

Ashish Atreja

Moises Auron

Alexander Aufreiter

Raghav Awasthi

Soumya Banerjee

Joseph Barnby

Rhea Basappa

Severin Bergsmann

Djallel Bouneffouf

Patrick Callaghan

Marc Cavazza

Thierry Chaminade

Sonia Chernova … (see 88 more)

Mohamed Chetouan

Moumita Choudhury

Axel Cleeremans

J. Cywinski

Fabio Cuzzolin

Hokin Deng

N'yoma Diamond

C. D. Pasquasio

Guillaume Dumas

Max J. van Duijn

Mahapatra Dwarikanath

Qingying Gao

Ashok Goel

Rebecca R. Goldstein

Matthew C. Gombolay

Gabriel Enrique Gonzalez

Amar Halilovic

Tobias Halmdienst

Mahimul Islam

Julian Jara-Ettinger

Natalie Kastel

Renana Keydar

Ashish K. Khanna

Mahdi Khoramshahi

Jihyun Kim

Mihyeon Kim

Youngbin Kim

Senka Krivic

Nikita Krasnytskyi

Arun Kumar

Junehyoung Kwon

EunJu Lee

Shane Lee

Peter R. Lewis 0001

Xue Li

Yijiang Li

Michal Lewandowski

Nathan Lloyd

Matthew B. Luebbers

Dezhi Luo

Haiyun Lyu

Dwarikanath Mahapatra

Kamal Maheshwari

Mallika Mainali

P. Mathur

Patrick Mederitsch

Shuwa Miura

Manuel Preston de Miranda

Reuth Mirsky

Shreya Mishra

Nina M. Moorman

Katelyn Morrison

John Muchovej

Bernhard Nessler

Felix Nessler

Hieu Minh Jord Nguyen

Abby Ortego

F. Papay

Antoine Pasquali

Hamed Rahimi

C. Raghu

Amanda L. Royka

Stefan Sarkadi

Jaelle Scheuerman

Simon Schmid

Paul Schrater

Anik Sen

Zahra Sheikhbahaee

Ke Shi

Reid G. Simmons

Nishant Singh

Mason O. Smith

Ramira van der Meulen

Anthia Solaki

Haoran Sun

Viktor Szolga

Matthew E. Taylor

Travis Taylor

Sanne van Waveren

Juan David Vargas

R. Verbrugge

Eitan Wagner

Justin D. Weisz

Ximing Wen

William Yeoh

Wenlong Zhang

Michelle Zhao

Shlomo Zilberstein

2025-05-01

arXiv (published)

Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

Kusha Sareen

Morgane M Moss

Alessandro Sordoni

Rishabh Agarwal

Arian Hosseini

2025-05-01

arXiv (published)

Real-time fine finger motion decoding for transradial amputees with surface electromyography

Zihan Weng

Yang Xiao

Peiyang Li

Chanlin Yi

Pouya Bashivan

Hailin Ma

Guang Yao

Yuan Lin

Fali Li

Dezhong Yao 0001

Jingming Hou

Yangsong Zhang

Peng Xu

2025-05-01

Neural Networks (published)

REARANK: Reasoning Re-ranking Agent via Reinforcement Learning

Le Zhang

Bo Wang

Xipeng Qiu

Siva Reddy

Aishwarya Agrawal

We present REARANK, a large language model (LLM)-based listwise reasoning reranking agent. REARANK explicitly reasons before reranking, sign… (see more)ificantly improving both performance and interpretability. Leveraging reinforcement learning and data augmentation, REARANK achieves substantial improvements over baseline models across popular information retrieval benchmarks, notably requiring only 179 annotated samples. Built on top of Qwen2.5-7B, our REARANK-7B demonstrates performance comparable to GPT-4 on both in-domain and out-of-domain benchmarks and even surpasses GPT-4 on reasoning-intensive BRIGHT benchmarks. These results underscore the effectiveness of our approach and highlight how reinforcement learning can enhance LLM reasoning capabilities in reranking.

2025-05-01

arXiv (published)