Publications

Stimulus information guides the emergence of behavior-related signals in primary somatosensory cortex during learning.

Mariangela Panniello

Colleen J Gillon

Roberto Maffulli

Marco Celotto

Stefano Panzeri

Blake Richards

Michael M Kohl

2024-05-25

Cell Reports (published)

doi.org

Towards a Generic Representation of Combinatorial Problems for Learning-Based Approaches

Léo Boisvert

Hélène Verhaeghe

Quentin Cappart

2024-05-25

Integration of Constraint Programming, Artificial Intelligence, and Operations Research (published)

doi.org

arxiv.org

Managing AI Risks in an Era of Rapid Progress

Yoshua Bengio

Geoffrey Hinton

Andrew Yao

Dawn Song

Pieter Abbeel

Yuval Noah Harari

Trevor Darrell

Ya-Qin Zhang

Lan Xue

Shai Shalev-Shwartz

Gillian K. Hadfield

Jeff Clune

Tegan Maharaj

Frank Hutter

Atilim Güneş Baydin

Sheila McIlraith

Qiqi Gao

Ashwin Acharya

David Scott Krueger

Anca Dragan … (see 5 more)

Philip Torr

Stuart Russell

Daniel Kahneman

Jan Brauner

Sören Mindermann

2024-05-24

Science (published)

doi.org

arxiv.org

Assortment Optimization with Visibility Constraints

Théo Barré

Omar El Housni

Andrea Lodi

2024-05-22

Integer Programming and Combinatorial Optimization (published)

doi.org

arxiv.org

Towards Modular LLMs by Building and Reusing a Library of LoRAs

Oleksiy Ostapenko

Zhan Su

Edoardo Ponti

Laurent Charlin

Nicolas Le Roux

Matheus Pereira

Lucas Caccia

Alessandro Sordoni

The growing number of parameter-efficient adaptations of a base large language model (LLM) calls for studying whether we can reuse such trai… (see more)ned adapters to improve performance for new tasks. We study how to best build a library of adapters given multi-task data and devise techniques for both zero-shot and supervised task generalization through routing in such library. We benchmark existing approaches to build this library and introduce model-based clustering, MBC, a method that groups tasks based on the similarity of their adapter parameters, indirectly optimizing for transfer across the multi-task dataset. To re-use the library, we present a novel zero-shot routing mechanism, Arrow, which enables dynamic selection of the most relevant adapters for new inputs without the need for retraining. We experiment with several LLMs, such as Phi-2 and Mistral, on a wide array of held-out tasks, verifying that MBC-based adapters and Arrow routing lead to superior generalization to new tasks. We make steps towards creating modular, adaptable LLMs that can match or outperform traditional joint training.

2024-05-18

ArXiv (preprint)

arxiv.org

GFETM: Genome Foundation-based Embedded Topic Model for scATAC-seq Modeling

Yimin Fan

Yu Li

Jun Ding

Yue Li

2024-05-17

Lecture Notes in Computer Science (published)

doi.org

What Mechanisms Does Knowledge Distillation Distill?

Cindy Wu

Ekdeep Singh Lubana

Bruno Mlodozeniec

Robert Kirk

David Scott Krueger

Knowledge distillation is a commonly-used compression method in ML due to the popularity of increasingly large-scale models, but it is uncle… (see more)ar if all the information a teacher model contains is distilled into the smaller student model. We aim to formalize the concept of ‘knowledge’ to investigate how knowledge is transferred during distillation, focusing on shared invariant outputs to counterfactual changes of dataset latent variables (we call these latents mechanisms). We define a student model to be a good stand-in model for a teacher if it shares the teacher’s learned mechanisms, and find that Jacobian matching and contrastive representation learning are viable methods by which to train such models. While these methods do not result in perfect transfer of mechanisms, we show they often improve student fidelity or mitigate simplicity bias (as measured by the teacher-to-student KL divergence and accuracy on various out-of-distribution test datasets), especially on datasets with spurious statistical correlations.

2024-05-14

Proceedings of UniReps: the First Workshop on Unifying Representations in Neural Models (published)

proceedings.mlr.press

openreview.net

TEMPLATES: Characterization of a Merger in the Dusty Lensing SPT0418-47 System

Jared Cathey

Anthony H. Gonzalez

Sidney Lower

Kedar A. Phadke

Justin Spilker

Manuel Aravena

Matthew Bayliss

Jack E. Birkin

Simon Birrer

Scott Chapman

Håkon Dahle

Christopher C. Hayward

Yashar Hezaveh

Ryley Hill

Taylor A. Hutchison

Keunho J. Kim

Guillaume Mahler

Daniel P. Marrone

Desika Narayanan

Alexander Navarre … (see 7 more)

Cassie Reuter

Jane R Rigby

Keren Sharon

Manuel Solimano

Nikolaus Sulzenauer

Joaquin Vieira

David Vizgan

2024-05-13

The Astrophysical Journal (published)

doi.org

arxiv.org

An AI-Resilient Text Rendering Technique for Reading and Skimming Documents

Ziwei Gu

Ian Arawjo

Kenneth Li

Jonathan K. Kummerfeld

Elena L. Glassman

2024-05-11

Proceedings of the CHI Conference on Human Factors in Computing Systems (published)

doi.org

arxiv.org

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

Ian Arawjo

Chelse Swoopes

Priyan Vaithilingam

Martin Wattenberg

Elena L. Glassman

Evaluating outputs of large language models (LLMs) is challenging, requiring making -- and making sense of -- many responses. Yet tools that… (see more) go beyond basic prompting tend to require knowledge of programming APIs, focus on narrow domains, or are closed-source. We present ChainForge, an open-source visual toolkit for prompt engineering and on-demand hypothesis testing of text generation LLMs. ChainForge provides a graphical interface for comparison of responses across models and prompt variations. Our system was designed to support three tasks: model selection, prompt template design, and hypothesis testing (e.g., auditing). We released ChainForge early in its development and iterated on its design with academics and online users. Through in-lab and interview studies, we find that a range of people could use ChainForge to investigate hypotheses that matter to them, including in real-world settings. We identify three modes of prompt engineering and LLM hypothesis testing: opportunistic exploration, limited evaluation, and iterative refinement.

2024-05-11

Proceedings of the CHI Conference on Human Factors in Computing Systems (published)

doi.org

arxiv.org

295. Rare Variant Genetic Architecture of the Human Cortical MRI Phenotypes in General Population

Kuldeep Kumar

Sayeh Kazem

Zhijie Liao

Jakub Kopal

Guillaume Huguet

Thomas Renne

Martineau Jean-Louis

Zhe Xie

Zohra Saci

Laura Almasy

David C. Glahn

Tomas Paus

Guillaume Dumas

Carrie Bearden

Paul Thompson

Richard A.I. Bethlehem

Varun Warrier

Sébastien Jacquemont

2024-05-01

Biological Psychiatry (published)

doi.org

All-in-one simulation-based inference

Manuel Gloeckler

Michael Deistler

Christian Dietrich Weilbach

Frank Wood

Jakob H. Macke

2024-05-01

ICML.cc/2024/Conference (oral)

doi.org

openreview.net

AI Research Driven by Real-World Problems

AI Policy Compass

Student Life and Resources

Publications

AI Research Driven by Real-World Problems

AI Policy Compass

Student Life and Resources

Popular keywords:

Publications