Publications

Editorial: Special Issue on Software Engineering and AI for Data Quality

Andreas Metzger

Phu H. Nguyen

Sagar Sen

This editorial summarizes the content of the Special Issue on Software Engineering and AI for Data Quality of the Journal of Data and Inform… (voir plus)ation Quality (JDIQ).

2024-12-19

Journal of Data and Information Quality (publié)

doi.org

Editorial: Special Issue on Software Engineering and AI for Data Quality

Foutse Khomh

Andreas Metzger

Phu Nguyen

Sagar Sen

This editorial summarizes the content of the Special Issue on Software Engineering and AI for Data Quality of the Journal of Data and Inform… (voir plus)ation Quality (JDIQ).

2024-12-19

Journal of Data and Information Quality (publié)

doi.org

Scaling 4D Representations

João Carreira

Dilara Gokay

Michael King

Chuhan Zhang

Ignacio Rocco

Aravindh Mahendran

T. Keck

Joseph Heyward

Skanda Koppula

Etienne Pot

Goker Erdogan

Yana Hasson

Yi Yang

Klaus Greff

Guillaume Le Moing

Sjoerd van Steenkiste

Daniel Zoran

Drew A. Hudson

Pedro V'elez

Luisa F. Polan'ia … (voir 15 de plus)

Luke Friedman

Chris Duvarney

Ross Goroshin

Kelsey Allen

Jacob Walker

Rishabh Kabra

Eric Aboussouan

Jennifer Sun

Thomas Kipf

Carl Doersch

Viorica Puatruaucean

Dima Damen

Pauline Luc

Mehdi S. M. Sajjadi

Andrew Zisserman

Scaling has not yet been convincingly demonstrated for pure self-supervised learning from video. However, prior work has focused evaluations… (voir plus) on semantic-related tasks

2024-12-19

ArXiv (prépublication)

doi.org

arxiv.org

Delays in Care for Children With Low Anorectal Malformations in Southwestern Uganda.

Felix Oyania

Caroline Q. Stephens

Sarah Ullrich

Meera Kotagal

Daniel Kisitu

Francis Bajunirwe

Doruk Ozgediz

Dan Poenaru

2024-12-18

Journal of Surgical Research (publié)

doi.org

Delays in Care for Children With Low Anorectal Malformations in Southwestern Uganda.

Felix Oyania

Caroline Q. Stephens

Sarah Ullrich

Meera Kotagal

Daniel Kisitu

Francis Bajunirwe

Doruk Ozgediz

Dan Poenaru

2024-12-18

Journal of Surgical Research (publié)

doi.org

Delays in Care for Children With Low Anorectal Malformations in Southwestern Uganda.

Felix Oyania

Caroline Q. Stephens

Sarah Ullrich

Meera Kotagal

Daniel Kisitu

Francis Bajunirwe

Doruk Ozgediz

Dan Poenaru

2024-12-18

Journal of Surgical Research (publié)

doi.org

Embedding Cultural Diversity in Prototype-based Recommender Systems

Popularity bias in recommender systems can increase cultural overrepresentation by favoring norms from dominant cultures and marginalizing u… (voir plus)nderrepresented groups. This issue is critical for platforms offering cultural products, as they influence consumption patterns and human perceptions. In this work, we address popularity bias by identifying demographic biases within prototype-based matrix factorization methods. Using the country of origin as a proxy for cultural identity, we link this demographic attribute to popularity bias by refining the embedding space learning process. First, we propose filtering out irrelevant prototypes to improve representativity. Second, we introduce a regularization technique to enforce a uniform distribution of prototypes within the embedding space. Across four datasets, our results demonstrate a 27\% reduction in the average rank of long-tail items and a 2\% reduction in the average rank of items from underrepresented countries. Additionally, our model achieves a 2\% improvement in HitRatio@10 compared to the state-of-the-art, highlighting that fairness is enhanced without compromising recommendation quality. Moreover, the distribution of prototypes leads to more inclusive explanations by better aligning items with diverse prototypes.

2024-12-18

ArXiv (prépublication)

arxiv.org

Embedding Cultural Diversity in Prototype-based Recommender Systems

Popularity bias in recommender systems can increase cultural overrepresentation by favoring norms from dominant cultures and marginalizing u… (voir plus)nderrepresented groups. This issue is critical for platforms offering cultural products, as they influence consumption patterns and human perceptions. In this work, we address popularity bias by identifying demographic biases within prototype-based matrix factorization methods. Using the country of origin as a proxy for cultural identity, we link this demographic attribute to popularity bias by refining the embedding space learning process. First, we propose filtering out irrelevant prototypes to improve representativity. Second, we introduce a regularization technique to enforce a uniform distribution of prototypes within the embedding space. Across four datasets, our results demonstrate a 27\% reduction in the average rank of long-tail items and a 2\% reduction in the average rank of items from underrepresented countries. Additionally, our model achieves a 2\% improvement in HitRatio@10 compared to the state-of-the-art, highlighting that fairness is enhanced without compromising recommendation quality. Moreover, the distribution of prototypes leads to more inclusive explanations by better aligning items with diverse prototypes.

2024-12-18

ArXiv (prépublication)

doi.org

arxiv.org

Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Matthew Riemer

Gopeshh Raaj Subbaraj

Glen Berseth

Irina Rish

Realtime environments change even as agents perform action inference and learning, thus requiring high interaction frequencies to effectivel… (voir plus)y minimize regret. However, recent advances in machine learning involve larger neural networks with longer inference times, raising questions about their applicability in realtime systems where reaction time is crucial. We present an analysis of lower bounds on regret in realtime reinforcement learning (RL) environments to show that minimizing long-term regret is generally impossible within the typical sequential interaction and learning paradigm, but often becomes possible when sufficient asynchronous compute is available. We propose novel algorithms for staggering asynchronous inference processes to ensure that actions are taken at consistent time intervals, and demonstrate that use of models with high action inference times is only constrained by the environment's effective stochasticity over the inference horizon, and not by action frequency. Our analysis shows that the number of inference processes needed scales linearly with increasing inference times while enabling use of models that are multiple orders of magnitude larger than existing approaches when learning from a realtime simulation of Game Boy games such as Pok\'emon and Tetris.

2024-12-18

ArXiv (prépublication)

doi.org

arxiv.org

Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Matthew Riemer

Gopeshh Raaj Subbaraj

Glen Berseth

Irina Rish

Realtime environments change even as agents perform action inference and learning, thus requiring high interaction frequencies to effectivel… (voir plus)y minimize regret. However, recent advances in machine learning involve larger neural networks with longer inference times, raising questions about their applicability in realtime systems where reaction time is crucial. We present an analysis of lower bounds on regret in realtime reinforcement learning (RL) environments to show that minimizing long-term regret is generally impossible within the typical sequential interaction and learning paradigm, but often becomes possible when sufficient asynchronous compute is available. We propose novel algorithms for staggering asynchronous inference processes to ensure that actions are taken at consistent time intervals, and demonstrate that use of models with high action inference times is only constrained by the environment's effective stochasticity over the inference horizon, and not by action frequency. Our analysis shows that the number of inference processes needed scales linearly with increasing inference times while enabling use of models that are multiple orders of magnitude larger than existing approaches when learning from a realtime simulation of Game Boy games such as Pok\'emon and Tetris.

2024-12-18

ArXiv (prépublication)

arxiv.org

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

Yinlam Chow

Guy Tennenholtz

Izzeddin Gur

Vincent Zhuang

Bo Dai

Sridhar Thiagarajan

Craig Boutilier

Rishabh Agarwal

Aviral Kumar

Aleksandra Faust

Recent studies have indicated that effectively utilizing inference-time compute is crucial for attaining better performance from large langu… (voir plus)age models (LLMs). In this work, we propose a novel inference-aware fine-tuning paradigm, in which the model is fine-tuned in a manner that directly optimizes the performance of the inference-time strategy. We study this paradigm using the simple yet effective Best-of-N (BoN) inference strategy, in which a verifier selects the best out of a set of LLM-generated responses. We devise the first imitation learning and reinforcement learning~(RL) methods for BoN-aware fine-tuning, overcoming the challenging, non-differentiable argmax operator within BoN. We empirically demonstrate that our BoN-aware models implicitly learn a meta-strategy that interleaves best responses with more diverse responses that might be better suited to a test-time input -- a process reminiscent of the exploration-exploitation trade-off in RL. Our experiments demonstrate the effectiveness of BoN-aware fine-tuning in terms of improved performance and inference-time compute. In particular, we show that our methods improve the Bo32 performance of Gemma 2B on Hendrycks MATH from 26.8% to 30.8%, and pass@32 from 60.0% to 67.0%, as well as the pass@16 on HumanEval from 61.6% to 67.1%.

2024-12-18

ArXiv (prépublication)

doi.org

arxiv.org

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Shengbang Tong

David Fan

Jiachen Zhu

Yunyang Xiong

Xinlei Chen

Koustuv Sinha

Michael Rabbat

Yann Lecun

Saining Xie

Zhuang Liu

In this work, we propose Visual-Predictive Instruction Tuning (VPiT) - a simple and effective extension to visual instruction tuning that en… (voir plus)ables a pretrained LLM to quickly morph into an unified autoregressive model capable of generating both text and visual tokens. VPiT teaches an LLM to predict discrete text tokens and continuous visual tokens from any input sequence of image and text data curated in an instruction-following format. Our empirical investigation reveals several intriguing properties of VPiT: (1) visual generation ability emerges as a natural byproduct of improved visual understanding, and can be unlocked efficiently with a small amount of generation data; (2) while we find understanding and generation to be mutually beneficial, understanding data contributes to both capabilities more effectively than generation data. Building upon these findings, we train our MetaMorph model and achieve competitive performance on both visual understanding and generation. In visual generation, MetaMorph can leverage the world knowledge and reasoning abilities gained from LLM pretraining, and overcome common failure modes exhibited by other generation models. Our results suggest that LLMs may have strong"prior"vision capabilities that can be efficiently adapted to both visual understanding and generation with a relatively simple instruction tuning process.

2024-12-18

ArXiv (prépublication)

doi.org

arxiv.org

Conférence sur les politiques de l'IA de Mila

À l’avant-garde d’une nouvelle ère

TRAIL : IA responsable pour les professionnels et les leaders

Publications

Conférence sur les politiques de l'IA de Mila

À l’avant-garde d’une nouvelle ère

TRAIL : IA responsable pour les professionnels et les leaders

Mots-clés populaires:

Publications