Publications

When does word order matter and when doesn't it?

Xuanda Chen

Timothy John O'donnell

Language models (LMs) may appear insensitive to word order changes in natural language understanding (NLU) tasks. In this paper, we propose … (voir plus)that linguistic redundancy can explain this phenomenon, whereby word order and other linguistic cues such as case markers provide overlapping and thus redundant information. Our hypothesis is that models exhibit insensitivity to word order when the order provides redundant information, and the degree of insensitivity varies across tasks. We quantify how informative word order is using mutual information (MI) between unscrambled and scrambled sentences. Our results show the effect that the less informative word order is, the more consistent the model's predictions are between unscrambled and scrambled sentences. We also find that the effect varies across tasks: for some tasks, like SST-2, LMs' prediction is almost always consistent with the original one even if the Pointwise-MI (PMI) changes, while for others, like RTE, the consistency is near random when the PMI gets lower, i.e., word order is really important.

2024-02-29

ArXiv (prépublication)

doi.org

arxiv.org

Acoustic tactile sensing for mobile robot wheels

Wilfred Mason

David Brenken

Falcon Z. Dai

Ricardo Gonzalo Cruz Castillo

Olivier St-Martin Cormier

Audrey Sedal

2024-02-28

ArXiv (prépublication)

doi.org

arxiv.org

ICE-SEARCH: A Language Model-Driven Feature Selection Approach

Tianze Yang

Tianyi Yang

Shaoshan Liu

Fuyuan Lyu

Xue (Steve) Liu

This study unveils the In-Context Evolutionary Search (ICE-SEARCH) method, the first work that melds language models (LMs) with evolutionary… (voir plus) algorithms for feature selection (FS) tasks and demonstrates its effectiveness in Medical Predictive Analytics (MPA) applications. ICE-SEARCH harnesses the crossover and mutation capabilities inherent in LMs within an evolutionary framework, significantly improving FS through the model's comprehensive world knowledge and its adaptability to a variety of roles. Our evaluation of this methodology spans three crucial MPA tasks: stroke, cardiovascular disease, and diabetes, where ICE-SEARCH outperforms traditional FS methods in pinpointing essential features for medical applications. ICE-SEARCH achieves State-of-the-Art (SOTA) performance in stroke prediction and diabetes prediction; the Decision-Randomized ICE-SEARCH ranks as SOTA in cardiovascular disease prediction. Our results not only demonstrate the efficacy of ICE-SEARCH in medical FS but also underscore the versatility, efficiency, and scalability of integrating LMs in FS tasks. The study emphasizes the critical role of incorporating domain-specific insights, illustrating ICE-SEARCH's robustness, generalizability, and swift convergence. This opens avenues for further research into comprehensive and intricate FS landscapes, marking a significant stride in the application of artificial intelligence in medical predictive analytics.

2024-02-28

ArXiv (prépublication)

doi.org

arxiv.org

On the Challenges and Opportunities in Generative AI

Laura Manduchi

Kushagra Pandey

Robert Bamler

Ryan Cotterell

Sina Daubener

Sophie Fellenz

Asja Fischer

Thomas Gartner

Matthias Kirchler

Marius Kloft

Yingzhen Li

Christoph Lippert

Gerard de Melo

Eric T. Nalisnick

Bjorn Ommer

Rajesh Ranganath

Maja Rudolph

Karen Ullrich

Guy Van den Broeck

Julia E Vogt … (voir 5 de plus)

Yixin Wang

Florian Wenzel

Frank Wood

Stephan Mandt

Vincent Fortuin

2024-02-28

ArXiv (prépublication)

doi.org

arxiv.org

A density estimation perspective on learning from pairwise human preferences

Vincent Dumoulin

Daniel D. Johnson

Pablo Samuel Castro

Hugo Larochelle

Yann Dauphin

Learning from human feedback (LHF) -- and in particular learning from pairwise preferences -- has recently become a crucial ingredient in tr… (voir plus)aining large language models (LLMs), and has been the subject of much research. Most recent works frame it as a reinforcement learning problem, where a reward function is learned from pairwise preference data and the LLM is treated as a policy which is adapted to maximize the rewards, often under additional regularization constraints. We propose an alternative interpretation which centers on the generative process for pairwise preferences and treats LHF as a density estimation problem. We provide theoretical and empirical results showing that for a family of generative processes defined via preference behavior distribution equations, training a reward function on pairwise preferences effectively models an annotator's implicit preference distribution. Finally, we discuss and present findings on"annotator misspecification"-- failure cases where wrong modeling assumptions are made about annotator behavior, resulting in poorly-adapted models -- suggesting that approaches that learn from pairwise human preferences could have trouble learning from a population of annotators with diverse viewpoints.

2024-02-27

TMLR (accepté)

doi.org

openreview.net

A Neural-Evolutionary Algorithm for Autonomous Transit Network Design

Andrew Holliday

Gregory Dudek

2024-02-27

ArXiv (prépublication)

doi.org

arxiv.org

RAMEN Unveils Clinical Variable Networks for COVID-19 Severity and Long COVID Using Absorbing Random Walks and Genetic Algorithms

Yiwei Xiong

Jingtao Wang

Xiaoxiao Shang

Tingting Chen

Douglas D. Fraser

Gregory Fonseca

Simon Rousseau

Jun Ding

The COVID-19 pandemic has significantly altered global socioeconomic structures and individual lives. Understanding the disease mechanisms a… (voir plus)nd facilitating diagnosis requires comprehending the complex interplay among clinical factors like demographics, symptoms, comorbidities, treatments, lab results, complications, and other metrics, and their relation to outcomes such as disease severity and long term outcomes (e.g., post-COVID-19 condition/long COVID). Conventional correlational methods struggle with indirect and directional connections among these factors, while standard graphical methods like Bayesian networks are computationally demanding for extensive clinical variables. In response, we introduced RAMEN, a methodology that integrates Genetic Algorithms with random walks for efficient Bayesian network inference, designed to map the intricate relationships among clinical variables. Applying RAMEN to the Biobanque québécoise de la COVID-19 (BQC19) dataset, we identified critical markers for long COVID and varying disease severity. The Bayesian Network, corroborated by existing literature and supported through multi-omics analyses, highlights significant clinical variables linked to COVID-19 outcomes. RAMEN’s ability to accurately map these connections contributes substantially to developing early and effective diagnostics for severe COVID-19 and long COVID.

2024-02-27

bioRxiv (prépublication)

doi.org

On the Societal Impact of Open Foundation Models

Sayash Kapoor

Rishi Bommasani

Kevin Klyman

Shayne Longpre

Ashwin Ramaswami

Peter Cihon

Aspen Hopkins

Kevin Bankston

Stella Biderman

Miranda Bogen

Rumman Chowdhury

Alex Engler

Peter Henderson

Yacine Jernite

Seth Lazar

Stefano Maffulli

Alondra Nelson

Joelle Pineau

Aviya Skowron

Dawn Song … (voir 5 de plus)

Victor Storchan

Daniel Zhang

Daniel E. Ho

Percy Liang

Arvind Narayanan

2024-02-27

ArXiv (prépublication)

doi.org

arxiv.org

Effective Latent Differential Equation Models via Attention and Multiple Shooting

Germán Abrevaya

Mahta Ramezanian-Panahi

Jean-Christophe Gagnon-Audet

Pablo Polosecki

Irina Rish

Silvina Ponce Dawson

Guillermo Cecchi

Guillaume Dumas

2024-02-26

TMLR (accepté)

openreview.net

SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning

Luca Zampierin

Ghouthi Boukli Hacene

Bac Nguyen

Mirco Ravanelli

2024-02-26

ArXiv (prépublication)

doi.org

arxiv.org

Correction to: Multi-agent reinforcement learning for fast-timescale demand response of residential loads

Vincent Mai

Philippe Maisonneuve

Tianyu Zhang

Hadi Nekoei

Liam Paull

Antoine Lesage-Landry

2024-02-23

Machine-mediated learning (publié)

doi.org

Intra-Host Evolution Analyses in an Immunosuppressed Patient Supports SARS-CoV-2 Viral Reservoir Hypothesis

Dominique Fournelle

Fatima Mostefai

Elsa Brunet-Ratnasingham

Raphael Poujol

Jean-Christophe Grenier

José Héctor Gálvez

Amélie Pagliuzza

Inès Levade

Sandrine Moreira

Mehdi Benlarbi

Guillaume Beaudoin-Bussières

Gabrielle Gendron-Lepage

Catherine Bourassa

Alexandra Tauzin

Simon Grandjean Lapierre

Nicolas Chomont

Andrés Finzi

Daniel E. Kaufmann

Morgan Craig

Julie Hussin

2024-02-23

Viruses (publié)

doi.org

Le traitement du langage naturel à l'ère de l'IA générative

Boussole des politiques en IA

Vie étudiante et ressources

Publications

Le traitement du langage naturel à l'ère de l'IA générative

Boussole des politiques en IA

Vie étudiante et ressources

Mots-clés populaires:

Publications