Publications

TapeAgents: a Holistic Framework for Agent Development and Optimization

Dzmitry Bahdanau

Nicolas Gontier

Gabriel Huang

Ehsan Kamalloo

Rafael Pardinas

Alexandre Piché

Torsten Scholak

Oleh Shliazhko

Jordan Prince Tremblay

Karam Ghanem

Soham Parikh

Mitul Tiwari

Quaizar Vohra

We present TapeAgents, an agent framework built around a granular, structured log tape of the agent session that also plays the role of the … (voir plus)session's resumable state. In TapeAgents we leverage tapes to facilitate all stages of the LLM Agent development lifecycle. The agent reasons by processing the tape and the LLM output to produce new thought and action steps and append them to the tape. The environment then reacts to the agent's actions by likewise appending observation steps to the tape. By virtue of this tape-centred design, TapeAgents can provide AI practitioners with holistic end-to-end support. At the development stage, tapes facilitate session persistence, agent auditing, and step-by-step debugging. Post-deployment, one can reuse tapes for evaluation, fine-tuning, and prompt-tuning; crucially, one can adapt tapes from other agents or use revised historical tapes. In this report, we explain the TapeAgents design in detail. We demonstrate possible applications of TapeAgents with several concrete examples of building monolithic agents and multi-agent teams, of optimizing agent prompts and finetuning the agent's LLM. We present tooling prototypes and report a case study where we use TapeAgents to finetune a Llama-3.1-8B form-filling assistant to perform as well as GPT-4o while being orders of magnitude cheaper. Lastly, our comparative analysis shows that TapeAgents's advantages over prior frameworks stem from our novel design of the LLM agent as a resumable, modular state machine with a structured configuration, that generates granular, structured logs and that can transform these logs into training text -- a unique combination of features absent in previous work.

2024-12-10

ArXiv (prépublication)

doi.org

arxiv.org

Longitudinal reproducibility of brain and spinal cord quantitative MRI biomarkers

Mathieu Boudreau

Agah Karakuzu

Arnaud Boré

Basile Pinsard

Kiril Zelenkovski

Eva Alonso-Ortiz

Julie Boyle

Lune Bellec

Julien Cohen-Adad

Quantitative MRI (qMRI) promises better specificity, accuracy, repeatability, and reproducibility relative to its clinically-used qualitativ… (voir plus)e MRI counterpart. Longitudinal reproducibility is particularly important in qMRI. The goal is to reliably quantify tissue properties that may be assessed in longitudinal clinical studies throughout disease progression or during treatment. In this work, we present the initial data release of the quantitative MRI portion of the Courtois project on neural modelling (CNeuroMod), where the brain and cervical spinal cord of six participants were scanned at regular intervals over the course of several years. This first release includes 3 years of data collection and up to 10 sessions per participant using quantitative MRI imaging protocols (T1, magnetization transfer (MTR, MTsat), and diffusion). In the brain, T1MP2RAGE, fractional anisotropy (FA), mean diffusivity (MD), and radial diffusivity (RD) all exhibited high longitudinal reproducibility (intraclass correlation coefficient – ICC ≃ 1 and within-subject coefficient of variations – wCV 1%). The spinal cord cross-sectional area (CSA) computed using T2w images and T1MTsatexhibited the best longitudinal reproducibility (ICC ≃ 1 and 0.7 respectively, and wCV 2.4% and 6.9%). Results from this work show the level of longitudinal reproducibility that can be expected from qMRI protocols in the brain and spinal cord in the absence of hardware and software upgrades, and could help in the design of future longitudinal clinical studies.

2024-12-09

Imaging Neuroscience (publié)

doi.org

BootsTAP: Bootstrapped Training for Tracking-Any-Point

Carl Doersch

Yi Yang

Dilara Gokay

Pauline Luc

Skanda Koppula

Ankush Gupta

Joseph Heyward

Ignacio Rocco

Ross Goroshin

João Carreira

Andrew Zisserman

To endow models with greater understanding of physics and motion, it is useful to enable them to perceive how solid surfaces move and deform… (voir plus) in real scenes. This can be formalized as Tracking-Any-Point (TAP), which requires the algorithm to track any point on solid surfaces in a video, potentially densely in space and time. Large-scale groundtruth training data for TAP is only available in simulation, which currently has a limited variety of objects and motion. In this work, we demonstrate how large-scale, unlabeled, uncurated real-world data can improve a TAP model with minimal architectural changes, using a selfsupervised student-teacher setup. We demonstrate state-of-the-art performance on the TAP-Vid benchmark surpassing previous results by a wide margin: for example, TAP-Vid-DAVIS performance improves from 61.3% to 67.4%, and TAP-Vid-Kinetics from 57.2% to 62.5%. For visualizations, see our project webpage at https://bootstap.github.io/

2024-12-07

Lecture Notes in Computer Science (publié)

doi.org

arxiv.org

The Responsible Foundation Model Development Cheatsheet: A Review of Tools&Resources

Shayne Longpre

Stella Biderman

Alon Albalak

Hailey Schoelkopf

Daniel McDuff

Sayash Kapoor

Kevin Klyman

Kyle Lo

Gabriel Ilharco

Nay San

Maribeth Rauh

Aviya Skowron

Bertie Vidgen

Laura Weidinger

Arvind Narayanan

Victor Sanh

David Ifeoluwa Adelani

Percy Liang

Rishi Bommasani

Peter Henderson … (voir 3 de plus)

Sasha Luccioni

Yacine Jernite

Luca Soldaini

2024-12-06

TMLR (accepté)

doi.org

openreview.net

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Shivalika Singh

Angelika Romanou

Cl'ementine Fourrier

David Ifeoluwa Adelani

Jian Gang Ngui

Daniel Vila-Suero

Peerat Limkonchotiwat

Kelly Marchisio

Wei Qi Leong

Yosephine Susanto

Raymond Ng

Shayne Longpre

Wei-Yin Ko

Madeline Smith

Antoine Bosselut

Alice Oh

André F. T. Martins

Leshem Choshen

Daphne Ippolito

Enzo Ferrante … (voir 3 de plus)

Marzieh Fadaee

Beyza Ermis

Sara Hooker

2024-12-03

ArXiv (prépublication)

doi.org

arxiv.org

Beta cells are essential drivers of pancreatic ductal adenocarcinoma development

Cathy C. Garcia

Aarthi Venkat

Daniel C. McQuaid

Sherry Agabiti

Alex Tong

Rebecca L. Cardone

Rebecca Starble

Akin Sogunro

Jeremy B. Jacox

Christian F. Ruiz

Richard G. Kibbey

Smita Krishnaswamy

Mandar Deepak Muzumdar

Pancreatic endocrine-exocrine crosstalk plays a key role in normal physiology and disease. For instance, endocrine islet beta (β) cell secr… (voir plus)etion of insulin or cholecystokinin (CCK) promotes progression of pancreatic adenocarcinoma (PDAC), an exocrine cell-derived tumor. However, the cellular and molecular mechanisms that govern endocrine-exocrine signaling in tumorigenesis remain incompletely understood. We find that β cell ablation impedes PDAC development in mice, arguing that the endocrine pancreas is critical for exocrine tumorigenesis. Conversely, obesity induces β cell hormone dysregulation, alters CCK-dependent peri-islet exocrine cell transcriptional states, and enhances islet proximal tumor formation. Single-cell RNA-sequencing, in silico latent-space archetypal and trajectory analysis, and genetic lineage tracing in vivo reveal that obesity stimulates postnatal immature β cell expansion and adaptation towards a pro-tumorigenic CCK+ state via JNK/cJun stress-responsive signaling. These results define endocrine-exocrine signaling as a driver of PDAC development and uncover new avenues to target the endocrine pancreas to subvert exocrine tumorigenesis.

2024-12-02

bioRxiv (prépublication)

doi.org

Improving Text-to-Image Consistency via Automatic Prompt Optimization

Oscar Mañas

Pietro Astolfi

Melissa Hall

Candace Ross

Jack Urbanek

Adina Williams

Aishwarya Agrawal

Adriana Romero-Soriano

Michal Drozdzal

Impressive advances in text-to-image (T2I) generative models have yielded a plethora of high performing models which are able to generate ae… (voir plus)sthetically appealing, photorealistic images. Despite the progress, these models still struggle to produce images that are consistent with the input prompt, oftentimes failing to capture object quantities, relations and attributes properly. Existing solutions to improve prompt-image consistency suffer from the following challenges: (1) they oftentimes require model fine-tuning, (2) they only focus on nearby prompt samples, and (3) they are affected by unfavorable trade-offs among image quality, representation diversity, and prompt-image consistency. In this paper, we address these challenges and introduce a T2I optimization-by-prompting framework, OPT2I, which leverages a large language model (LLM) to improve prompt-image consistency in T2I models. Our framework starts from a user prompt and iteratively generates revised prompts with the goal of maximizing a consistency score. Our extensive validation on two datasets, MSCOCO and PartiPrompts, shows that OPT2I can boost the initial consistency score by up to 24.9% in terms of DSG score while preserving the FID and increasing the recall between generated and real data. Our work paves the way toward building more reliable and robust T2I systems by harnessing the power of LLMs.

2024-12-02

TMLR (accepté)

doi.org

openreview.net

Insect Identification in the Wild: The AMI Dataset

Aditya Jain

F. Cunha

M. J. Bunsen

Juan Sebastián Cañas

L. Pasi

N. Pinoy

Flemming Helsing

JoAnne Russo

Marc Botham

Michael Sabourin

Jonathan Fréchette

Alexandre Anctil

Yacksecari Lopez

Eduardo Navarro

Filonila Perez Pimentel

Ana Cecilia Zamora

José Alejandro Ramirez Silva

Jonathan Gagnon

Tom August

K. Bjerge … (voir 8 de plus)

Alba Gomez Segura

Marc Bélisle

Yves Basset

K. P. McFarland

David Roy

Toke Thomas Høye

Maxim Larrivée

David Rolnick

Insects represent half of all global biodiversity, yet many of the world's insects are disappearing, with severe implications for ecosystems… (voir plus) and agriculture. Despite this crisis, data on insect diversity and abundance remain woefully inadequate, due to the scarcity of human experts and the lack of scalable tools for monitoring. Ecologists have started to adopt camera traps to record and study insects, and have proposed computer vision algorithms as an answer for scalable data processing. However, insect monitoring in the wild poses unique challenges that have not yet been addressed within computer vision, including the combination of long-tailed data, extremely similar classes, and significant distribution shifts. We provide the first large-scale machine learning benchmarks for fine-grained insect recognition, designed to match real-world tasks faced by ecologists. Our contributions include a curated dataset of images from citizen science platforms and museums, and an expert-annotated dataset drawn from automated camera traps across multiple continents, designed to test out-of-distribution generalization under field conditions. We train and evaluate a variety of baseline algorithms and introduce a combination of data augmentation techniques that enhance generalization across geographies and hardware setups.

2024-12-01

Lecture Notes in Computer Science (publié)

doi.org

arxiv.org

ProGRes: Prompted Generative Rescoring on ASR n-Best

Ada Defne Tur

Adel Moumen

Mirco Ravanaelli

2024-12-01

2024 IEEE Spoken Language Technology Workshop (SLT) (publié)

doi.org

arxiv.org

Combining supervised learning and local search for the multicommodity capacitated fixed-charge network design problem

Charly Robinson La Rocca

Jean-François Cordeau

Emma Frejinger

The multicommodity capacitated fixed-charge network design problem has been extensively studied in the literature due to its wide range of a… (voir plus)pplications. Despite the fact that many sophisticated solution methods exist today, finding high-quality solutions to large-scale instances remains challenging. In this paper, we explore how a data-driven approach can help improve upon the state of the art. By leveraging machine learning models, we attempt to reveal patterns hidden in the data that might be difficult to capture with traditional optimization methods. For scalability, we propose a prediction method where the machine learning model is called at the level of each arc of the graph. We take advantage of off-the-shelf models trained via supervised learning to predict near-optimal solutions. Our experimental results include an algorithm design analysis that compares various integration strategies of predictions within local search algorithms. We benchmark the ML-based approach with respect to the state-of-the-art heuristic for this problem. The findings indicate that our method can outperform the leading heuristic on sets of instances sampled from a uniform distribution.

2024-11-30

Transportation Research Part E: Logistics and Transportation Review (publié)

doi.org

arxiv.org

Decomposing the Brain in Autism: Linking Behavioral Domains to Neuroanatomical Variation and Genomic Underpinnings.

Hanna Seelemeyer

Caroline Gurr

Johanna Leyhausen

Lisa M. Berg

Charlotte M. Pretzsch

Tim Schäfer

Bassem Hermila

Christine M. Freitag

Eva Loth

Beth Oakley

Luke Mason

Jan K. Buitelaar

Christian Beckmann

Dorothea L. Floris

Tony Charman

Tobias Banaschewski

Emily Jones

Thomas Bourgeron

Jumana Ahmad

Sara Ambrosino … (voir 58 de plus)

Bonnie Auyeung

Simon Baron-Cohen

Sarah Baumeister

Sven Bölte

Carsten Bours

Michael Brammer

Daniel Brandeis

Claudia Brogna

Yvette de Bruijn

Bhismadev Chakrabarti

Ineke Cornelissen

Daisy Crawley

Flavio Dell’Acqua

Guillaume Dumas

Sarah Durston

Christine Ecker

Jessica Faulkner

Vincent Frouin

Pilar Garcés

David Goyard

Lindsay Ham

Hannah Hayward

Joerg F. Hipp

Rosemary Holt

Mark Johnson

Emily J. H. Jones

Prantik Kundu

Meng-Chuan Lai

Xavier Liogier D’ardhuy

Michael V. Lombardo

David J. Lythgoe

René Mandl

Andre Marquand

Maarten Mennes

Andreas Meyer-Lindenberg

Carolin Moessnang

Nico Bast

Larry O’Dwyer

Marianne Oldehinkel

Bob Oranje

Gahan Pandina

Antonio Persico

Barbara Ruggeri

Declan G.M. Murphy

Amber N. V. Ruigrok

Jessica Sabet

Roberto Sacco

Antonia San José Cáceres

Emily Simonoff

Will Spooren

Julian Tillmann

Roberto Toro

Heike Tost

Jack Waldman

Steve C. R. Williams

Caroline Wooldridge

Marcel P. Zwiers

Declan Murphy

2024-11-30

Biological Psychiatry: Cognitive Neuroscience and Neuroimaging (publié)

doi.org

Effects of gene dosage on cognitive ability: A function-based association study across brain and non-brain processes

Guillaume Huguet

Thomas Renne

Cécile Poulain

Alma Dubuc

Kuldeep Kumar

Sayeh Kazem

Worrawat Engchuan

Omar Shanta

Elise Douard

Catherine Proulx

Martineau Jean-Louis

Zohra Saci

Josephine Mollon

Laura M. Schultz

Emma E.M. Knowles

Simon R. Cox

David Porteous

Gail Davies

Paul Redmond

Sarah E. Harris … (voir 10 de plus)

Gunter Schumann

Guillaume Dumas

Aurélie Labbe

Zdenka Pausova

Tomáš Paus

Stephen W. Scherer

Jonathan Sebat

Laura Almasy

David C. Glahn

Sébastien Jacquemont

Copy-number variants (CNVs) that increase the risk for neurodevelopmental disorders also affect cognitive ability. However, such CNVs remain… (voir plus) challenging to study due to their scarcity, limiting our understanding of gene-dosage-sensitive biological processes linked to cognitive ability. We performed a genome-wide association study (GWAS) in 258,292 individuals, which identified—for the first time—a duplication at 2q12.3 associated with higher cognitive performance. We developed a functional-burden analysis, which tested the association between cognition and CNVs disrupting 6,502 gene sets biologically defined across tissues, cell types, and ontologies. Among those, 864 gene sets were associated with cognition, and effect sizes of deletion and duplication were negatively correlated. The latter suggested that functions across all biological processes were sensitive to either deletions (e.g., subcortical regions, postsynaptic) or duplications (e.g., cerebral cortex, presynaptic). Associations between non-brain tissues and cognition were driven partly by constrained genes, which may shed light on medical comorbidities in neurodevelopmental disorders.

2024-11-30

Cell Genomics (publié)

doi.org

Mila Techaide 2026

Désinformation 2.0 : quand l’IA brouille nos ondes

Avantage IA : productivité dans la fonction publique

Publications

Mila Techaide 2026

Désinformation 2.0 : quand l’IA brouille nos ondes

Avantage IA : productivité dans la fonction publique

Mots-clés populaires:

Publications