Publications

Search-Based Correction of Reasoning Chains for Language Models

Minsu Kim

Jean-Pierre R. Falet

Oliver E. Richardson

Xiaoyin Chen

Moksh J. Jain

Sungjin Ahn

Sungsoo Ahn

Yoshua Bengio

2025-05-17

ArXiv (prépublication)

Search-Based Correction of Reasoning Chains for Language Models

Minsu Kim

Jean-Pierre R. Falet

Oliver E. Richardson

Xiaoyin Chen

Moksh J. Jain

Sungjin Ahn

Sungsoo Ahn

Yoshua Bengio

2025-05-17

ArXiv (prépublication)

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment

Jean-Philippe Corbeil

Amin Dada

Jean-Michel Attendu

Asma Ben Abacha

Alessandro Sordoni

Lucas Caccia

Franccois Beaulieu

Thomas Lin

Jens Kleesiek

Paul Vozila

2025-05-15

ArXiv (prépublication)

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment

Jean-Philippe Corbeil

Amin Dada

Jean-Michel Attendu

Asma Ben Abacha

Alessandro Sordoni

Lucas Caccia

Franccois Beaulieu

Thomas Lin

Jens Kleesiek

Paul Vozila

High computation costs and latency of large language models such as GPT-4 have limited their deployment in clinical settings. Small language… (voir plus) models (SLMs) offer a cost-effective alternative, but their limited capacity requires biomedical domain adaptation, which remains challenging. An additional bottleneck is the unavailability and high sensitivity of clinical data. To address these challenges, we propose a novel framework for adapting SLMs into high-performing clinical models. We introduce the MediPhi collection of 3.8B-parameter SLMs developed with our novel framework: pre-instruction tuning of experts on relevant medical and clinical corpora (PMC, Medical Guideline, MedWiki, etc.), model merging, and clinical-tasks alignment. To cover most clinical tasks, we extended the CLUE benchmark to CLUE+, doubling its size. Our expert models deliver relative improvements on this benchmark over the base model without any task-specific fine-tuning: 64.3% on medical entities, 49.5% on radiology reports, and 44% on ICD-10 coding (outperforming GPT-4-0125 by 14%). We unify the expert models into MediPhi via model merging, preserving gains across benchmarks. Furthermore, we built the MediFlow collection, a synthetic dataset of 2.5 million high-quality instructions on 14 medical NLP tasks, 98 fine-grained document types, and JSON format support. Alignment of MediPhi using supervised fine-tuning and direct preference optimization achieves further gains of 18.9% on average.

2025-05-15

ArXiv (prépublication)

Persistent signs of poisoning after massive drug ingestion: move the ultrasound probe to the stomach.

N. Lautrou-cabasson

H. Pirollet

C. Lombois

Guillaume Dumas

2025-05-15

Intensive Care Medicine (publié)

Plasticity as the Mirror of Empowerment

David Abel

Michael Bowling

Andre Barreto

Will Dabney

Shi Dong

Steven Hansen

Anna Harutyunyan

Khimya Khetarpal

Clare Lyle

Razvan Pascanu

Georgios Piliouras

Jonathan Richens

Mark Rowland

Tom Schaul

Satinder Singh

2025-05-15

ArXiv (prépublication)

Plasticity as the Mirror of Empowerment

David Abel

Michael Bowling

Andre Barreto

Will Dabney

Shi Dong

Steven Hansen

Anna Harutyunyan

Khimya Khetarpal

Clare Lyle

Razvan Pascanu

Georgios Piliouras

Jonathan Richens

Mark Rowland

Tom Schaul

Satinder Singh

2025-05-15

ArXiv (prépublication)

Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?

Anthony GX-Chen

Dongyan Lin

Mandana Samiei

Blake Richards

Rob Fergus

Kenneth Marino

Language model (LM) agents are increasingly used as autonomous decision-makers who need to actively gather information to guide their decisi… (voir plus)ons. A crucial cognitive skill for such agents is the efficient exploration and understanding of the causal structure of the world -- key to robust, scientifically grounded reasoning. Yet, it remains unclear whether LMs possess this capability or exhibit systematic biases leading to erroneous conclusions. In this work, we examine LMs' ability to explore and infer causal relationships, using the well-established"Blicket Test"paradigm from developmental psychology. We find that LMs reliably infer the common, intuitive disjunctive causal relationships but systematically struggle with the unusual, yet equally (or sometimes even more) evidenced conjunctive ones. This"disjunctive bias"persists across model families, sizes, and prompting strategies, and performance further declines as task complexity increases. Interestingly, an analogous bias appears in human adults, suggesting that LMs may have inherited deep-seated reasoning heuristics from their training data. To this end, we quantify similarities between LMs and humans, finding that LMs exhibit adult-like inference profiles (but not children-like). Finally, we propose a test-time sampling method which explicitly samples and eliminates hypotheses about causal relationships from the LM. This scalable approach significantly reduces the disjunctive bias and moves LMs closer to the goal of scientific, causally rigorous reasoning.

2025-05-14

ArXiv (prépublication)

Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?

Anthony GX-Chen

Dongyan Lin

Mandana Samiei

Blake Richards

Rob Fergus

Kenneth Marino

2025-05-14

ArXiv (prépublication)

Laurence Perreault-Levasseur

The CASTOR mission

Patrick Côté

T. Woods

John Hutchings

J. Rhodes

R. Sánchez-Janssen

Alan D. Scott

J. Pazder

Melissa Amenouche

Michael Balogh

Simon Blouin

Alain Cournoyer

M. Drout

Nick Kuzmin

Katherine J. Mack

Laura Ferrarese

Wesley C. Fraser

Sarah C. Gallagher

Frederic J. Grandmont

Daryl Haggard

Paul Harrison … (voir 160 de plus)

Vincent Hénault-Brunet

J. Kavelaars

V. Khatu

J. Roediger

J. Rowe

Marcin Sawicki

Jesper Skottfelt

Matt Taylor

Ludo van Waerbeke

Laurie Amen

Dhananjhay Bansal

Martin Bergeron

Toby Brown

Greg Burley

Hum Chand

Isaac Cheng

Ryan Cloutier

N. Dickson

Oleg Djazovski

Ivana Damjanov

James Doherty

K. Finner

Macarena García Del Valle Espinosa

Jennifer Glover

A. I. Gómez de Castro

Or Graur

Tim Hardy

Michelle Kao

D A Leahy

Deborah Lokhorst

A. I. Malz

Allison Man

Madeline A. Marshall

Sean McGee

Ryan McKenzie

Kai Michaud

Surhud S. More

David Morris

Patrick W. Morris

T. Moutard

Wasi Naqvi

Matthew Nicholl

G. Noirot

M. S. Oey

C. Opitom

Samir Salim

Bryan R. Scott

Charles Shapiro

Daniel Stern

A. Subramaniam

David Thilke

I. Wevers

Dmitri Vorobiev

L. Y. Aaron Yung

Frédéric Zamkotsian

S. Aigrain

A. Alavi

Martin Barstow

Peter Bartosik

Hadleigh Bluhm

J. Bovy

Peter Cameron

R. Carlberg

Jessie L. Christiansen

Yuyang Chen

Paul Crowther

Kristen Dage

Aaron Dotter

Patrick Dufour

Jean Dupuis

B. Dryer

A. Duara

Gwendolyn M. Eadie

Marielle R. Eduardo

V. Estrada-Carpenter

Sébastien Fabbro

A. Faisst

N. M. Ford

Morgan Fraser

Boris T. Gaensicke

Shashkiran Ganesh

Poshak Gandhi

Melissa L. Graham

Rebecca Hamel

Martin Hellmich

John J. Hennessy

Kaitlyn Hessel

J. Heyl

Catherine Heymans

Yashar Hezaveh

Renée Hložek

Michael Hoenk

Andrew Holland

Eric Huff

Ian Hutchinson

Ikuru Iwata

April D. Jewell

Doug Johnstone

Maia Jones

Todd J. Jones

D. Lang

J. Lapington

Justin Larivière

C. Lawlor-Forsyth

Denis Laurin

Charles Lee

Ronan Legin

Ting S. Li

Sungsoon Lim

Bethany Ludwig

Matt Kozun

V. M

Robert Mann

Alan McConnachie

Evan McDonough

S. Metchev

David R. Miller

Takashi Moriya

Cameron Morgan

Julio F. Navarro

Y. Nazé

Shouleh Nikzad

Vivek Oad

N. N.-Q. Ouellette

E. Pass

Will J. Percival

Joe Postma

Nayyer Raza

G. T. Richards

Harvey Richer

Carmelle Robert

Erik Rosolowsky

J. Ruan

Sarah Rugheimer

S. Safi-Harb

Kanak Saha

Vicky Scowcroft

F. Sestito

Himanshu Sharma

James Sikora

G. Sivakoff

T. S. Sivarani

Patrick Smith

Warren Soh

R. Sorba

S. Subramanian

Hossen Teimoorinia

H. Teplitz

Shaylin Thadani

Shavon Thadani

Aaron Tohuvavohu

K. Venn

Nicholas Vieira

Jeremy J. Webb

P. Wiegert

Ryan Wierckx

Yanqin Wu

Jade Yeung

S. K. Yi

2025-05-14

Journal of Astronomical Telescopes Instruments and Systems (publié)

Laurence Perreault-Levasseur

The CASTOR mission

Patrick Côté

T. Woods

John Hutchings

J. Rhodes

R. Sánchez-Janssen

Alan D. Scott

J. Pazder

Melissa Amenouche

Michael Balogh

Simon Blouin

Alain Cournoyer

M. Drout

Nick Kuzmin

Katherine J. Mack

Laura Ferrarese

Wesley C. Fraser

Sarah C. Gallagher

Frederic J. Grandmont

Daryl Haggard

Paul Harrison … (voir 160 de plus)

Vincent Hénault-Brunet

J. Kavelaars

V. Khatu

J. Roediger

J. Rowe

Marcin Sawicki

Jesper Skottfelt

Matt Taylor

Ludo van Waerbeke

Laurie Amen

Dhananjhay Bansal

Martin Bergeron

Toby Brown

Greg Burley

Hum Chand

Isaac Cheng

Ryan Cloutier

N. Dickson

Oleg Djazovski

Ivana Damjanov

James Doherty

K. Finner

Macarena García Del Valle Espinosa

Jennifer Glover

A. I. Gómez de Castro

Or Graur

Tim Hardy

Michelle Kao

D A Leahy

Deborah Lokhorst

A. I. Malz

Allison Man

Madeline A. Marshall

Sean McGee

Ryan McKenzie

Kai Michaud

Surhud S. More

David Morris

Patrick W. Morris

T. Moutard

Wasi Naqvi

Matthew Nicholl

G. Noirot

M. S. Oey

C. Opitom

Samir Salim

Bryan R. Scott

Charles Shapiro

Daniel Stern

A. Subramaniam

David Thilke

I. Wevers

Dmitri Vorobiev

L. Y. Aaron Yung

Frédéric Zamkotsian

S. Aigrain

A. Alavi

Martin Barstow

Peter Bartosik

Hadleigh Bluhm

J. Bovy

Peter Cameron

R. Carlberg

Jessie L. Christiansen

Yuyang Chen

Paul Crowther

Kristen Dage

Aaron Dotter

Patrick Dufour

Jean Dupuis

B. Dryer

A. Duara

Gwendolyn M. Eadie

Marielle R. Eduardo

V. Estrada-Carpenter

Sébastien Fabbro

A. Faisst

N. M. Ford

Morgan Fraser

Boris T. Gaensicke

Shashkiran Ganesh

Poshak Gandhi

Melissa L. Graham

Rebecca Hamel

Martin Hellmich

John J. Hennessy

Kaitlyn Hessel

J. Heyl

Catherine Heymans

Yashar Hezaveh

Renée Hložek

Michael Hoenk

Andrew Holland

Eric Huff

Ian Hutchinson

Ikuru Iwata

April D. Jewell

Doug Johnstone

Maia Jones

Todd Jones

D. Lang

J. Lapington

Justin Larivière

C. Lawlor-Forsyth

Denis Laurin

Charles Lee

Ronan Legin

Ting S. Li

Sungsoon Lim

Bethany Ludwig

Matt Kozun

V. M

Robert Mann

Alan McConnachie

Evan McDonough

S. Metchev

David R. Miller

Takashi Moriya

Cameron Morgan

Julio F. Navarro

Y. Nazé

Shouleh Nikzad

Vivek Oad

N. N.-Q. Ouellette

E. Pass

Will J. Percival

Joe Postma

Nayyer Raza

G. T. Richards

Harvey Richer

Carmelle Robert

Erik Rosolowsky

J. Ruan

Sarah Rugheimer

S. Safi-Harb

Kanak Saha

Vicky Scowcroft

F. Sestito

Himanshu Sharma

James Sikora

G. Sivakoff

T. S. Sivarani

Patrick Smith

Warren Soh

R. Sorba

S. Subramanian

Hossen Teimoorinia

H. Teplitz

Shaylin Thadani

Shavon Thadani

Aaron Tohuvavohu

K. Venn

Nicholas Vieira

Jeremy J. Webb

P. Wiegert

Ryan Wierckx

Yanqin Wu

Jade Yeung

Sukyoung K. Yi

2025-05-14

Journal of Astronomical Telescopes Instruments and Systems (publié)