Search-Based Correction of Reasoning Chains for Language Models
Minsu Kim
Jean-Pierre R. Falet
Oliver E. Richardson
Xiaoyin Chen
Moksh J. Jain
Sungjin Ahn
Sungsoo Ahn
Search-Based Correction of Reasoning Chains for Language Models
Minsu Kim
Jean-Pierre R. Falet
Oliver E. Richardson
Xiaoyin Chen
Moksh J. Jain
Sungjin Ahn
Sungsoo Ahn
A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment
Jean-Philippe Corbeil
Amin Dada
Jean-Michel Attendu
Asma Ben Abacha
Lucas Caccia
Franccois Beaulieu
Thomas Lin
Jens Kleesiek
Paul Vozila
A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment
Jean-Philippe Corbeil
Amin Dada
Jean-Michel Attendu
Asma Ben Abacha
Lucas Caccia
Franccois Beaulieu
Thomas Lin
Jens Kleesiek
Paul Vozila
High computation costs and latency of large language models such as GPT-4 have limited their deployment in clinical settings. Small language… (see more) models (SLMs) offer a cost-effective alternative, but their limited capacity requires biomedical domain adaptation, which remains challenging. An additional bottleneck is the unavailability and high sensitivity of clinical data. To address these challenges, we propose a novel framework for adapting SLMs into high-performing clinical models. We introduce the MediPhi collection of 3.8B-parameter SLMs developed with our novel framework: pre-instruction tuning of experts on relevant medical and clinical corpora (PMC, Medical Guideline, MedWiki, etc.), model merging, and clinical-tasks alignment. To cover most clinical tasks, we extended the CLUE benchmark to CLUE+, doubling its size. Our expert models deliver relative improvements on this benchmark over the base model without any task-specific fine-tuning: 64.3% on medical entities, 49.5% on radiology reports, and 44% on ICD-10 coding (outperforming GPT-4-0125 by 14%). We unify the expert models into MediPhi via model merging, preserving gains across benchmarks. Furthermore, we built the MediFlow collection, a synthetic dataset of 2.5 million high-quality instructions on 14 medical NLP tasks, 98 fine-grained document types, and JSON format support. Alignment of MediPhi using supervised fine-tuning and direct preference optimization achieves further gains of 18.9% on average.
Persistent signs of poisoning after massive drug ingestion: move the ultrasound probe to the stomach.
N. Lautrou-cabasson
H. Pirollet
C. Lombois
Plasticity as the Mirror of Empowerment
David Abel
Michael Bowling
Andre Barreto
Will Dabney
Shi Dong
Steven Hansen
Anna Harutyunyan
Clare Lyle
Georgios Piliouras
Jonathan Richens
Mark Rowland
Tom Schaul
Satinder Singh
Plasticity as the Mirror of Empowerment
David Abel
Michael Bowling
Andre Barreto
Will Dabney
Shi Dong
Steven Hansen
Anna Harutyunyan
Clare Lyle
Georgios Piliouras
Jonathan Richens
Mark Rowland
Tom Schaul
Satinder Singh
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
Anthony GX-Chen
Dongyan Lin
Mandana Samiei
Rob Fergus
Kenneth Marino
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
Anthony GX-Chen
Dongyan Lin
Mandana Samiei
Rob Fergus
Kenneth Marino
Language model (LM) agents are increasingly used as autonomous decision-makers who need to actively gather information to guide their decisi… (see more)ons. A crucial cognitive skill for such agents is the efficient exploration and understanding of the causal structure of the world -- key to robust, scientifically grounded reasoning. Yet, it remains unclear whether LMs possess this capability or exhibit systematic biases leading to erroneous conclusions. In this work, we examine LMs' ability to explore and infer causal relationships, using the well-established"Blicket Test"paradigm from developmental psychology. We find that LMs reliably infer the common, intuitive disjunctive causal relationships but systematically struggle with the unusual, yet equally (or sometimes even more) evidenced conjunctive ones. This"disjunctive bias"persists across model families, sizes, and prompting strategies, and performance further declines as task complexity increases. Interestingly, an analogous bias appears in human adults, suggesting that LMs may have inherited deep-seated reasoning heuristics from their training data. To this end, we quantify similarities between LMs and humans, finding that LMs exhibit adult-like inference profiles (but not children-like). Finally, we propose a test-time sampling method which explicitly samples and eliminates hypotheses about causal relationships from the LM. This scalable approach significantly reduces the disjunctive bias and moves LMs closer to the goal of scientific, causally rigorous reasoning.
The CASTOR mission
Patrick Côté
T. Woods
John Hutchings
J. Rhodes
R. Sánchez-Janssen
Alan D. Scott
J. Pazder
Melissa Amenouche
Michael Balogh
Simon Blouin
Alain Cournoyer
M. Drout
Nick Kuzmin
Katherine J. Mack
Laura Ferrarese
Wesley C. Fraser
Sarah C. Gallagher
Frederic J. Grandmont
Daryl Haggard
Paul Harrison … (see 160 more)
Vincent Hénault-Brunet
J. Kavelaars
V. Khatu
J. Roediger
J. Rowe
Marcin Sawicki
Jesper Skottfelt
Matt Taylor
Ludo van Waerbeke
Laurie Amen
Dhananjhay Bansal
Martin Bergeron
Toby Brown
Greg Burley
Hum Chand
Isaac Cheng
Ryan Cloutier
N. Dickson
Oleg Djazovski
Ivana Damjanov
James Doherty
K. Finner
Macarena García Del Valle Espinosa
Jennifer Glover
A. I. Gómez de Castro
Or Graur
Tim Hardy
Michelle Kao
D A Leahy
Deborah Lokhorst
A. I. Malz
Allison Man
Madeline A. Marshall
Sean McGee
Ryan McKenzie
Kai Michaud
Surhud S. More
David Morris
Patrick W. Morris
T. Moutard
Wasi Naqvi
Matthew Nicholl
G. Noirot
M. S. Oey
C. Opitom
Samir Salim
Bryan R. Scott
Charles Shapiro
Daniel Stern
A. Subramaniam
David Thilke
I. Wevers
Dmitri Vorobiev
L. Y. Aaron Yung
Frédéric Zamkotsian
S. Aigrain
A. Alavi
Martin Barstow
Peter Bartosik
Hadleigh Bluhm
J. Bovy
Peter Cameron
R. Carlberg
Jessie L. Christiansen
Yuyang Chen
Paul Crowther
Kristen Dage
Aaron Dotter
Patrick Dufour
Jean Dupuis
B. Dryer
A. Duara
Gwendolyn M. Eadie
Marielle R. Eduardo
V. Estrada-Carpenter
Sébastien Fabbro
A. Faisst
N. M. Ford
Morgan Fraser
Boris T. Gaensicke
Shashkiran Ganesh
Poshak Gandhi
Melissa L. Graham
Rebecca Hamel
Martin Hellmich
John J. Hennessy
Kaitlyn Hessel
J. Heyl
Catherine Heymans
Renée Hložek
Michael Hoenk
Andrew Holland
Eric Huff
Ian Hutchinson
Ikuru Iwata
April D. Jewell
Doug Johnstone
Maia Jones
Todd J. Jones
D. Lang
J. Lapington
Justin Larivière
C. Lawlor-Forsyth
Denis Laurin
Charles Lee
Ronan Legin
Ting S. Li
Sungsoon Lim
B. Ludwig
Matt Kozun
V. M
Robert Mann
Alan McConnachie
Evan McDonough
S. Metchev
David R. Miller
Takashi Moriya
Cameron Morgan
Julio F. Navarro
Y. Nazé
Shouleh Nikzad
Vivek Oad
N. N.-Q. Ouellette
E. Pass
Will J. Percival
Joe Postma
Nayyer Raza
G. T. Richards
Harvey Richer
Carmelle Robert
Erik Rosolowsky
J. Ruan
Sarah Rugheimer
S. Safi-Harb
Kanak Saha
Vicky Scowcroft
F. Sestito
Himanshu Sharma
James Sikora
G. Sivakoff
T. S. Sivarani
Patrick Smith
Warren Soh
R. Sorba
S. Subramanian
Hossen Teimoorinia
H. Teplitz
Shaylin Thadani
Shavon Thadani
Aaron Tohuvavohu
K. Venn
Nicholas Vieira
Jeremy J. Webb
P. Wiegert
Ryan Wierckx
Yanqin Wu
Jade Yeung
S. K. Yi
The CASTOR mission
Patrick Côté
T. Woods
John Hutchings
J. Rhodes
R. Sánchez-Janssen
Alan D. Scott
J. Pazder
Melissa Amenouche
Michael Balogh
Simon Blouin
Alain Cournoyer
M. Drout
Nick Kuzmin
Katherine J. Mack
Laura Ferrarese
Wesley C. Fraser
Sarah C. Gallagher
Frederic J. Grandmont
Daryl Haggard
Paul Harrison … (see 160 more)
Vincent Hénault-Brunet
J. Kavelaars
V. Khatu
J. Roediger
J. Rowe
Marcin Sawicki
Jesper Skottfelt
Matt Taylor
Ludo van Waerbeke
Laurie Amen
Dhananjhay Bansal
Martin Bergeron
Toby Brown
Greg Burley
Hum Chand
Isaac Cheng
Ryan Cloutier
N. Dickson
Oleg Djazovski
Ivana Damjanov
James Doherty
K. Finner
Macarena García Del Valle Espinosa
Jennifer Glover
A. I. Gómez de Castro
Or Graur
Tim Hardy
Michelle Kao
D A Leahy
Deborah Lokhorst
A. I. Malz
Allison Man
Madeline A. Marshall
Sean McGee
Ryan McKenzie
Kai Michaud
Surhud S. More
David Morris
Patrick W. Morris
T. Moutard
Wasi Naqvi
Matthew Nicholl
G. Noirot
M. S. Oey
C. Opitom
Samir Salim
Bryan R. Scott
Charles Shapiro
Daniel Stern
A. Subramaniam
David Thilke
I. Wevers
Dmitri Vorobiev
L. Y. Aaron Yung
Frédéric Zamkotsian
S. Aigrain
A. Alavi
Martin Barstow
Peter Bartosik
Hadleigh Bluhm
J. Bovy
Peter Cameron
R. Carlberg
Jessie L. Christiansen
Yuyang Chen
Paul Crowther
Kristen Dage
Aaron Dotter
Patrick Dufour
Jean Dupuis
B. Dryer
A. Duara
Gwendolyn M. Eadie
Marielle R. Eduardo
V. Estrada-Carpenter
Sébastien Fabbro
A. Faisst
N. M. Ford
Morgan Fraser
Boris T. Gaensicke
Shashkiran Ganesh
Poshak Gandhi
Melissa L. Graham
Rebecca Hamel
Martin Hellmich
John J. Hennessy
Kaitlyn Hessel
J. Heyl
Catherine Heymans
Renée Hložek
Michael Hoenk
Andrew Holland
Eric Huff
Ian Hutchinson
Ikuru Iwata
April D. Jewell
Doug Johnstone
Maia Jones
Todd Jones
D. Lang
J. Lapington
Justin Larivière
C. Lawlor-Forsyth
Denis Laurin
Charles Lee
Ronan Legin
Ting S. Li
Sungsoon Lim
Bethany Ludwig
Matt Kozun
V. M
Robert Mann
Alan McConnachie
Evan McDonough
S. Metchev
David R. Miller
Takashi Moriya
Cameron Morgan
Julio F. Navarro
Y. Nazé
Shouleh Nikzad
Vivek Oad
N. N.-Q. Ouellette
E. Pass
Will J. Percival
Joe Postma
Nayyer Raza
G. T. Richards
Harvey Richer
Carmelle Robert
Erik Rosolowsky
J. Ruan
Sarah Rugheimer
S. Safi-Harb
Kanak Saha
Vicky Scowcroft
F. Sestito
Himanshu Sharma
James Sikora
G. Sivakoff
T. S. Sivarani
Patrick Smith
Warren Soh
R. Sorba
S. Subramanian
Hossen Teimoorinia
H. Teplitz
Shaylin Thadani
Shavon Thadani
Aaron Tohuvavohu
K. Venn
Nicholas Vieira
Jeremy J. Webb
P. Wiegert
Ryan Wierckx
Yanqin Wu
Jade Yeung
Sukyoung K. Yi
The CASTOR mission
Patrick Côté
T. Woods
John Hutchings
J. Rhodes
R. Sánchez-Janssen
Alan D. Scott
J. Pazder
Melissa Amenouche
Michael Balogh
Simon Blouin
Alain Cournoyer
M. Drout
Nick Kuzmin
Katherine J. Mack
Laura Ferrarese
Wesley C. Fraser
Sarah C. Gallagher
Frederic J. Grandmont
Daryl Haggard
Paul Harrison … (see 160 more)
Vincent Hénault-Brunet
J. Kavelaars
V. Khatu
J. Roediger
J. Rowe
Marcin Sawicki
Jesper Skottfelt
Matt Taylor
Ludo van Waerbeke
Laurie Amen
Dhananjhay Bansal
Martin Bergeron
Toby Brown
Greg Burley
Hum Chand
Isaac Cheng
Ryan Cloutier
N. Dickson
Oleg Djazovski
Ivana Damjanov
James Doherty
K. Finner
Macarena García Del Valle Espinosa
Jennifer Glover
A. I. Gómez de Castro
Or Graur
Tim Hardy
Michelle Kao
D A Leahy
Deborah Lokhorst
A. I. Malz
Allison Man
Madeline A. Marshall
Sean McGee
Ryan McKenzie
Kai Michaud
Surhud S. More
David Morris
Patrick W. Morris
T. Moutard
Wasi Naqvi
Matthew Nicholl
G. Noirot
M. S. Oey
C. Opitom
Samir Salim
Bryan R. Scott
Charles Shapiro
Daniel Stern
A. Subramaniam
David Thilke
I. Wevers
Dmitri Vorobiev
L. Y. Aaron Yung
Frédéric Zamkotsian
S. Aigrain
A. Alavi
Martin Barstow
Peter Bartosik
Hadleigh Bluhm
J. Bovy
Peter Cameron
R. Carlberg
Jessie L. Christiansen
Yuyang Chen
Paul Crowther
Kristen Dage
Aaron Dotter
Patrick Dufour
Jean Dupuis
B. Dryer
A. Duara
Gwendolyn M. Eadie
Marielle R. Eduardo
V. Estrada-Carpenter
Sébastien Fabbro
A. Faisst
N. M. Ford
Morgan Fraser
Boris T. Gaensicke
Shashkiran Ganesh
Poshak Gandhi
Melissa L. Graham
Rebecca Hamel
Martin Hellmich
John J. Hennessy
Kaitlyn Hessel
J. Heyl
Catherine Heymans
Renée Hložek
Michael Hoenk
Andrew Holland
Eric Huff
Ian Hutchinson
Ikuru Iwata
April D. Jewell
Doug Johnstone
Maia Jones
Todd J. Jones
D. Lang
J. Lapington
Justin Larivière
C. Lawlor-Forsyth
Denis Laurin
Charles Lee
Ronan Legin
Ting S. Li
Sungsoon Lim
Bethany Ludwig
Matt Kozun
V. M
Robert Mann
Alan McConnachie
Evan McDonough
S. Metchev
David R. Miller
Takashi Moriya
Cameron Morgan
Julio F. Navarro
Y. Nazé
Shouleh Nikzad
Vivek Oad
N. N.-Q. Ouellette
E. Pass
Will J. Percival
Joe Postma
Nayyer Raza
G. T. Richards
Harvey Richer
Carmelle Robert
Erik Rosolowsky
J. Ruan
Sarah Rugheimer
S. Safi-Harb
Kanak Saha
Vicky Scowcroft
F. Sestito
Himanshu Sharma
James Sikora
G. Sivakoff
T. S. Sivarani
Patrick Smith
Warren Soh
R. Sorba
S. Subramanian
Hossen Teimoorinia
H. Teplitz
Shaylin Thadani
Shavon Thadani
Aaron Tohuvavohu
K. Venn
Nicholas Vieira
Jeremy J. Webb
P. Wiegert
Ryan Wierckx
Yanqin Wu
Jade Yeung
S. K. Yi