Publications

The CASTOR mission

Patrick Côté

T. Woods

John Hutchings

J. Rhodes

R. Sánchez-Janssen

Alan D. Scott

J. Pazder

Melissa Amenouche

Michael Balogh

Simon Blouin

Alain Cournoyer

M. Drout

Nick Kuzmin

Katherine J. Mack

Laura Ferrarese

Wesley C. Fraser

Sarah C. Gallagher

Frederic J. Grandmont

Daryl Haggard

Paul Harrison … (see 160 more)

Vincent Hénault-Brunet

J. Kavelaars

V. Khatu

J. Roediger

J. Rowe

Marcin Sawicki

Jesper Skottfelt

Matt Taylor

Ludo van Waerbeke

Laurie Amen

Dhananjhay Bansal

Martin Bergeron

Toby Brown

Greg Burley

Hum Chand

Isaac Cheng

Ryan Cloutier

N. Dickson

Oleg Djazovski

Ivana Damjanov

James Doherty

K. Finner

Macarena García Del Valle Espinosa

Jennifer Glover

A. I. Gómez de Castro

Or Graur

Tim Hardy

Michelle Kao

D A Leahy

Deborah Lokhorst

A. I. Malz

Allison Man

Madeline A. Marshall

Sean McGee

Ryan McKenzie

Kai Michaud

Surhud S. More

David Morris

Patrick W. Morris

T. Moutard

Wasi Naqvi

Matthew Nicholl

G. Noirot

M. S. Oey

C. Opitom

Samir Salim

Bryan R. Scott

Charles Shapiro

Daniel Stern

A. Subramaniam

David Thilke

I. Wevers

Dmitri Vorobiev

L. Y. Aaron Yung

Frédéric Zamkotsian

S. Aigrain

A. Alavi

Martin Barstow

Peter Bartosik

Hadleigh Bluhm

J. Bovy

Peter Cameron

R. Carlberg

Jessie L. Christiansen

Yuyang Chen

Paul Crowther

Kristen Dage

Aaron Dotter

Patrick Dufour

Jean Dupuis

B. Dryer

A. Duara

Gwendolyn M. Eadie

Marielle R. Eduardo

V. Estrada-Carpenter

Sébastien Fabbro

A. Faisst

N. M. Ford

Morgan Fraser

Boris T. Gaensicke

Shashkiran Ganesh

Poshak Gandhi

Melissa L. Graham

Rebecca Hamel

Martin Hellmich

John J. Hennessy

Kaitlyn Hessel

J. Heyl

Catherine Heymans

Yashar Hezaveh

Renée Hložek

Michael Hoenk

Andrew Holland

Eric Huff

Ian Hutchinson

Ikuru Iwata

April D. Jewell

Doug Johnstone

Maia Jones

Todd J. Jones

D. Lang

J. Lapington

Justin Larivière

C. Lawlor-Forsyth

Denis Laurin

Charles Lee

Ronan Legin

Ting S. Li

Sungsoon Lim

B. Ludwig

Matt Kozun

V. M

Robert Mann

Alan McConnachie

Evan McDonough

S. Metchev

David R. Miller

Takashi Moriya

Cameron Morgan

Julio F. Navarro

Y. Nazé

Shouleh Nikzad

Vivek Oad

N. N.-Q. Ouellette

E. Pass

Will J. Percival

Laurence Perreault-Levasseur

Joe Postma

Nayyer Raza

G. T. Richards

Harvey Richer

Carmelle Robert

Erik Rosolowsky

J. Ruan

Sarah Rugheimer

S. Safi-Harb

Kanak Saha

Vicky Scowcroft

F. Sestito

Himanshu Sharma

James Sikora

G. Sivakoff

T. S. Sivarani

Patrick Smith

Warren Soh

R. Sorba

S. Subramanian

Hossen Teimoorinia

H. Teplitz

Shaylin Thadani

Shavon Thadani

Aaron Tohuvavohu

K. Venn

Nicholas Vieira

Jeremy J. Webb

P. Wiegert

Ryan Wierckx

Yanqin Wu

Jade Yeung

S. K. Yi

2025-05-14

Journal of Astronomical Telescopes Instruments and Systems (published)

doi.org

An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration

Hiroki Naganuma

Ryuichiro Hataya

Kotaro Yoshida

Ioannis Mitliagkas

2025-05-13

TMLR (accepted)

openreview.net

Influence of scanning plane on Human Spinal Cord functional Magnetic Resonance echo planar imaging

Marta Moraschi

Silvia Tommasin

Laura Maugeri

Mauro DiNuzzo

Julien Cohen-Adad

Marco Masullo

Fabio Mangini

Lorenzo Giovannelli

Daniele Mascali

Tommaso Gili

Valerio Pisani

Ugo Nocentini

Federico Giove

Michela Fratini

BACKGROUND: Functional Magnetic Resonance Imaging (fMRI) is based on the Blood Oxygenation Level Dependent contrast and has been exploited f… (see more)or the indirect study of the neuronal activity within both the brain and the spinal cord. However, the interpretation of spinal cord fMRI (scfMRI) is still controversial and its diffusion is rather limited because of technical limitations. Overcoming these limitations would have a beneficial effect for the assessment and follow-up of spinal injuries and neurodegenerative diseases. PURPOSE: This study was aimed at systematically verify whether sagittal scanning in scfMRI using EPI readout is a viable alternative to the more common axial scanning, and at optimizing a pipeline for EPI-based scfMRI data analysis, based on Spinal Cord Toolbox (SCT). METHODS: Forty-five healthy subjects underwent MRI acquisition in a Philips Achieva 3T MRI scanner. T2*-weighted fMRI data were acquired using a GE-EPI sequence along sagittal and axial planes during an isometric motor task. Differences on benchmarks were assessed via paired two-sample t-test at p=0.05. RESULTS: We investigated the impact of the acquisition strategy by means of various metrics such as Temporal Signal to Noise Ratio (tSNR), Dice Coefficient to assess geometric distortions, Reproducibility and Sensitivity. tSNR was higher in axial than in sagittal scans, as well as reproducibility within the whole cord mask (t=7.4, p0.01) and within the GM mask (t=4.2, p0.01). The other benchmarks, associated with distortion and functional response, showed no differenc

2025-05-12

PLOS One (published)

doi.org

arxiv.org

Learning Penalty for Optimal Partitioning via Automatic Feature Extraction

Tung L. Nguyen

Toby Dylan Hocking

2025-05-12

ArXiv (preprint)

arxiv.org

Learning Penalty for Optimal Partitioning via Automatic Feature Extraction

Tung L. Nguyen

Toby Dylan Hocking

Changepoint detection identifies significant shifts in data sequences, making it important in areas like finance, genetics, and healthcare. … (see more)The Optimal Partitioning algorithms efficiently detect these changes, using a penalty parameter to limit the changepoints number. Determining the appropriate value for this penalty can be challenging. Traditionally, this process involved manually extracting statistical features, such as sequence length or variance to make the prediction. This study proposes a novel approach that uses recurrent neural networks to learn this penalty directly from raw sequences by automatically extracting features. Experiments conducted on 20 benchmark genomic datasets show that this novel method surpasses traditional methods in partitioning accuracy in most cases.

2025-05-12

ArXiv (preprint)

arxiv.org

OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations

Christina Kassab

Sacha Morin

Martin Büchner

Matias Mattamala

Kumaraditya Gupta

Abhinav Valada

Liam Paull

Maurice Fallon

2025-05-12

IEEE.org/ICRA/2025/Workshop/Safe-VLM (spotlight)

openreview.net

Ctrl-V: Higher Fidelity Autonomous Vehicle Video Generation with Bounding-Box Controlled Object Motion

Ge Ya Luo

Zhi Hao Luo

Anthony Gosselin

Alexia Jolicoeur-Martineau

Chris Pal

2025-05-10

TMLR (accepted)

openreview.net

Efficient Morphology-Aware Policy Transfer to New Embodiments

Michael Przystupa

Hongyao Tang

Glen Berseth

Mariano Phielipp

Santiago Miret

Martin Jägersand

Matthew E. Taylor

Morphology-aware policy learning is a means of enhancing policy sample efficiency by aggregating data from multiple agents. These types of p… (see more)olicies have previously been shown to help generalize over dynamic, kinematic, and limb configuration variations between agent morphologies. Unfortunately, these policies still have sub-optimal zero-shot performance compared to end-to-end finetuning on morphologies at deployment. This limitation has ramifications in practical applications such as robotics because further data collection to perform end-to-end finetuning can be computationally expensive. In this work, we investigate combining morphology-aware pretraining with \textit{parameter efficient finetuning} (PEFT) techniques to help reduce the learnable parameters necessary to specialize a morphology-aware policy to a target embodiment. We compare directly tuning sub-sets of model weights, input learnable adapters, and prefix tuning techniques for online finetuning. Our analysis reveals that PEFT techniques in conjunction with policy pre-training generally help reduce the number of samples to necessary to improve a policy compared to training models end-to-end from scratch. We further find that tuning as few as less than 1\% of total parameters will improve policy performance compared the zero-shot performance of the base pretrained a policy.

2025-05-09

rl-conference.cc/RLC/2025/Conference (accepted)

openreview.net

Efficient Morphology-Aware Policy Transfer to New Embodiments

Michael Przystupa

Hongyao Tang

Glen Berseth

Mariano Phielipp

Santiago Miret

Martin Jägersand

Matthew E. Taylor

Morphology-aware policy learning is a means of enhancing policy sample efficiency by aggregating data from multiple agents. These types of p… (see more)olicies have previously been shown to help generalize over dynamic, kinematic, and limb configuration variations between agent morphologies. Unfortunately, these policies still have sub-optimal zero-shot performance compared to end-to-end finetuning on morphologies at deployment. This limitation has ramifications in practical applications such as robotics because further data collection to perform end-to-end finetuning can be computationally expensive. In this work, we investigate combining morphology-aware pretraining with \textit{parameter efficient finetuning} (PEFT) techniques to help reduce the learnable parameters necessary to specialize a morphology-aware policy to a target embodiment. We compare directly tuning sub-sets of model weights, input learnable adapters, and prefix tuning techniques for online finetuning. Our analysis reveals that PEFT techniques in conjunction with policy pre-training generally help reduce the number of samples to necessary to improve a policy compared to training models end-to-end from scratch. We further find that tuning as few as less than 1\% of total parameters will improve policy performance compared the zero-shot performance of the base pretrained a policy.

2025-05-09

rl-conference.cc/RLC/2025/Conference (published)

openreview.net

Mitigating Goal Misgeneralization via Minimax Regret

Karim Ahmed Abdel Sadek

Matthew Farrugia-Roberts

Usman Anwar

Hannah Erlebach

Christian Schroeder de Witt

David Scott Krueger

Michael D Dennis

Robustness research in reinforcement learning often focuses on ensuring that the policy consistently exhibits capable, goal-driven behavior.… (see more) However, not every capable behavior is the intended behavior. *Goal misgeneralization* can occur when the policy generalizes capably with respect to a 'proxy goal' whose optimal behavior correlates with the intended goal on the training distribution, but not out of distribution. Though the intended goal would be ambiguous if they were perfectly correlated in training, we show progress can be made if the goals are only *nearly ambiguous*, with the training distribution containing a small proportion of *disambiguating* levels. We observe that the training signal from disambiguating levels could be amplified by regret-based prioritization. We formally show that approximately optimal policies on maximal-regret levels avoid the harmful effects of goal misgeneralization, which may exist without this prioritization. Empirically, we find that current regret-based Unsupervised Environment Design (UED) methods can mitigate the effects of goal misgeneralization, though do not always entirely eliminate it. Our theoretical and empirical results show that as UED methods improve they could further mitigate goal misgeneralization in practice.

2025-05-09

rl-conference.cc/RLC/2025/Conference (published)

openreview.net

Multi-Task Reinforcement Learning Enables Parameter Scaling

Reginald McLean

Evangelos Chatzaroulas

J K Terry

Isaac Woungang

Nariman Farsad

Pablo Samuel Castro

Multi-task reinforcement learning (MTRL) aims to endow a single agent with the ability to perform well on multiple tasks. Recent works have … (see more)focused on developing novel sophisticated architectures to improve performance, often resulting in larger models; it is unclear, however, whether the performance gains are a consequence of the architecture design or the extra parameters. We argue that gains are mostly due to scale by demonstrating that naively scaling up a simple MTRL baseline to match parameter counts outperforms the more sophisticated architectures, and these gains benefit most from scaling the critic over the actor. Additionally, we explore the training stability advantages that come with task diversity, demonstrating that increasing the number of tasks can help mitigate plasticity loss. Our findings suggest that MTRL's simultaneous training across multiple tasks provides a natural framework for beneficial parameter scaling in reinforcement learning, challenging the need for complex architectural innovations.

2025-05-09

rl-conference.cc/RLC/2025/Conference (accepted)

openreview.net

Multi-Task Reinforcement Learning Enables Parameter Scaling

Reginald McLean

Evangelos Chatzaroulas

J K Terry

Isaac Woungang

Nariman Farsad

Pablo Samuel Castro

Multi-task reinforcement learning (MTRL) aims to endow a single agent with the ability to perform well on multiple tasks. Recent works have … (see more)focused on developing novel sophisticated architectures to improve performance, often resulting in larger models; it is unclear, however, whether the performance gains are a consequence of the architecture design or the extra parameters. We argue that gains are mostly due to scale by demonstrating that naively scaling up a simple MTRL baseline to match parameter counts outperforms the more sophisticated architectures, and these gains benefit most from scaling the critic over the actor. Additionally, we explore the training stability advantages that come with task diversity, demonstrating that increasing the number of tasks can help mitigate plasticity loss. Our findings suggest that MTRL's simultaneous training across multiple tasks provides a natural framework for beneficial parameter scaling in reinforcement learning, challenging the need for complex architectural innovations.

2025-05-09

rl-conference.cc/RLC/2025/Conference (published)

openreview.net

AI Advantage

Mila AI Policy Fellowship

Strategic Priorities

AI Advantage

Mila AI Policy Fellowship

Publications

AI Advantage

Mila AI Policy Fellowship

Strategic Priorities

AI Advantage

Mila AI Policy Fellowship

Popular keywords:

Publications