Publications

Combining Sampling Methods with Attractor Dynamics in Spiking Models of Head-Direction Systems

Vojko Pjanovic

Jacob Zavatone-Veth

Sander Keemink

Michele Nardin

Uncertainty is a fundamental aspect of the natural environment, requiring the brain to infer and integrate noisy signals to guide behavior e… (see more)ffectively. Sampling-based inference has been proposed as a mechanism for dealing with uncertainty, particularly in early sensory processing. However, it is unclear how to reconcile sampling-based methods with operational principles of higher-order brain areas, such as attractor dynamics of persistent neural representations. In this study, we present a spiking neural network model for the head-direction (HD) system that combines sampling-based inference with attractor dynamics. To achieve this, we derive the required spiking neural network dynamics and interactions to perform sampling from a large family of probability distributions—including variables encoded with Poisson noise. We then propose a method that allows the network to update its estimate of the current head direction by integrating angular velocity samples—derived from noisy inputs—with a pull towards a circular manifold, thereby maintaining consistent attractor dynamics. This model makes specific, testable predictions about the HD system that can be examined in future neurophysiological experiments: it predicts correlated subthreshold voltage fluctuations; distinctive short- and long-term firing correlations among neurons; and characteristic statistics of the movement of the neural activity “bump” representing the head direction. Overall, our approach extends previous theories on probabilistic sampling with spiking neurons, offers a novel perspective on the computations responsible for orientation and navigation, and supports the hypothesis that sampling-based methods can be combined with attractor dynamics to provide a viable framework for studying neural dynamics across the brain.

2025-02-26

bioRxiv (preprint)

doi.org

Considerations and recommendations from the <scp>ISMRM</scp> diffusion study group for preclinical diffusion <scp>MRI</scp>: Part 1: In vivo small‐animal imaging

Ileana O. Jelescu

Francesco Grussu

Andrada Ianus

Brian Hansen

Rachel L. C. Barrett

Manisha Aggarwal

Stijn Michielse

Fatima Nasrallah

Warda Syeda

Nian Wang

Jelle Veraart

Alard Roebroeck

Andrew F. Bagdasarian

Cornelius Eichner

Farshid Sepehrband

Jan Zimmermann

Lucas Soustelle

Christien Bowman

Benjamin C. Tendler

Andreea Hertanu … (see 37 more)

Ben Jeurissen

Marleen Verhoye

Lucio Frydman

Yohan van de Looij

David Hike

Jeff F. Dunn

Karla Miller

Bennett Landman

Noam Shemesh

Arthur Anderson

Emilie McKinnon

Shawna Farquharson

Flavio Dell’Acqua

Carlo Pierpaoli

Ivana Drobnjak

Alexander Leemans

Kevin D. Harkins

Maxime Descoteaux

Duan Xu

Hao Huang

Mathieu D. Santin

Samuel C. Grant

Andre Obenaus

Gene S. Kim

Dan Wu

Denis Le Bihan

Stephen J. Blackband

Luisa Ciobanu

Els Fieremans

Ruiliang Bai

Trygve B. Leergaard

Jiangyang Zhang

Tim B. Dyrby

G. Allan Johnson

Julien Cohen-Adad

Matthew D. Budde

Kurt G Schilling

2025-02-26

Magnetic Resonance in Medicine (published)

doi.org

Considerations and recommendations from the ISMRM diffusion study group for preclinical diffusion MRI: Part 1: In vivo small‐animal imaging

Ileana O. Jelescu

Francesco Grussu

Andrada Ianus

Brian Hansen

Rachel L. C. Barrett

Manisha Aggarwal

Fatima Nasrallah

Stijn Michielse

Warda Syeda

Nian Wang

Jelle Veraart

Alard Roebroeck

Andrew F. Bagdasarian

Cornelius Eichner

Farshid Sepehrband

Jan Zimmermann

Lucas Soustelle

Christien Bowman

Benjamin C. Tendler

Andreea Hertanu … (see 37 more)

Ben Jeurissen

Marleen Verhoye

Lucio Frydman

Yohan van de Looij

David Hike

Jeff F. Dunn

Karla Miller

Bennett Landman

Noam Shemesh

Arthur Anderson

Emilie McKinnon

Shawna Farquharson

Flavio Dell’Acqua

Carlo Pierpaoli

Mathieu D. Santin

Ivana Drobnjak

Samuel C. Grant

Andre Obenaus

Alexander Leemans

Kevin D. Harkins

Maxime Descoteaux

Duan Xu

Hao Huang

Gene S. Kim

Dan Wu

Denis Le Bihan

Stephen J. Blackband

Luisa Ciobanu

Els Fieremans

Matthew D. Budde

Ruiliang Bai

Trygve B. Leergaard

Jiangyang Zhang

Tim B. Dyrby

G. Allan Johnson

Julien Cohen-Adad

Kurt G Schilling

Small-animal diffusion MRI (dMRI) has been used for methodological development and validation, characterizing the biological basis of diffus… (see more)ion phenomena, and comparative anatomy. The steps from animal setup and monitoring, to acquisition, analysis, and interpretation are complex, with many decisions that may ultimately affect what questions can be answered using the resultant data. This work aims to present selected considerations and recommendations from the diffusion community on best practices for preclinical dMRI of in vivo animals. We describe the general considerations and foundational knowledge that must be considered when designing experiments. We briefly describe differences in animal species and disease models and discuss why some may be more or less appropriate for different studies. We, then, give recommendations for in vivo acquisition protocols, including decisions on hardware, animal preparation, and imaging sequences, followed by advice for data processing including preprocessing, model-fitting, and tractography. Finally, we provide an online resource that lists publicly available preclinical dMRI datasets and software packages to promote responsible and reproducible research. In each section, we attempt to provide guides and recommendations, but also highlight areas for which no guidelines exist (and why), and where future work should focus. Although we mainly cover the central nervous system (on which most preclinical dMRI studies are focused), we also provide, where possible and applicable, recommendations for other organs of interest. An overarching goal is to enhance the rigor and reproducibility of small animal dMRI acquisitions and analyses, and thereby advance biomedical knowledge.

2025-02-26

Magnetic Resonance in Medicine (published)

doi.org

arxiv.org

Considerations and recommendations from the ISMRM Diffusion Study Group for preclinical diffusion MRI: Part 3 -- Ex vivo imaging: data processing, comparisons with microscopy, and tractography

Kurt G Schilling

Amy F. D. Howard

Francesco Grussu

Andrada Ianus

Brian Hansen

Rachel L. C. Barrett

Manisha Aggarwal

Stijn Michielse

Fatima Nasrallah

Warda Syeda

Nian Wang

Jelle Veraart

Alard Roebroeck

Andrew F. Bagdasarian

Cornelius Eichner

Farshid Sepehrband

Jan Zimmermann

Lucas Soustelle

Christien Bowman

Benjamin C. Tendler … (see 38 more)

Andreea Hertanu

Ben Jeurissen

Marleen Verhoye

Lucio Frydman

Yohan van de Looij

David Hike

Jeff F. Dunn

Karla Miller

Bennett Landman

Noam Shemesh

Arthur Anderson

Emilie McKinnon

Shawna Farquharson

Flavio Dell’Acqua

Carlo Pierpaoli

Ivana Drobnjak

Alexander Leemans

Kevin D. Harkins

Maxime Descoteaux

Duan Xu

Hao Huang

Mathieu D. Santin

Samuel C. Grant

Andre Obenaus

Gene S. Kim

Dan Wu

Denis Le Bihan

Stephen J. Blackband

Luisa Ciobanu

Els Fieremans

Ruiliang Bai

Trygve B. Leergaard

Jiangyang Zhang

Tim B. Dyrby

G. Allan Johnson

Julien Cohen-Adad

Matthew D. Budde

Ileana O. Jelescu

2025-02-26

Magnetic Resonance in Medicine (published)

doi.org

arxiv.org

NeoBERT: A Next-Generation BERT

Lola Le Breton

Quentin Fournier

Mariam El Mezouar

Sarath Chandar

Recent innovations in architecture, pre-training, and fine-tuning have led to the remarkable in-context learning and reasoning abilities of … (see more)large auto-regressive language models such as LLaMA and DeepSeek. In contrast, encoders like BERT and RoBERTa have not seen the same level of progress despite being foundational for many downstream NLP applications. To bridge this gap, we introduce NeoBERT, a next-generation encoder that redefines the capabilities of bidirectional models by integrating state-of-the-art advancements in architecture, modern data, and optimized pre-training methodologies. NeoBERT is designed for seamless adoption: it serves as a plug-and-play replacement for existing base models, relies on an optimal depth-to-width ratio, and leverages an extended context length of 4,096 tokens. Despite its compact 250M parameter footprint, it achieves state-of-the-art results on the massive MTEB benchmark, outperforming BERT large, RoBERTa large, NomicBERT, and ModernBERT under identical fine-tuning conditions. In addition, we rigorously evaluate the impact of each modification on GLUE and design a uniform fine-tuning and evaluation framework for MTEB. We release all code, data, checkpoints, and training scripts to accelerate research and real-world adoption.

2025-02-26

ArXiv (preprint)

arxiv.org

Origin of Nonlinear Circular Photocurrent in 2D Semiconductor
<mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" display="inline"><mml:mrow><mml:msub><mml:mrow><mml:mi>MoS</mml:mi></mml:mrow><mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:math>

Yanchong Zhao

Fengyu Chen

Jing Liang

Mohammad Saeed Bahramy

Mingwei Yang

Yao Guang

Xiaomei Li

Zheng Wei

Jian Tang

Jiaojiao Zhao

Mengzhou Liao

Cheng Shen

Qinqin Wang

Rong Yang

Kenji Watanabe

Takashi Taniguchi

Zhiheng Huang

Dongxia Shi

Kaihui Liu

Zhipei Sun … (see 3 more)

Ji Feng

Luojun Du

Guangyu Zhang

2025-02-26

Physical Review Letters (published)

doi.org

Origin of Nonlinear Circular Photocurrent in 2D Semiconductor MoS_{2}.

Yanchong Zhao

Fengyu Chen

Jing Liang

Mohammad Saeed Bahramy

Mingwei Yang

Yao Guang

Xiaomei Li

Zheng Wei

Jian Tang

Jiaojiao Zhao

Mengzhou Liao

Cheng Shen

Qinqin Wang

Rong Yang

Kenji Watanabe

Takashi Taniguchi

Zhiheng Huang

Dongxia Shi

Kaihui Liu

Zhipei Sun … (see 3 more)

Ji Feng

Luojun Du

Guangyu Zhang

2025-02-26

Physical Review Letters (published)

doi.org

Origin of Nonlinear Circular Photocurrent in 2D Semiconductor MoS_{2}.

Yanchong Zhao

Fengyu Chen

Jing Liang

Mohammad Saeed Bahramy

Mingwei Yang

Yao Guang

Xiaomei Li

Zheng Wei

Jian Tang

Jiaojiao Zhao

Mengzhou Liao

Cheng Shen

Qinqin Wang

Rong Yang

Kenji Watanabe

Takashi Taniguchi

Zhiheng Huang

Dongxia Shi

Kaihui Liu

Zhipei Sun … (see 3 more)

Ji Feng

Luojun Du

Guangyu Zhang

2025-02-26

Physical Review Letters (published)

doi.org

The use of extended reality in anesthesiology education: a scoping review

Gianluca Bertolizio

Yu Tong Huang

Marta Garbin

Elena Guadagno

Dan Poenaru

2025-02-26

Canadian Journal of Anaesthesia-journal Canadien D Anesthesie (published)

doi.org

Learning Multi-agent Multi-machine Tending by Mobile Robots

Abdalwhab Abdalwhab

Giovanni Beltrame

Samira Ebrahimi Kahou

David St-Onge

Robotics can help address the growing worker shortage challenge of the manufacturing industry. As such, machine tending is a task collaborat… (see more)ive robots can tackle that can also highly boost productivity. Nevertheless, existing robotics systems deployed in that sector rely on a fixed single-arm setup, whereas mobile robots can provide more flexibility and scalability. In this work, we introduce a multi-agent multi-machine tending learning framework by mobile robots based on Multi-agent Reinforcement Learning (MARL) techniques with the design of a suitable observation and reward. Moreover, an attention-based encoding mechanism is developed and integrated into Multi-agent Proximal Policy Optimization (MAPPO) algorithm to boost its performance for machine tending scenarios. Our model (AB-MAPPO) outperformed MAPPO in this new challenging scenario in terms of task success, safety, and resources utilization. Furthermore, we provided an extensive ablation study to support our various design decisions.

2025-02-25

AAAI.org/2025/Workshop/MARW (published)

doi.org

openreview.net

Scalable Equilibrium Sampling with Sequential Boltzmann Generators

Charlie B. Tan

Joey Bose

Chen Lin

Leon Klein

Michael M. Bronstein

Alexander Tong

Scalable sampling of molecular states in thermodynamic equilibrium is a long-standing challenge in statistical physics. Boltzmann generators… (see more) tackle this problem by pairing powerful normalizing flows with importance sampling to obtain statistically independent samples under the target distribution. In this paper, we extend the Boltzmann generator framework and introduce Sequential Boltzmann generators (SBG) with two key improvements. The first is a highly efficient non-equivariant Transformer-based normalizing flow operating directly on all-atom Cartesian coordinates. In contrast to equivariant continuous flows of prior methods, we leverage exactly invertible non-equivariant architectures which are highly efficient both during sample generation and likelihood computation. As a result, this unlocks more sophisticated inference strategies beyond standard importance sampling. More precisely, as a second key improvement we perform inference-time scaling of flow samples using annealed Langevin dynamics which transports samples toward the target distribution leading to lower variance (annealed) importance weights which enable higher fidelity resampling with sequential Monte Carlo. SBG achieves state-of-the-art performance w.r.t. all metrics on molecular systems, demonstrating the first equilibrium sampling in Cartesian coordinates of tri, tetra, and hexapeptides that were so far intractable for prior Boltzmann generators.

2025-02-25

ArXiv (preprint)

doi.org

arxiv.org

Scalable Equilibrium Sampling with Sequential Boltzmann Generators

Charlie B. Tan

Joey Bose

Chen Lin

Leon Klein

Michael M. Bronstein

Alexander Tong

Scalable sampling of molecular states in thermodynamic equilibrium is a long-standing challenge in statistical physics. Boltzmann generators… (see more) tackle this problem by pairing powerful normalizing flows with importance sampling to obtain statistically independent samples under the target distribution. In this paper, we extend the Boltzmann generator framework and introduce Sequential Boltzmann generators (SBG) with two key improvements. The first is a highly efficient non-equivariant Transformer-based normalizing flow operating directly on all-atom Cartesian coordinates. In contrast to equivariant continuous flows of prior methods, we leverage exactly invertible non-equivariant architectures which are highly efficient both during sample generation and likelihood computation. As a result, this unlocks more sophisticated inference strategies beyond standard importance sampling. More precisely, as a second key improvement we perform inference-time scaling of flow samples using annealed Langevin dynamics which transports samples toward the target distribution leading to lower variance (annealed) importance weights which enable higher fidelity resampling with sequential Monte Carlo. SBG achieves state-of-the-art performance w.r.t. all metrics on molecular systems, demonstrating the first equilibrium sampling in Cartesian coordinates of tri, tetra, and hexapeptides that were so far intractable for prior Boltzmann generators.

2025-02-25

ArXiv (preprint)

arxiv.org

AI Advantage

Mila AI Policy Fellowship

Strategic Priorities

AI Advantage

Mila AI Policy Fellowship

Publications

AI Advantage

Mila AI Policy Fellowship

Strategic Priorities

AI Advantage

Mila AI Policy Fellowship

Popular keywords:

Publications