Publications

The Harmonic Exponential Filter for Nonparametric Estimation on Motion Groups

Miguel Saavedra-Ruiz

Steven A. Parkison

Bayesian estimation is a vital tool in robotics as it allows systems to update the robot state belief using incomplete information from nois… (voir plus)y sensors. To render the state estimation problem tractable, many systems assume that the motion and measurement noise, as well as the state distribution, are unimodal and Gaussian. However, there are numerous scenarios and systems that do not comply with these assumptions. Existing nonparametric filters that are used to model multimodal distributions have drawbacks that limit their ability to represent a diverse set of distributions. This paper introduces a novel approach to nonparametric Bayesian filtering on motion groups, designed to handle multimodal distributions using harmonic exponential distributions. This approach leverages two key insights of harmonic exponential distributions: a) the product of two distributions can be expressed as the element-wise addition of their log-likelihood Fourier coefficients, and b) the convolution of two distributions can be efficiently computed as the tensor product of their Fourier coefficients. These observations enable the development of an efficient and asymptotically exact solution to the Bayes filter up to the band limit of a Fourier transform. We demonstrate our filter's performance compared with established nonparametric filtering methods across simulated and real-world localization tasks.

2025-01-31

IEEE Robotics and Automation Letters (publié)

doi.org

arxiv.org

La communication financière à l’épreuve de la crise COVID : une gestion des impressions ?

Corinne Bessieux-Ollier

Grégoire Davrinche

Guillaume Dumas

Nous étudions l’impact de la crise du COVID-19 sur la gestion des impressions pratiquée par les entreprises françaises cotées. Cette c… (voir plus)rise ayant eu un impact fort sur l’activité des entreprises, nous observons si les dirigeants modifient la manière de présenter l’information liée aux résultats non-GAAP, à travers l’utilisation de stratégies d’obscurcissement. Les données sur la gestion des impressions ont été collectées manuellement dans les communiqués de résultats annuels des entreprises du SBF 120 sur la période 2018-2020. Nous constatons une diminution générale du niveau de gestion des impressions en période de crise, notamment pour les entreprises des secteurs ayant été les plus impactés par la crise COVID. Cette diminution est toutefois moins prononcée pour les entreprises ayant sous-performé par rapport à leur secteur d’activité et pour les entreprises dont la performance a le plus diminué (indépendamment du secteur auquel elles appartiennent). Nos résultats suggèrent que les entreprises dont la baisse de performance pourrait être attribuée à des causes internes (résultats très défavorables, résultats en deçà du secteur d’activité) demeurent soucieuses de l’image qu’elles renvoient et maintiennent leur niveau de gestion des impressions malgré la crise.

2025-01-28

Comptabilité Contrôle Audit (publié)

doi.org

TEARS: Text Representations for Scrutable Recommendations.

Traditional recommender systems rely on high-dimensional (latent) embeddings for modeling user-item interactions, often resulting in opaque … (voir plus)representations that lack interpretability. Moreover, these systems offer limited control to users over their recommendations. Inspired by recent work, we introduce TExtuAl Representations for Scrutable recommendations (TEARS) to address these challenges. Instead of representing a user's interests through a latent embedding, TEARS encodes them in natural text, providing transparency and allowing users to edit them. To do so, TEARS uses a modern LLM to generate user summaries based on user preferences. Using these summaries, we take a hybrid approach where we use an optimal transport procedure to align the summaries' representation with the learned representation of a standard VAE for collaborative filtering. We find this approach can surpass the performance of popular VAE models while providing user-controllable recommendations. We also analyze the controllability of TEARS through three simulated user tasks to evaluate the effectiveness of a user editing its summary. A more detailed version of this manuscript with more experiments, baselines and detail is provided on arXiv.

2025-01-28

ACM.org/TheWebConf/2025/Conference (poster)

doi.org

openreview.net

Causal Discovery in Astrophysics: Unraveling Supermassive Black Hole and Galaxy Coevolution

Zehao Jin

Mario Pasquato

Benjamin L. Davis

Tristan Deleu

Yu Luo

Changhyun Cho

Pablo Lemos

Laurence Perreault-Levasseur

Yoshua Bengio

Xi Kang

Andrea Valerio Maccio

Yashar Hezaveh

Correlation does not imply causation, but patterns of statistical association between variables can be exploited to infer a causal structure… (voir plus) (even with purely observational data) with the burgeoning field of causal discovery. As a purely observational science, astrophysics has much to gain by exploiting these new methods. The supermassive black hole (SMBH)--galaxy interaction has long been constrained by observed scaling relations, that is low-scatter correlations between variables such as SMBH mass and the central velocity dispersion of stars in a host galaxy's bulge. This study, using advanced causal discovery techniques and an up-to-date dataset, reveals a causal link between galaxy properties and dynamically-measured SMBH masses. We apply a score-based Bayesian framework to compute the exact conditional probabilities of every causal structure that could possibly describe our galaxy sample. With the exact posterior distribution, we determine the most likely causal structures and notice a probable causal reversal when separating galaxies by morphology. In elliptical galaxies, bulge properties (built from major mergers) tend to influence SMBH growth, while in spiral galaxies, SMBHs are seen to affect host galaxy properties, potentially through feedback in gas-rich environments. For spiral galaxies, SMBHs progressively quench star formation, whereas in elliptical galaxies, quenching is complete, and the causal connection has reversed. Our findings support theoretical models of hierarchical assembly of galaxies and active galactic nuclei feedback regulating galaxy evolution. Our study suggests the potentiality for further exploration of causal links in astrophysical and cosmological scaling relations, as well as any other observational science.

2025-01-27

The Astrophysical Journal (publié)

doi.org

arxiv.org

The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications

Philippe Brouillard

Chandler Squires

Jonas Wahl

Konrad Paul Kording

Karen Sachs

Alexandre Drouin

Dhanya Sridhar

Causal discovery aims to automatically uncover causal relationships from data, a capability with significant potential across many scientifi… (voir plus)c disciplines. However, its real-world applications remain limited. Current methods often rely on unrealistic assumptions and are evaluated only on simple synthetic toy datasets, often with inadequate evaluation metrics. In this paper, we substantiate these claims by performing a systematic review of the recent causal discovery literature. We present applications in biology, neuroscience, and Earth sciences - fields where causal discovery holds promise for addressing key challenges. We highlight available simulated and real-world datasets from these domains and discuss common assumption violations that have spurred the development of new methods. Our goal is to encourage the community to adopt better evaluation practices by utilizing realistic datasets and more adequate metrics.

2025-01-27

cclear.cc/CLeaR/2025/Conference (poster)

doi.org

proceedings.mlr.press

Associations between circulating amino acids and metabolic dysfunction‐associated steatotic liver disease in individuals living with severe obesity

Ina Maltais‐Payette

Jérôme Bourgault

Marie‐Frédérique Gauthier

Laurent Biertho

Simon Marceau

François Julien

Patricia L. Mitchell

Christian Couture

Francis Brière

J. Corbeil

Benoit J. Arsenault

André Tchernof

2025-01-26

Physiological Reports (publié)

doi.org

scGraphETM: Graph-Based Deep Learning Approach for Unraveling Cell Type-Specific Gene Regulatory Networks from Single-Cell Multi-Omics Data

Wenqi Dong

Manqi Zhou

Boyu Han

Yi Wang

Yuemei Li

2025-01-26

bioRxiv (prépublication)

doi.org

A multivariable prediction model for invasive pulmonary aspergillosis in immunocompromised patients with acute respiratory failure (IPA-GRRR-OH score).

Alice Friol

Guillaume Dumas

Frédéric Pène

Alexandre Demoule

Achille Kouatchet

Laurent Argaud

Naike Bigé

Anne-Sophie Moreau

François Barbier

Djamel Mokart

Virginie Lemiale

Elie Azoulay

2025-01-23

Intensive Care Medicine (publié)

doi.org

Systemizing Multiplicity: The Curious Case of Arbitrariness in Machine Learning

Prakhar Ganesh

Afaf Taïk

Golnoosh Farnadi

Algorithmic modeling relies on limited information in data to extrapolate outcomes for unseen scenarios, often embedding an element of arbit… (voir plus)rariness in its decisions. A perspective on this arbitrariness that has recently gained interest is multiplicity-the study of arbitrariness across a set of "good models", i.e., those likely to be deployed in practice. In this work, we systemize the literature on multiplicity by: (a) formalizing the terminology around model design choices and their contribution to arbitrariness, (b) expanding the definition of multiplicity to incorporate underrepresented forms beyond just predictions and explanations, (c) clarifying the distinction between multiplicity and other lenses of arbitrariness, i.e., uncertainty and variance, and (d) distilling the benefits and potential risks of multiplicity into overarching trends, situating it within the broader landscape of responsible AI. We conclude by identifying open research questions and highlighting emerging trends in this young but rapidly growing area of research.

2025-01-23

ArXiv (prépublication)

doi.org

arxiv.org

Automatic segmentation of spinal cord lesions in MS: A robust tool for axial T2-weighted MRI scans

Enamundram Naga Karthik

Julian McGinnis

Ricarda Wurm

Sebastian Ruehling

Robert Graf

Jan Valosek

Pierre-Louis Benveniste

Markus Lauerer

Jason Talbott

Rohit Bakshi

Shahamat Tauhid

Timothy Shepherd

Achim Berthele

Claus Zimmer

Bernhard Hemmer

Daniel Rueckert

Benedikt Wiestler

Jan S. Kirschke

Julien Cohen-Adad

Mark Mühlau

Deep learning models have achieved remarkable success in segmenting brain white matter lesions in multiple sclerosis (MS), becoming integral… (voir plus) to both research and clinical workflows. While brain lesions have gained significant attention in MS research, the involvement of spinal cord lesions in MS is relatively understudied. This is largely owed to the variability in spinal cord magnetic resonance imaging (MRI) acquisition protocols, high individual anatomical differences, the complex morphology and size of spinal cord lesions - and lastly, the scarcity of labeled datasets required to develop robust segmentation tools. As a result, automatic segmentation of spinal cord MS lesions remains a significant challenge. Although some segmentation tools exist for spinal cord lesions, most have been developed using sagittal T2-weighted (T2w) sequences primarily focusing on cervical spines. With the growing importance of spinal cord imaging in MS, axial T2w scans are becoming increasingly relevant due to their superior sensitivity in detecting lesions compared to sagittal acquisition protocols. However, most existing segmentation methods struggle to effectively generalize to axial sequences due to differences in image characteristics caused by the highly anisotropic spinal cord scans. To address these challenges, we developed a robust, open-source lesion segmentation tool tailored specifically for axial T2w scans covering the whole spinal cord. We investigated key factors influencing lesion segmentation, including the impact of stitching together individually acquired spinal regions, straightening the spinal cord, and comparing the effectiveness of 2D and 3D convolutional neural networks (CNNs). Drawing on these insights, we trained a multi-center model using an extensive dataset of 582 MS patients, resulting in a dataset comprising an entirety of 2,167 scans. We empirically evaluated the model’s segmentation performance across various spinal segments for lesions with varying sizes. Our model significantly outperforms the current state-of-the-art methods, providing consistent segmentation across cervical, thoracic and lumbar regions. To support the broader research community, we integrate our model into the widely-used Spinal Cord Toolbox (v7.0 and above), making it accessible via the command sct_deepseg lesion_ms_axial_t2 -i <path-to-image.nii.gz>.

2025-01-22

medRxiv (prépublication)

doi.org

Pitfalls of Evidence-Based AI Policy

Stephen Casper

David M. Krueger

Dylan Hadfield-Menell

Nations across the world are working to govern AI. However, from a technical perspective, the best way to do this is not yet clear. Meanwhil… (voir plus)e, recent debates over AI regulation have led to calls for “evidence-based AI policy” which emphasize holding regulatory action to a high evidentiary standard. Evidence is of irreplaceable value to policymaking. However, holding regulatory action to too high an evidentiary standard can lead to systematic neglect of certain risks. In historical policy debates (e.g., over tobacco ca. 1965 and fossil fuels ca. 1990) “evidence-based policy” rhetoric is also a well-precedented strategy to downplay the urgency of action, delay regulation, and protect industry interests. Here, we argue that if the goal is evidence-based AI policy, the first regulatory objective must be to actively facilitate the process of identifying, studying, and deliberating about AI risks. We discuss a set of 16 regulatory goals to facilitate this and show that the EU, UK, USA, Brazil, Canada, and China all have substantial opportunities to adopt further evidence-seeking policies.

2025-01-22

ICLR.cc/2025/BlogPosts (accepté)

openreview.net

Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection

Yun Zhu

Jia-Chen Gu

Caitlin Sikora

Ho Ko

Yinxiao Liu

Chu-Cheng Lin

Lei Shu

Liangchen Luo

Lei Meng

Bang Liu

Jindong Chen

Large language models (LLMs) augmented with retrieval exhibit robust performance and extensive versatility by incorporating external context… (voir plus)s. However, the input length grows linearly in the number of retrieved documents, causing a dramatic increase in latency. In this paper, we propose a novel paradigm named Sparse RAG, which seeks to cut computation costs through sparsity. Speciﬁcally, Sparse RAG encodes retrieved documents in parallel, which eliminates latency introduced by long-range attention of retrieved documents. Then, LLMs selectively decode the output by only attending to highly relevant caches auto-regressively, which are chosen via prompting LLMs with special control tokens. It is notable that Sparse RAG combines the assessment of each individual document and the generation of the response into a single process. The designed sparse mechanism in a RAG system can facilitate the reduction of the number of documents loaded during decoding for accelerating the inference of the RAG system. Additionally, ﬁltering out undesirable contexts enhances the model’s focus on relevant context, inherently improving its generation quality. Evaluation results on four datasets show that Sparse RAG can be used to strike an optimal balance between generation quality and computational efﬁciency, demonstrating its generalizability across tasks.

2025-01-21

ICLR.cc/2025/Conference (poster)

openreview.net

TRAIL : IA responsable pour les professionnels et les leaders

Fondateur en résidence Mila Ventures

Avantage IA : productivité dans la fonction publique

Publications

TRAIL : IA responsable pour les professionnels et les leaders

Fondateur en résidence Mila Ventures

Avantage IA : productivité dans la fonction publique

Mots-clés populaires:

Publications