Publications

Understanding Social Appropriateness Perceptions in Secondary Users of Domestic Robots

Seol Han

A new generation of robots are being developed to enter our homes in a matter of months. But has the industry appropriately accounted for th… (voir plus)e complexities of the social environment that we call home? We conducted an exploratory design workshop to examine what secondary users—those who are not expected to be owners but nonetheless daily users—deem to be socially appropriate behavior of a domestic robot. A total of 90 students from Mexico participated in the study. By analyzing they define and reason about appropriateness of robot behaviors in the home, we show why deployment of domestic robots require much more thoughtful considerations than implementation of simplified social rules; judgments of what is appropriate depend on context, roles, relationships, and individual boundaries, and can differ between primary and secondary users. We call on Human-Robot Interaction (HRI) practitioners to treat social appropriateness as a fluid, gradient factor at design time rather than a binary concept (appropriate/inappropriate).

2026-03-15

International Conference on Human-Robot Interaction (publié)

doi.org

Power-Law Spectrum of the Random Feature Model

Elliot Paquette

Ke Liang Xiao

Yizhe Zhu

Scaling laws for neural networks, in which the loss decays as a power-law in the number of parameters, data, and compute, depend fundamental… (voir plus)ly on the spectral structure of the data covariance, with power-law eigenvalue decay appearing ubiquitously in vision and language tasks. A central question is whether this spectral structure is preserved or destroyed when data passes through the basic building block of a neural network: a random linear projection followed by a nonlinear activation. We study this question for the random feature model: given data

2026-03-14

arXiv (prépublication)

doi.org

arxiv.org

Tactile Modality Fusion for Vision-Language-Action Models

Charlotte Morissette

Wei-Di Chang

We propose TacFiLM, a lightweight modality-fusion approach that integrates visual-tactile signals into vision-language-action (VLA) models. … (voir plus)While recent advances in VLA models have introduced robot policies that are both generalizable and semantically grounded, these models mainly rely on vision-based perception. Vision alone, however, cannot capture the complex interaction dynamics that occur during contact-rich manipulation, including contact forces, surface friction, compliance, and shear. While recent attempts to integrate tactile signals into VLA models often increase complexity through token concatenation or large-scale pretraining, the heavy computational demands of behavioural models necessitate more lightweight fusion strategies. To address these challenges, TacFiLM outlines a post-training finetuning approach that conditions intermediate visual features on pretrained tactile representations using feature-wise linear modulation (FiLM). Experimental results on insertion tasks demonstrate consistent improvements in success rate, direct insertion performance, completion time, and force stability across both in-distribution and out-of-distribution tasks. Together, these results support our method as an effective approach to integrating tactile signals into VLA models, improving contact-rich manipulation behaviours.

2026-03-14

arXiv (prépublication)

doi.org

arxiv.org

Toward Self-Driven Microscopy Exploration for the Characterization of Functional Materials

Claudia M. Bazán

Kirsty Ellis

Faisal Mohamed

Ramzi Zidani

Maxime Goulet

Jean-Nicolas Deraspe

Jeanine Looman

Glen Berseth

Delphine Bouilly

Audrey Laventure

2026-03-14

AI4X @ Neural Information Processing Systems (poster)

openreview.net

BATIS: Bayesian Approaches for Targeted Improvement of Species Distribution Models

Catherine Villeneuve

Benjamin Akera

Mélisande Teng

David Rolnick

Species distribution models (SDMs), which aim to predict species occurrence based on environmental variables, are widely used to monitor and… (voir plus) respond to biodiversity change. Recent deep learning advances for SDMs have been shown to perform well on complex and heterogeneous datasets, but their effectiveness remains limited by spatial biases in the data. In this paper, we revisit deep SDMs from a Bayesian perspective and introduce BATIS, a novel and practical framework wherein prior predictions are updated iteratively using limited observational data. Models must appropriately capture both aleatoric and epistemic uncertainty to effectively combine fine-grained local insights with broader ecological patterns. We benchmark an extensive set of uncertainty quantification approaches on a novel dataset including citizen science observations from the eBird platform. Our empirical study shows how Bayesian deep learning approaches can greatly improve the reliability of SDMs in data-scarce locations, which can contribute to ecological understanding and conservation efforts.

2026-03-13

AAAI Conference on Artificial Intelligence (publié)

doi.org

arxiv.org

Identifying and Analyzing Performance-Critical Tokens in Large Language Models

Yu Bai

Heyan Huang

Cesare Spinoso-Di Piano

Sanxing Chen

Marc-Antoine Rondeau

Yang Gao

Jackie Chi Kit Cheung

In-context learning (ICL) has emerged as an effective solution for few-shot learning with large language models (LLMs). However, how LLMs le… (voir plus)verage demonstrations to specify a task and learn a corresponding computational function through ICL is underexplored. Drawing from the way humans learn from content-label mappings in demonstrations, we categorize the tokens in an ICL prompt into content, stopword, and template tokens. Our goal is to identify the types of tokens whose representations directly influence LLM's performance, a property we refer to as being performance-critical. By ablating representations from the attention of the test example, we find that the representations of informative content tokens have less influence on performance compared to template and stopword tokens, which contrasts with the human attention to informative words. We give evidence that the representations of performance-critical tokens aggregate information from the content tokens. Moreover, we demonstrate experimentally that lexical meaning, repetition, and structural cues are the main distinguishing characteristics of these tokens. Our work sheds light on how LLMs learn to perform tasks from demonstrations and deepens our understanding of the roles different types of tokens play in LLMs.

2026-03-13

AAAI Conference on Artificial Intelligence (publié)

doi.org

arxiv.org

PlantTraitNet: An Uncertainty-Aware Multimodal Framework for Global-Scale Plant Trait Inference from Citizen Science Data

Ayushi Sharma

Johanna Trost

Daniel Lusk

Johannes Dollinger

Julian Schrader

Christian Rossi

Javier Lopatin

Étienne Laliberté

Simon Haberstroh

Jana Eichel

Daniel Mederer

Jose Miguel Cerda-Paredes

Shyam S. Phartyal

Lisa-Maricia Schwarz

Anja Linstädter

Maria Conceição Caldeira

Teja Kattenborn

Global plant maps of plant traits, such as leaf nitrogen or plant height, are essential for understanding ecosystem processes, including the… (voir plus) carbon and energy cycles of the Earth system. However, existing trait maps remain limited by the high cost and sparse geographic coverage of field-based measurements. Citizen science initiatives offer a largely untapped resource to overcome these limitations, with over 50 million geotagged plant photographs worldwide capturing valuable visual information on plant morphology and physiology. In this study, we introduce PlantTraitNet, a multi-modal, multi-task uncertainty-aware deep learning framework that predicts four key plant traits (plant height, leaf area, specific leaf area, and nitrogen content) from citizen science photos using weak supervision. By aggregating individual trait predictions across space, we generate global maps of trait distributions. We validate these maps against independent vegetation survey data (sPlotOpen) and benchmark them against leading global trait products. Our results show that PlantTraitNet consistently outperforms existing trait maps across all evaluated traits, demonstrating that citizen science imagery, when integrated with computer vision and geospatial AI, enables not only scalable but also more accurate global trait mapping. This approach offers a powerful new pathway for ecological research and Earth system modeling.

2026-03-13

AAAI Conference on Artificial Intelligence (publié)

doi.org

arxiv.org

Unbiased characterization of COVID-19 endotypes leads to prognostication of high-risk individuals using routine blood tests

William Ma

Antoine Soulé

Catherine Allard

Madeleine Durand

Karine Tremblay

Simon Rousseau

Amin Emad

Unsupervised proteomic analysis identified biologically coherent endotypes that advance understanding of acute lung injury in COVID‑19 and… (voir plus) support improved diagnostic and prognostic strategies.

2026-03-13

Communications Medicine (publié)

doi.org

What to Ask Next? Probing the Imaginative Reasoning of LLMs with TurtleSoup Puzzles

Mi Zhou

Sifan Wu

H. Zhang

Qi Sima

Bang Liu

We investigate the capacity of Large Language Models (LLMs) for imaginative reasoning—the proactive construction, testing, and revision of… (voir plus) hypotheses in information-sparse environments. Existing benchmarks, often static or focused on social deduction, fail to capture the dynamic, exploratory nature of this reasoning process. To address this gap, we introduce a comprehensive research framework based on the classic "Turtle Soup" game, integrating a benchmark, an agent, and an evaluation protocol. We present TurtleSoup-Bench, the first large-scale, bilingual, interactive benchmark for imaginative reasoning, comprising 800 turtle soup stories sourced from both the Internet and expert authors. We also propose Mosaic-Agent, a novel agent designed to assess LLMs' performance in this setting. To evaluate reasoning quality, we develop a multi-dimensional protocol measuring logical consistency, detail completion, and conclusion alignment. Experiments with leading LLMs reveal clear capability limits, common failure patterns, and a significant performance gap compared to humans. Our work offers new insights into LLMs' imaginative reasoning and establishes a foundation for future research on exploratory agent behavior.

2026-03-13

AAAI Conference on Artificial Intelligence (publié)

doi.org

arxiv.org

MANSION: Multi-floor lANguage-to-3D Scene generatIOn for loNg-horizon tasks

Lirong Che

Shuo Wen

Shan Huang

Chuang Wang

Yuzhe Yang

Gregory Dudek

Xueqian Wang

Jian Su

Real-world robotic tasks are long-horizon and often span multiple floors, demanding rich spatial reasoning. However, existing embodied bench… (voir plus)marks are largely confined to single-floor in-house environments, failing to reflect the complexity of real-world tasks. We introduce MANSION, the first language-driven framework for generating building-scale, multi-floor 3D environments. Being aware of vertical structural constraints, MANSION generates realistic, navigable whole-building structures with diverse, human-friendly scenes, enabling the development and evaluation of cross-floor long-horizon tasks. Building on this framework, we release MansionWorld, a dataset of over 1,000 diverse buildings ranging from hospitals to offices, alongside a Task-Semantic Scene Editing Agent that customizes these environments using open-vocabulary commands to meet specific user needs. Benchmarking reveals that state-of-the-art agents degrade sharply in our settings, establishing MANSION as a critical testbed for the next generation of spatial reasoning and planning.

2026-03-11

arXiv (prépublication)

doi.org

arxiv.org

Overcoming the Modality Gap in Context-Aided Forecasting

Vincent Zhihao Zheng

Étienne Marcotte

Arjun Ashok

Andrew Robert Williams

Lijun Sun

Alexandre Drouin

Valentina Zantedeschi

Context-aided forecasting (CAF) holds promise for integrating domain knowledge and forward-looking information, enabling AI systems to surpa… (voir plus)ss traditional statistical methods. However, recent empirical studies reveal a puzzling gap: multimodal models often fail to outperform their unimodal counterparts. We hypothesize that this underperformance stems from poor context quality in existing datasets, as verification is challenging. To address these limitations, we introduce a semi-synthetic data augmentation method that generates contexts both descriptive of temporal dynamics and verifiably complementary to numerical histories. This approach enables massive-scale dataset creation, resulting in CAF-7M, a corpus of 7 million context-augmented time series windows, including a rigorously verified test set. We demonstrate that semi-synthetic pre-training transfers effectively to real-world evaluation, and show clear evidence of context utilization. Our results suggest that dataset quality, rather than architectural limitations, has been the primary bottleneck in context-aided forecasting.

2026-03-11

arXiv (prépublication)

doi.org

openreview.net

Theta Dual-Brain Stimulation of rTPJ Shapes Joint Agency

Yuto Kurihara

Ayaka Tsuchiya

Guillaume Dumas

Rieko Osu

Summary Joint agency, the shared feeling of “we are doing this together”, has been linked to inter-brain synchrony, but its causal role … (voir plus)in shaping this experience remains unclear. We applied dual transcranial alternating current stimulation (dual-tACS) over the right temporo-parietal junction (rTPJ) to 13 dyads performing an alternating tapping task (target ITI = 0.5 s; 180 deg. relative phase), manipulating in- and anti-phase coupling at theta (6 Hz), alpha (10 Hz), and beta (20 Hz). As a result, tapping in the theta anti-phase condition was significantly slower than the memorized reference tempo, whereas the other stimulation conditions did not influence the inter-tap interval. Meanwhile, the relative phase remained close to 180 deg. across all conditions. In the theta condition, anti-phase stimulation produced significantly lower joint agency than in-phase stimulation. Furthermore, mediation analysis suggested that the inter-tap interval may partially account for the effect of theta dual-brain stimulation on joint agency, although this indirect pathway did not reach statistical significance. These findings suggest that anti-phase theta stimulation over the rTPJ lowers joint agency, possibly by reducing coordination efficiency while preserving the overall 180 deg. alternation structure.

2026-03-11

bioRxiv (prépublication)

doi.org

Mila Techaide 2026

Propulsion d'entrepreneurs scientifiques

Avantage IA : productivité dans la fonction publique

Publications

Mila Techaide 2026

Propulsion d'entrepreneurs scientifiques

Avantage IA : productivité dans la fonction publique

Mots-clés populaires:

Publications