Publications

Understanding Social Appropriateness Perceptions in Secondary Users of Domestic Robots
A new generation of robots are being developed to enter our homes in a matter of months. But has the industry appropriately accounted for th… (see more)e complexities of the social environment that we call home? We conducted an exploratory design workshop to examine what secondary users—those who are not expected to be owners but nonetheless daily users—deem to be socially appropriate behavior of a domestic robot. A total of 90 students from Mexico participated in the study. By analyzing they define and reason about appropriateness of robot behaviors in the home, we show why deployment of domestic robots require much more thoughtful considerations than implementation of simplified social rules; judgments of what is appropriate depend on context, roles, relationships, and individual boundaries, and can differ between primary and secondary users. We call on Human-Robot Interaction (HRI) practitioners to treat social appropriateness as a fluid, gradient factor at design time rather than a binary concept (appropriate/inappropriate).
Power-Law Spectrum of the Random Feature Model
Ke Liang Xiao
Yizhe Zhu
Scaling laws for neural networks, in which the loss decays as a power-law in the number of parameters, data, and compute, depend fundamental… (see more)ly on the spectral structure of the data covariance, with power-law eigenvalue decay appearing ubiquitously in vision and language tasks. A central question is whether this spectral structure is preserved or destroyed when data passes through the basic building block of a neural network: a random linear projection followed by a nonlinear activation. We study this question for the random feature model: given data
Tactile Modality Fusion for Vision-Language-Action Models
We propose TacFiLM, a lightweight modality-fusion approach that integrates visual-tactile signals into vision-language-action (VLA) models. … (see more)While recent advances in VLA models have introduced robot policies that are both generalizable and semantically grounded, these models mainly rely on vision-based perception. Vision alone, however, cannot capture the complex interaction dynamics that occur during contact-rich manipulation, including contact forces, surface friction, compliance, and shear. While recent attempts to integrate tactile signals into VLA models often increase complexity through token concatenation or large-scale pretraining, the heavy computational demands of behavioural models necessitate more lightweight fusion strategies. To address these challenges, TacFiLM outlines a post-training finetuning approach that conditions intermediate visual features on pretrained tactile representations using feature-wise linear modulation (FiLM). Experimental results on insertion tasks demonstrate consistent improvements in success rate, direct insertion performance, completion time, and force stability across both in-distribution and out-of-distribution tasks. Together, these results support our method as an effective approach to integrating tactile signals into VLA models, improving contact-rich manipulation behaviours.
Toward Self-Driven Microscopy Exploration for the Characterization of Functional Materials
Claudia M. Bazán
Ramzi Zidani
Maxime Goulet
Jean-Nicolas Deraspe
Jeanine Looman
Delphine Bouilly
Audrey Laventure
BATIS: Bayesian Approaches for Targeted Improvement of Species Distribution Models
Benjamin Akera
Mélisande Teng
Species distribution models (SDMs), which aim to predict species occurrence based on environmental variables, are widely used to monitor and… (see more) respond to biodiversity change. Recent deep learning advances for SDMs have been shown to perform well on complex and heterogeneous datasets, but their effectiveness remains limited by spatial biases in the data. In this paper, we revisit deep SDMs from a Bayesian perspective and introduce BATIS, a novel and practical framework wherein prior predictions are updated iteratively using limited observational data. Models must appropriately capture both aleatoric and epistemic uncertainty to effectively combine fine-grained local insights with broader ecological patterns. We benchmark an extensive set of uncertainty quantification approaches on a novel dataset including citizen science observations from the eBird platform. Our empirical study shows how Bayesian deep learning approaches can greatly improve the reliability of SDMs in data-scarce locations, which can contribute to ecological understanding and conservation efforts.
Identifying and Analyzing Performance-Critical Tokens in Large Language Models
Heyan Huang
Sanxing Chen
Marc-Antoine Rondeau
Yang Gao
Jackie Chi Kit Cheung
In-context learning (ICL) has emerged as an effective solution for few-shot learning with large language models (LLMs). However, how LLMs le… (see more)verage demonstrations to specify a task and learn a corresponding computational function through ICL is underexplored. Drawing from the way humans learn from content-label mappings in demonstrations, we categorize the tokens in an ICL prompt into content, stopword, and template tokens. Our goal is to identify the types of tokens whose representations directly influence LLM's performance, a property we refer to as being performance-critical. By ablating representations from the attention of the test example, we find that the representations of informative content tokens have less influence on performance compared to template and stopword tokens, which contrasts with the human attention to informative words. We give evidence that the representations of performance-critical tokens aggregate information from the content tokens. Moreover, we demonstrate experimentally that lexical meaning, repetition, and structural cues are the main distinguishing characteristics of these tokens. Our work sheds light on how LLMs learn to perform tasks from demonstrations and deepens our understanding of the roles different types of tokens play in LLMs.
PlantTraitNet: An Uncertainty-Aware Multimodal Framework for Global-Scale Plant Trait Inference from Citizen Science Data
Ayushi Sharma
Johanna Trost
Daniel Lusk
Johannes Dollinger
Julian Schrader
Christian Rossi
Javier Lopatin
Simon Haberstroh
Jana Eichel
Daniel Mederer
Jose Miguel Cerda-Paredes
Shyam S. Phartyal
Lisa-Maricia Schwarz
Anja Linstädter
Maria Conceição Caldeira
Teja Kattenborn
Global plant maps of plant traits, such as leaf nitrogen or plant height, are essential for understanding ecosystem processes, including the… (see more) carbon and energy cycles of the Earth system. However, existing trait maps remain limited by the high cost and sparse geographic coverage of field-based measurements. Citizen science initiatives offer a largely untapped resource to overcome these limitations, with over 50 million geotagged plant photographs worldwide capturing valuable visual information on plant morphology and physiology. In this study, we introduce PlantTraitNet, a multi-modal, multi-task uncertainty-aware deep learning framework that predicts four key plant traits (plant height, leaf area, specific leaf area, and nitrogen content) from citizen science photos using weak supervision. By aggregating individual trait predictions across space, we generate global maps of trait distributions. We validate these maps against independent vegetation survey data (sPlotOpen) and benchmark them against leading global trait products. Our results show that PlantTraitNet consistently outperforms existing trait maps across all evaluated traits, demonstrating that citizen science imagery, when integrated with computer vision and geospatial AI, enables not only scalable but also more accurate global trait mapping. This approach offers a powerful new pathway for ecological research and Earth system modeling.
Unbiased characterization of COVID-19 endotypes leads to prognostication of high-risk individuals using routine blood tests
Catherine Allard
Madeleine Durand
Karine Tremblay
Simon Rousseau
Unsupervised proteomic analysis identified biologically coherent endotypes that advance understanding of acute lung injury in COVID‑19 and… (see more) support improved diagnostic and prognostic strategies.
What to Ask Next? Probing the Imaginative Reasoning of LLMs with TurtleSoup Puzzles
Mi Zhou
H. Zhang
Qi Sima
We investigate the capacity of Large Language Models (LLMs) for imaginative reasoning—the proactive construction, testing, and revision of… (see more) hypotheses in information-sparse environments. Existing benchmarks, often static or focused on social deduction, fail to capture the dynamic, exploratory nature of this reasoning process. To address this gap, we introduce a comprehensive research framework based on the classic "Turtle Soup" game, integrating a benchmark, an agent, and an evaluation protocol. We present TurtleSoup-Bench, the first large-scale, bilingual, interactive benchmark for imaginative reasoning, comprising 800 turtle soup stories sourced from both the Internet and expert authors. We also propose Mosaic-Agent, a novel agent designed to assess LLMs' performance in this setting. To evaluate reasoning quality, we develop a multi-dimensional protocol measuring logical consistency, detail completion, and conclusion alignment. Experiments with leading LLMs reveal clear capability limits, common failure patterns, and a significant performance gap compared to humans. Our work offers new insights into LLMs' imaginative reasoning and establishes a foundation for future research on exploratory agent behavior.
MANSION: Multi-floor lANguage-to-3D Scene generatIOn for loNg-horizon tasks
Lirong Che
Shuo Wen
Shan Huang
Chuang Wang
Yuzhe Yang
Xueqian Wang
Jian Su
Real-world robotic tasks are long-horizon and often span multiple floors, demanding rich spatial reasoning. However, existing embodied bench… (see more)marks are largely confined to single-floor in-house environments, failing to reflect the complexity of real-world tasks. We introduce MANSION, the first language-driven framework for generating building-scale, multi-floor 3D environments. Being aware of vertical structural constraints, MANSION generates realistic, navigable whole-building structures with diverse, human-friendly scenes, enabling the development and evaluation of cross-floor long-horizon tasks. Building on this framework, we release MansionWorld, a dataset of over 1,000 diverse buildings ranging from hospitals to offices, alongside a Task-Semantic Scene Editing Agent that customizes these environments using open-vocabulary commands to meet specific user needs. Benchmarking reveals that state-of-the-art agents degrade sharply in our settings, establishing MANSION as a critical testbed for the next generation of spatial reasoning and planning.
Overcoming the Modality Gap in Context-Aided Forecasting
Vincent Zhihao Zheng
Étienne Marcotte
Andrew Robert Williams
Lijun Sun
Valentina Zantedeschi
Context-aided forecasting (CAF) holds promise for integrating domain knowledge and forward-looking information, enabling AI systems to surpa… (see more)ss traditional statistical methods. However, recent empirical studies reveal a puzzling gap: multimodal models often fail to outperform their unimodal counterparts. We hypothesize that this underperformance stems from poor context quality in existing datasets, as verification is challenging. To address these limitations, we introduce a semi-synthetic data augmentation method that generates contexts both descriptive of temporal dynamics and verifiably complementary to numerical histories. This approach enables massive-scale dataset creation, resulting in CAF-7M, a corpus of 7 million context-augmented time series windows, including a rigorously verified test set. We demonstrate that semi-synthetic pre-training transfers effectively to real-world evaluation, and show clear evidence of context utilization. Our results suggest that dataset quality, rather than architectural limitations, has been the primary bottleneck in context-aided forecasting.
Theta Dual-Brain Stimulation of rTPJ Shapes Joint Agency
Yuto Kurihara
Ayaka Tsuchiya
Rieko Osu
Summary Joint agency, the shared feeling of “we are doing this together”, has been linked to inter-brain synchrony, but its causal role … (see more)in shaping this experience remains unclear. We applied dual transcranial alternating current stimulation (dual-tACS) over the right temporo-parietal junction (rTPJ) to 13 dyads performing an alternating tapping task (target ITI = 0.5 s; 180 deg. relative phase), manipulating in- and anti-phase coupling at theta (6 Hz), alpha (10 Hz), and beta (20 Hz). As a result, tapping in the theta anti-phase condition was significantly slower than the memorized reference tempo, whereas the other stimulation conditions did not influence the inter-tap interval. Meanwhile, the relative phase remained close to 180 deg. across all conditions. In the theta condition, anti-phase stimulation produced significantly lower joint agency than in-phase stimulation. Furthermore, mediation analysis suggested that the inter-tap interval may partially account for the effect of theta dual-brain stimulation on joint agency, although this indirect pathway did not reach statistical significance. These findings suggest that anti-phase theta stimulation over the rTPJ lowers joint agency, possibly by reducing coordination efficiency while preserving the overall 180 deg. alternation structure.