Publications

Responsible Humanoids: A Contradiction in Terms?

Séverin Lemaignan

AJung Moon

Simon Coghlan

Emily C. Collins

Vanessa Evers

Nico Hochgeschwender

Sara Ljungblad

Michael Milford

Sarah Moth-Lund Christensen

Francisco J. Rodríguez Lera

Pericle Salvini

Yi Yang

In this paper, we critically examine the current "humanoid hype" in robotics, questioning its alignment with responsible robotics principles… (see more). While technical challenges drive internal fascination, the pervasive public image of humanoids demands deeper HRI engagement. We explore how responsible robotics concepts, such as privacy, dignity, and trust, are uniquely challenged or overlooked in the pursuit of anthropomorphic robot forms. By dissecting this hype, and mapping the main findings of the recently-published Roadmap for Responsible Robotics to the humanoids field, we aim to move beyond technical form-factor obsessions to understand the true societal implications and identify potential blind spots for the HRI community.

2026-03-15

IEEE/ACM Conference on Human-Robot Interaction (published)

doi.org

Understanding Social Appropriateness Perceptions in Secondary Users of Domestic Robots

Rahatul Amin Ananto

Seol Han

Rachel Ruddy

AJung Moon

A new generation of robots are being developed to enter our homes in a matter of months. But has the industry appropriately accounted for th… (see more)e complexities of the social environment that we call home? We conducted an exploratory design workshop to examine what secondary users—those who are not expected to be owners but nonetheless daily users—deem to be socially appropriate behavior of a domestic robot. A total of 90 students from Mexico participated in the study. By analyzing they define and reason about appropriateness of robot behaviors in the home, we show why deployment of domestic robots require much more thoughtful considerations than implementation of simplified social rules; judgments of what is appropriate depend on context, roles, relationships, and individual boundaries, and can differ between primary and secondary users. We call on Human-Robot Interaction (HRI) practitioners to treat social appropriateness as a fluid, gradient factor at design time rather than a binary concept (appropriate/inappropriate).

2026-03-15

International Conference on Human-Robot Interaction (published)

doi.org

Power-Law Spectrum of the Random Feature Model

Elliot Paquette

Ke Liang Xiao

Yizhe Zhu

Scaling laws for neural networks, in which the loss decays as a power-law in the number of parameters, data, and compute, depend fundamental… (see more)ly on the spectral structure of the data covariance, with power-law eigenvalue decay appearing ubiquitously in vision and language tasks. A central question is whether this spectral structure is preserved or destroyed when data passes through the basic building block of a neural network: a random linear projection followed by a nonlinear activation. We study this question for the random feature model: given data

2026-03-14

arXiv (preprint)

doi.org

arxiv.org

Tactile Modality Fusion for Vision-Language-Action Models

Charlotte Morissette

Wei-Di Chang

We propose TacFiLM, a lightweight modality-fusion approach that integrates visual-tactile signals into vision-language-action (VLA) models. … (see more)While recent advances in VLA models have introduced robot policies that are both generalizable and semantically grounded, these models mainly rely on vision-based perception. Vision alone, however, cannot capture the complex interaction dynamics that occur during contact-rich manipulation, including contact forces, surface friction, compliance, and shear. While recent attempts to integrate tactile signals into VLA models often increase complexity through token concatenation or large-scale pretraining, the heavy computational demands of behavioural models necessitate more lightweight fusion strategies. To address these challenges, TacFiLM outlines a post-training finetuning approach that conditions intermediate visual features on pretrained tactile representations using feature-wise linear modulation (FiLM). Experimental results on insertion tasks demonstrate consistent improvements in success rate, direct insertion performance, completion time, and force stability across both in-distribution and out-of-distribution tasks. Together, these results support our method as an effective approach to integrating tactile signals into VLA models, improving contact-rich manipulation behaviours.

2026-03-14

arXiv (preprint)

doi.org

arxiv.org

Toward Self-Driven Microscopy Exploration for the Characterization of Functional Materials

Claudia M. Bazán

Kirsty Ellis

Faisal Mohamed

Ramzi Zidani

Maxime Goulet

Jean-Nicolas Deraspe

Jeanine Looman

Glen Berseth

Delphine Bouilly

Audrey Laventure

2026-03-14

AI4X @ Neural Information Processing Systems (poster)

openreview.net

BATIS: Bayesian Approaches for Targeted Improvement of Species Distribution Models

Catherine Villeneuve

Benjamin Akera

Mélisande Teng

David Rolnick

Species distribution models (SDMs), which aim to predict species occurrence based on environmental variables, are widely used to monitor and… (see more) respond to biodiversity change. Recent deep learning advances for SDMs have been shown to perform well on complex and heterogeneous datasets, but their effectiveness remains limited by spatial biases in the data. In this paper, we revisit deep SDMs from a Bayesian perspective and introduce BATIS, a novel and practical framework wherein prior predictions are updated iteratively using limited observational data. Models must appropriately capture both aleatoric and epistemic uncertainty to effectively combine fine-grained local insights with broader ecological patterns. We benchmark an extensive set of uncertainty quantification approaches on a novel dataset including citizen science observations from the eBird platform. Our empirical study shows how Bayesian deep learning approaches can greatly improve the reliability of SDMs in data-scarce locations, which can contribute to ecological understanding and conservation efforts.

2026-03-13

AAAI Conference on Artificial Intelligence (published)

doi.org

arxiv.org

Identifying and Analyzing Performance-Critical Tokens in Large Language Models

Yu Bai

Heyan Huang

Cesare Spinoso-Di Piano

Sanxing Chen

Marc-Antoine Rondeau

Yang Gao

Jackie Chi Kit Cheung

In-context learning (ICL) has emerged as an effective solution for few-shot learning with large language models (LLMs). However, how LLMs le… (see more)verage demonstrations to specify a task and learn a corresponding computational function through ICL is underexplored. Drawing from the way humans learn from content-label mappings in demonstrations, we categorize the tokens in an ICL prompt into content, stopword, and template tokens. Our goal is to identify the types of tokens whose representations directly influence LLM's performance, a property we refer to as being performance-critical. By ablating representations from the attention of the test example, we find that the representations of informative content tokens have less influence on performance compared to template and stopword tokens, which contrasts with the human attention to informative words. We give evidence that the representations of performance-critical tokens aggregate information from the content tokens. Moreover, we demonstrate experimentally that lexical meaning, repetition, and structural cues are the main distinguishing characteristics of these tokens. Our work sheds light on how LLMs learn to perform tasks from demonstrations and deepens our understanding of the roles different types of tokens play in LLMs.

2026-03-13

AAAI Conference on Artificial Intelligence (published)

doi.org

arxiv.org

PlantTraitNet: An Uncertainty-Aware Multimodal Framework for Global-Scale Plant Trait Inference from Citizen Science Data

Ayushi Sharma

Johanna Trost

Daniel Lusk

Johannes Dollinger

Julian Schrader

Christian Rossi

Javier Lopatin

Étienne Laliberté

Simon Haberstroh

Jana Eichel

Daniel Mederer

Jose Miguel Cerda-Paredes

Shyam S. Phartyal

Lisa-Maricia Schwarz

Anja Linstädter

Maria Conceição Caldeira

Teja Kattenborn

Global plant maps of plant traits, such as leaf nitrogen or plant height, are essential for understanding ecosystem processes, including the… (see more) carbon and energy cycles of the Earth system. However, existing trait maps remain limited by the high cost and sparse geographic coverage of field-based measurements. Citizen science initiatives offer a largely untapped resource to overcome these limitations, with over 50 million geotagged plant photographs worldwide capturing valuable visual information on plant morphology and physiology. In this study, we introduce PlantTraitNet, a multi-modal, multi-task uncertainty-aware deep learning framework that predicts four key plant traits (plant height, leaf area, specific leaf area, and nitrogen content) from citizen science photos using weak supervision. By aggregating individual trait predictions across space, we generate global maps of trait distributions. We validate these maps against independent vegetation survey data (sPlotOpen) and benchmark them against leading global trait products. Our results show that PlantTraitNet consistently outperforms existing trait maps across all evaluated traits, demonstrating that citizen science imagery, when integrated with computer vision and geospatial AI, enables not only scalable but also more accurate global trait mapping. This approach offers a powerful new pathway for ecological research and Earth system modeling.

2026-03-13

AAAI Conference on Artificial Intelligence (published)

doi.org

arxiv.org

Profiling Pre-service Teachers’ Computational Thinking

Tanya Chichekian

Maria Cutumisu

Annie Savard

Yi-Mei Zhang

Computational thinking (CT) is a vital skill set for pre-service teachers who will need to foster computational literacy in K–12 classroom… (see more)s, yet the factors influencing their CT skills remain less understood than those for K–12 students or in-service teachers. This study leverages multimodal data to investigate how pre-service teachers (n=128) differ in CT skills, the predictive role of metacognitive strategies and prior coding experience, and variations in online behaviours. Using latent profile analysis, we identified three profiles based on digital literacy, problem-solving, and coding comfort (Novice, Developing, and Proficient), revealing heterogeneity in CT, and supporting non-linear skill acquisition. Linear discriminant analysis revealed that metacognitive strategies and prior coding experience significantly predict profile membership, validating the interplay of technical and cognitive factors in the development of CT skills. Behavioural data from an interactive problem-solving task showed that, compared to Novices and Developing learners, Proficient learners were more task efficient and perceived fewer challenges during task completion. Implications for designing a learning analytics dashboard to visualize profiles and behavioural metrics to support adaptive, equitable, and personalized teacher training are discussed, thereby enhancing pre-service teachers’ readiness to integrate CT into K–12 education.

2026-03-13

Journal of Learning Analytics (published)

doi.org

Unbiased characterization of COVID-19 endotypes leads to prognostication of high-risk individuals using routine blood tests

William Ma

Antoine Soulé

Catherine Allard

Madeleine Durand

Karine Tremblay

Simon Rousseau

Amin Emad

Unsupervised proteomic analysis identified biologically coherent endotypes that advance understanding of acute lung injury in COVID‑19 and… (see more) support improved diagnostic and prognostic strategies.

2026-03-13

Communications Medicine (published)

doi.org

What to Ask Next? Probing the Imaginative Reasoning of LLMs with TurtleSoup Puzzles

Mi Zhou

Sifan Wu

H. Zhang

Qi Sima

Bang Liu

We investigate the capacity of Large Language Models (LLMs) for imaginative reasoning—the proactive construction, testing, and revision of… (see more) hypotheses in information-sparse environments. Existing benchmarks, often static or focused on social deduction, fail to capture the dynamic, exploratory nature of this reasoning process. To address this gap, we introduce a comprehensive research framework based on the classic "Turtle Soup" game, integrating a benchmark, an agent, and an evaluation protocol. We present TurtleSoup-Bench, the first large-scale, bilingual, interactive benchmark for imaginative reasoning, comprising 800 turtle soup stories sourced from both the Internet and expert authors. We also propose Mosaic-Agent, a novel agent designed to assess LLMs' performance in this setting. To evaluate reasoning quality, we develop a multi-dimensional protocol measuring logical consistency, detail completion, and conclusion alignment. Experiments with leading LLMs reveal clear capability limits, common failure patterns, and a significant performance gap compared to humans. Our work offers new insights into LLMs' imaginative reasoning and establishes a foundation for future research on exploratory agent behavior.

2026-03-13

AAAI Conference on Artificial Intelligence (published)

doi.org

arxiv.org

MANSION: Multi-floor lANguage-to-3D Scene generatIOn for loNg-horizon tasks

Lirong Che

Shuo Wen

Shan Huang

Chuang Wang

Yuzhe Yang

Gregory Dudek

Xueqian Wang

Jian Su

Real-world robotic tasks are long-horizon and often span multiple floors, demanding rich spatial reasoning. However, existing embodied bench… (see more)marks are largely confined to single-floor in-house environments, failing to reflect the complexity of real-world tasks. We introduce MANSION, the first language-driven framework for generating building-scale, multi-floor 3D environments. Being aware of vertical structural constraints, MANSION generates realistic, navigable whole-building structures with diverse, human-friendly scenes, enabling the development and evaluation of cross-floor long-horizon tasks. Building on this framework, we release MansionWorld, a dataset of over 1,000 diverse buildings ranging from hospitals to offices, alongside a Task-Semantic Scene Editing Agent that customizes these environments using open-vocabulary commands to meet specific user needs. Benchmarking reveals that state-of-the-art agents degrade sharply in our settings, establishing MANSION as a critical testbed for the next generation of spatial reasoning and planning.

2026-03-11

arXiv (preprint)

doi.org

arxiv.org

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Publications

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Popular keywords:

Publications