Publications

Learning Decision Trees as Amortized Structure Inference

Mohammed Mahfoud

Ghait Boukachab

Michał Koziarski

Alex Hernandez-Garcia

Stefan Bauer

Yoshua Bengio

Nikolay Malkin

2025-03-10

ArXiv (preprint)

arxiv.org

Relative biological effectiveness of 31 meV thermal neutrons in peripheral blood lymphocytes

Laura C Paterson

Fawaz Ali

Mohsen Naseri

David Perez Loureiro

Amy Festarini

Marilyne Stuart

Chad Boyer

Ronald Rogge

Christie Costello

Norma Ybarra

John Kildea

Richard B Richardson

2025-03-10

Radiation Protection Dosimetry (published)

doi.org

SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Detection

Shamsuddeen Hassan Muhammad

Nedjma OUSIDHOUM

Idris Abdulmumin

Seid Muhie Yimam

Jan Philip Wahle

Terry Lima Ruas

Meriem Beloucif

Christine de Kock

Tadesse Belay

Ibrahim Ahmad

Nirmal Surange

Daniela Teodorescu

David Ifeoluwa Adelani

Alham Fikri Aji

Felermino Ali

Vladimir Araujo

Abinew Ayele

Oana Ignat

Alexander Panchenko

Yi Zhou … (see 1 more)

Saif M. Mohammad

2025-03-10

ArXiv (preprint)

arxiv.org

Understanding the impact of IoT security patterns on CPU usage and energy consumption: a dynamic approach for selecting patterns with deep reinforcement learning

Saeid Jamshidi

Amin Nikanjam

Kawser Wazed Nafi

Foutse Khomh

2025-03-10

International Journal of Information Security (published)

doi.org

Spectral State Space Model for Rotation-Invariant Visual Representation Learning

Sahar Dastani

Ali Bahri

Moslem Yazdanpanah

Mehrdad Noori

David Osowiechi

Gustavo Adolfo Vargas Hakim

Farzad Beizaee

Milad Cheraghalikhani

Arnab Kumar Mondal

Hervé Lombaert

Christian Desrosiers

2025-03-09

ArXiv (preprint)

arxiv.org

Unveiling Inefficiencies in LLM-Generated Code: Toward a Comprehensive Taxonomy

Altaf Allah Abbassi

Leuson Da Silva

Amin Nikanjam

Foutse Khomh

2025-03-08

ArXiv (preprint)

arxiv.org

NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild

Shikhar Murty

Dzmitry Bahdanau

Hao Zhu

Christopher D Manning

2025-03-07

ICLR.cc/2025/Workshop/SSI-FM (poster)

openreview.net

NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild

Shikhar Murty

Hao Zhu

Dzmitry Bahdanau

Christopher D Manning

We introduce NNetNav, a method for unsupervised interaction with websites that generates synthetic demonstrations for training browser agent… (see more)s. Given any website, NNetNav produces these demonstrations by retroactively labeling action sequences from an exploration policy. Most work on training browser agents has relied on expensive human supervision, and the limited prior work on such interaction-based techniques has failed to provide effective search through the exponentially large space of exploration. In contrast, NNetNav exploits the hierarchical structure of language instructions to make this search more tractable: Complex instructions are typically decomposable into simpler sub-tasks, allowing NNetNav to automatically prune interaction episodes when an intermediate trajectory cannot be annotated with a meaningful sub-task. \texttt{LLama-3.1-8b} finetuned on 10k NNetNav self-generated demonstrations obtains over 16\% success rate on WebArena, and 35\% on WebVoyager, an improvement of 15pts and 31pts respectively over zero-shot \texttt{LLama-3.1-8b}, outperforming zero-shot GPT-4 and reaching the state-of-the-art among unsupervised methods, for both benchmarks.

2025-03-07

ICLR.cc/2025/Workshop/SSI-FM (poster)

openreview.net

Towards Graph Foundation Models: A Study on the Generalization of Positional and Structural Encodings

Billy Joe Franks

Moshe Eliasof

Semih Cantürk

Guy Wolf

Carola-Bibiane Schönlieb

Sophie Fellenz

Marius Kloft

Recent advances in integrating positional and structural encodings (PSEs) into graph neural networks (GNNs) have significantly enhanced thei… (see more)r performance across various graph learning tasks. However, the general applicability of these encodings and their potential to serve as foundational representations for graphs remain uncertain. This paper investigates the fine-tuning efficiency, scalability with sample size, and generalization capability of learnable PSEs across diverse graph datasets. Specifically, we evaluate their potential as universal pre-trained models that can be easily adapted to new tasks with minimal fine-tuning and limited data. Furthermore, we assess the expressivity of the learned representations, particularly, when used to augment downstream GNNs. We demonstrate through extensive benchmarking and empirical analysis that PSEs generally enhance downstream models. However, some datasets may require specific PSE-augmentations to achieve optimal performance. Nevertheless, our findings highlight their significant potential to become integral components of future graph foundation models. We provide new insights into the strengths and limitations of PSEs, contributing to the broader discourse on foundation models in graph learning.

2025-03-07

TMLR (accepted)

openreview.net

Tractable Representations for Convergent Approximation of Distributional HJB Equations

Julie Alhosh

Harley Wiltzer

David Meger

2025-03-07

ArXiv (preprint)

arxiv.org

Attention-based Class-Conditioned Alignment for Multi-Source Domain Adaptation of Object Detectors

Atif Belal

Akhil Meethal

Francisco Perdigon Romero

Marco Pedersoli

Eric Granger

Domain adaptation methods for object detection (OD) strive to mitigate the impact of distribution shifts by promoting feature alignment acro… (see more)ss source and target domains. Multi-source domain adaptation (MSDA) allows leveraging multiple annotated source datasets and unlabeled target data to improve the accuracy and robustness of the detection model. Most state-of-the-art MSDA methods for OD perform feature alignment in a class-agnostic manner. This is challenging since the objects have unique modality information due to variations in object appearance across domains. A recent prototype-based approach proposed a class-wise alignment, yet it suffers from error accumulation caused by noisy pseudo-labels that can negatively affect adaptation with imbalanced data. To overcome these limitations, we propose an attention-based class-conditioned alignment method for MSDA, designed to align instances of each object category across domains. In particular, an attention module combined with an adversarial domain classifier allows learning domain-invariant and class-specific instance representations. Experimental results on multiple benchmarking MSDA datasets indicate that our method outperforms state-of-the-art methods and exhibits robustness to class imbalance, achieved through a conceptually simple class-conditioning strategy. Our code is available at: https://github.com/imatif17/ACIA.

2025-03-06

2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (published)

doi.org

arxiv.org

Continual Pre-training of MoEs: How robust is your router?

Benjamin Thérien

Charles-Étienne Joseph

Zain Sarwar

Ashwinee Panda

Anirban Das

Shi-Xiong Zhang

Stephen Rawls

Sambit Sahu

Eugene Belilovsky

Irina Rish

2025-03-06

ArXiv (preprint)

arxiv.org

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Publications

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Popular keywords:

Publications