Publications

A Layer Selection Approach to Test Time Adaptation

Sabyasachi Sahoo

Mostafa ElAraby

Jonas Ngnawe

Yann Batiste Pequignot

Frederic Precioso

Test Time Adaptation (TTA) addresses the problem of distribution shift by adapting a pretrained model to a new domain during inference. When… (see more) faced with challenging shifts, most methods collapse and perform worse than the original pretrained model. In this paper, we find that not all layers are equally receptive to the adaptation, and the layers with the most misaligned gradients often cause performance degradation. To address this, we propose GALA, a novel layer selection criterion to identify the most beneficial updates to perform during test time adaptation. This criterion can also filter out unreliable samples with noisy gradients. Its simplicity allows seamless integration with existing TTA loss functions, thereby preventing degradation and focusing adaptation on the most trainable layers. This approach also helps to regularize adaptation to preserve the pretrained features, which are crucial for handling unseen domains. Through extensive experiments, we demonstrate that the proposed layer selection framework improves the performance of existing TTA approaches across multiple datasets, domain shifts, model architectures, and TTA losses.

2025-04-11

Proceedings of the AAAI Conference on Artificial Intelligence (published)

doi.org

openreview.net

StarVector: Generating Scalable Vector Graphics Code from Images and Text

Juan A. Rodriguez

Abhay Puri

Shubham Agarwal

Issam Hadj Laradji

Pau Rodriguez

Sai Rajeswar

David Vazquez

Chris Pal

Marco Pedersoli

2025-04-11

Proceedings of the AAAI Conference on Artificial Intelligence (published)

doi.org

arxiv.org

AgentAda: Skill-Adaptive Data Analytics for Tailored Insight Discovery

Amirhossein Abaskohi

Amrutha Varshini Ramesh

Shailesh Nanisetty

Chirag Goel

David Vazquez

Chris Pal

Spandana Gella

Giuseppe Carenini

Issam Hadj Laradji

2025-04-10

ArXiv (preprint)

arxiv.org

Min-Max Optimisation for Nonconvex-Nonconcave Functions Using a Random Zeroth-Order Extragradient Algorithm

Amir Ali Farzin

Yuen-Man Pun

Philipp Braun

Antoine Lesage-Landry

Youssef Diouane

Iman Shames

2025-04-10

ArXiv (preprint)

arxiv.org

Leveraging Machine Learning Techniques in Intrusion Detection Systems for Internet of Things

Saeid Jamshidi

Amin Nikanjam

Nafi Kawser Wazed

Foutse Khomh

2025-04-09

ArXiv (preprint)

arxiv.org

Lugha-Llama: Adapting Large Language Models for African Languages

Happy Buzaaba

Alexander Wettig

David Ifeoluwa Adelani

Christiane Fellbaum

2025-04-09

ArXiv (preprint)

arxiv.org

Echoes in the Noise: Posterior Samples of Faint Galaxy Surface Brightness Profiles with Score-Based Likelihoods and Priors

Alexandre Adam

Connor Stone

Connor Bottrell

Ronan Legin

Yashar Hezaveh

Laurence Perreault-Levasseur

Examining the detailed structure of galaxy populations provides valuable insights into their formation and evolution mechanisms. Significant… (see more) barriers to such analysis are the non-trivial noise properties of real astronomical images and the point spread function (PSF) which blurs structure. Here we present a framework which combines recent advances in score-based likelihood characterization and diffusion model priors to perform a Bayesian analysis of image deconvolution. The method, when applied to minimally processed \emph{Hubble Space Telescope} (\emph{HST}) data, recovers structures which have otherwise only become visible in next-generation \emph{James Webb Space Telescope} (\emph{JWST}) imaging.

2025-04-08

The Astronomical Journal (published)

doi.org

arxiv.org

InfoGain Wavelets: Furthering the Design of Diffusion Wavelets for Graph-Structured Data

David R. Johnson

Smita Krishnaswamy

Michael Perlmutter

2025-04-08

ArXiv (preprint)

arxiv.org