Gintare Karolina Dziugaite

Continual Learning in Vision-Language Models via Aligned Model Merging

Ghada Sokar

Anurag Arnab

Ahmet Iscen

Pablo Samuel Castro

Cordelia Schmid

Continual learning is conventionally tackled through sequential fine-tuning, a process that, while enabling adaptation, inherently favors pl… (see more)asticity over the stability needed to retain prior knowledge. While existing approaches attempt to mitigate catastrophic forgetting, a bias towards recent tasks persists as they build upon this sequential nature. In this work we present a new perspective based on model merging to maintain stability while still retaining plasticity. Rather than just sequentially updating the model weights, we propose merging newly trained task parameters with previously learned ones, promoting a better balance. To maximize the effectiveness of the merging process, we propose a simple mechanism that promotes learning aligned weights with previous ones, thereby avoiding interference when merging. We evaluate this approach on large Vision-Language Models (VLMs), and demonstrate its effectiveness in reducing forgetting, increasing robustness to various task orders and similarities, and improving generalization.

2025-05-30

ArXiv (preprint)

From Dormant to Deleted: Tamper-Resistant Unlearning Through Weight-Space Regularization

Shoaib Ahmed Siddiqui

Adrian Weller

David Scott Krueger

Michael Curtis Mozer

Eleni Triantafillou

Recent unlearning methods for LLMs are vulnerable to relearning attacks: knowledge believed-to-be-unlearned re-emerges by fine-tuning on a s… (see more)mall set of (even seemingly-unrelated) examples. We study this phenomenon in a controlled setting for example-level unlearning in vision classifiers. We make the surprising discovery that forget-set accuracy can recover from around 50% post-unlearning to nearly 100% with fine-tuning on just the retain set -- i.e., zero examples of the forget set. We observe this effect across a wide variety of unlearning methods, whereas for a model retrained from scratch excluding the forget set (gold standard), the accuracy remains at 50%. We observe that resistance to relearning attacks can be predicted by weight-space properties, specifically,

2025-05-28

ArXiv (preprint)

From Dormant to Deleted: Tamper-Resistant Unlearning Through Weight-Space Regularization

Shoaib Ahmed Siddiqui

Adrian Weller

David Krueger 0001

M. C. Mozer

Eleni Triantafillou

Recent unlearning methods for LLMs are vulnerable to relearning attacks: knowledge believed-to-be-unlearned re-emerges by fine-tuning on a s… (see more)mall set of (even seemingly-unrelated) examples. We study this phenomenon in a controlled setting for example-level unlearning in vision classifiers. We make the surprising discovery that forget-set accuracy can recover from around 50% post-unlearning to nearly 100% with fine-tuning on just the retain set -- i.e., zero examples of the forget set. We observe this effect across a wide variety of unlearning methods, whereas for a model retrained from scratch excluding the forget set (gold standard), the accuracy remains at 50%. We observe that resistance to relearning attacks can be predicted by weight-space properties, specifically,

2025-05-28

ArXiv (preprint)

Leveraging Per-Instance Privacy for Machine Unlearning

Nazanin Mohammadi Sepahvand

Anvith Thudi

Berivan Isik

Ashmita Bhattacharyya

Nicolas Papernot

Eleni Triantafillou

Daniel M. Roy

2025-05-01

ICML.cc/2025/Conference (poster)

openreview.net

Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization

Phillip Huang Guo

Aaquib Syed

Abhay Sheshadri

Aidan Ewart

2025-05-01

ICML.cc/2025/Conference (poster)

openreview.net

On the Dichotomy Between Privacy and Traceability in $\ell_p$ Stochastic Convex Optimization

Sasha Voitovych

MAHDI HAGHIFAM

Idan Attias

Roi Livni

Daniel M. Roy

In this paper, we investigate the necessity of memorization in stochastic convex optimization (SCO) under …

2025-02-24

ArXiv (preprint)

On the Dichotomy Between Privacy and Traceability in ℓp Stochastic Convex Optimization

Sasha Voitovych

MAHDI HAGHIFAM

Idan Attias

Roi Livni

Daniel M. Roy

2025-02-24

ArXiv (preprint)

On the Dichotomy Between Privacy and Traceability in $\ell_p$ Stochastic Convex Optimization

Sasha Voitovych

MAHDI HAGHIFAM

Idan Attias

Roi Livni

Daniel M. Roy

In this paper, we investigate the necessity of memorization in stochastic convex optimization (SCO) under …

2025-02-24

ArXiv (preprint)

On the Dichotomy Between Privacy and Traceability in ℓp Stochastic Convex Optimization

Sasha Voitovych

MAHDI HAGHIFAM

Idan Attias

Roi Livni

Daniel M. Roy

2025-02-24

ArXiv (preprint)

On Traceability in $\ell_p$ Stochastic Convex Optimization

Sasha Voitovych

MAHDI HAGHIFAM

Idan Attias

Roi Livni

Daniel M. Roy

In this paper, we investigate the necessity of traceability for accurate learning in stochastic convex optimization (SCO) under …

2025-02-24

ArXiv (preprint)

Selective Unlearning via Representation Erasure Using Domain Adversarial Training

Nazanin Mohammadi Sepahvand

Eleni Triantafillou

Hugo Larochelle

Doina Precup

James J. Clark

Daniel M. Roy