Pablo Piantanida

Celine Hudelot

Ismail Ben Ayed

We tackle the Few-Shot Open-Set Recognition (FSOSR) problem, i.e. classifying instances among a set of classes for which we only have a few … (see more)labeled samples, while simultaneously detecting instances that do not belong to any known class. We explore the popular transductive setting, which leverages the unlabelled query instances at inference. Motivated by the observation that existing transductive methods perform poorly in open-set scenarios, we propose a generalization of the maximum likelihood principle, in which latent scores down-weighing the influence of potential outliers are introduced alongside the usual parametric model. Our formulation embeds supervision constraints from the support set and additional penalties discouraging overconfident predictions on the query set. We proceed with a block-coordinate descent, with the latent scores and parametric model co-optimized alternately, thereby benefiting from each other. We call our resulting formulation Open-Set Likelihood Optimization (OSLO). OSLO is interpretable and fully modular; it can be applied on top of any pre-trained model seamlessly. Through extensive experiments, we show that our method surpasses existing inductive and transductive methods on both aspects of open-set recognition, namely inlier classification and outlier detection. Code is available at https://github.com/ebennequin/few-shot-open-set.

2023-06-17

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (published)

A Functional Data Perspective and Baseline On Multi-Layer Out-of-Distribution Detection

Eduardo Dadalto Câmara Gomes

Pierre Colombo

Guillaume Staerman

Nathan Noiry

2023-06-06

ArXiv (preprint)

On the incompatibility of accuracy and equal opportunity

Carlos Pinzón

Catuscia Palamidessi

Frank Valencia

2023-05-02

Machine-mediated learning (published)

A Halfspace-Mass Depth-Based Method for Adversarial Attack Detection

Marine Picot

Federica Granese

Guillaume Staerman

Marco Romanelli

Francisco Messina

Pierre Colombo

2023-03-28

TMLR (accepted)

openreview.net

Unsupervised Layer-wise Score Aggregation for Textual OOD Detection

Maxime Darrin

Guillaume Staerman

Eduardo Dadalto Câmara Gomes

Jackie Ck Cheung

Pierre Colombo

2023-02-20

ArXiv (preprint)

A Minimax Approach Against Multi-Armed Adversarial Attacks Detection

Federica Granese

Marco Romanelli

Siddharth Garg

2023-02-04

ArXiv (preprint)

On the (Im)Possibility of Estimating Various Notions of Differential Privacy (short paper)

Daniele Gorla

Louis Jalouzot

Federica Granese

Catuscia Palamidessi

We analyze to what extent final users can infer information about the level of protection of their data when the data obfuscation mechanism … (see more)is a priori unknown to them (the so-called “black-box" scenario). In particular, we delve into the investigation of two notions of local differential privacy (LDP), namely 𝜀 -LDP and Rényi LDP. On one hand, we prove that, without any assumption on the underlying distributions, it is not possible to have an algorithm able to infer the level of data protection with provable guarantees. On the other hand, we demonstrate that, under reasonable assumptions (namely, Lipschitzness of the involved densities on a closed interval), such guarantees exist and can be achieved by a simple histogram-based estimator.

2023-01-01

ICTCS (published)

dblp.uni-trier.de

Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation

Nuno Miguel Guerreiro

Pierre Colombo

André Martins

Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can un… (see more)predictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the problem of hallucination detection in NMT by following a simple intuition: as hallucinations are detached from the source content, they exhibit encoder-decoder attention patterns that are statistically different from those of good quality translations. We frame this problem with an optimal transport formulation and propose a fully unsupervised, plug-in detector that can be used with any attention-based NMT model. Experimental results show that our detector not only outperforms all previous model-based detectors, but is also competitive with detectors that employ external models trained on millions of samples for related tasks such as quality estimation and cross-lingual sentence similarity.

2022-12-19

ArXiv (preprint)

Beyond Mahalanobis-Based Scores for Textual OOD Detection

Pierre Colombo

Eduardo Dadalto Câmara Gomes

Guillaume Staerman

Nathan Noiry

2022-11-24

ArXiv (preprint)

Beyond Mahalanobis Distance for Textual OOD Detection

Pierre Colombo

Eduardo Dadalto Câmara Gomes

Guillaume Staerman

Nathan Noiry

openreview.net

KNIFE: Kernelized-Neural Differential Entropy Estimation

Georg Pichler

Pierre Colombo

Malik Boudiaf

Gunther Koliander

Mutual Information (MI) has been widely used as a loss regularizer for training neural networks. This has been particularly effective when l… (see more)earn dis-entangled or compressed representations of high dimensional data. However, differential entropy (DE), another fundamental measure of information, has not found widespread use in neural network training. Although DE offers a potentially wider range of applications than MI, off-the-shelf DE estimators are either non differentiable, computationally intractable or fail to adapt to changes in the underlying distribution. These drawbacks prevent them from being used as regularizers in neural networks training. To address shortcomings in previously proposed estimators for DE, here we introduce K NIFE , a fully parameterized, differentiable kernel-based estimator of DE. The ﬂexibility of our approach also allows us to construct K NIFE -based estimators for conditional (on either discrete or continuous variables) DE, as well as MI. We empirically validate our method on high-dimensional synthetic data and further apply it to guide the training of neural networks for real-world tasks. Our experiments on a large variety of tasks, including visual domain adaptation, textual fair classiﬁcation, and textual ﬁne-tuning demonstrate the effectiveness of K NIFE - based estimation. Code can be found at https: //github.com/g-pichler/knife .

2022-01-01

arXiv.org (preprint)

dblp.uni-trier.de

Realistic Evaluation of Transductive Few-Shot Learning - Supplementary Material

Olivier Veilleux

Éts Montréal

Malik Boudiaf