Portrait de Ulrich Aivodji

Ulrich Aivodji

Membre académique associé
Professeur associé, École de technologie supérieure (ETS), Département de génie logiciel et des TI
École de technologie supérieure
Sujets de recherche
Apprentissage de représentations
Apprentissage profond
Exploration des données
Optimisation

Biographie

Ulrich Aivodji est professeur associé d'informatique au Département de génie logiciel et des technologies de l'information de l’École de technologie supérieure de Montréal (ÉTS).

Il dirige le Trustworthy Information Systems Lab (TISL). Ses domaines de recherche sont la sécurité informatique, la confidentialité des données, l'optimisation et l'apprentissage automatique. Ses travaux actuels portent sur plusieurs aspects de l'apprentissage automatique digne de confiance, tels que l'équité, l'apprentissage automatique préservant la vie privée et l'explicabilité.

Avant d'occuper son poste actuel, il était chercheur postdoctoral à l'Université du Québec à Montréal (UQAM), où il travaillait avec Sébastien Gambs sur l'éthique de l'apprentissage automatique et la protection de la vie privée. Il a obtenu un doctorat en informatique à l'Université Paul-Sabatier, sous la direction de Marie-José Huguet et Marc-Olivier Killijian. Pendant son doctorat, il a été affilié au Laboratoire de recherche spécialisé dans l’analyse et l’architecture des systèmes du Centre national de la recherche scientifique (LAAS-CNRS) en tant que membre des groupes de recherche Informatique fiable et tolérance aux fautes et Recherche opérationnelle, optimisation combinatoire et contraintes.

Étudiants actuels

Doctorat - École de technologie suprérieure
Maîtrise recherche - École de technologie suprérieure
Postdoctorat - École de technologie suprérieure
Maîtrise recherche - École de technologie suprérieure
Doctorat - École de technologie suprérieure
Co-superviseur⋅e :
Collaborateur·rice de recherche - Simon Fraser University
Doctorat - École de technologie suprérieure
Co-superviseur⋅e :
Visiteur de recherche indépendant - UQAM
Doctorat - École de technologie suprérieure
Collaborateur·rice de recherche - University of Waterloo
Stagiaire de recherche - École de technologie suprérieure (ÉTS)

Publications

IDP-Bench: Benchmarking ability of LLMs to protect personal information in interdependent privacy contexts
Nicholas Vincent
Héber Hwang Arcolezi
Large language models (LLMs) are becoming widely deployed as personal AI assistants with access to sensitive user data, making privacy a maj… (voir plus)or challenge for their design and evaluation. Prior work focuses mainly on individual-level risks, overlooking \textbf{interdependent privacy (IDP)}--where one person's data may be revealed by others without their knowledge or consent. We address this gap by introducing \textbf{IDP-Bench}: the first LLM benchmark for IDP scenarios, grounded in the Contextual Integrity (CI) framework. We evaluate eight open-source LLMs on their understanding of IDP scenarios across three levels of IDP reasoning using two LLM judges. Results show strong co-ownership recognition (6/8 models exceed 90\%) but persistent weaknesses in identifying CI parameters (information attribute, primary subject) and IDP-specific parameters such as secondary subjects, where 7/8 models score below 74\%. Models also struggle to judge sharing appropriateness (5/8 scoring below 77\%). While the ability to judge the appropriateness of sharing improves with scale, performance tends to decline in smaller models, and prompt sensitivity remains high on IDP-specific questions--highlighting the need for more targeted study of IDP in LLM privacy research. Data \& code available \href{https://github.com/tisl-lab/Interdependent_Privacy_Bench}{here}.
Model Stealing Through the Lens of Model Multiplicity
Model stealing attacks, where adversaries create high-fidelity surrogate models, are a significant threat to the intellectual property of ma… (voir plus)chine learning services. Conventional wisdom suggests these surrogates could provide adversaries with economic leverage comparable to the original service providers. This paper challenges this assumption by evaluating model stealing attacks beyond mere fidelity to the target model. Because query-based extraction provides only partial supervision of the target's input-output behavior, the surrogate is not uniquely identified: many near-optimal surrogates can achieve comparable fidelity while differing in deployment-relevant properties. Instead of performing a classic learning-based model stealing attack, we compute the Rashomon Set (i.e., the set of almost-equally-accurate models) of surrogate models, and evaluate its diversity using multiplicity metrics (ambiguity, discrepancy and rashomon capcity) and group fairness metrics. Our experiments on real-world datasets reveal that despite exhibiting similar fidelity to the target model, surrogate models can display significant variances in other critical performance metrics. These findings cast doubt on the presumed equivalence between high-fidelity surrogates and the target model in practical deployment scenarios.
Challenges in Using LLM Agents to Validate Agent Governance
Héber Hwang Arcolezi
The increasing deployment of Large Language Models (LLMs) as autonomous agents has intensified the need for credible and trustworthy methods… (voir plus) to evaluate governance interventions. Motivated by recent research, this work considers the use of LLM and agent-based simulations to evaluate AI agent governance mechanisms before real-world deployment. While conceptually appealing, this approach introduces various challenges. We examine three such problems: (1) obtaining ground truth for validation, (2) determining whether observed behaviors represent actual agent operations or simulation artifacts, and (3) obtaining consent for data use, and addressing ethical concerns about computational surrogates replacing real users. We also outline considerations based on documented limitations, aiming to catalyze workshop discussion on trustworthy and reliable evaluation methods for agent governance.
Fairwashing: the risk of rationalization
Hiromi Arai
Olivier Fortineau
Sébastien Gambs
Satoshi Hara
Black-box explanation is the problem of explaining how a machine learning model -- whose internal logic is hidden to the auditor and general… (voir plus)ly complex -- produces its outcomes. Current approaches for solving this problem include model explanation, outcome explanation as well as model inspection. While these techniques can be beneficial by providing interpretability, they can be used in a negative manner to perform fairwashing, which we define as promoting the false perception that a machine learning model respects some ethical values. In particular, we demonstrate that it is possible to systematically rationalize decisions taken by an unfair black-box model using the model explanation as well as the outcome explanation approaches with a given fairness metric. Our solution, LaundryML, is based on a regularized rule list enumeration algorithm whose objective is to search for fair rule lists approximating an unfair black-box model. We empirically evaluate our rationalization technique on black-box models trained on real-world datasets and show that one can obtain rule lists with high fidelity to the black-box model while being considerably less unfair at the same time.