Luca Scimeca

Postdoctorate - Université de Montréal

Supervisor

Yoshua Bengio

Research Topics

Bayesian Inference

Causality

Computational Biology

Deep Learning

Generative Models

Probabilistic Models

Representation Learning

Website

Google Scholar

GitHub

Blog Posts

A tab representation of the Pre trained CLIP Vision Transformer

October 8, 2025

Why AI Models Hallucinate and How to Fix Them

Praneet Suresh

Jack Stanley

Danilo Bzdok

Sonia Joseph

Luca Scimeca

Read the article

Publications

Mitigating Biases with Diverse Ensembles and Diffusion Models

Luca Scimeca

Alexander Rubinstein

Damien Teney

Seong Joon Oh

Armand Mihai Nicolicioiu

Yoshua Bengio

Spurious correlations in the data, where multiple cues are predictive of the target labels, often lead to a phenomenon known as shortcut lea… (see more)rning, where a model relies on erroneous, easy-to-learn cues while ignoring reliable ones. In this work, we propose an ensemble diversification framework exploiting Diffusion Probabilistic Models (DPMs) to mitigate this form of bias. We show that at particular training intervals, DPMs can generate images with novel feature combinations, even when trained on samples displaying correlated input features. We leverage this crucial property to generate synthetic counterfactuals to increase model diversity via ensemble disagreement. We show that DPM-guided diversification is sufficient to remove dependence on primary shortcut cues, without a need for additional supervised signals. We further empirically quantify its efficacy on several diversification objectives, and finally show improved generalization and diversification performance on par with prior work that relies on auxiliary data collection.

2023-11-23

ArXiv (preprint)

arxiv.org

Leveraging Diffusion Disentangled Representations to Mitigate Shortcuts in Underspecified Visual Tasks

Luca Scimeca

Alexander Rubinstein

Armand Mihai Nicolicioiu

Damien Teney

Yoshua Bengio

Spurious correlations in the data, where multiple cues are predictive of the target labels, often lead to shortcut learning phenomena, where… (see more) a model may rely on erroneous, easy-to-learn, cues while ignoring reliable ones. In this work, we propose an ensemble diversification framework exploiting the generation of synthetic counterfactuals using Diffusion Probabilistic Models (DPMs). We discover that DPMs have the inherent capability to represent multiple visual cues independently, even when they are largely correlated in the training data. We leverage this characteristic to encourage model diversity and empirically show the efficacy of the approach with respect to several diversification objectives. We show that diffusion-guided diversification can lead models to avert attention from shortcut cues, achieving ensemble diversity performance comparable to previous methods requiring additional data collection.

2023-10-03

ArXiv (preprint)

doi.org

arxiv.org

Hackathon | Building safer AI for youth mental health

Indigenous Pathfinders in AI

AI Advantage

Luca Scimeca

Blog Posts

Publications

Hackathon | Building safer AI for youth mental health

Indigenous Pathfinders in AI

AI Advantage

Popular keywords:

Luca Scimeca

Blog Posts

Publications