Gaétan Marceau Caron

gaetan.marceau.caron@mila.quebec

Directeur principal, Recherche appliquée en apprentissage automatique

Biographie

Gaétan Marceau Caron est le directeur principal de l’équipe de recherche appliquée en apprentissage automatique à Mila – Institut québécois d’intelligence artificielle. Son objectif est de promouvoir une équipe de chercheurs et de chercheuses travaillant à l’aide de l’intelligence artificielle et conjointement avec l’industrie sur des problèmes scientifiques difficiles, dont les solutions auront des retombées de grande valeur pour la société canadienne.

Il a plus de 12 ans d’expérience dans le transfert de connaissances relatives à l’intelligence artificielle à travers des projets collaboratifs en recherche appliquée. Diplômé en ingénierie de Polytechnique Montréal, ENSTA Paris (Institut Polytechnique de Paris) et de l’Université Pierre-et-Marie-Curie (Sorbonne Universités) et titulaire d’un doctorat en recherche scientifique de l’Université Paris-Saclay, il possède cette double expertise de l’analyse des systèmes industriels et de la création de solutions innovantes répondant à des besoins sociétaux de plus en plus complexes.

Durant les 9 dernières années, à Mila, il a été consultant scientifique dans plus de 25 projets avec l’industrie et comme formateur dans 6 éditions de l’école en apprentissage profond co-organisée avec IVADO.

Publications

OpenFake: An Open Dataset and Platform Toward Real-World Deepfake Detection

Akshatha Arodi

Ga'etan Marceau Caron

Jean-François Godbout

Reihaneh Rabbany

Deepfakes, synthetic media created using advanced AI techniques, pose a growing threat to information integrity, particularly in politically… (voir plus) sensitive contexts. This challenge is amplified by the increasing realism of modern generative models, which our human perception study confirms are often indistinguishable from real images. Yet, existing deepfake detection benchmarks rely on outdated generators or narrowly scoped datasets (e.g., single-face imagery), limiting their utility for real-world detection. To address these gaps, we present OpenFake, a large politically grounded dataset specifically crafted for benchmarking against modern generative models with high realism, and designed to remain extensible through an innovative crowdsourced adversarial platform that continually integrates new hard examples. OpenFake comprises nearly four million total images: three million real images paired with descriptive captions and almost one million synthetic counterparts from state-of-the-art proprietary and open-source models. Detectors trained on OpenFake achieve near-perfect in-distribution performance, strong generalization to unseen generators, and high accuracy on a curated in-the-wild social media test set, significantly outperforming models trained on existing datasets. Overall, we demonstrate that with high-quality and continually updated benchmarks, automatic deepfake detection is both feasible and effective in real-world settings.

2024-12-31

arXiv.org (prépublication)

doi.org

arxiv.org

Predicting Infectiousness for Proactive Contact Tracing

Prateek Gupta

Nasim Rahaman

Pierre-Luc St-Charles

Hannah Alsdurf

Olexa Bilanuik

David Buckeridge

Gaétan Marceau Caron

Pierre-Luc Carrier

Joumana Ghosn

Satya Ortiz-Gagné

Chris Pal

Irina Rish

Bernhard Schölkopf … (voir 3 de plus)

Abhinav Sharma

Jian Tang

Andrew Williams

The COVID-19 pandemic has spread rapidly worldwide, overwhelming manual contact tracing in many countries and resulting in widespread lockdo… (voir plus)wns for emergency containment. Large-scale digital contact tracing (DCT) has emerged as a potential solution to resume economic and social activity while minimizing spread of the virus. Various DCT methods have been proposed, each making trade-offs between privacy, mobility restrictions, and public health. The most common approach, binary contact tracing (BCT), models infection as a binary event, informed only by an individual's test results, with corresponding binary recommendations that either all or none of the individual's contacts quarantine. BCT ignores the inherent uncertainty in contacts and the infection process, which could be used to tailor messaging to high-risk individuals, and prompt proactive testing or earlier warnings. It also does not make use of observations such as symptoms or pre-existing medical conditions, which could be used to make more accurate infectiousness predictions. In this paper, we use a recently-proposed COVID-19 epidemiological simulator to develop and test methods that can be deployed to a smartphone to locally and proactively predict an individual's infectiousness (risk of infecting others) based on their contact history and other information, while respecting strong privacy constraints. Predictions are used to provide personalized recommendations to the individual via an app, as well as to send anonymized messages to the individual's contacts, who use this information to better predict their own infectiousness, an approach we call proactive contact tracing (PCT). We find a deep-learning based PCT method which improves over BCT for equivalent average mobility, suggesting PCT could help in safe re-opening and second-wave prevention.

2021-05-02

International Conference on Learning Representations (Spotlight)

doi.org

openreview.net

COVI-AgentSim: an Agent-based Model for Evaluating Methods of Digital Contact Tracing

Prateek Gupta

Tegan Maharaj

Martin Weiss

Nasim Rahaman

Hannah Alsdurf

Abhinav Sharma

Nanor Minoyan

Soren Harnois Leblanc

Victor Schmidt

Pierre-Luc St. Charles

Akshay Patel

David Buckeridge … (voir 9 de plus)

Joumana Ghosn

Yang Zhang

Bernhard Schölkopf

Jian Tang

Irina Rish

Christopher Pal

Joanna Merckx

Eilif B. Muller

Yoshua Bengio

The rapid global spread of COVID-19 has led to an unprecedented demand for effective methods to mitigate the spread of the disease, and vari… (voir plus)ous digital contact tracing (DCT) methods have emerged as a component of the solution. In order to make informed public health choices, there is a need for tools which allow evaluation and comparison of DCT methods. We introduce an agent-based compartmental simulator we call COVI-AgentSim, integrating detailed consideration of virology, disease progression, social contact networks, and mobility patterns, based on parameters derived from empirical research. We verify by comparing to real data that COVI-AgentSim is able to reproduce realistic COVID-19 spread dynamics, and perform a sensitivity analysis to verify that the relative performance of contact tracing methods are consistent across a range of settings. We use COVI-AgentSim to perform cost-benefit analyses comparing no DCT to: 1) standard binary contact tracing (BCT) that assigns binary recommendations based on binary test results; and 2) a rule-based method for feature-based contact tracing (FCT) that assigns a graded level of recommendation based on diverse individual features. We find all DCT methods consistently reduce the spread of the disease, and that the advantage of FCT over BCT is maintained over a wide range of adoption rates. Feature-based methods of contact tracing avert more disability-adjusted life years (DALYs) per socioeconomic cost (measured by productive hours lost). Our results suggest any DCT method can help save lives, support re-opening of economies, and prevent second-wave outbreaks, and that FCT methods are a promising direction for enriching BCT using self-reported symptoms, yielding earlier warning signals and a significantly reduced spread of the virus per socioeconomic cost.

2020-10-01

OpenReview.net/Anonymous_Preprint (inconnu)

doi.org

openreview.net

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Gaétan Marceau Caron

Biographie

Publications

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Mots-clés populaires:

Gaétan Marceau Caron

Biographie

Publications