Yoshua Bengio

Membre académique principal

Chaire en IA Canada-CIFAR

Professeur titulaire, Université de Montréal, Département d'informatique et de recherche opérationnelle

Directeur scientifique, Équipe de direction

Observateur, Conseil d'administration, Mila

Site web

Google Scholar

Biographie

*Pour toute demande média, veuillez écrire à medias@mila.quebec.

Pour plus d’information, contactez Julie Mongeau, adjointe de direction à julie.mongeau@mila.quebec.

Reconnu comme une sommité mondiale en intelligence artificielle, Yoshua Bengio s’est surtout distingué par son rôle de pionnier en apprentissage profond, ce qui lui a valu le prix A. M. Turing 2018, le « prix Nobel de l’informatique », avec Geoffrey Hinton et Yann LeCun. Il est professeur titulaire à l’Université de Montréal, fondateur et directeur scientifique de Mila – Institut québécois d’intelligence artificielle, et codirige en tant que senior fellow le programme Apprentissage automatique, apprentissage biologique de l'Institut canadien de recherches avancées (CIFAR). Il occupe également la fonction de directeur scientifique d’IVADO.

En 2018, il a été l’informaticien qui a recueilli le plus grand nombre de nouvelles citations au monde. En 2019, il s’est vu décerner le prestigieux prix Killam. Depuis 2022, il détient le plus grand facteur d’impact (h-index) en informatique à l’échelle mondiale. Il est fellow de la Royal Society de Londres et de la Société royale du Canada, et officier de l’Ordre du Canada.

Soucieux des répercussions sociales de l’IA et de l’objectif que l’IA bénéficie à tous, il a contribué activement à la Déclaration de Montréal pour un développement responsable de l’intelligence artificielle.

Étudiants actuels

Aayush Bajaj

Maîtrise professionnelle - Université de Montréal

Co-superviseur⋅e :

Samira Ebrahimi Kahou

aayush.bajaj@mila.quebec

Ahmad Ghawanmeh

Maîtrise professionnelle - Université de Montréal

ahmad.ghawanmeh@mila.quebec

Akram Erraqabi

Doctorat - Université de Montréal

akram.erraqabi@mila.quebec

Alex Hernandez-Garcia

Postdoctorat - Université de Montréal

Co-superviseur⋅e :

Postdoctorat - Université de Montréal

alexander.tong@mila.quebec

Sasha Volokhova

Doctorat - Université de Montréal

alexandra.volokhova@mila.quebec

Alexandre Duval

Collaborateur·rice de recherche - Université Paris-Saclay

Superviseur⋅e principal⋅e :

David Rolnick

alexandre.duval@mila.quebec

Site web

Aman Dalmia

Maîtrise professionnelle - Université de Montréal

aman.dalmia@mila.quebec

Andrés Campero

Visiteur de recherche indépendant - MIT

andres.campero@mila.quebec

Site web

Aniket Didolkar

Doctorat - Université de Montréal

aniket.didolkar@mila.quebec

Site web

Github

Google Scholar

Anja Surina

Doctorat - École Polytechnique Montréal Fédérale de Lausanne

anja.surina@mila.quebec

Ayoub Atanane

Stagiaire de recherche - Université du Québec à Rimouski

ayoub.atanane@mila.quebec

Github

Basile Terver

Collaborateur·rice de recherche

Superviseur⋅e principal⋅e :

David Rolnick

basile.terver@mila.quebec

Camille Rochefort-Boulanger

Doctorat - Université de Montréal

Superviseur⋅e principal⋅e :

Julie Hussin

rochefoc@mila.quebec

Github

Chen Chen

Postdoctorat - Université de Montréal

Co-superviseur⋅e :

Blake Richards

chen.sun@mila.quebec

Chenghao Liu

Collaborateur·rice alumni

Maîtrise professionnelle - Université de Montréal

clemence.granade@mila.quebec

Site web

Github

Cristian Meo

Collaborateur·rice alumni

cristian.meo@mila.quebec

Github

Google Scholar

Cristian Dragos Manta

Doctorat - Université de Montréal

Co-superviseur⋅e :

Dhanya Sridhar

cristian-dragos.manta@mila.quebec

Github

Damiano Fornasiere

Doctorat - Barcelona University

damiano.fornasiere@mila.quebec

Dan Assouline

Collaborateur·rice alumni

dan.assouline@mila.quebec

Github

Google Scholar

Dinghuai Zhang

Doctorat - Université de Montréal

Superviseur⋅e principal⋅e :

Aaron Courville

dinghuai.zhang@mila.quebec

Site web

Divya Sharma

Collaborateur·rice alumni

divya.sharma@mila.quebec

Site web

Github

Donna Vakalis

Postdoctorat - Université de Montréal

Co-superviseur⋅e :

David Rolnick

donna.vakalis@mila.quebec

Github

Google Scholar

Dragos Secrieru

Maîtrise recherche - Université de Montréal

dragos.secrieru@mila.quebec

Edward Hu

Doctorat - Université de Montréal

edward.hu@mila.quebec

Elmimouni Zakaria

Stagiaire de recherche - Université de Montréal

zakarya.elmimouni@mila.quebec

Github

Eric Elmoznino

Doctorat - Université de Montréal

Co-superviseur⋅e :

Guillaume Lajoie

eric.elmoznino@mila.quebec

Stagiaire de recherche - UQAR

ghait.boukachab@mila.quebec

Github

Hae-Beom Lee

Collaborateur·rice alumni

hae-beom.lee@mila.quebec

Jessie Richter-Powell

Visiteur de recherche indépendant - Université de Montréal

jack.richter-powell@mila.quebec

Google Scholar

Jama Mohamud

Doctorat - Université de Montréal

Superviseur⋅e principal⋅e :

Mirco Ravanelli

hussein-mohamu.jama@mila.quebec

Stagiaire de recherche - McGill University

jamal.abouhaibeh@mila.quebec

Github

Google Scholar

James Requeima

Visiteur de recherche indépendant - Université de Montréal

james.requeima@mila.quebec

Jarrid Rector-Brooks

Doctorat - Université de Montréal

Co-superviseur⋅e :

Sarath Chandar Anbil Parthipan

jarrid.rector-brooks@mila.quebec

Jean-pierre Falet

Doctorat - Université de Montréal

Co-superviseur⋅e :

Guillaume Lajoie

jean-pierre.falet@mila.quebec

Maîtrise professionnelle - Université de Montréal

jerome.francis@mila.quebec

Site web

Github

George Jiangyan Ma

Stagiaire de recherche - Université de Montréal

jiangyan.ma@mila.quebec

Doctorat - Université de Montréal

madankan@mila.quebec

Katie Everett

Doctorat - Massachusetts Institute of Technology

katie-elizabeth.everett@mila.quebec

Léna Ezzine

Doctorat - Université de Montréal

lena-nehale.ezzine@mila.quebec

Github

Leo Feng

Doctorat - Université de Montréal

leo.feng@mila.quebec

Site web

Google Scholar

Leon Hetzel

Visiteur de recherche indépendant - Technical University Munich (TUM)

leon.hetzel@mila.quebec

Site web

Github

Google Scholar

Ling Pan

Visiteur de recherche indépendant - Hong Kong University of Science and Technology (HKUST)

ling.pan@mila.quebec

Github

Loic Mandine

DESS - Université de Montréal

loic.mandine@mila.quebec

Loubna Benabbou

Visiteur de recherche indépendant - UQAR

loubna.benabbou@mila.quebec

Site web

Google Scholar

Luca Scimeca

Postdoctorat - Université de Montréal

luca.scimeca@mila.quebec

Doctorat - Université de Montréal

korablym@mila.quebec

Marcin Sendera

Stagiaire de recherche - Université de Montréal

marcin.sendera@mila.quebec

Github

Google Scholar

Marco STOCK

Visiteur de recherche indépendant - Technical University of Munich

marco.stock@mila.quebec

Github

Matt MacDermott

Stagiaire de recherche - Imperial College London

matthew.macdermott@mila.quebec

Site web

Github

Google Scholar

Mélisande Astrid Crystal Teng

Doctorat - Université de Montréal

Co-superviseur⋅e :

Postdoctorat - Université de Montréal

michal.koziarski@mila.quebec

Harry Zhao

Doctorat - McGill University

Superviseur⋅e principal⋅e :

Mingze Li

Maîtrise professionnelle - Université de Montréal

mingze2.li@mila.quebec

Minsu Kim

Collaborateur·rice de recherche - Université de Montréal

minsu.kim@mila.quebec

Stagiaire de recherche - Université de Montréal

mohammed.abukalam@mila.quebec

Github

Mohammed Mahfoud

Stagiaire de recherche - Université de Montréal

mohammed.mahfoud@mila.quebec

Mohsin Hasan

Doctorat - Université de Montréal

mohsin.hasan@mila.quebec

Site web

Github

Google Scholar

Moksh Jain

Doctorat - Université de Montréal

moksh.jain@mila.quebec

Doctorat - Max-Planck-Institute for Intelligent Systems

rahamann@mila.quebec

Nicole Zhang

Doctorat - McGill University

Superviseur⋅e principal⋅e :

Collaborateur·rice alumni - Université de Montréal

nikolay.malkin@mila.quebec

Doctorat - Université de Montréal

oussama.boussif@mila.quebec

Site web

Google Scholar

Param Raval

Maîtrise professionnelle - Université de Montréal

param.raval@mila.quebec

Paul Bertin

Doctorat - Université de Montréal

bertinpa@mila.quebec

Phong Nguyen

Visiteur de recherche indépendant - Université de Montréal

nguyenph@mila.quebec

Pierre-Paul De Breuck

Collaborateur·rice alumni - Université de Montréal

pierre-paul.de-breuck@mila.quebec

Collaborateur·rice de recherche

pietro.greiner@mila.quebec

Priya Nama Venkatesh

Maîtrise professionnelle - Université de Montréal

priya.nama@mila.quebec

Prudencio Tossou

Collaborateur·rice de recherche - Valence

Superviseur⋅e principal⋅e :

Dominique Beaini

prudencio.tossou@mila.quebec

Rim Assouel

Doctorat - Université de Montréal

assouelr@mila.quebec

Ruixiang Zhang

Doctorat - Université de Montréal

Superviseur⋅e principal⋅e :

Doctorat - Université de Montréal

lahlosal@mila.quebec

Sarthak Mittal

Doctorat - Université de Montréal

Superviseur⋅e principal⋅e :

Seanie Lee

Stagiaire de recherche - Université de Montréal

seanie.lee@mila.quebec

Maîtrise professionnelle

aasheesh.singh@mila.quebec

Collaborateur·rice de recherche - Université de Montréal

soren.mindermann@mila.quebec

Site web

Google Scholar

Stefan Bauer

Visiteur de recherche indépendant

Co-superviseur⋅e :

Guillaume Lajoie

stefan.bauer@mila.quebec

Google Scholar

Stefano Massaroli

Postdoctorat - Université de Montréal

stefano.massaroli@mila.quebec

Stephen Lu

Stagiaire de recherche - McGill University

stephen.lu@mila.quebec

Maîtrise professionnelle - Université de Montréal

subhrajyoti.dasgupta@mila.quebec

Site web

Github

Google Scholar

Theo Saulus

Collaborateur·rice de recherche

Superviseur⋅e principal⋅e :

David Rolnick

theo.saulus@mila.quebec

Thomas Jiralerspong

Maîtrise recherche - Université de Montréal

Co-superviseur⋅e :

Doina Precup

thomas.jiralerspong@mila.quebec

Doctorat - Université de Montréal

tianyu.zhang@mila.quebec

Doctorat - Université de Montréal

Vedant Shah

Maîtrise recherche - Université de Montréal

vedant.shah@mila.quebec

Doctorat - Université de Montréal

Todosijevic Viktor Todosijevic

Collaborateur·rice de recherche - RWTH Aachen University (Rheinisch-Westfälische Technische Hochschule Aachen)

Superviseur⋅e principal⋅e :

David Rolnick

viktor.todosijevic@mila.quebec

Github

Vincent Quirion

Baccalauréat - Université de Montréal

vincent.quirion@mila.quebec

Xiaoyin Chen

Doctorat - Université de Montréal

xiaoyin.chen@mila.quebec

Yashaswi Pupneja

Maîtrise professionnelle - Université de Montréal

yashaswi.pupneja@mila.quebec

Github

Yizhao Wang

Maîtrise professionnelle - Université de Montréal

yizhao.wang@mila.quebec

Younesse Kaddar

Stagiaire de recherche - Université de Montréal

younesse.kaddar@mila.quebec

Site web

Github

Zhen Liu

Doctorat - Université de Montréal

Superviseur⋅e principal⋅e :

Liam Paull

liuzhen@mila.quebec

Zibo Shang

Maîtrise professionnelle - Université de Montréal

zibo.shang@mila.quebec

Zichao Yan

Postdoctorat - Université de Montréal

yanzicha@mila.quebec

Billets de blogue

Generic thumbnail for Mila Blog articles.

22 février 2024

Skipper : combiner l’abstraction spatiale et temporelle afin d’améliorer la généralisation

par

Mingde Harry Zhao

Safa Alver

Harm van Seijen

Romain Laroche

Doina Precup

Yoshua Bengio

Lire l'article

Scaling in the service of reasoning & model-based ML

4 avril 2023

Mise à l’échelle au service du raisonnement et de l’apprentissage automatique basé sur un modèle

par

Yoshua Bengio

Edward J. Hu

Lire l'article

A collaboration between Mila and Relation Therapeutics to discover novel synergistic combinations of drugs in vitro

23 mars 2022

Une collaboration entre Mila et Relation Therapeutics pour découvrir in vitro de nouvelles associations médicamenteuses synergiques

par

Paul Bertin

Jake P. Taylor-King

Yoshua Bengio

Lire l'article

15 mars 2022

Les réseaux de flot génératifs

par

Yoshua Bengio

Lire l'article

Publications

hBERT + BiasCorp - Fighting Racism on the Web

Olawale Moses Onabola

Zhuang Ma

Xie Yang

Benjamin Akera

Ibraheem Abdulrahman

Jia Xue

Dianbo Liu

Yoshua Bengio

Subtle and overt racism is still present both in physical and online communities today and has impacted many lives in different segments of … (voir plus)the society. In this short piece of work, we present how we’re tackling this societal issue with Natural Language Processing. We are releasing BiasCorp, a dataset containing 139,090 comments and news segment from three specific sources - Fox News, BreitbartNews and YouTube. The first batch (45,000 manually annotated) is ready for publication. We are currently in the final phase of manually labeling the remaining dataset using Amazon Mechanical Turk. BERT has been used widely in several downstream tasks. In this work, we present hBERT, where we modify certain layers of the pretrained BERT model with the new Hopfield Layer. hBert generalizes well across different distributions with the added advantage of a reduced model complexity. We are also releasing a JavaScript library 3 and a Chrome Extension Application, to help developers make use of our trained model in web applications (say chat application) and for users to identify and report racially biased contents on the web respectively

2021-04-19

OpenReview.net/Archive (publié)

openreview.net

Neural Function Modules with Sparse Arguments: A Dynamic Approach to Integrating Information across Layers

Alex Lamb

Anirudh Goyal

A. Slowik

Michael Curtis Mozer

Philippe Beaudoin

Yoshua Bengio

Feed-forward neural networks consist of a sequence of layers, in which each layer performs some processing on the information from the previ… (voir plus)ous layer. A downside to this approach is that each layer (or module, as multiple modules can operate in parallel) is tasked with processing the entire hidden state, rather than a particular part of the state which is most relevant for that module. Methods which only operate on a small number of input variables are an essential part of most programming languages, and they allow for improved modularity and code re-usability. Our proposed method, Neural Function Modules (NFM), aims to introduce the same structural capability into deep learning. Most of the work in the context of feed-forward networks combining top-down and bottom-up feedback is limited to classification problems. The key contribution of our work is to combine attention, sparsity, top-down and bottom-up feedback, in a flexible algorithm which, as we show, improves the results in standard classification, out-of-domain generalization, generative modeling, and learning representations in the context of reinforcement learning.

2021-03-18

Proceedings of The 24th International Conference on Artificial Intelligence and Statistics (publié)

proceedings.mlr.press

arxiv.org

Predicting Infectiousness for Proactive Contact Tracing

Yoshua Bengio

Prateek Gupta

Tegan Maharaj

Nasim Rahaman

Martin Weiss

Tristan Deleu

Eilif Benjamin Muller

Meng Qu

Victor Schmidt

Pierre-Luc St-Charles

Hannah Alsdurf

Olexa Bilaniuk

David Buckeridge

gaetan caron

pierre luc carrier

Joumana Ghosn

satya ortiz gagne

Chris Pal

Irina Rish

Bernhard Schölkopf … (voir 3 de plus)

abhinav sharma

Jian Tang

andrew williams

The COVID-19 pandemic has spread rapidly worldwide, overwhelming manual contact tracing in many countries and resulting in widespread lockdo… (voir plus)wns for emergency containment. Large-scale digital contact tracing (DCT) has emerged as a potential solution to resume economic and social activity while minimizing spread of the virus. Various DCT methods have been proposed, each making trade-offs between privacy, mobility restrictions, and public health. The most common approach, binary contact tracing (BCT), models infection as a binary event, informed only by an individual's test results, with corresponding binary recommendations that either all or none of the individual's contacts quarantine. BCT ignores the inherent uncertainty in contacts and the infection process, which could be used to tailor messaging to high-risk individuals, and prompt proactive testing or earlier warnings. It also does not make use of observations such as symptoms or pre-existing medical conditions, which could be used to make more accurate infectiousness predictions. In this paper, we use a recently-proposed COVID-19 epidemiological simulator to develop and test methods that can be deployed to a smartphone to locally and proactively predict an individual's infectiousness (risk of infecting others) based on their contact history and other information, while respecting strong privacy constraints. Predictions are used to provide personalized recommendations to the individual via an app, as well as to send anonymized messages to the individual's contacts, who use this information to better predict their own infectiousness, an approach we call proactive contact tracing (PCT). We find a deep-learning based PCT method which improves over BCT for equivalent average mobility, suggesting PCT could help in safe re-opening and second-wave prevention.

2021-01-12

ICLR.cc/2021/Conference (spotlight)

openreview.net

An Analysis of the Adaptation Speed of Causal Models

Rémi LE PRIOL

Reza Babanezhad Harikandeh

Yoshua Bengio

Simon Lacoste-Julien

2021-01-01

AISTATS (publié)

proceedings.mlr.press

arxiv.org

Invariance Principle Meets Information Bottleneck for Out-of-Distribution Generalization

Kartik Ahuja

Ethan Caballero

Dinghuai Zhang

Jean-Christophe Gagnon-Audet

Yoshua Bengio

Ioannis Mitliagkas

Irina Rish

The invariance principle from causality is at the heart of notable approaches such as invariant risk minimization (IRM) that seek to address… (voir plus) out-of-distribution (OOD) generalization failures. Despite the promising theory, invariance principle-based approaches fail in common classification tasks, where invariant (causal) features capture all the information about the label. Are these failures due to the methods failing to capture the invariance? Or is the invariance principle itself insufficient? To answer these questions, we revisit the fundamental assumptions in linear regression tasks, where invariance-based approaches were shown to provably generalize OOD. In contrast to the linear regression tasks, we show that for linear classification tasks we need much stronger restrictions on the distribution shifts, or otherwise OOD generalization is impossible. Furthermore, even with appropriate restrictions on distribution shifts in place, we show that the invariance principle alone is insufficient. We prove that a form of the information bottleneck constraint along with invariance helps address key failures when invariant features capture all the information about the label and also retains the existing success when they do not. We propose an approach that incorporates both of these principles and demonstrate its effectiveness in several experiments.

openreview.net

Inductive biases for deep learning of higher-level cognition

Anirudh Goyal

Yoshua Bengio

A fascinating hypothesis is that human and animal intelligence could be explained by a few principles (rather than an encyclopaedic list of … (voir plus)heuristics). If that hypothesis was correct, we could more easily both understand our own intelligence and build intelligent machines. Just like in physics, the principles themselves would not be sufficient to predict the behaviour of complex systems like brains, and substantial computation might be needed to simulate human-like intelligence. This hypothesis would suggest that studying the kind of inductive biases that humans and animals exploit could help both clarify these principles and provide inspiration for AI research and neuroscience theories. Deep learning already exploits several key inductive biases, and this work considers a larger list, focusing on those which concern mostly higher-level and sequential conscious processing. The objective of clarifying these particular principles is that they could potentially help us build AI systems benefiting from humans’ abilities in terms of flexible out-of-distribution and systematic generalization, which is currently an area where a large gap exists between state-of-the-art machine learning and human intelligence.

2020-11-30

ArXiv (preprint)

doi.org

arxiv.org

Revisiting Fundamentals of Experience Replay

William Fedus

Prajit Ramachandran

Rishabh Agarwal

Yoshua Bengio

Hugo Larochelle

Mark Rowland

Will Dabney

Experience replay is central to off-policy algorithms in deep reinforcement learning (RL), but there remain significant gaps in our understa… (voir plus)nding. We therefore present a systematic and extensive analysis of experience replay in Q-learning methods, focusing on two fundamental properties: the replay capacity and the ratio of learning updates to experience collected (replay ratio). Our additive and ablative studies upend conventional wisdom around experience replay -- greater capacity is found to substantially increase the performance of certain algorithms, while leaving others unaffected. Counterintuitively we show that theoretically ungrounded, uncorrected n-step returns are uniquely beneficial while other techniques confer limited benefit for sifting through larger memory. Separately, by directly controlling the replay ratio we contextualize previous observations in the literature and empirically measure its importance across a variety of deep RL algorithms. Finally, we conclude by testing a set of hypotheses on the nature of these performance benefits.

2020-11-21

Proceedings of the 37th International Conference on Machine Learning (publié)

proceedings.mlr.press

arxiv.org

Neural Function Modules with Sparse Arguments: A Dynamic Approach to Integrating Information across Layers

Alex Lamb

Anirudh Goyal

A. Slowik

Michael Curtis Mozer

Philippe Beaudoin

Yoshua Bengio

2020-10-15

ArXiv (preprint)

arxiv.org

COVI-AgentSim: an Agent-based Model for Evaluating Methods of Digital Contact Tracing

Prateek Gupta

Tegan Maharaj

Martin Weiss

Nasim Rahaman

Hannah Alsdurf

abhinav sharma

Nanor Minoyan

Soren Harnois-Leblanc

Victor Schmidt

Pierre-Luc St-Charles

Tristan Deleu

andrew williams

Akshay Patel

Meng Qu

Olexa Bilaniuk

gaetan caron

pierre luc carrier

satya ortiz gagne

Marc-Andre Rousseau

David Buckeridge … (voir 9 de plus)

Joumana Ghosn

Yang Zhang

Bernhard Schölkopf

Jian Tang

Irina Rish

Chris Pal

Joanna Merckx

Eilif Benjamin Muller

Yoshua Bengio

2020-10-02

OpenReview.net/Anonymous_Preprint (inconnu)

openreview.net

A Large-Scale, Open-Domain, Mixed-Interface Dialogue-Based ITS for STEM

Iulian V. Serban

Varun Gupta

Ekaterina Kochmar

Dung D. Vu

Robert Belfer

2020-06-10

Artificial Intelligence in Education (publié)

doi.org

arxiv.org

An Analysis of the Adaptation Speed of Causal Models

Rémi LE PRIOL

Reza Babanezhad Harikandeh

Yoshua Bengio

Simon Lacoste-Julien

We consider the problem of discovering the causal process that generated a collection of datasets. We assume that all these datasets were ge… (voir plus)nerated by unknown sparse interventions on a structural causal model (SCM)

2020-05-18

ArXiv (preprint)

arxiv.org

COVI White Paper

Hannah Alsdurf

Yoshua Bengio

Tristan Deleu

Prateek Gupta

Daphne Ippolito

Richard Janda

Max Jarvie

Tyler J. Kolody

Sekoul Krastev

Tegan Maharaj

Robert Obryk

Dan Pilat

Valerie Pisano

Benjamin Prud'homme

Meng Qu

Nasim Rahaman

Irina Rish

Jean-franois Rousseau

abhinav sharma

Brooke Struck … (voir 3 de plus)

Jian Tang

Martin Weiss

Yun William Yu

2020-05-18

ArXiv (prépublication)

arxiv.org

La recherche en IA au service du monde réel

Boussole des politiques en IA

Vie étudiante et ressources

Yoshua Bengio

Biographie

Étudiants actuels

Billets de blogue

Publications

La recherche en IA au service du monde réel

Boussole des politiques en IA

Vie étudiante et ressources

Mots-clés populaires:

Yoshua Bengio

Biographie

Étudiants actuels

Billets de blogue

Publications