Yoshua Bengio

ahmad.ghawanmeh@mila.quebec

Biography

*For media requests, please write to medias@mila.quebec.

For more information please contact Julie Mongeau, executive assistant at julie.mongeau@mila.quebec.

Yoshua Bengio is recognized worldwide as a leading expert in AI. He is most known for his pioneering work in deep learning, which earned him the 2018 A.M. Turing Award, “the Nobel Prize of computing,” with Geoffrey Hinton and Yann LeCun.

Bengio is a full professor at Université de Montréal, and the founder and scientific director of Mila – Quebec Artificial Intelligence Institute. He is also a senior fellow at CIFAR and co-directs its Learning in Machines & Brains program, serves as scientific director of IVADO, and holds a Canada CIFAR AI Chair.

In 2019, Bengio was awarded the prestigious Killam Prize and in 2022, he was the most cited computer scientist in the world by h-index. He is a Fellow of the Royal Society of London, Fellow of the Royal Society of Canada, Knight of the Legion of Honor of France and Officer of the Order of Canada. In 2023, he was appointed to the UN’s Scientific Advisory Board for Independent Advice on Breakthroughs in Science and Technology.

Concerned about the social impact of AI, Bengio helped draft the Montréal Declaration for the Responsible Development of Artificial Intelligence and continues to raise awareness about the importance of mitigating the potentially catastrophic risks associated with future AI systems.

Current Students

Aayush Bajaj

Professional Master's - Université de Montréal

Co-supervisor :

Samira Ebrahimi Kahou

aayush.bajaj@mila.quebec

Ahmad Ghawanmeh

Professional Master's - Université de Montréal

Akram Erraqabi

PhD - Université de Montréal

akram.erraqabi@mila.quebec

Alex Hernandez-Garcia

Postdoctorate - Université de Montréal

Co-supervisor :

Postdoctorate - Université de Montréal

alexander.tong@mila.quebec

Sasha Volokhova

PhD - Université de Montréal

alexandra.volokhova@mila.quebec

Alexandre Duval

Collaborating researcher - Université Paris-Saclay

Principal supervisor :

alexandre.duval@mila.quebec

andres.campero@mila.quebec

Aman Dalmia

Professional Master's - Université de Montréal

aman.dalmia@mila.quebec

Andrés Campero

Independent visiting researcher - MIT

aniket.didolkar@mila.quebec

Aniket Didolkar

PhD - Université de Montréal

ayoub.atanane@mila.quebec

Anja Surina

PhD - École Polytechnique Montréal Fédérale de Lausanne

anja.surina@mila.quebec

Ayoub Atanane

Research Intern - Université du Québec à Rimouski

Basile Terver

Collaborating researcher

Principal supervisor :

basile.terver@mila.quebec

Camille Rochefort-Boulanger

PhD - Université de Montréal

Principal supervisor :

Julie Hussin

rochefoc@mila.quebec

clemence.granade@mila.quebec

Chen Chen

Postdoctorate - Université de Montréal

Co-supervisor :

Collaborating Alumni

Professional Master's - Université de Montréal

Cristian Meo

Collaborating Alumni

cristian.meo@mila.quebec

cristian-dragos.manta@mila.quebec

Cristian Dragos Manta

PhD - Université de Montréal

Co-supervisor :

Dhanya Sridhar

damiano.fornasiere@mila.quebec

Damiano Fornasiere

PhD - Barcelona University

Dan Assouline

Collaborating Alumni

dan.assouline@mila.quebec

dinghuai.zhang@mila.quebec

Dinghuai Zhang

PhD - Université de Montréal

Principal supervisor :

Aaron Courville

Divya Sharma

Collaborating Alumni

divya.sharma@mila.quebec

Donna Vakalis

Postdoctorate - Université de Montréal

Co-supervisor :

donna.vakalis@mila.quebec

dragos.secrieru@mila.quebec

Dragos Secrieru

Master's Research - Université de Montréal

Edward Hu

PhD - Université de Montréal

edward.hu@mila.quebec

Elmimouni Zakaria

Research Intern - Université de Montréal

zakarya.elmimouni@mila.quebec

eric.elmoznino@mila.quebec

Eric Elmoznino

PhD - Université de Montréal

Co-supervisor :

Guillaume Lajoie

Research Intern - UQAR

ghait.boukachab@mila.quebec

jack.richter-powell@mila.quebec

Hae-Beom Lee

Collaborating Alumni

hae-beom.lee@mila.quebec

Jessie Richter-Powell

Independent visiting researcher - Université de Montréal

hussein-mohamu.jama@mila.quebec

Jama Mohamud

PhD - Université de Montréal

Principal supervisor :

Mirco Ravanelli

Research Intern - McGill University

jamal.abouhaibeh@mila.quebec

james.requeima@mila.quebec

James Requeima

Independent visiting researcher - Université de Montréal

Jarrid Rector-Brooks

PhD - Université de Montréal

Co-supervisor :

Sarath Chandar Anbil Parthipan

jarrid.rector-brooks@mila.quebec

Jean-pierre Falet

PhD - Université de Montréal

Co-supervisor :

Guillaume Lajoie

jean-pierre.falet@mila.quebec

Professional Master's - Université de Montréal

jerome.francis@mila.quebec

katie-elizabeth.everett@mila.quebec

George Jiangyan Ma

Research Intern - Université de Montréal

jiangyan.ma@mila.quebec

PhD - Université de Montréal

madankan@mila.quebec

Katie Everett

PhD - Massachusetts Institute of Technology

Léna Ezzine

PhD - Université de Montréal

lena-nehale.ezzine@mila.quebec

Leo Feng

PhD - Université de Montréal

leo.feng@mila.quebec

Leon Hetzel

Independent visiting researcher - Technical University Munich (TUM)

leon.hetzel@mila.quebec

Ling Pan

Independent visiting researcher - Hong Kong University of Science and Technology (HKUST)

ling.pan@mila.quebec

loubna.benabbou@mila.quebec

Loic Mandine

DESS - Université de Montréal

loic.mandine@mila.quebec

Loubna Benabbou

Independent visiting researcher - UQAR

marcin.sendera@mila.quebec

Luca Scimeca

Postdoctorate - Université de Montréal

luca.scimeca@mila.quebec

PhD - Université de Montréal

korablym@mila.quebec

Marcin Sendera

Research Intern - Université de Montréal

Marco STOCK

Independent visiting researcher - Technical University of Munich

marco.stock@mila.quebec

matthew.macdermott@mila.quebec

Matt MacDermott

Research Intern - Imperial College London

Mélisande Astrid Crystal Teng

PhD - Université de Montréal

Co-supervisor :

Postdoctorate - Université de Montréal

michal.koziarski@mila.quebec

Harry Zhao

PhD - McGill University

Principal supervisor :

Mingze Li

Professional Master's - Université de Montréal

mingze2.li@mila.quebec

Minsu Kim

Collaborating researcher - Université de Montréal

minsu.kim@mila.quebec

Research Intern - Université de Montréal

mohammed.mahfoud@mila.quebec

Mohammed Abukalam

Research Intern - Université de Montréal

mohammed.abukalam@mila.quebec

Mohsin Hasan

PhD - Université de Montréal

mohsin.hasan@mila.quebec

nikolay.malkin@mila.quebec

Moksh Jain

PhD - Université de Montréal

moksh.jain@mila.quebec

PhD - Max-Planck-Institute for Intelligent Systems

rahamann@mila.quebec

Nicole Zhang

PhD - McGill University

Principal supervisor :

Collaborating Alumni - Université de Montréal

PhD - Université de Montréal

oussama.boussif@mila.quebec

pierre-paul.de-breuck@mila.quebec

Param Raval

Professional Master's - Université de Montréal

param.raval@mila.quebec

Paul Bertin

PhD - Université de Montréal

bertinpa@mila.quebec

Phong Nguyen

Independent visiting researcher - Université de Montréal

nguyenph@mila.quebec

Pierre-Paul De Breuck

Collaborating Alumni - Université de Montréal

Collaborating researcher

pietro.greiner@mila.quebec

Priya Nama Venkatesh

Professional Master's - Université de Montréal

priya.nama@mila.quebec

Prudencio Tossou

Collaborating researcher - Valence

Principal supervisor :

Dominique Beaini

prudencio.tossou@mila.quebec

Rim Assouel

PhD - Université de Montréal

assouelr@mila.quebec

Ruixiang Zhang

PhD - Université de Montréal

Principal supervisor :

PhD - Université de Montréal

lahlosal@mila.quebec

Sarthak Mittal

PhD - Université de Montréal

Principal supervisor :

Seanie Lee

Research Intern - Université de Montréal

seanie.lee@mila.quebec

Professional Master's

aasheesh.singh@mila.quebec

Collaborating researcher - Université de Montréal

soren.mindermann@mila.quebec

Stefan Bauer

Independent visiting researcher

Co-supervisor :

Guillaume Lajoie

stefan.bauer@mila.quebec

stefano.massaroli@mila.quebec

Stefano Massaroli

Postdoctorate - Université de Montréal

Stephen Lu

Research Intern - McGill University

stephen.lu@mila.quebec

Professional Master's - Université de Montréal

subhrajyoti.dasgupta@mila.quebec

Theo Saulus

Collaborating researcher

Principal supervisor :

thomas.jiralerspong@mila.quebec

theo.saulus@mila.quebec

Thomas Jiralerspong

Master's Research - Université de Montréal

Co-supervisor :

Doina Precup

PhD - Université de Montréal

tianyu.zhang@mila.quebec

PhD - Université de Montréal

Vedant Shah

Master's Research - Université de Montréal

vedant.shah@mila.quebec

PhD - Université de Montréal

Todosijevic Viktor Todosijevic

Collaborating researcher - RWTH Aachen University (Rheinisch-Westfälische Technische Hochschule Aachen)

Principal supervisor :

viktor.todosijevic@mila.quebec

vincent.quirion@mila.quebec

Vincent Quirion

Undergraduate - Université de Montréal

Xiaoyin Chen

PhD - Université de Montréal

xiaoyin.chen@mila.quebec

Yashaswi Pupneja

Professional Master's - Université de Montréal

yashaswi.pupneja@mila.quebec

younesse.kaddar@mila.quebec

Yizhao Wang

Professional Master's - Université de Montréal

yizhao.wang@mila.quebec

Younesse Kaddar

Research Intern - Université de Montréal

Skipper: Combining Spatial and Temporal Abstraction for Better Generalization

Zhen Liu

PhD - Université de Montréal

Principal supervisor :

Liam Paull

liuzhen@mila.quebec

Zibo Shang

Professional Master's - Université de Montréal

zibo.shang@mila.quebec

Zichao Yan

Postdoctorate - Université de Montréal

yanzicha@mila.quebec

Blog Posts

Generic thumbnail for Mila Blog articles.

February 22, 2024

Mingde Harry Zhao

Safa Alver

Harm van Seijen

Romain Laroche

Doina Precup

Yoshua Bengio

Scaling in the Service of Reasoning & Model-Based ML

April 4, 2023

Yoshua Bengio

Edward J. Hu

A collaboration between Mila and Relation Therapeutics to discover novel synergistic combinations of drugs in vitro

March 23, 2022

Paul Bertin

Jake P. Taylor-King

Yoshua Bengio

March 15, 2022

Generative Flow Networks

Yoshua Bengio

Publications

Unlearning via Sparse Representations

Vedant Shah

Frederik Träuble

Ashish Malik

Hugo Larochelle

Michael Curtis Mozer

Sanjeev Arora

Anirudh Goyal

2023-11-26

ArXiv (preprint)

Mitigating Biases with Diverse Ensembles and Diffusion Models

Luca Scimeca

Alexander Rubinstein

Damien Teney

Seong Joon Oh

Armand Nicolicioiu

Spurious correlations in the data, where multiple cues are predictive of the target labels, often lead to a phenomenon known as shortcut lea… (see more)rning, where a model relies on erroneous, easy-to-learn cues while ignoring reliable ones. In this work, we propose an ensemble diversification framework exploiting Diffusion Probabilistic Models (DPMs) to mitigate this form of bias. We show that at particular training intervals, DPMs can generate images with novel feature combinations, even when trained on samples displaying correlated input features. We leverage this crucial property to generate synthetic counterfactuals to increase model diversity via ensemble disagreement. We show that DPM-guided diversification is sufficient to remove dependence on primary shortcut cues, without a need for additional supervised signals. We further empirically quantify its efficacy on several diversification objectives, and finally show improved generalization and diversification performance on par with prior work that relies on auxiliary data collection.

2023-11-23

ArXiv (preprint)

arxiv.org

Learning from unexpected events in the neocortical microcircuit

Colleen J Gillon

Jason E. Pina

Jérôme A. Lecoq

Ruweida Ahmed

Yazan N. Billeh

Shiella Caldejon

Peter Groblewski

Timothy M. Henley

India Kato

Eric Lee

Jennifer Luviano

Kyla Mace

Chelsea Nayan

Thuyanh V. Nguyen

Kat North

Jed Perkins

Sam Seid

Matthew T. Valley

Ali Williford

Yoshua Bengio … (see 3 more)

Timothy P. Lillicrap

Blake Richards

Joel Zylberberg

2023-11-21

Journal of Neuroscience (published)

Responses to Pattern-Violating Visual Stimuli Evolve Differently Over Days in Somata and Distal Apical Dendrites

Colleen J Gillon

Jason E. Pina

Jérôme A. Lecoq

Ruweida Ahmed

Yazan N. Billeh

Shiella Caldejon

Peter Groblewski

Timothy M. Henley

India Kato

Eric Lee

Jennifer Luviano

Kyla Mace

Chelsea Nayan

Thuyanh V. Nguyen

Kat North

Jed Perkins

Sam Seid

Matthew T. Valley

Ali Williford

Yoshua Bengio … (see 3 more)

Timothy P. Lillicrap

Blake Richards

Joel Zylberberg

Scientists have long conjectured that the neocortex learns patterns in sensory data to generate top-down predictions of upcoming stimuli. In… (see more) line with this conjecture, different responses to pattern-matching vs pattern-violating visual stimuli have been observed in both spiking and somatic calcium imaging data. However, it remains unknown whether these pattern-violation signals are different between the distal apical dendrites, which are heavily targeted by top-down signals, and the somata, where bottom-up information is primarily integrated. Furthermore, it is unknown how responses to pattern-violating stimuli evolve over time as an animal gains more experience with them. Here, we address these unanswered questions by analyzing responses of individual somata and dendritic branches of layer 2/3 and layer 5 pyramidal neurons tracked over multiple days in primary visual cortex of awake, behaving female and male mice. We use sequences of Gabor patches with patterns in their orientations to create pattern-matching and pattern-violating stimuli, and two-photon calcium imaging to record neuronal responses. Many neurons in both layers show large differences between their responses to pattern-matching and pattern-violating stimuli. Interestingly, these responses evolve in opposite directions in the somata and distal apical dendrites, with somata becoming less sensitive to pattern-violating stimuli and distal apical dendrites more sensitive. These differences between the somata and distal apical dendrites may be important for hierarchical computation of sensory predictions and learning, since these two compartments tend to receive bottom-up and top-down information, respectively.

2023-11-21

Journal of Neuroscience (published)

SatBird: Bird Species Distribution Modeling with Remote Sensing and Citizen Science Data

Mélisande Teng

Amna Elmustafa

Benjamin Akera

Hager Radi

Hugo Larochelle

Biodiversity is declining at an unprecedented rate, impacting ecosystem services necessary to ensure food, water, and human health and well-… (see more)being. Understanding the distribution of species and their habitats is crucial for conservation policy planning. However, traditional methods in ecology for species distribution models (SDMs) generally focus either on narrow sets of species or narrow geographical areas and there remain significant knowledge gaps about the distribution of species. A major reason for this is the limited availability of data traditionally used, due to the prohibitive amount of effort and expertise required for traditional field monitoring. The wide availability of remote sensing data and the growing adoption of citizen science tools to collect species observations data at low cost offer an opportunity for improving biodiversity monitoring and enabling the modelling of complex ecosystems. We introduce a novel task for mapping bird species to their habitats by predicting species encounter rates from satellite images, and present SatBird, a satellite dataset of locations in the USA with labels derived from presence-absence observation data from the citizen science database eBird, considering summer (breeding) and winter seasons. We also provide a dataset in Kenya representing low-data regimes. We additionally provide environmental data and species range maps for each location. We benchmark a set of baselines on our dataset, including SOTA models for remote sensing tasks. SatBird opens up possibilities for scalably modelling properties of ecosystems worldwide.

2023-11-02

ArXiv (preprint)

arxiv.org

Generative AI models should include detection mechanisms as a condition for public release

Alistair Knott

Dino Pedreschi

Raja Chatila

Tapabrata Chakraborti

Susan Leavy

Ricardo Baeza-Yates

D. Eyers

Andrew Trotman

Paul D. Teal

Przemyslaw Biecek

Stuart Russell

2023-10-28

Ethics and Information Technology (published)

OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning

Rim Assouel

Pau Rodriguez

Perouz Taslakian

David Vazquez

2023-10-28

ArXiv (preprint)

arxiv.org

Attention Schema in Neural Agents

Dianbo Liu

Samuele Bolotta

Mike He Zhu

Zahra Sheikhbahaee

Guillaume Dumas

Attention has become a common ingredient in deep learning architectures. It adds a dynamical selection of information on top of the static s… (see more)election of information supported by weights. In the same way, we can imagine a higher-order informational filter built on top of attention: an Attention Schema (AS), namely, a descriptive and predictive model of attention. In cognitive neuroscience, Attention Schema Theory (AST) supports this idea of distinguishing attention from AS. A strong prediction of this theory is that an agent can use its own AS to also infer the states of other agents' attention and consequently enhance coordination with other agents. As such, multi-agent reinforcement learning would be an ideal setting to experimentally test the validity of AST. We explore different ways in which attention and AS interact with each other. Our preliminary results indicate that agents that implement the AS as a recurrent internal control achieve the best performance. In general, these exploratory experiments suggest that equipping artificial agents with a model of attention can enhance their social intelligence.

2023-10-27

NeurIPS.cc/2023/Workshop/InfoCog (poster)

Baking Symmetry into GFlowNets

George Ma

Emmanuel Bengio

Dinghuai Zhang

GFlowNets have exhibited promising performance in generating diverse candidates with high rewards. These networks generate objects increment… (see more)ally and aim to learn a policy that assigns probability of sampling objects in proportion to rewards. However, the current training pipelines of GFlowNets do not consider the presence of isomorphic actions, which are actions resulting in symmetric or isomorphic states. This lack of symmetry increases the amount of samples required for training GFlowNets and can result in inefficient and potentially incorrect flow functions. As a consequence, the reward and diversity of the generated objects decrease. In this study, our objective is to integrate symmetries into GFlowNets by identifying equivalent actions during the generation process. Experimental results using synthetic data demonstrate the promising performance of our proposed approaches.

2023-10-27

NeurIPS.cc/2023/Workshop/AI4Science (oral)

Baking Symmetry into GFlowNets

George Ma

Emmanuel Bengio

Dinghuai Zhang

2023-10-27

NeurIPS.cc/2023/Workshop/AI4Science (oral)

Causal Discovery in Gene Regulatory Networks with GFlowNet: Towards Scalability in Large Systems

Trang Nguyen

Alexander Tong

Kanika Madan

Dianbo Liu

Understanding causal relationships within Gene Regulatory Networks (GRNs) is essential for unraveling the gene interactions in cellular proc… (see more)esses. However, causal discovery in GRNs is a challenging problem for multiple reasons including the existence of cyclic feedback loops and uncertainty that yields diverse possible causal structures. Previous works in this area either ignore cyclic dynamics (assume acyclic structure) or struggle with scalability. We introduce Swift-DynGFN as a novel framework that enhances causal structure learning in GRNs while addressing scalability concerns. Specifically, Swift-DynGFN exploits gene-wise independence to boost parallelization and to lower computational cost. Experiments on real single-cell RNA velocity and synthetic GRN datasets showcase the advancement in learning causal structure in GRNs and scalability in larger systems.

2023-10-27

NeurIPS.cc/2023/Workshop/GenBio (poster)

Crystal-GFN: sampling materials with desirable properties and constraints

Mistal

Alex Hernandez-Garcia

Alexandra Volokhova

Alexandre AGM Duval

Divya Sharma

pierre luc carrier

Michał Koziarski

Victor Schmidt

2023-10-27

NeurIPS.cc/2023/Workshop/AI4Mat (spotlight)