Yoshua Bengio

Membre académique principal

Chaire en IA Canada-CIFAR

Professeur titulaire, Université de Montréal, Département d'informatique et de recherche opérationnelle

Directeur scientifique, Équipe de direction

Observateur, Conseil d'administration, Mila

Site web

Google Scholar

Biographie

*Pour toute demande média, veuillez écrire à medias@mila.quebec.

Pour plus d’information, contactez Julie Mongeau, adjointe de direction à julie.mongeau@mila.quebec.

Reconnu comme une sommité mondiale en intelligence artificielle, Yoshua Bengio s’est surtout distingué par son rôle de pionnier en apprentissage profond, ce qui lui a valu le prix A. M. Turing 2018, le « prix Nobel de l’informatique », avec Geoffrey Hinton et Yann LeCun. Il est professeur titulaire à l’Université de Montréal, fondateur et directeur scientifique de Mila – Institut québécois d’intelligence artificielle, et codirige en tant que senior fellow le programme Apprentissage automatique, apprentissage biologique de l'Institut canadien de recherches avancées (CIFAR). Il occupe également la fonction de directeur scientifique d’IVADO.

En 2018, il a été l’informaticien qui a recueilli le plus grand nombre de nouvelles citations au monde. En 2019, il s’est vu décerner le prestigieux prix Killam. Depuis 2022, il détient le plus grand facteur d’impact (h-index) en informatique à l’échelle mondiale. Il est fellow de la Royal Society de Londres et de la Société royale du Canada, et officier de l’Ordre du Canada.

Soucieux des répercussions sociales de l’IA et de l’objectif que l’IA bénéficie à tous, il a contribué activement à la Déclaration de Montréal pour un développement responsable de l’intelligence artificielle.

Étudiants actuels

Singh Aasheesh

Maîtrise professionnelle

aasheesh.singh@mila.quebec

Stagiaire de recherche - McGill

jamal.abouhaibeh@mila.quebec

Github

Google Scholar

Mohammed Abukalam

Stagiaire de recherche - UdeM

mohammed.abukalam@mila.quebec

Github

Rim Assouel

Doctorat - UdeM

assouelr@mila.quebec

Dan Assouline

Collaborateur·rice alumni

dan.assouline@mila.quebec

Github

Google Scholar

Ayoub Atanane

Stagiaire de recherche - Université du Québec à Rimouski

ayoub.atanane@mila.quebec

Github

Aayush Bajaj

Maîtrise professionnelle - UdeM

aayush.bajaj@mila.quebec

Stefan Bauer

Visiteur de recherche indépendant

Co-superviseur⋅e :

Guillaume Lajoie

stefan.bauer@mila.quebec

Google Scholar

Loubna Benabbou

Visiteur de recherche indépendant - UQAR

loubna.benabbou@mila.quebec

Site web

Google Scholar

Paul Bertin

Doctorat - UdeM

bertinpa@mila.quebec

Ghait Boukachab

Stagiaire de recherche - UQAR

ghait.boukachab@mila.quebec

Github

Oussama Boussif

Doctorat - UdeM

oussama.boussif@mila.quebec

Visiteur de recherche indépendant - MIT

andres.campero@mila.quebec

Site web

Xiaoyin Chen

Doctorat - UdeM

xiaoyin.chen@mila.quebec

Chen Chen

Postdoctorat - UdeM

Co-superviseur⋅e :

Blake Richards

chen.sun@mila.quebec

Aman Dalmia

Maîtrise professionnelle - UdeM

aman.dalmia@mila.quebec

Subhrajyoti Dasgupta

Maîtrise professionnelle - UdeM

subhrajyoti.dasgupta@mila.quebec

Site web

Github

Google Scholar

Pierre-Paul De Breuck

Collaborateur·rice alumni - UdeM

pierre-paul.de-breuck@mila.quebec

Doctorat - UdeM

Doctorat - UdeM

aniket.didolkar@mila.quebec

Collaborateur·rice de recherche - Université Paris-Saclay

Superviseur⋅e principal⋅e :

David Rolnick

alexandre.duval@mila.quebec

Site web

Eric Elmoznino

Doctorat - UdeM

Co-superviseur⋅e :

Guillaume Lajoie

eric.elmoznino@mila.quebec

Doctorat - UdeM

akram.erraqabi@mila.quebec

Katie Everett

Doctorat - Massachusetts Institute of Technology

katie-elizabeth.everett@mila.quebec

Léna Nehale Ezzine

Doctorat - UdeM

lena-nehale.ezzine@mila.quebec

Jean-pierre Falet

Doctorat - UdeM

Co-superviseur⋅e :

Guillaume Lajoie

jean-pierre.falet@mila.quebec

Site web

Github

Google Scholar

Leo Feng

Doctorat - UdeM

Doctorat - Barcelona University

damiano.fornasiere@mila.quebec

Jerome Francis

Maîtrise professionnelle - UdeM

jerome.francis@mila.quebec

Site web

Github

Piotr Gainski

Stagiaire de recherche - UdeM

piotr.gainski@mila.quebec

Github

Ahmad Ghawanmeh

Maîtrise professionnelle - UdeM

ahmad.ghawanmeh@mila.quebec

Clemence Granade

Maîtrise professionnelle - UdeM

clemence.granade@mila.quebec

Site web

Github

Pietro Greiner

Collaborateur·rice de recherche

pietro.greiner@mila.quebec

Mohsin Hasan

Doctorat - UdeM

mohsin.hasan@mila.quebec

Site web

Github

Google Scholar

Alex Hernandez-Garcia

Postdoctorat - UdeM

Co-superviseur⋅e :

Leon Hetzel

Visiteur de recherche indépendant - Technical University Munich (TUM)

leon.hetzel@mila.quebec

Site web

Github

Google Scholar

Edward Hu

Doctorat - UdeM

edward.hu@mila.quebec

Moksh Jain

Doctorat - UdeM

moksh.jain@mila.quebec

Stagiaire de recherche - UdeM

jiangyan.ma@mila.quebec

Maîtrise recherche - UdeM

Co-superviseur⋅e :

Doina Precup

thomas.jiralerspong@mila.quebec

Stagiaire de recherche - UdeM

younesse.kaddar@mila.quebec

Site web

Github

Minsu Kim

Collaborateur·rice de recherche - UdeM

minsu.kim@mila.quebec

Doctorat - UdeM

Postdoctorat - UdeM

michal.koziarski@mila.quebec

Salem Lahlou

Doctorat - UdeM

lahlosal@mila.quebec

Hae-Beom Lee

Collaborateur·rice alumni

hae-beom.lee@mila.quebec

Seanie Lee

Stagiaire de recherche - UdeM

seanie.lee@mila.quebec

Site web

Github

Google Scholar

Mingze Li

Maîtrise professionnelle - UdeM

mingze2.li@mila.quebec

Chenghao Liu

Collaborateur·rice alumni

Zhen Liu

Doctorat - UdeM

Superviseur⋅e principal⋅e :

Liam Paull

liuzhen@mila.quebec

Stephen Lu

Stagiaire de recherche - McGill

stephen.lu@mila.quebec

Stagiaire de recherche - Imperial College London

matthew.macdermott@mila.quebec

Doctorat - UdeM

Stagiaire de recherche - UdeM

mohammed.mahfoud@mila.quebec

Nikolay Malkin

Collaborateur·rice alumni - UdeM

nikolay.malkin@mila.quebec

DESS - UdeM

loic.mandine@mila.quebec

Cristian Dragos Manta

Doctorat - UdeM

Co-superviseur⋅e :

Dhanya Sridhar

cristian-dragos.manta@mila.quebec

Github

Stefano Massaroli

Postdoctorat - UdeM

stefano.massaroli@mila.quebec

Cristian Meo

Collaborateur·rice alumni

cristian.meo@mila.quebec

Github

Google Scholar

Sören Mindermann

Collaborateur·rice de recherche - UdeM

soren.mindermann@mila.quebec

Site web

Google Scholar

Sarthak Mittal

Doctorat - UdeM

Superviseur⋅e principal⋅e :

Doctorat - UdeM

Superviseur⋅e principal⋅e :

Mirco Ravanelli

hussein-mohamu.jama@mila.quebec

Maîtrise professionnelle - UdeM

priya.nama@mila.quebec

Phong Nguyen

Visiteur de recherche indépendant - UdeM

nguyenph@mila.quebec

Ling Pan

Visiteur de recherche indépendant - Hong Kong University of Science and Technology (HKUST)

ling.pan@mila.quebec

Github

Ali Parviz

Collaborateur·rice de recherche - Ying Wu Coll of Computing

ali.parviz@mila.quebec

Google Scholar

Yashaswi Pupneja

Maîtrise professionnelle - UdeM

yashaswi.pupneja@mila.quebec

Github

Vincent Quirion

Baccalauréat - UdeM

vincent.quirion@mila.quebec

Nassim Rahaman

Doctorat - Max-Planck-Institute for Intelligent Systems

rahamann@mila.quebec

Param Raval

Maîtrise professionnelle - UdeM

param.raval@mila.quebec

Jarrid Rector-Brooks

Doctorat - UdeM

Co-superviseur⋅e :

Sarath Chandar Anbil Parthipan

jarrid.rector-brooks@mila.quebec

James Requeima

Visiteur de recherche indépendant - UdeM

james.requeima@mila.quebec

Jessie Richter-Powell

Visiteur de recherche indépendant - UdeM

jack.richter-powell@mila.quebec

Google Scholar

Camille Rochefort-Boulanger

Doctorat - UdeM

Superviseur⋅e principal⋅e :

Julie Hussin

rochefoc@mila.quebec

Github

Theo Saulus

Collaborateur·rice de recherche

Superviseur⋅e principal⋅e :

David Rolnick

theo.saulus@mila.quebec

Doctorat - UdeM

Postdoctorat - UdeM

luca.scimeca@mila.quebec

Maîtrise recherche - UdeM

dragos.secrieru@mila.quebec

Marcin Sendera

Stagiaire de recherche - UdeM

marcin.sendera@mila.quebec

Github

Google Scholar

Vedant Shah

Maîtrise recherche - UdeM

vedant.shah@mila.quebec

Site web

Github

Google Scholar

Zibo Shang

Maîtrise professionnelle - UdeM

zibo.shang@mila.quebec

Divya Sharma

Collaborateur·rice alumni

divya.sharma@mila.quebec

Site web

Github

Marco Stock

Visiteur de recherche indépendant - Technical University of Munich

marco.stock@mila.quebec

Github

Anja Surina

Doctorat - École Polytechnique Fédérale de Lausanne

anja.surina@mila.quebec

Mélisande Astrid Crystal Teng

Doctorat - UdeM

Co-superviseur⋅e :

Collaborateur·rice de recherche

Superviseur⋅e principal⋅e :

David Rolnick

basile.terver@mila.quebec

Alexander Tong

Postdoctorat - UdeM

alexander.tong@mila.quebec

Prudencio Tossou

Collaborateur·rice de recherche - Valence

Superviseur⋅e principal⋅e :

Dominique Beaini

prudencio.tossou@mila.quebec

Donna Vakalis

Postdoctorat - UdeM

Co-superviseur⋅e :

David Rolnick

donna.vakalis@mila.quebec

Github

Google Scholar

Todosijevic Viktor Todosijevic

Collaborateur·rice de recherche - RWTH Aachen University (Rheinisch-Westfälische Technische Hochschule Aachen)

Superviseur⋅e principal⋅e :

David Rolnick

viktor.todosijevic@mila.quebec

Github

Sasha Volokhova

Doctorat - UdeM

alexandra.volokhova@mila.quebec

Yizhao Wang

Maîtrise professionnelle - UdeM

yizhao.wang@mila.quebec

Zichao Yan

Collaborateur·rice alumni - UdeM

yanzicha@mila.quebec

Elmimouni Zakaria

Stagiaire de recherche - UdeM

zakarya.elmimouni@mila.quebec

Github

Nicole Zhang

Doctorat - McGill

Superviseur⋅e principal⋅e :

Doctorat - UdeM

Superviseur⋅e principal⋅e :

Aaron Courville

dinghuai.zhang@mila.quebec

Site web

Ruixiang Zhang

Doctorat - UdeM

Superviseur⋅e principal⋅e :

Doctorat - UdeM

tianyu.zhang@mila.quebec

Site web

Github

Google Scholar

Harry Zhao

Doctorat - McGill

Superviseur⋅e principal⋅e :

Billets de blogue

Generic thumbnail for Mila Blog articles.

22 février 2024

Skipper : combiner l’abstraction spatiale et temporelle afin d’améliorer la généralisation

par

Mingde Harry Zhao

Safa Alver

Harm van Seijen

Romain Laroche

Doina Precup

Yoshua Bengio

Lire l'article

Scaling in the service of reasoning & model-based ML

4 avril 2023

Mise à l’échelle au service du raisonnement et de l’apprentissage automatique basé sur un modèle

par

Yoshua Bengio

Edward J. Hu

Lire l'article

A collaboration between Mila and Relation Therapeutics to discover novel synergistic combinations of drugs in vitro

23 mars 2022

Une collaboration entre Mila et Relation Therapeutics pour découvrir in vitro de nouvelles associations médicamenteuses synergiques

par

Paul Bertin

Jake P. Taylor-King

Yoshua Bengio

Lire l'article

15 mars 2022

Les réseaux de flot génératifs

par

Yoshua Bengio

Lire l'article

Publications

Managing AI Risks in an Era of Rapid Progress

Yoshua Bengio

Geoffrey Hinton

Andrew Yao

Dawn Song

Pieter Abbeel

Yuval Noah Harari

Trevor Darrell

Ya-Qin Zhang

Lan Xue

Shai Shalev-Shwartz

Gillian K. Hadfield

Jeff Clune

Tegan Maharaj

Frank Hutter

Atilim Güneş Baydin

Sheila McIlraith

Qiqi Gao

Ashwin Acharya

David Scott Krueger

Anca Dragan … (voir 5 de plus)

Philip Torr

Stuart Russell

Daniel Kahneman

Jan Brauner

Sören Mindermann

2024-05-24

Science (publié)

doi.org

arxiv.org

Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

Tara Akhound-Sadegh

Jarrid Rector-Brooks

Joey Bose

Sarthak Mittal

Pablo Lemos

Cheng-Hao Liu

Marcin Sendera

Siamak Ravanbakhsh

Gauthier Gidel

Yoshua Bengio

Nikolay Malkin

Alexander Tong

Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-… (voir plus)body systems, is a foundational problem in science. In this paper, we propose Iterated Denoising Energy Matching (iDEM), an iterative algorithm that uses a novel stochastic score matching objective leveraging solely the energy function and its gradient---and no data samples---to train a diffusion-based sampler. Specifically, iDEM alternates between (I) sampling regions of high model density from a diffusion-based sampler and (II) using these samples in our stochastic matching objective to further improve the sampler. iDEM is scalable to high dimensions as the inner matching objective, is *simulation-free*, and requires no MCMC samples. Moreover, by leveraging the fast mode mixing behavior of diffusion, iDEM smooths out the energy landscape enabling efficient exploration and learning of an amortized sampler. We evaluate iDEM on a suite of tasks ranging from standard synthetic energy functions to invariant

2024-05-01

ICML.cc/2024/Conference (poster)

doi.org

openreview.net

Learning to Scale Logits for Temperature-Conditional GFlowNets

Minsu Kim

Joohwan Ko

Dinghuai Zhang

Ling Pan

Taeyoung Yun

Woo Chang Kim

Jinkyoo Park

Emmanuel Bengio

Yoshua Bengio

GFlowNets are probabilistic models that sequentially generate compositional structures through a stochastic policy. Among GFlowNets, tempera… (voir plus)ture-conditional GFlowNets can introduce temperature-based controllability for exploration and exploitation. We propose \textit{Logit-scaling GFlowNets} (Logit-GFN), a novel architectural design that greatly accelerates the training of temperature-conditional GFlowNets. It is based on the idea that previously proposed approaches introduced numerical challenges in the deep network training, since different temperatures may give rise to very different gradient profiles as well as magnitudes of the policy's logits. We find that the challenge is greatly reduced if a learned function of the temperature is used to scale the policy's logits directly. Also, using Logit-GFN, GFlowNets can be improved by having better generalization capabilities in offline learning and mode discovery capabilities in online learning, which is empirically verified in various biological and chemical tasks. Our code is available at https://github.com/dbsxodud-11/logit-gfn

2024-05-01

ICML.cc/2024/Conference (poster)

doi.org

openreview.net

Memory Efficient Neural Processes via Constant Memory Attention Block

Leo Feng

Frederick Tung

Hossein Hajimirsadeghi

Yoshua Bengio

Mohamed Osama Ahmed

Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty. Recent state-of-the-art methods, … (voir plus)however, leverage expensive attention mechanisms, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attention Block (CMAB), a novel general-purpose attention block that (1) is permutation invariant, (2) computes its output in constant memory, and (3) performs updates in constant computation. Building on CMAB, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant which only requires \textbf{constant} memory. Empirically, we show CMANPs achieve state-of-the-art results on popular NP benchmarks (meta-regression and image completion) while being significantly more memory efficient than prior methods.

2024-05-01

ICML.cc/2024/Conference (poster)

openreview.net

Discrete Probabilistic Inference as Control in Multi-path Environments

Tristan Deleu

Padideh Nouri

Nikolay Malkin

Doina Precup

Yoshua Bengio

We consider the problem of sampling from a discrete and structured distribution as a sequential decision problem, where the objective is to … (voir plus)find a stochastic policy such that objects are sampled at the end of this sequential process proportionally to some predefined reward. While we could use maximum entropy Reinforcement Learning (MaxEnt RL) to solve this problem for some distributions, it has been shown that in general, the distribution over states induced by the optimal policy may be biased in cases where there are multiple ways to generate the same object. To address this issue, Generative Flow Networks (GFlowNets) learn a stochastic policy that samples objects proportionally to their reward by approximately enforcing a conservation of flows across the whole Markov Decision Process (MDP). In this paper, we extend recent methods correcting the reward in order to guarantee that the marginal distribution induced by the optimal MaxEnt RL policy is proportional to the original reward, regardless of the structure of the underlying MDP. We also prove that some flow-matching objectives found in the GFlowNet literature are in fact equivalent to well-established MaxEnt RL algorithms with a corrected reward. Finally, we study empirically the performance of multiple MaxEnt RL and GFlowNet algorithms on multiple problems involving sampling from discrete distributions.

2024-04-26

auai.org/UAI/2024/Conference (poster)

doi.org

openreview.net

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Usman Anwar

Abulhair Saparov

Javier Rando

Daniel Paleka

Miles Turpin

Peter Hase

Ekdeep Singh Lubana

Erik Jenner

Stephen Casper

Oliver Sourbut

Benjamin L. Edelman

Zhaowei Zhang

Mario Gunther

Anton Korinek

Jose Hernandez-Orallo

Lewis Hammond

Eric J Bigelow

Alexander Pan

Lauro Langosco

Tomasz Korbak … (voir 18 de plus)

Heidi Zhang

Ruiqi Zhong

Sean 'o H'eigeartaigh

Gabriel Recchia

Giulio Corsi

Alan Chan

Markus Anderljung

Lilian Edwards

Yoshua Bengio

Danqi Chen

Samuel Albanie

Tegan Maharaj

Jakob Nicolaus Foerster

Florian Tramèr

He He

Atoosa Kasirzadeh

Yejin Choi

David Scott Krueger

This work identifies 18 foundational challenges in assuring the alignment and safety of large language models (LLMs). These challenges are o… (voir plus)rganized into three different categories: scientific understanding of LLMs, development and deployment methods, and sociotechnical challenges. Based on the identified challenges, we pose

2024-04-15

ArXiv (prépublication)

doi.org

arxiv.org

Government Interventions to Avert Future Catastrophic AI Risks

Yoshua Bengio

2024-04-15

Special Issue 5: Grappling With the Generative AI Revolution (publié)

doi.org

Regulating advanced artificial agents

Michael K. Cohen

Noam Kolt

Yoshua Bengio

Gillian K. Hadfield

Stuart Russell

2024-04-05

Science (publié)

doi.org

Language Models Can Reduce Asymmetry in Information Markets

Nasim Rahaman

Martin Weiss

Manuel Wüthrich

Yoshua Bengio

Erran L. Li

Chris Pal

Bernhard Schölkopf

2024-03-21

ArXiv (prépublication)

doi.org

arxiv.org

Ant Colony Sampling with GFlowNets for Combinatorial Optimization

Minsu Kim

Sanghyeok Choi

Jiwoo Son

Hyeon-Seob Kim

Jinkyoo Park

Yoshua Bengio

2024-03-11

ArXiv (prépublication)

doi.org

arxiv.org

Improving and Generalizing Flow-Based Generative Models with Minibatch Optimal Transport

Alexander Tong

Nikolay Malkin

Guillaume Huguet

Yanlei Zhang

Jarrid Rector-Brooks

Kilian FATRAS

Guy Wolf

Yoshua Bengio

Continuous normalizing flows (CNFs) are an attractive generative modeling technique, but they have been held back by limitations in their si… (voir plus)mulation-based maximum likelihood training. We introduce the generalized \textit{conditional flow matching} (CFM) technique, a family of simulation-free training objectives for CNFs. CFM features a stable regression objective like that used to train the stochastic flow in diffusion models but enjoys the efficient inference of deterministic flow models. In contrast to both diffusion models and prior CNF training algorithms, CFM does not require the source distribution to be Gaussian or require evaluation of its density. A variant of our objective is optimal transport CFM (OT-CFM), which creates simpler flows that are more stable to train and lead to faster inference, as evaluated in our experiments. Furthermore, OT-CFM is the first method to compute dynamic OT in a simulation-free way. Training CNFs with CFM improves results on a variety of conditional and unconditional generation tasks, such as inferring single cell dynamics, unsupervised image translation, and Schrödinger bridge inference.

2024-03-11

TMLR (accepté)

openreview.net

Integrating Generative and Experimental Platforms or Biomolecular Design

Cheng-Hao Liu

Jarrid Rector-Brooks

Jason Yim

Soojung Yang

Sidney Lisanza

Francesca-Zhoufan Li

Pranam Chatterjee

Tommi Jaakkola

Regina Barzilay

David Baker

Frances H. Arnold

Yoshua Bengio

2024-03-08

ICLR.cc/2024/Workshop_Proposals (publié)

openreview.net

La recherche en IA au service du monde réel

Boussole des politiques en IA

Vie étudiante et ressources

Yoshua Bengio

Biographie

Étudiants actuels

Billets de blogue

Publications

La recherche en IA au service du monde réel

Boussole des politiques en IA

Vie étudiante et ressources

Mots-clés populaires:

Yoshua Bengio

Biographie

Étudiants actuels

Billets de blogue

Publications