Publications

Current State and Future Directions for Learning in Biological Recurrent Neural Networks: A Perspective Piece

Luke Y. Prince

Roy Henha Eyono

Ellen Boven

Arna Ghosh

Joseph Pemberton

Franz Scherr

Claudia Clopath

Rui Ponte Costa

Wolfgang Maass

Blake Richards

Cristina Savin

Katharina Wilmes

We provide a brief review of the common assumptions about biological learning with findings from experimental neuroscience and contrast them… (see more) with the efficiency of gradient-based learning in recurrent neural networks. The key issues discussed in this review include: synaptic plasticity, neural circuits, theory-experiment divide, and objective functions. We conclude with recommendations for both theoretical and experimental neuroscientists when designing new studies that could help bring clarity to these issues.

2022-04-27

Neurons, Behavior, Data analysis, and Theory (published)

doi.org

arxiv.org

Don't Freeze Your Embedding: Lessons from Policy Finetuning in Environment Transfer

Victoria Dean

Daniel Toyama

Doina Precup

A common occurrence in reinforcement learning (RL) research is making use of a pretrained vision stack that converts image observations to l… (see more)atent vectors. Using a visual embedding in this way leaves open questions, though: should the vision stack be updated with the policy? In this work, we evaluate the effectiveness of such decisions in RL transfer settings. We introduce policy update formulations for use after pretraining in a different environment and analyze the performance of such formulations. Through this evaluation, we also detail emergent metrics of benchmark suites and present results on Atari and AndroidEnv.

2022-04-27

ICLR.cc/2022/Workshop/GPL (poster)

openreview.net

A Probabilistic Perspective on Reinforcement Learning via Supervised Learning

Alexandre Piché

Rafael Pardinas

David Vazquez

Chris Pal

2022-04-27

ICLR.cc/2022/Workshop/GPL (poster)

openreview.net

Accepted Tutorials at The Web Conference 2022

Riccardo Tommasini

Senjuti Basu Roy

Xuan Wang

Hongwei Wang

Heng Ji

Jiawei Han

Preslav Nakov

Giovanni Da San Martino

Firoj Alam

Markus Schedl

Elisabeth Lex

Akash Bharadwaj

Graham Cormode

Milan Dojchinovski

Jan Forberg

Johannes Frey

Pieter Bonte

Marco Balduini

Matteo Belcao

Emanuele Della Valle … (see 53 more)

Junliang Yu

Hongzhi Yin

Tong Chen

Haochen Liu

Yiqi Wang

Wenqi Fan

Xiaorui Liu

Jamell Dacon

Lingjuan Lye

Jiliang Tang

Aristides Gionis

Stefan Neumann

Bruno Ordozgoiti

Simon Razniewski

Hiba Arnaout

Shrestha Ghosh

Fabian Suchanek

Lingfei Wu

Yu Chen

Yunyao Li

Bang Liu

Filip Ilievski

Daniel Garijo

Hans Chalupsky

Pedro Szekely

Ilias Kanellos

Dimitris Sacharidis

Thanasis Vergoulis

Nurendra Choudhary

Nikhil Rao

Karthik Subbian

Srinivasan Sengamedu

Chandan K. Reddy

Friedhelm Victor

Bernhard Haslhofer

George Katsogiannis- Meimarakis

Georgia Koutrika

Shengmin Jin

Danai Koutra

Reza Zafarani

Yulia Tsvetkov

Vidhisha Balachandran

Sachin Kumar

Xiangyu Zhao

Bo Chen

Huifeng Guo

Yejing Wang

Ruiming Tang

Yang Zhang

Wenjie Wang

Peng Wu

Fuli Feng

Xiangnan He

This paper summarizes the content of the 20 tutorials that have been given at The Web Conference 2022: 85% of these tutorials are lecture st… (see more)yle, and 15% of these are hands on.

2022-04-25

The Web Conference (published)

doi.org

I NTRODUCING C OORDINATION IN C ONCURRENT R EIN - FORCEMENT L EARNING

Adrien Ali Taiga

Aaron Courville

Marc Gendron-Bellemare

Google Brain

Research on exploration in reinforcement learning has mostly focused on problems with a single agent interacting with an environment. Howeve… (see more)r many problems are better addressed by the concurrent reinforcement learning paradigm, where multiple agents operate in a common environment. Recent work has tackled the challenge of exploration in this particular setting (Dimakopoulou & Van Roy, 2018; Dimakopoulou et al., 2018). Nonetheless, they do not completely leverage the characteristics of this framework and agents end up behaving independently from each other. In this work we argue that coordination among concurrent agents is crucial for efficient exploration. We introduce coordination in Thompson Sampling based methods by drawing correlated samples from an agent’s posterior. We apply this idea to extend existing exploration schemes such as randomized least squares value iteration (RLSVI). Empirical results on simple toy tasks emphasize the merits of our approach and call attention to coordination as a key objective for efficient exploration in concurrent reinforcement learning.

2022-04-25

ICLR.cc/2022/Workshop/GMS (published)

openreview.net

Offline Retrieval Evaluation Without Evaluation Metrics

Fernando Diaz

Andres Ferraro

Offline evaluation of information retrieval and recommendation has traditionally focused on distilling the quality of a ranking into a scala… (see more)r metric such as average precision or normalized discounted cumulative gain. We can use this metric to compare the performance of multiple systems for the same request. Although evaluation metrics provide a convenient summary of system performance, they also collapse subtle differences across users into a single number and can carry assumptions about user behavior and utility not supported across retrieval scenarios. We propose recall-paired preference (RPP), a metric-free evaluation method based on directly computing a preference between ranked lists. RPP simulates multiple user subpopulations per query and compares systems across these pseudo-populations. Our results across multiple search and recommendation tasks demonstrate that RPP substantially improves discriminative power while correlating well with existing metrics and being equally robust to incomplete data.

2022-04-25

ArXiv (preprint)

doi.org

arxiv.org

QEN: Applicable Taxonomy Completion via Evaluating Full Taxonomic Relations

Suyuchen Wang

Ruihui Zhao

Yefeng Zheng

Bang Liu

Taxonomy is a fundamental type of knowledge graph for a wide range of web applications like searching and recommendation systems. To keep a … (see more)taxonomy automatically updated with the latest concepts, the taxonomy completion task matches a pair of proper hypernym and hyponym in the original taxonomy with the new concept as its parent and child. Previous solutions utilize term embeddings as input and only evaluate the parent-child relations between the new concept and the hypernym-hyponym pair. Such methods ignore the important sibling relations, and are not applicable in reality since term embeddings are not available for the latest concepts. They also suffer from the relational noise of the “pseudo-leaf” node, which is a null node acting as a node’s hyponym to enable the new concept to be a leaf node. To tackle the above drawbacks, we propose the Quadruple Evaluation Network (QEN), a novel taxonomy completion framework that utilizes easily accessible term descriptions as input, and applies pretrained language model and code attention for accurate inference while reducing online computation. QEN evaluates both parent-child and sibling relations to both enhance the accuracy and reduce the noise brought by pseudo-leaf. Extensive experiments on three real-world datasets in different domains with different sizes and term description sources prove the effectiveness and robustness of QEN on overall performance and especially the performance for adding non-leaf nodes, which largely surpasses previous methods and achieves the new state-of-the-art of the task.1

2022-04-25

The Web Conference (published)

doi.org

QEN: Applicable Taxonomy Completion via Evaluating Full Taxonomic Relations

Suyuchen Wang

Ruihui Zhao

Yefeng Zheng

Bang Liu

Taxonomy is a fundamental type of knowledge graph for a wide range of web applications like searching and recommendation systems. To keep a … (see more)taxonomy automatically updated with the latest concepts, the taxonomy completion task matches a pair of proper hypernym and hyponym in the original taxonomy with the new concept as its parent and child. Previous solutions utilize term embeddings as input and only evaluate the parent-child relations between the new concept and the hypernym-hyponym pair. Such methods ignore the important sibling relations, and are not applicable in reality since term embeddings are not available for the latest concepts. They also suffer from the relational noise of the “pseudo-leaf” node, which is a null node acting as a node’s hyponym to enable the new concept to be a leaf node. To tackle the above drawbacks, we propose the Quadruple Evaluation Network (QEN), a novel taxonomy completion framework that utilizes easily accessible term descriptions as input, and applies pretrained language model and code attention for accurate inference while reducing online computation. QEN evaluates both parent-child and sibling relations to both enhance the accuracy and reduce the noise brought by pseudo-leaf. Extensive experiments on three real-world datasets in different domains with different sizes and term description sources prove the effectiveness and robustness of QEN on overall performance and especially the performance for adding non-leaf nodes, which largely surpasses previous methods and achieves the new state-of-the-art of the task.1

2022-04-25

The Web Conference (published)

doi.org

Rare CNVs and phenome-wide profiling: a tale of brain-structural divergence and phenotypical convergence

J. Kopal

Kuldeep Kumar

Karin Saltoun

Claudia Modenato

Clara A. Moreau

Sandra Martin-Brevet

Guillaume Huguet

Martineau Jean-Louis

C.O. Martin

Zohra Saci

Nadine Younis

Petra Tamer

Elise Douard

Anne M. Maillard

Borja Rodriguez-Herreros

Aurélie Pain

Sonia Richetin

Leila Kushan

Ana I. Silva

Marianne B.M. van den Bree … (see 12 more)

David E.J. Linden

M. J. Owen

Jeremy Hall

Sarah Lippé

Bogdan Draganski

Ida E. Sønderby

Ole A. Andreassen

David C. Glahn

Paul M. Thompson

Carrie E. Bearden

Sébastien Jacquemont

Danilo Bzdok

Copy number variations (CNVs) are rare genomic deletions and duplications that can exert profound effects on brain and behavior. Previous re… (see more)ports of pleiotropy in CNVs imply that they converge on shared mechanisms at some level of pathway cascades, from genes to large-scale neural circuits to the phenome. However, studies to date have primarily examined single CNV loci in small clinical cohorts. It remains unknown how distinct CNVs escalate the risk for the same developmental and psychiatric disorders. Here, we quantitatively dissect the impact on brain organization and behavioral differentiation across eight key CNVs. In 534 clinical CNV carriers from multiple sites, we explored CNV-specific brain morphology patterns. We extensively annotated these CNV-associated patterns with deep phenotyping assays through the UK Biobank resource. Although the eight CNVs cause disparate brain changes, they are tied to similar phenotypic profiles across ∼1000 lifestyle indicators. Our population-level investigation established brain structural divergences and phenotypical convergences of CNVs, with direct relevance to major brain disorders.

2022-04-25

bioRxiv (preprint)

doi.org

Shared and unique brain network features predict cognitive, personality, and mental health scores in the ABCD study

Jianzhong Chen

Angela Tam

Valeria Kebets

Csaba Orban

L.Q.R. Ooi

Christopher L Asplund

Scott A. Marek

N. Dosenbach

Simon B. Eickhoff

Danilo Bzdok

Avram J. Holmes

B.T. Thomas Yeo

2022-04-25

Nature Communications (published)

doi.org

Shared and unique brain network features predict cognitive, personality, and mental health scores in the ABCD study

Jianzhong Chen

Angela Tam

Valeria Kebets

Csaba Orban

L.Q.R. Ooi

Leon Qi Rong Ooi

Christopher L. Asplund

Scott Marek

Nico Dosenbach

Simon B. Eickhoff

Danilo Bzdok

Avram J. Holmes

B.T. Thomas Yeo

2022-04-25

Nature Communications (published)

doi.org

Staged independent learning: Towards decentralized cooperative multi-agent Reinforcement Learning

Hadi Nekoei

Akilesh Badrinaaraayanan

Amit Sinha

Mohammad Amini

Janarthanan Rajendran

Aditya Mahajan

Sarath Chandar

We empirically show that classic ideas from two-time scale stochastic approximation \citep{borkar1997stochastic} can be combined with sequen… (see more)tial iterative best response (SIBR) to solve complex cooperative multi-agent reinforcement learning (MARL) problems. We first start with giving a multi-agent estimation problem as a motivating example where SIBR converges while parallel iterative best response (PIBR) does not. Then we present a general implementation of staged multi-agent RL algorithms based on SIBR and multi-time scale stochastic approximation, and show that our new methods which we call Staged Independent Proximal Policy Optimization (SIPPO) and Staged Independent Q-learning (SIQL) outperform state-of-the-art independent learning on almost all the tasks in the epymarl \citep{papoudakis2020benchmarking} benchmark. This can be seen as a first step towards more decentralized MARL methods based on SIBR and multi-time scale learning.

2022-04-25

ICLR.cc/2022/Workshop/GMS (published)

openreview.net

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Publications

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Popular keywords:

Publications