Publications

Attention Schema in Neural Agents

Dianbo Liu

Samuele Bolotta

Mike He Zhu

Guillaume Dumas

Attention has become a common ingredient in deep learning architectures. It adds a dynamical selection of information on top of the static s… (see more)election of information supported by weights. In the same way, we can imagine a higher-order informational filter built on top of attention: an Attention Schema (AS), namely, a descriptive and predictive model of attention. In cognitive neuroscience, Attention Schema Theory (AST) supports this idea of distinguishing attention from AS. A strong prediction of this theory is that an agent can use its own AS to also infer the states of other agents' attention and consequently enhance coordination with other agents. As such, multi-agent reinforcement learning would be an ideal setting to experimentally test the validity of AST. We explore different ways in which attention and AS interact with each other. Our preliminary results indicate that agents that implement the AS as a recurrent internal control achieve the best performance. In general, these exploratory experiments suggest that equipping artificial agents with a model of attention can enhance their social intelligence.

2023-05-27

ArXiv (preprint)

Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets

Dinghuai Zhang

Hanjun Dai

Nikolay Malkin

Aaron Courville

Ling Pan

Combinatorial optimization (CO) problems are often NP-hard and thus out of reach for exact algorithms, making them a tempting domain to appl… (see more)y machine learning methods. The highly structured constraints in these problems can hinder either optimization or sampling directly in the solution space. On the other hand, GFlowNets have recently emerged as a powerful machinery to efficiently sample from composite unnormalized densities sequentially and have the potential to amortize such solution-searching processes in CO, as well as generate diverse solution candidates. In this paper, we design Markov decision processes (MDPs) for different combinatorial problems and propose to train conditional GFlowNets to sample from the solution space. Efficient training techniques are also developed to benefit long-range credit assignment. Through extensive experiments on a variety of different CO tasks with synthetic and realistic data, we demonstrate that GFlowNet policies can efficiently find high-quality solutions. Our implementation is open-sourced at https://github.com/zdhNarsil/GFlowNet-CombOpt.

2023-05-26

ArXiv (preprint)

Motor cortex latent dynamics encode arm movement direction and urgency independently

Andrea Colins Rodriguez

Matt Perich

Lee Miller

Mark D. Humphries

2023-05-26

bioRxiv (preprint)

Testing Feedforward Neural Networks Training Programs

Houssem Ben Braiek

Foutse Khomh

2023-05-26

ACM Transactions on Software Engineering and Methodology (published)

An Examination of the Robustness of Reference-Free Image Captioning Evaluation Metrics

Saba Ahmadi

Aishwarya Agrawal

2023-05-24

ArXiv (preprint)

Model evaluation for extreme risks

Toby Shevlane

Sebastian Farquhar

Ben Garfinkel

Mary Phuong

Jess Whittlestone

Jade Leung

Daniel Kokotajlo

Nahema A. Marchal

Markus Anderljung

Noam Kolt

Lewis Ho

Divya Siddarth

Shahar Avin

W. Hawkins

Been Kim

Iason Gabriel

Vijay Bolina

Jack Clark

Paul F. Christiano … (see 1 more)

Allan Dafoe

Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further pro… (see more)gress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify dangerous capabilities (through"dangerous capability evaluations") and the propensity of models to apply their capabilities for harm (through"alignment evaluations"). These evaluations will become critical for keeping policymakers and other stakeholders informed, and for making responsible decisions about model training, deployment, and security.

2023-05-24

ArXiv (preprint)

Model evaluation for extreme risks

Toby Shevlane

Sebastian Farquhar

Ben Garfinkel

Mary Phuong

Jess Whittlestone

Jade Leung

Daniel Kokotajlo

Nahema A. Marchal

Markus Anderljung

Noam Kolt

Lewis Ho

Divya Siddarth

Shahar Avin

W. Hawkins

Been Kim

Iason Gabriel

Vijay Bolina

Jack Clark

Paul F. Christiano … (see 1 more)

Allan Dafoe

Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further pro… (see more)gress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify dangerous capabilities (through"dangerous capability evaluations") and the propensity of models to apply their capabilities for harm (through"alignment evaluations"). These evaluations will become critical for keeping policymakers and other stakeholders informed, and for making responsible decisions about model training, deployment, and security.

2023-05-24

ArXiv (preprint)

Model evaluation for extreme risks

Toby Shevlane

Sebastian Farquhar

Ben Garfinkel

Mary Phuong

Jess Whittlestone

Jade Leung

Daniel Kokotajlo

Nahema A. Marchal

Markus Anderljung

Noam Kolt

Lewis Ho

Divya Siddarth

Shahar Avin

W. Hawkins

Been Kim

Iason Gabriel

Vijay Bolina

Jack Clark

Paul F. Christiano … (see 1 more)

Allan Dafoe

2023-05-24

ArXiv (preprint)

Model evaluation for extreme risks

Toby Shevlane

Sebastian Farquhar

Ben Garfinkel

Mary Phuong

Jess Whittlestone

Jade Leung

Daniel Kokotajlo

Nahema A. Marchal

Markus Anderljung

Noam Kolt

Lewis Ho

Divya Siddarth

Shahar Avin

W. Hawkins

Been Kim

Iason Gabriel

Vijay Bolina

Jack Clark

Paul F. Christiano … (see 1 more)

Allan Dafoe

Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further pro… (see more)gress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify dangerous capabilities (through"dangerous capability evaluations") and the propensity of models to apply their capabilities for harm (through"alignment evaluations"). These evaluations will become critical for keeping policymakers and other stakeholders informed, and for making responsible decisions about model training, deployment, and security.

2023-05-24

ArXiv (preprint)

Model evaluation for extreme risks

Toby Shevlane

Sebastian Farquhar

Ben Garfinkel

Mary Phuong

Jess Whittlestone

Jade Leung

Daniel Kokotajlo

Nahema A. Marchal

Markus Anderljung

Noam Kolt

Lewis Ho

Divya Siddarth

Shahar Avin

W. Hawkins

Been Kim

Iason Gabriel

Vijay Bolina

Jack Clark

Paul F. Christiano … (see 1 more)

Allan Dafoe

Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further pro… (see more)gress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify dangerous capabilities (through"dangerous capability evaluations") and the propensity of models to apply their capabilities for harm (through"alignment evaluations"). These evaluations will become critical for keeping policymakers and other stakeholders informed, and for making responsible decisions about model training, deployment, and security.

2023-05-24

ArXiv (preprint)

De novo motor learning creates structure in neural activity space that shapes adaptation

Joanna C. Chang

Matt Perich

Lee Miller

Juan A. Gallego

Claudia Clopath

2023-05-24

bioRxiv (preprint)