Dzmitry Bahdanau

Understanding by Understanding Not: Modeling Negation in Language Models

Negation is a core construction in natural language. Despite being very successful on many tasks, state-of-the-art pre-trained language mode… (see more)ls often handle negation incorrectly. To improve language models in this regard, we propose to augment the language modeling objective with an unlikelihood objective that is based on negated generic sentences from a raw text corpus. By training BERT with the resulting combined objective we reduce the mean top 1 error rate to 4% on the negated LAMA dataset. We also see some improvements on the negated NLI benchmarks.

2021-06-01

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (published)

doi.org

BabyAI 1.1

David Y. T. Hui

Maxime Chevalier-Boisvert

BabyAI 1.1

David Y. T. Hui

Maxime Chevalier-Boisvert

The BabyAI platform is designed to measure the sample efﬁciency of training an agent to follow grounded-language instructions. BabyAI 1.0 … (see more)presents baseline results of an agent trained by deep imitation or reinforcement learning. BabyAI 1.1 improves the agent’s architecture in three minor ways. This increases reinforcement learning sample efﬁciency by up to 3 × and improves imitation learning performance on the hardest level from 77% to 90 . 4% . We hope that these improvements increase the computational efﬁciency of BabyAI experiments and help users design better agents.

2021-01-01

(published)

www.semanticscholar.org

BabyAI 1.1

David Y. T. Hui

Maxime Chevalier-Boisvert

The BabyAI platform is designed to measure the sample efficiency of training an agent to follow grounded-language instructions. BabyAI 1.0 p… (see more)resents baseline results of an agent trained by deep imitation or reinforcement learning. BabyAI 1.1 improves the agent's architecture in three minor ways. This increases reinforcement learning sample efficiency by up to 3 times and improves imitation learning performance on the hardest level from 77 % to 90.4 %. We hope that these improvements increase the computational efficiency of BabyAI experiments and help users design better agents.

2020-07-24

ArXiv (preprint)

BabyAI 1.1

David Y. T. Hui

Maxime Chevalier-Boisvert

The BabyAI platform is designed to measure the sample efficiency of training an agent to follow grounded-language instructions. BabyAI 1.0 p… (see more)resents baseline results of an agent trained by deep imitation or reinforcement learning. BabyAI 1.1 improves the agent's architecture in three minor ways. This increases reinforcement learning sample efficiency by up to 3 times and improves imitation learning performance on the hardest level from 77 % to 90.4 %. We hope that these improvements increase the computational efficiency of BabyAI experiments and help users design better agents.

2020-07-24

ArXiv (preprint)

BabyAI 1.1

David Y. T. Hui

Maxime Chevalier-Boisvert

2020-07-24

ArXiv (preprint)

BabyAI 1.1

David Y. T. Hui

Maxime Chevalier-Boisvert

The BabyAI platform is designed to measure the sample efficiency of training an agent to follow grounded-language instructions. BabyAI 1.0 p… (see more)resents baseline results of an agent trained by deep imitation or reinforcement learning. BabyAI 1.1 improves the agent's architecture in three minor ways. This increases reinforcement learning sample efficiency by up to 3 times and improves imitation learning performance on the hardest level from 77 % to 90.4 %. We hope that these improvements increase the computational efficiency of BabyAI experiments and help users design better agents.

2020-07-24

ArXiv (preprint)

Combating False Negatives in Adversarial Imitation Learning (Student Abstract)

Konrad Żołna

Chitwan Saharia

Léonard Boussioux

David Y. T. Hui

Maxime Chevalier-Boisvert

2020-04-03

AAAI Conference on Artificial Intelligence (published)

doi.org

CLOSURE: Assessing Systematic Generalization of CLEVR Models

Harm de Vries

Timothy O'Donnell

Shikhar Murty

Philippe Beaudoin

Aaron Courville

2019-12-12

ArXiv (preprint)

Automated curriculum generation for Policy Gradients from Demonstrations

Anirudh Srinivasan

Maxime Chevalier-Boisvert

2019-12-01

ArXiv (preprint)