Publications

Workflow Discovery from Dialogues in the Low Data Regime

Amine El hattami

Stefania Raimondo

Issam Hadj Laradji

Pau Rodríguez

Christopher Pal

Text-based dialogues are now widely used to solve real-world problems. In cases where solution strategies are already known, they can someti… (see more)mes be codified into workflows and used to guide humans or artificial agents through the task of helping clients. We introduce a new problem formulation that we call Workflow Discovery (WD) in which we are interested in the situation where a formal workflow may not yet exist. Still, we wish to discover the set of actions that have been taken to resolve a particular problem. We also examine a sequence-to-sequence (Seq2Seq) approach for this novel task. We present experiments where we extract workflows from dialogues in the Action-Based Conversations Dataset (ABCD). Since the ABCD dialogues follow known workflows to guide agents, we can evaluate our ability to extract such workflows using ground truth sequences of actions. We propose and evaluate an approach that conditions models on the set of possible actions, and we show that using this strategy, we can improve WD performance. Our conditioning approach also improves zero-shot and few-shot WD performance when transferring learned models to unseen domains within and across datasets. Further, on ABCD a modified variant of our Seq2Seq method achieves state-of-the-art performance on related but different problems of Action State Tracking (AST) and Cascading Dialogue Success (CDS) across many evaluation metrics.

2022-12-31

Trans. Mach. Learn. Res. (published)

doi.org

openreview.net

"Your child needs surgery": A survey-based evaluation of simulated expert consent conversations by key stakeholders.

Zoe Atsaidis

Stephan Robitaille

Elena Guadagno

Jeffrey Wiseman

Sherif Emil

Dan Poenaru

2022-12-31

Journal of Pediatric Surgery (published)

doi.org

Offline Policy Optimization in RL with Variance Regularizaton

Riashat Islam

Samarth Sinha

Homanga Bharadhwaj

Samin Yeasar Arnob

Zhuoran Yang

Animesh Garg

Zhaoran Wang

Lihong Li

Doina Precup

2022-12-28

ArXiv (preprint)

doi.org

arxiv.org

Simplicity and learning to distinguish arguments from modifiers

Leon Bergen

E. Gibson

Timothy J. O'Donnell

2022-12-27

Journal of Language Modelling (published)

doi.org

How programmers find online learning resources

Deeksha M. Arya

Jin L.C. Guo

Martin P. Robillard

2022-12-23

Empirical Software Engineering (published)

doi.org

FaithDial: A Faithful Benchmark for Information-Seeking Dialogue

Nouha Dziri

Ehsan Kamalloo

Sivan Milton

Osmar Zaiane

Mo Yu

Edoardo M. Ponti

Siva Reddy

The goal of information-seeking dialogue is to respond to seeker queries with natural language utterances that are grounded on knowledge sou… (see more)rces. However, dialogue systems often produce unsupported utterances, a phenomenon known as hallucination. To mitigate this behavior, we adopt a data-centric solution and create FaithDial, a new benchmark for hallucination-free dialogues, by editing hallucinated responses in the Wizard of Wikipedia (WoW) benchmark. We observe that FaithDial is more faithful than WoW while also maintaining engaging conversations. We show that FaithDial can serve as training signal for: i) a hallucination critic, which discriminates whether an utterance is faithful or not, and boosts the performance by 12.8 F1 score on the BEGIN benchmark compared to existing datasets for dialogue coherence; ii) high-quality dialogue generation. We benchmark a series of state-of-the-art models and propose an auxiliary contrastive objective that achieves the highest level of faithfulness and abstractiveness based on several automated metrics. Further, we find that the benefits of FaithDial generalize to zero-shot transfer on other datasets, such as CMU-Dog and TopicalChat. Finally, human evaluation reveals that responses generated by models trained on FaithDial are perceived as more interpretable, cooperative, and engaging.

2022-12-22

Transactions of the Association for Computational Linguistics (published)

doi.org

arxiv.org

Post-hoc Interpretability for Neural NLP: A Survey

Andreas Madsen

Siva Reddy

A. Chandar

2022-12-22

ACM Computing Surveys (published)

doi.org

arxiv.org

Towards Continual Reinforcement Learning: A Review and Perspectives

Khimya Khetarpal

Matthew D Riemer

Irina Rish

Doina Precup

2022-12-21

Journal of Artificial Intelligence Research (published)

doi.org

arxiv.org

Meta-topologies define distinct anatomical classes of brain tumours linked to histology and survival

Julius M. Kernbach

Daniel Delev

Georg Neuloh

Hans Clusmann

Danilo Bzdok

Simon B. Eickhoff

Victor E. Staartjes

Flavio Vasella

Michael Weller

Luca Regli

Carlo Serra

Niklaus Krayenbühl

Kevin Akeret

The current World Health Organization classification integrates histological and molecular features of brain tumours. The aim of this study … (see more)was to identify generalizable topological patterns with the potential to add an anatomical dimension to the classification of brain tumours. We applied non-negative matrix factorization as an unsupervised pattern discovery strategy to the fine-grained topographic tumour profiles of 936 patients with neuroepithelial tumours and brain metastases. From the anatomical features alone, this machine learning algorithm enabled the extraction of latent topological tumour patterns, termed meta-topologies. The optimal part-based representation was automatically determined in 10 000 split-half iterations. We further characterized each meta-topology’s unique histopathologic profile and survival probability, thus linking important biological and clinical information to the underlying anatomical patterns. In neuroepithelial tumours, six meta-topologies were extracted, each detailing a transpallial pattern with distinct parenchymal and ventricular compositions. We identified one infratentorial, one allopallial, three neopallial (parieto-occipital, frontal, temporal) and one unisegmental meta-topology. Each meta-topology mapped to distinct histopathologic and molecular profiles. The unisegmental meta-topology showed the strongest anatomical–clinical link demonstrating a survival advantage in histologically identical tumours. Brain metastases separated to an infra- and supratentorial meta-topology with anatomical patterns highlighting their affinity to the cortico-subcortical boundary of arterial watershed areas.Using a novel data-driven approach, we identified generalizable topological patterns in both neuroepithelial tumours and brain metastases. Differences in the histopathologic profiles and prognosis of these anatomical tumour classes provide insights into the heterogeneity of tumour biology and might add to personalized clinical decision-making.

2022-12-19

Brain Communications (published)

doi.org

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting

Zheng Xin Yong

Hailey Schoelkopf

Niklas Muennighoff

Alham Fikri Aji

David Ifeoluwa Adelani

Khalid Almubarak

M. Saiful Bari

Lintang A. Sutawika

Jungo Kasai

Ahmed Baruwa

Genta Indra Winata

Stella Biderman

Dragomir R. Radev

Vassilina Nikoulina

The BLOOM model is a large publicly available multilingual language model, but its pretraining was limited to 46 languages. To extend the be… (see more)nefits of BLOOM to other languages without incurring prohibitively large costs, it is desirable to adapt BLOOM to new languages not seen during pretraining. In this work, we apply existing language adaptation strategies to BLOOM and benchmark its zero-shot prompting performance on eight new languages in a resource-constrained setting. We find language adaptation to be effective at improving zero-shot performance in new languages. Surprisingly, we find that adapter-based finetuning is more effective than continued pretraining for large models. In addition, we discover that prompting performance is not significantly affected by language specifics, such as the writing system. It is primarily determined by the size of the language adaptation data. We also add new languages to BLOOMZ, which is a multitask finetuned version of BLOOM capable of following task instructions zero-shot. We find including a new language in the multitask fine-tuning mixture to be the most effective method to teach BLOOMZ a new language. We conclude that with sufficient training data language adaptation can generalize well to diverse languages. Our code is available at https://github.com/bigscience-workshop/multilingual-modeling.

2022-12-18

ArXiv (preprint)

doi.org

arxiv.org

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann

Annika Reinke

Vivienn Weru

Minu Dietlinde Tizabi

Fabian Isensee

T. Adler

PATRICK GODAU

Veronika Cheplygina

Michal Kozubek

Sharib Ali

Anubha Gupta

Jan. Kybic

Alison Professor Noble

Carlos Ortiz de Sol'orzano

Samiksha Pachade

Caroline Petitjean

Daniel Sage

Donglai Wei

Elizabeth Wilden

Deepak Alapatt … (see 334 more)

Vincent Andrearczyk

Ujjwal Baid

Spyridon Bakas

Niranjan Balu

Sophia Bano

Vivek Singh Bawa

Jorge Bernal

Sebastian Bodenstedt

Alessandro Casella

Jinwook Choi

Olivier Commowick

M. Daum

Adrien Depeursinge

Reuben Dorent

J. Egger

H. Eichhorn

Sandy Engelhardt

Melanie Ganz

Gabriel Girard

Lasse Donovan Hansen

Mattias Paul Heinrich

Nicholas Heller

Alessa Hering

Arnaud Huaulm'e

Hyunjeong Kim

Bennett Landman

Hongwei Bran Li

Jianning Li

Junfang Ma

Anne L. Martel

Carlos Mart'in-Isla

Bjoern Menze

Chinedu Innocent Nwoye

Valentin Oreiller

Nicolas Padoy

Sarthak Pati

Kelly Payette

Carole H. Sudre

K. V. Wijnen

Armine Vardazaryan

Tom Kamiel Magda Vercauteren

Martin Wagner

Chuanbo Wang

Moi Hoon Yap

Zeyun Yu

Chuner Yuan

Maximilian Zenk

Aneeq Zia

David Zimmerer

Rina Bao

Chanyeol Choi

Andrew Cohen

Oleh Dzyubachyk

Adrian Galdran

Tianyuan Gan

Tianqi Guo

Pradyumna Gupta

M. Haithami

Edward Ho

Ikbeom Jang

Zhili Li

Zheng Luo

Filip Lux

Sokratis Makrogiannis

Dominikus Muller

Young-Tack Oh

Subeen Pang

Constantin Pape

Gorkem Polat

Charlotte Rosalie Reed

Kanghyun Ryu

Tim Scherr

Vajira L. Thambawita

Haoyu Wang

Xinliang Wang

Kele Xu

H.-I. Yeh

Doyeob Yeo

Yi Yuan

Yan Zeng

Xingwen Zhao

Julian Ronald Abbing

Jannes Adam

Nagesh Adluru

Niklas Agethen

S. Ahmed

Yasmina Al Khalil

Mireia Alenya

Esa J. Alhoniemi

C. An

Talha E Anwar

Tewodros Arega

Netanell Avisdris

D. Aydogan

Yi-Shi Bai

Maria Baldeon Calisto

Berke Doga Basaran

Marcel Beetz

Cheng Bian

Hao-xuan Bian

Kevin Blansit

Louise Bloch

Robert Bohnsack

Sara Bosticardo

J. Breen

Mikael Brudfors

Raphael Brungel

Mariano Cabezas

Alberto Cacciola

Zhiwei Chen

Yucong Chen

Dan Chen

Minjeong Cho

Min-Kook Choi

Chuantao Xie Chuantao Xie

Dana Cobzas

Julien Cohen-Adad

Jorge Corral Acero

Sujit Kumar Das

Marcela de Oliveira

Hanqiu Deng

Guiming Dong

Lars Doorenbos

Cory Efird

Di Fan

Mehdi Fatan Serj

Alexandre Fenneteau

Lucas Fidon

Patryk Filipiak

Ren'e Finzel

Nuno Renato Freitas

C. Friedrich

Mitchell J. Fulton

Finn Gaida

Francesco Galati

Christoforos Galazis

Changna Gan

Zheyao Gao

Sheng Gao

Matej Gazda

Beerend G. A. Gerats

Neil Getty

Adam Gibicar

Ryan J. Gifford

Sajan Gohil

Maria Grammatikopoulou

Daniel Grzech

Orhun Guley

Timo Gunnemann

Chun-Hai Guo

Sylvain Guy

Heonjin Ha

Luyi Han

Ilseok Han

Ali Hatamizadeh

Tianhai He

Ji-Wu Heo

Sebastian Hitziger

SeulGi Hong

Seungbum Hong

Rian Huang

Zi-You Huang

Markus Huellebrand

Stephan Huschauer

M. Hussain

Tomoo Inubushi

Ece Isik Polat

Mojtaba Jafaritadi

Seonghun Jeong

Bailiang Jian

Yu Jiang

Zhifan Jiang

Yu Jin

Smriti Joshi

A. Kadkhodamohammadi

R. A. Kamraoui

Inhak Kang

Jun-Su Kang

Davood Karimi

April Ellahe Khademi

Muhammad Irfan Khan

Suleiman A. Khan

Rishab Khantwal

Kwang-Ju Kim

Timothy Lee Kline

Satoshi Kondo

Elina Kontio

Adrian Krenzer

Artem Kroviakov

Hugo J. Kuijf

Satyadwyoom Kumar

Francesco La Rosa

Abhishek Lad

Doohee Lee

Minho Lee

Chiara Lena

Hao Li

Ling Li

Xingyu Li

F. Liao

Kuan-Ya Liao

Arlindo L. Oliveira

Chaonan Lin

Shanhai Lin

Akis Linardos

M. Linguraru

Han Liu

Tao Liu

Dian Liu

Yanling Liu

Joao Lourencco-Silva

Jing Lu

Jia Lu

Imanol Luengo

Christina Bach Lund

Huan Minh Luu

Yingqi Lv

Uzay Macar

Leon Maechler

L. SinaMansour

Kenji Marshall

Moona Mazher

Richard McKinley

Alfonso Medela

Felix Meissen

Mingyuan Meng

Dylan Bradley Miller

S. Mirjahanmardi

Arnab Kumar Mishra

Samir Mitha

Hassan Mohy-ud-Din

Tony C. W. Mok

Gowtham Krishnan Murugesan

Enamundram Naga Karthik

Sahil Nalawade

Jakub Nalepa

M. Naser

Ramin Nateghi

Hammad Naveed

Quang-Minh Nguyen

Cuong Nguyen Quoc

Brennan Nichyporuk

Bruno Oliveira

David Owen

Jimut Bahan Pal

Junwen Pan

Wei-Dong Pan

Winnie Pang

Bogyu Park

Vivek G. Pawar

Kamlesh Pawar

Michael Peven

Lena Philipp

Tomasz Pieciak

Szymon S Płotka

Marcel Plutat

Fattane Pourakpour

Domen Prelovznik

K. Punithakumar

Abdul Qayyum

Sandro Queir'os

Arman Rahmim

Salar Razavi

Jintao Ren

Mina Rezaei

Jonathan Adam Rico

ZunHyan Rieu

Markus Rink

Johannes Roth

Yusely Ruiz-gonzalez

Numan Saeed

Anindo Saha

Mostafa M. Sami Salem

Ricardo Sanchez-matilla

Kurt G Schilling

Weizhen Shao

Zhiqiang Shen

Ruize Shi

Pengcheng Shi

Daniel Sobotka

Th'eodore Soulier

Bella Specktor Fadida

D. Stoyanov

Timothy Sum Hon Mun

Xiao-Fu Sun

Rong Tao

Franz Thaler

Antoine Th'eberge

Felix Thielke

Helena R. Torres

K. Wahid

Jiacheng Wang

Yifei Wang

Wei David Wang

Xiong Jun Wang

Jianhui Wen

Ning Wen

Marek Wodziński

Yehong Wu

Fangfang Xia

Tianqi Xiang

Cheng Xiaofei

Lizhang Xu

Tingting Xue

Yu‐Xia Yang

Lingxian Yang

Kai Yao

Huifeng Yao

Amirsaeed Yazdani

Michael Yip

Hwa-Seong Yoo

Fereshteh Yousefirizi

Shu-Fen Yu

Lei Yu

Jonathan Zamora

Ramy A. Zeineldin

Dewen Zeng

Jianpeng Zhang

Bokai Zhang

Jiapeng Zhang

Fangxi Zhang

Huahong Zhang

Zhongchen Zhao

Zixuan Zhao

Jia Zhao

Can Zhao

Qiuyue Zheng

Yuheng Zhi

Ziqi Zhou

Baosheng Zou

Klaus Maier-Hein

PAUL F. JÄGER

Annette Kopp-Schneider

Lena Maier-Hein

2022-12-15

ArXiv (preprint)

doi.org

arxiv.org

Dynamic Consolidation for Continual Learning

Hang Li

Chen Ma

X. T. Chen

Xue Liu

Abstract Training deep learning models from a stream of nonstationary data is a critical problem to be solved to achieve general artificial … (see more)intelligence. As a promising solution, the continual learning (CL) technique aims to build intelligent systems that have the plasticity to learn from new information without forgetting the previously obtained knowledge. Unfortunately, existing CL methods face two nontrivial limitations. First, when updating a model with new data, existing CL methods usually constrain the model parameters within the vicinity of the parameters optimized for old data, limiting the exploration ability of the model; second, the important strength of each parameter (used to consolidate the previously learned knowledge) is fixed and thus is suboptimal for the dynamic parameter updates. To address these limitations, we first relax the vicinity constraints with a global definition of the important strength, which allows us to explore the full parameter space. Specifically, we define the important strength as the sensitivity of the global loss function to the model parameters. Moreover, we propose adjusting the important strength adaptively to align it with the dynamic parameter updates. Through extensive experiments on popular data sets, we demonstrate that our proposed method outperforms the strong baselines by up to 24% in terms of average accuracy.

2022-12-13

Neural Computation (published)

doi.org

Mila Techaide 2026

Venture Scientist Bootcamp

AI Advantage: Productivity in Public Service

Publications

Mila Techaide 2026

Venture Scientist Bootcamp

AI Advantage: Productivity in Public Service

Popular keywords:

Publications