Publications

Scattered Mixture-of-Experts Implementation

Rameswar Panda

ScatterMoE is an implementation of Sparse Mixture-of-Experts (SMoE) on GPUs. ScatterMoE builds upon techniques in existing implementations, … (see more)and overcoming some of the current limitations to improve batched inference, training speed, and memory footprint. This implementation achieves this by avoiding padding and making excessive copies of the input. We also fuse expert linear transforms and reordering operations with ParallelLinear, a module that can be used to extend the concept of SMoEs. We benchmark our implementation against Megablocks, and show that it enables a higher throughput and lower memory footprint. We also show how ParallelLinear enables extension of the Mixture-of-Experts concept by demonstrating with an implementation of Mixture-of-Attention.

2024-07-09

colmweb.org/COLM/2024/Conference (accepted)

doi.org

openreview.net

Seeking Interpretability and Explainability in Binary Activated Neural Networks

Benjamin Leblanc

Pascal Germain

2024-07-09

Communications in Computer and Information Science (published)

doi.org

arxiv.org

Should We Attend More or Less? Modulating Attention for Fairness

Abdelrahman Zayed

Goncalo Mordido

Samira Shabanian

A. Chandar

2024-07-09

colmweb.org/COLM/2024/Conference (accepted)

doi.org

openreview.net

A Survey on Deep Learning for Theorem Proving

Zhaoyu Li

Jialiang Sun

Logan Murphy

Qidong Su

Zenan Li

Xian Zhang

Kaiyu Yang

Xujie Si

2024-07-09

colmweb.org/COLM/2024/Conference (accepted)

doi.org

openreview.net

The black box of the relationship between breast cancer patients and accompanying patients: the accompanied patients' point of view

Marie-Pascale Pomey

Monica Iliescu Nelea

Cécile Vialaron

Louise Normandin

Marie-Andrée Côté

Mado Desforges

Pénélope Pomey-Carpentier

Nesrine Adjtoutah

Israël Fortin

Isabelle Ganache

Catherine Régis

Zeev Rosberger

Danielle Charpentier

Lynda Bélanger

Michel Dorval

Djahanchah P. Ghadiri

Mélanie Lavoie-Tremblay

Antoine Boivin

Jean-François Pelletier

Nicolas Fernandez … (see 2 more)

Alain M. Danino

Michèle de Guise

The PAROLE-Onco program was introduced in the province of Quebec, Canada in 2019. It integrates accompanying patients (APs), i.e., people wh… (see more)o have been affected by cancer, into the clinical team as full members. These APs use their experiential knowledge with people undergoing treatment and with clinical teams. The aim of this paper is to evaluate, within the framework of two university medical centers, the perceptions of breast cancer patients who receive support from APs, particularly in terms of their active involvement in their care trajectory. A qualitative study based on semi-structured interviews with accompanied patients was performed. Fourteen individual interviews were conducted between July and September 2021 with women presenting different profiles in terms of age, education, professional status, type of treatment, family situation, and clinical background. The data were analyzed using thematic analysis, focusing on patients’ perceptions of APs’ contributions and suggested improvements for accessing AP support. Three themes emerged from the semi-structured interviews: communication modalities used to connect patients with their APs, the characteristics of the support provided by APs, and the perceived effects of this support on the patients. Patients expressed a preference for telephone communication, highlighting its convenience and accessibility. The support provided by APs included emotional and informational support, neutrality, and adaptability. This relationship improved patient communication, reduced anxiety, helped regain control, and enhanced overall quality of life. The results emphasized the added value of APs in complementing the support offered by healthcare professionals. Patients noted the critical role of APs in helping them navigate the healthcare system, better understand their treatment processes, and manage their emotions. The ability of APs to provide practical advice and emotional reassurance was particularly valued. Overall, the findings underscored the significant impact of AP support on patients’ experiences and highlighted areas for enhancing this service. This study highlights, during the care trajectory of people affected by breast cancer, APs’ contribution to patients’ emotional well-being because they improve, in particular, the management of emotions and communication with health professionals. The online version contains supplementary material available at 10.1186/s12885-024-12585-z.

2024-07-09

BMC Cancer (published)

doi.org

Trust No Bot: Discovering Personal Disclosures in Human-LLM Conversations in the Wild

Niloofar Mireshghallah

Maria Antoniak

Yash More

Yejin Choi

Golnoosh Farnadi

Measuring personal disclosures made in human-chatbot interactions can provide a better understanding of users' AI literacy and facilitate pr… (see more)ivacy research for large language models (LLMs). We run an extensive, fine-grained analysis on the personal disclosures made by real users to commercial GPT models, investigating the leakage of personally identifiable and sensitive information. To understand the contexts in which users disclose to chatbots, we develop a taxonomy of tasks and sensitive topics, based on qualitative and quantitative analysis of naturally occurring conversations. We discuss these potential privacy harms and observe that: (1) personally identifiable information (PII) appears in unexpected contexts such as in translation or code editing (48% and 16% of the time, respectively) and (2) PII detection alone is insufficient to capture the sensitive topics that are common in human-chatbot interactions, such as detailed sexual preferences or specific drug use habits. We believe that these high disclosure rates are of significant importance for researchers and data curators, and we call for the design of appropriate nudging mechanisms to help users moderate their interactions.

2024-07-09

colmweb.org/COLM/2024/Conference (accepted)

doi.org

openreview.net

V-STaR: Training Verifiers for Self-Taught Reasoners

Arian Hosseini

Xingdi Yuan

Nikolay Malkin

Aaron Courville

Alessandro Sordoni

Rishabh Agarwal

Common self-improvement approaches for large language models (LLMs), such as STaR, iteratively fine-tune LLMs on self-generated solutions to… (see more) improve their problem-solving ability. However, these approaches discard the large amounts of incorrect solutions generated during this process, potentially neglecting valuable information in such solutions. To address this shortcoming, we propose V-STaR that utilizes both the correct and incorrect solutions generated during the self-improvement process to train a verifier using DPO that judges correctness of model-generated solutions. This verifier is used at inference time to select one solution among many candidate solutions. Running V-STaR for multiple iterations results in progressively better reasoners and verifiers, delivering a 4% to 17% test accuracy improvement over existing self-improvement and verification approaches on common code generation and math reasoning benchmarks with LLaMA2 models.

2024-07-09

colmweb.org/COLM/2024/Conference (accepted)

doi.org

openreview.net

Web Retrieval Agents for Evidence-Based Misinformation Detection

Jacob-Junqi Tian

Hao Yu

Yury Orlovskiy

Tyler Vergho

Mauricio Rivera

Mayank Goel

Zachary Yang

Jean-François Godbout

Reihaneh Rabbany

Kellin Pelrine

2024-07-09

colmweb.org/COLM/2024/Conference (accepted)

doi.org

openreview.net

What makes a good metric? Evaluating automatic metrics for text-to-image consistency

Candace Ross

Melissa Hall

Adriana Romero

Adina Williams

2024-07-09

colmweb.org/COLM/2024/Conference (accepted)

doi.org

openreview.net

Automated River Substrate Mapping From Sonar Imagery With Machine Learning

C. S. Bodine

D. Buscombe

Toby Dylan Hocking

2024-07-08

Journal of Geophysical Research: Machine Learning and Computation (published)

doi.org

Canada’s approach to SARS-CoV-2 sero-surveillance: Lessons learned for routine surveillance and future pandemics

Sheila F. O’Brien

Michael Asamoah-Boaheng

Brian Grunau

Mel Krajden

David L Buckeridge

David M. Goldfarb

Maureen Anderson

Marc Germain

Patrick Brown

Derek R. Stein

Kami Kandola

Graham Tipples

Philip Awadalla

Amanda Lang

Lesley Behl

Tiffany Fitzpatrick

Steven J. Drews

2024-07-08

Canadian Journal of Public Health (published)

doi.org

Learn-To-Design: Reinforcement Learning-Assisted Chemical Process Optimization

Eslam G. Al-Sakkari

Ahmed Ragab

Mohamed Ali

Hanane Dagdougui

Daria C. Boffito

Mouloud Amazouz

This paper proposes an AI-assisted approach aimed at accelerating chemical process design through causal incremental reinforcement learning … (see more)(CIRL) where an intelligent agent is interacting iteratively with a process simulation environment (e.g., Aspen HYSYS, DWSIM, etc.). The proposed approach is based on an incremental learnable optimizer capable of guiding multi-objective optimization towards optimal design variable configurations, depending on several factors including the problem complexity, selected RL algorithm and hyperparameters tuning. One advantage of this approach is that the agent-simulator interaction significantly reduces the vast search space of design variables, leading to an accelerated and optimized design process. This is a generic causal approach that enables the exploration of new process configurations and provides actionable insights to designers to improve not only the process design but also the design process across various applications. The approach was validated on industrial processes including an absorption-based carbon capture, considering the economic and technological uncertainties of different capture processes, such as energy price, production cost, and storage capacity. It achieved a cost reduction of up to 5.5% for the designed capture process, after a few iterations, while also providing the designer with actionable insights. From a broader perspective, the proposed approach paves the way for accelerating the adoption of decarbonization technologies (CCUS value chains, clean fuel production, etc.) at a larger scale, thus catalyzing climate change mitigation.

2024-07-08

Systems and Control Transactions (published)

doi.org

AI Policy Fellowship Publications

Mila Ventures Launchpad

AI Policy Compass

Publications

AI Policy Fellowship Publications

Mila Ventures Launchpad

AI Policy Compass

Popular keywords:

Publications