Publications

Towards a standardized framework for AI-assisted, image-based monitoring of nocturnal insects

D. B. Roy

David Roy

J. Alison

Tom August

M. Bélisle

K. Bjerge

J. J. Bowden

M. J. Bunsen

F. Cunha

Q. Geissmann

K. Goldmann

Alba Gomez-Segura

A. Jain

C. Huijbers

M. Larrivée

J. L. Lawson

H. M. Mann

M. J. Mazerolle

K. P. McFarland

L. Pasi … (voir 8 de plus)

S. Peters

N. Pinoy

David Rolnick

G. L. Skinner

O. T. Strickson

A. Svenning

S. Teagle

Toke Thomas Høye

Automated sensors have potential to standardize and expand the monitoring of insects across the globe. As one of the most scalable and faste… (voir plus)st developing sensor technologies, we describe a framework for automated, image-based monitoring of nocturnal insects—from sensor development and field deployment to workflows for data processing and publishing. Sensors comprise a light to attract insects, a camera for collecting images and a computer for scheduling, data storage and processing. Metadata is important to describe sampling schedules that balance the capture of relevant ecological information against power and data storage limitations. Large data volumes of images from automated systems necessitate scalable and effective data processing. We describe computer vision approaches for the detection, tracking and classification of insects, including models built from existing aggregations of labelled insect images. Data from automated camera systems necessitate approaches that account for inherent biases. We advocate models that explicitly correct for bias in species occurrence or abundance estimates resulting from the imperfect detection of species or individuals present during sampling occasions. We propose ten priorities towards a step-change in automated monitoring of nocturnal insects, a vital task in the face of rapid biodiversity loss from global threats. This article is part of the theme issue ‘Towards a toolkit for global insect biodiversity monitoring’.

2024-06-24

Philosophical Transactions of the Royal Society B: Biological Sciences (publié)

doi.org

Efficient Evolutionary Search Over Chemical Space with Large Language Models

Haorui Wang

Marta Skreta

Cher Tian Ser

Wenhao Gao

Lingkai Kong

Felix Streith-Kalthoff

Chenru Duan

Yuchen Zhuang

Yue Yu

Yanqiao Zhu 0001

Yuanqi Du

Alan Aspuru-Guzik

Kirill Neklyudov

Chao Zhang

Molecular discovery, when formulated as an optimization problem, presents significant computational challenges because optimization objectiv… (voir plus)es can be non-differentiable. Evolutionary Algorithms (EAs), often used to optimize black-box objectives in molecular discovery, traverse chemical space by performing random mutations and crossovers, leading to a large number of expensive objective evaluations. In this work, we ameliorate this shortcoming by incorporating chemistry-aware Large Language Models (LLMs) into EAs. Namely, we redesign crossover and mutation operations in EAs using LLMs trained on large corpora of chemical information. We perform extensive empirical studies on both commercial and open-source models on multiple tasks involving property optimization, molecular rediscovery, and structure-based drug design, demonstrating that the joint usage of LLMs with EAs yields superior performance over all baseline models across single- and multi-objective settings. We demonstrate that our algorithm improves both the quality of the final solution and convergence speed, thereby reducing the number of required objective evaluations. Our code is available at http://github.com/zoom-wang112358/MOLLEO

2024-06-23

ArXiv (prépublication)

doi.org

arxiv.org

Existing Digital Health Technology Index Summary Report for Older Adults Living with Neurocognitive Disorders (Mild and Major) and Their Informal Caregivers: An Environmental Scan

Ambily Jose

Maxime Sasseville

Ellen Gorus

Anik Giguère

Anne Bourbonnais

Samira Abbasgholizadeh-Rahimi

Clémence Balley

Ronald Buyl

Marie-Pierre Gagnon

Digital health has added numerous promising solutions to enhance the health and wellness of people with neurocognitive disorders (NCDs) and … (voir plus)their informal caregivers. (1) Background: It is important to obtain a comprehensive view of currently available technologies, their outcomes, and conditions of success to inform recommendations regarding digital health solutions for people with NCDs and their caregivers. This environmental scan was performed to identify the features of existing digital health solutions relevant to the targeted population. This work reviews currently available digital health solutions and their related characteristics to develop a decision support tool for older adults living with mild or major neurocognitive disorders and their informal caregivers. This knowledge will aid the development of a decision support tool to assist older adults and their informal caregivers in their search for adequate digital health solutions according to their needs and preferences based on trustable information. (2) Methods: We conducted an environmental scan to identify digital health solutions from a systematic review and targeted searches in the grey literature covering the regions of Canada and Europe. Technological tools were scanned based on a preformatted extraction grid. We assessed their relevance based on selected attributes and summarized the findings. (3) Results: We identified 100 available digital health solutions. The majority (56%) were not specific to NCDs. Only 28% provided scientific evidence of their effectiveness. Remote patient care, movement tracking, and cognitive exercises were the most common purposes of digital health solutions. Most solutions were presented as decision aid tools, pill dispensers, apps, web, or a combination of these platforms. (4) Conclusions: This environmental scan allowed for identifying current digital health solutions for older adults with mild or major neurocognitive disorders and their informal caregivers. Findings from the environmental scan highlight the need for additional approaches to strengthen digital health interventions for the well-being of older adults with mild and major NCDs and their informal and formal healthcare providers.

2024-06-22

Geriatrics (publié)

doi.org

DASB -- Discrete Audio and Speech Benchmark

Jarod Duret

Discrete audio tokens have recently gained considerable attention for their potential to connect audio and language processing, enabling the… (voir plus) creation of modern multimodal large language models. Ideal audio tokens must effectively preserve phonetic and semantic content along with paralinguistic information, speaker identity, and other details. While several types of audio tokens have been recently proposed, identifying the optimal tokenizer for various tasks is challenging due to the inconsistent evaluation settings in existing studies. To address this gap, we release the Discrete Audio and Speech Benchmark (DASB), a comprehensive leaderboard for benchmarking discrete audio tokens across a wide range of discriminative tasks, including speech recognition, speaker identification and verification, emotion recognition, keyword spotting, and intent classification, as well as generative tasks such as speech enhancement, separation, and text-to-speech. Our results show that, on average, semantic tokens outperform compression tokens across most discriminative and generative tasks. However, the performance gap between semantic tokens and standard continuous representations remains substantial, highlighting the need for further research in this field.

2024-06-20

ArXiv (prépublication)

doi.org

openreview.net

Language Model-In-The-Loop: Data Optimal Approach to Recommend Actions in Text Games

Arjun V Sudhakar

Prasanna Parthasarathi

Janarthanan Rajendran

Sarath Chandar

Large Language Models (LLMs) have demonstrated superior performance in language understanding benchmarks. A recent use case for LLMs involve… (voir plus)s training decision-making agents over textual information. The existing approach leverages LLM's linguistic priors for action candidate recommendations in text games, i.e., to operate without environment-provided actions. However, adapting LLMs to specific games/tasks requires a massive amount of annotated human gameplay. Moreover, in the existing approach, the language model was kept frozen during an agent's training process, which limits learning from in-game knowledge about the world. Hence, we explore strategies to adapt the language model for candidate recommendation with in-game transition in an online learning fashion to mitigate reliance on human-annotated gameplays, which are costly to acquire. In this paper, we propose in-game transition selection methods to adapt the LLM in the loop, reducing the dependency on using human-annotated gameplays while improving performance and convergence. Our method demonstrates a 53% relative improvement in average game score over the previous state-of-the-art model, achieving more than twice the convergence rate in a full-annotated dataset setting. Furthermore, even with only 10% of human annotation, we surpassed the 100\% state-of-the-art performance benchmark.

2024-06-20

rl-conference.cc/RLC/2024/Workshop/TAFM (publié)

openreview.net

APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts

Honghua Dong

Qidong Su

Yubo Gao

Zhaoyu Li

Yangjun Ruan

Gennady G. Pekhimenko

Chris J. Maddison

Xujie Si

Large Language Models (LLMs) have become increasingly capable of handling diverse tasks with the aid of well-crafted prompts and integration… (voir plus) of external tools, but as task complexity rises, the workflow involving LLMs can be complicated and thus challenging to implement and maintain. To address this challenge, we propose APPL, A Prompt Programming Language that acts as a bridge between computer programs and LLMs, allowing seamless embedding of prompts into Python functions, and vice versa. APPL provides an intuitive and Python-native syntax, an efficient parallelized runtime with asynchronous semantics, and a tracing module supporting effective failure diagnosis and replaying without extra costs. We demonstrate that APPL programs are intuitive, concise, and efficient through three representative scenarios: Chain-of-Thought with self-consistency (CoT-SC), ReAct tool use agent, and multi-agent chat. Experiments on three parallelizable workflows further show that APPL can effectively parallelize independent LLM calls, with a significant speedup ratio that almost matches the estimation.

2024-06-19

ArXiv (prépublication)

doi.org

arxiv.org

Functional Acceleration for Policy Mirror Descent

Veronica Chelu

Doina Precup

2024-06-19

ICML.cc/2024/Workshop/ARLET (poster)

doi.org

openreview.net

Functional Acceleration for Policy Mirror Descent

Veronica Chelu

Doina Precup

We apply functional acceleration to the Policy Mirror Descent (PMD) general family of algorithms, which cover a wide range of novel and fund… (voir plus)amental methods in Reinforcement Learning (RL). Leveraging duality, we propose a momentum-based PMD update. By taking the functional route, our approach is independent of the policy parametrization and applicable to large-scale optimization, covering previous applications of momentum at the level of policy parameters as a special case. We theoretically analyze several properties of this approach and complement with a numerical ablation study, which serves to illustrate the policy optimization dynamics on the value polytope, relative to different algorithmic design choices in this space. We further characterize numerically several features of the problem setting relevant for functional acceleration, and lastly, we investigate the impact of approximation on their learning mechanics.

2024-06-19

ICML.cc/2024/Workshop/ARLET (poster)

doi.org

openreview.net

GAPS phase II: development and pilot results of the global assessment in pediatric surgery, an evidence-based pediatric surgical capacity assessment tool for low-resource settings.

Yasmine Yousef

Sarah Cairo

Etienne St-Louis

Laura F. Goodman

Doulia M. Hamad

Robert Baird

Emily R. Smith

Sherif Emil

Jean Martin Laberge

Mohamed Abdelmalak

Zipporah Gathuy

Faye Evans

Maryam Ghavami Adel

Ki K. Bertille

Milind Chitnis

Leecarlo Millano

Peter Nthumba

Sergio d’Agostino

Bruno Cigliano

Luis Enrique Zea-Salazar … (voir 4 de plus)

Emmanuel Ameh

Doruk Ozgediz

Elena Guadagno

Dan Poenaru

2024-06-19

Pediatric Surgery International (publié)

doi.org

Handling Delay in Reinforcement Learning Caused by Parallel Computations of Neurons

Ivan Anokhin

Rishav

Stephen Chung

Irina Rish

Samira Ebrahimi Kahou

Biological neural networks operate in parallel, a feature that sets them apart from artificial neural networks and can significantly enhance… (voir plus) inference speed. However, this parallelism introduces challenges: when each neuron operates asynchronously with a fixed execution time, an

2024-06-19

ICML.cc/2024/Workshop/ARLET (poster)

openreview.net

Realtime Reinforcement Learning: Towards Rapid Asynchronous Deployment of Large Models

Matthew D Riemer

Gopeshh Subbaraj

Glen Berseth

Irina Rish

Realtime environments change even as agents perform action inference and learning, thus requiring high interaction frequencies to effectivel… (voir plus)y minimize long-term regret. However, recent advances in machine learning involve larger neural networks with longer inference times, raising questions about their applicability in realtime systems where reaction time is crucial. We present an analysis of lower bounds on regret in realtime environments to show that minimizing long-term regret is generally impossible within the typical sequential interaction and learning paradigm, but often becomes possible when sufficient asynchronous compute is available. We propose novel algorithms for staggering asynchronous inference processes to ensure that actions are taken at consistent time intervals, and demonstrate that use of models with high action inference times is only constrained by the environment's effective stochasticity over the inference horizon, and not by action frequency. Our analysis shows that the number of inference processes needed scales linearly with increasing inference times while enabling use of models that are multiple orders of magnitude larger than existing approaches when learning from a realtime simulation of Game Boy games such as Pokemon and Tetris.

2024-06-19

ICML.cc/2024/Workshop/ARLET (poster)

openreview.net

A deeper look at depth pruning of LLMs

Shoaib Ahmed Siddiqui

Xin Dong

Greg Heinrich

Thomas Breuel

Jan Kautz

David Scott Krueger

Pavlo Molchanov

Large Language Models (LLMs) are not only resource-intensive to train but even more costly to deploy in production. Therefore, recent work h… (voir plus)as attempted to prune blocks of LLMs based on cheap proxies for estimating block importance, effectively removing 10% of blocks in well-trained LLaMa-2 and Mistral 7b models without any significant degradation of downstream metrics. In this paper, we explore different block importance metrics by considering adaptive metrics such as Shapley value in addition to static ones explored in prior work. We show that *adaptive metrics exhibit a trade-off in performance between tasks i.e., improvement on one task may degrade performance on the other due to differences in the computed block influences*. Furthermore, we extend this analysis from a complete block to individual self-attention and feed-forward layers, highlighting the propensity of the self-attention layers to be more amendable to pruning, even allowing ***removal of upto 33% of the self-attention layers without incurring any performance degradation on MMLU for Mistral 7b*** (significant reduction in costly maintenance of KV-cache). Finally, we look at simple performance recovery techniques to emulate the pruned layers by training lightweight additive bias or low-rank linear adapters. *Performance recovery using emulated updates avoids performance degradation for the initial blocks (up to 5% absolute improvement on MMLU)*, which is either competitive or superior to the learning-based technique.

2024-06-18

ICML.cc/2024/Workshop/TF2M (poster)

openreview.net

Conférence d'ouverture | Créer une IA plus sécuritaire pour la santé mentale des jeunes

TRAIL : IA responsable pour les professionnels et les leaders

Fondateur en résidence Mila Ventures

Éclaireurs autochtones en IA

Publications

Conférence d'ouverture | Créer une IA plus sécuritaire pour la santé mentale des jeunes

TRAIL : IA responsable pour les professionnels et les leaders

Fondateur en résidence Mila Ventures

Éclaireurs autochtones en IA

Mots-clés populaires:

Publications