Learn how to leverage generative AI to support and improve your productivity at work. The next cohort will take place online on April 28 and 30, 2026, in French.
We use cookies to analyze the browsing and usage of our website and to personalize your experience. You can disable these technologies at any time, but this may limit certain functionalities of the site. Read our Privacy Policy for more information.
Setting cookies
You can enable and disable the types of cookies you wish to accept. However certain choices you make could affect the services offered on our sites (e.g. suggestions, personalised ads, etc.).
Essential cookies
These cookies are necessary for the operation of the site and cannot be deactivated. (Still active)
Analytics cookies
Do you accept the use of cookies to measure the audience of our sites?
Multimedia Player
Do you accept the use of cookies to display and allow you to watch the video content hosted by our partners (YouTube, etc.)?
Publications
Options of Interest: Temporal Abstraction with Interest Functions
Temporal abstraction refers to the ability of an agent to use behaviours of controllers which act for a limited, variable amount of time. Th… (see more)e options framework describes such behaviours as consisting of a subset of states in which they can initiate, an internal policy and a stochastic termination condition. However, much of the subsequent work on option discovery has ignored the initiation set, because of difficulty in learning it from data. We provide a generalization of initiation sets suitable for general function approximation, by defining an interest function associated with an option. We derive a gradient-based learning algorithm for interest functions, leading to a new interest-option-critic architecture. We investigate how interest functions can be leveraged to learn interpretable and reusable temporal abstractions. We demonstrate the efficacy of the proposed approach through quantitative and qualitative results, in both discrete and continuous environments.
2020-04-02
Proceedings of the AAAI Conference on Artificial Intelligence (published)
Typical methods for unsupervised text style transfer often rely on two key ingredients: 1) seeking the explicit disentanglement of the conte… (see more)nt and the attributes, and 2) troublesome adversarial learning. In this paper, we show that neither of these components is indispensable. We propose a new framework that utilizes the gradients to revise the sentence in a continuous space during inference to achieve text style transfer. Our method consists of three key components: a variational auto-encoder (VAE), some attribute predictors (one for each attribute), and a content predictor. The VAE and the two types of predictors enable us to perform gradient-based optimization in the continuous space, which is mapped from sentences in a discrete space, to find the representation of a target sentence with the desired attributes and preserved content. Moreover, the proposed method naturally has the ability to simultaneously manipulate multiple fine-grained attributes, such as sentence length and the presence of specific words, when performing text style transfer tasks. Compared with previous adversarial learning based methods, the proposed method is more interpretable, controllable and easier to train. Extensive experimental studies on three popular text style transfer tasks show that the proposed method significantly outperforms five state-of-the-art methods.
2020-04-02
Proceedings of the AAAI Conference on Artificial Intelligence (published)
Emergent communication in artificial agents has been studied to understand language evolution, as well as to develop artificial systems that… (see more) learn to communicate with humans. We show that agents performing a cooperative navigation task in various gridworld environments learn an interpretable communication protocol that enables them to efficiently, and in many cases, optimally, solve the task. An analysis of the agents' policies reveals that emergent signals spatially cluster the state space, with signals referring to specific locations and spatial directions such as "left", "up", or "upper left room". Using populations of agents, we show that the emergent protocol has basic compositional structure, thus exhibiting a core property of natural language.
GitHub Repositories with Links to Academic Papers: Open Access, Traceability, and Evolution
Supatsara Wattanakriengkrai
Bodin Chinthanet
Hideaki Hata
Raula Gaikovina Kula
Christoph Treude
Jin L.C. Guo
Ken-ichi Matsumoto
Traceability between published scientific breakthroughs and their implementation is essential, especially in the case of Open Source Softwar… (see more)e implements bleeding edge science into its code. However, aligning the link between GitHub repositories and academic papers can prove difficult, and the link impact remains unknown. This paper investigates the role of academic paper references contained in these repositories. We conducted a large-scale study of 20 thousand GitHub repositories to establish prevalence of references to academic papers. We use a mixed-methods approach to identify Open Access (OA), traceability and evolutionary aspects of the links. Although referencing a paper is not typical, we find that a vast majority of referenced academic papers are OA. In terms of traceability, our analysis revealed that machine learning is the most prevalent topic of repositories. These repositories tend to be affiliated with academic communities. More than half of the papers do not link back to any repository. A case study of referenced arXiv paper shows that most of these papers are high-impact and influential and do align with academia, referenced by repositories written in different programming languages. From the evolutionary aspect, we find very few changes of papers being referenced and links to them.
Admission to hospital provides the opportunity to review patient medications; however, the extent to which the safety of drug regimens chang… (see more)es after hospitalization is unclear.
To estimate the number of potentially inappropriate medications (PIMs) prescribed to patients at hospital discharge and their association with the risk of adverse events 30 days after discharge.
Prospective cohort study.
Tertiary care hospitals within the McGill University Health Centre Network in Montreal, Quebec, Canada.
Patients from internal medicine, cardiac, and thoracic surgery, aged 65 years and older, admitted between October 2014 and November 2016.
Abstracted chart data were linked to provincial health databases. PIMs were identified using AGS (American Geriatrics Society) Beers Criteria®, STOPP, and Choosing Wisely statements. Multivariable logistic regression and Cox models were used to assess the association between PIMs and adverse events.
Of 2,402 included patients, 1,381 (57%) were male; median age was 76 years (interquartile range [IQR] = 70‐82 years); and eight discharge medications were prescribed (IQR = 2‐8). A total of 1,576 (66%) patients were prescribed at least one PIM at discharge; 1,176 (49%) continued a PIM from prior to admission, and 755 (31%) were prescribed at least one new PIM. In the 30 days after discharge, 218 (9%) experienced an adverse drug event (ADE) and 862 (36%) visited the emergency department (ED), were rehospitalized, or died. After adjustment, each additional new PIM and continued community PIM were respectively associated with a 21% (odds ratio [OR] = 1.21; 95% confidence interval [CI] = 1.01‐1.45) and a 10% (OR = 1.10; 95% CI = 1.01‐1.21) increased odds of ADEs. They were also respectively associated with a 13% (hazard ratio [HR] = 1.13; 95% CI = 1.03‐1.26) and a 5% (HR = 1.05; 95% CI = 1.00‐1.10) increased risk of ED visits, rehospitalization, and death.
Two in three hospitalized patients were prescribed a PIM at discharge, and increasing numbers of PIMs were associated with an increased risk of ADEs and all‐cause adverse events. Improving hospital prescribing practices may reduce the frequency of PIMs and associated adverse events. J Am Geriatr Soc 68:1184–1192, 2020.
2020-03-30
Journal of the American Geriatrics Society (published)
Synaptic plasticity is believed to be a key physiological mechanism for learning. It is well-established that it depends on pre and postsyna… (see more)ptic activity. However, models that rely solely on pre and postsynaptic activity for synaptic changes have, to date, not been able to account for learning complex tasks that demand credit assignment in hierarchical networks. Here, we show that if synaptic plasticity is regulated by high-frequency bursts of spikes, then neurons higher in a hierarchical circuit can coordinate the plasticity of lower-level connections. Using simulations and mathematical analyses, we demonstrate that, when paired with short-term synaptic dynamics, regenerative activity in the apical dendrites, and synaptic plasticity in feedback pathways, a burst-dependent learning rule can solve challenging tasks that require deep network architectures. Our results demonstrate that well-known properties of dendrites, synapses, and synaptic plasticity are sufficient to enable sophisticated learning in hierarchical circuits.
Unpaired Image-to-Image Translation (I2IT) tasks often suffer from lack of data, a problem which self-supervised learning (SSL) has recently… (see more) been very popular and successful at tackling. Leveraging auxiliary tasks such as rotation prediction or generative colorization, SSL can produce better and more robust representations in a low data regime. Training such tasks along an I2IT task is however computationally intractable as model size and the number of task grow. On the other hand, learning sequentially could incur catastrophic forgetting of previously learned tasks. To alleviate this, we introduce Lifelong Self-Supervision (LiSS) as a way to pre-train an I2IT model (e.g., CycleGAN) on a set of self-supervised auxiliary tasks. By keeping an exponential moving average of past encoders and distilling the accumulated knowledge, we are able to maintain the network's validation performance on a number of tasks without any form of replay, parameter isolation or retraining techniques typically used in continual learning. We show that models trained with LiSS perform better on past tasks, while also being more robust than the CycleGAN baseline to color bias and entity entanglement (when two entities are very close).
Planning as Inference in Epidemiological Dynamics Models
Frank Wood
Andrew Warrington
Saeid Naderiparizi
Christian Weilbach
Vaden Masrani
William Harvey
Adam Ścibior
Boyan Beronov
Ali Nasseri
In this work we demonstrate how to automate parts of the infectious disease-control policy-making process via performing inference in existi… (see more)ng epidemiological models. The kind of inference tasks undertaken include computing the posterior distribution over controllable, via direct policy-making choices, simulation model parameters that give rise to acceptable disease progression outcomes. Among other things, we illustrate the use of a probabilistic programming language that automates inference in existing simulators. Neither the full capabilities of this tool for automating inference nor its utility for planning is widely disseminated at the current time. Timely gains in understanding about how such simulation-based models and inference automation tools applied in support of policymaking could lead to less economically damaging policy prescriptions, particularly during the current COVID-19 pandemic.