Publications

Pharmacists' perceptions of a machine learning model for the identification of atypical medication orders

Sophie-Camille Hogue

Flora Chen

Geneviève Brassard

Denis Lebel

Jean-François Bussières

Audrey Durand

Maxime Thibault

2021-05-06

J. Am. Medical Informatics Assoc. (published)

doi.org

Impact of individual rater style on deep learning uncertainty in medical imaging segmentation

Olivier Vincent

Charley Gros

Julien Cohen-Adad

While multiple studies have explored the relation between inter-rater variability and deep learning model uncertainty in medical segmentatio… (see more)n tasks, little is known about the impact of individual rater style. This study quantifies rater style in the form of bias and consistency and explores their impacts when used to train deep learning models. Two multi-rater public datasets were used, consisting of brain multiple sclerosis lesion and spinal cord grey matter segmentation. On both datasets, results show a correlation (

2021-05-05

ArXiv (preprint)

arxiv.org

PNS-GAN: Conditional Generation of Peripheral Nerve Signals in the Wavelet Domain via Adversarial Networks

Olivier Tessier-Larivière

Luke Y. Prince

Pascal Fortier-Poisson

Lorenz Wernisch

Oliver Armitage

Emil Hewage

Guillaume Lajoie

Blake Richards

Simulated datasets of neural recordings are a crucial tool in neural engineering for testing the ability of decoding algorithms to recover k… (see more)nown ground-truth. In this work, we introduce PNS-GAN, a generative adversarial network capable of producing realistic nerve recordings conditioned on physiological biomarkers. PNS-GAN operates in the wavelet domain to preserve both the timing and frequency of neural events with high resolution. PNS-GAN generates sequences of scaleograms from noise using a recurrent neural network and 2D transposed convolution layers. PNS-GAN discriminates over stacks of scaleograms with a network of 3D convolution layers. We find that our generated signal reproduces a number of characteristics of the real signal, including similarity in a canonical time-series feature-space, and contains physiologically related neural events including respiration modulation and similar distributions of afferent and efferent signalling.

2021-05-04

International IEEE/EMBS Conference on Neural Engineering (published)

doi.org

Toward Causal Representation Learning

Bernhard Schölkopf

Francesco Locatello

Stefan Bauer

Nan Rosemary Ke

Nal Kalchbrenner

Anirudh Goyal

Yoshua Bengio

The two fields of machine learning and graphical causality arose and are developed separately. However, there is, now, cross-pollination and… (see more) increasing interest in both fields to benefit from the advances of the other. In this article, we review fundamental concepts of causal inference and relate them to crucial open problems of machine learning, including transfer and generalization, thereby assaying how causality can contribute to modern machine learning research. This also applies in the opposite direction: we note that most work in causality starts from the premise that the causal variables are given. A central problem for AI and causality is, thus, causal representation learning, that is, the discovery of high-level causal variables from low-level observations. Finally, we delineate some implications of causality for machine learning and propose key research areas at the intersection of both communities.

2021-05-01

Proceedings of the IEEE (published)

doi.org

Design and Implementation of Smooth Renewable Power in Cloud Data Centers

Xinxin Liu

Yu Hua

Xue (Steve) Liu

Ling Yang

Yuanyuan Sun

The renewable power has been widely used in modern cloud data centers, which also produce large electricity bills and the negative impacts o… (see more)n environments. However, frequent fluctuation and intermittency of renewable power often cause the challenges in terms of the stability of both electricity grid and data centers, as well as decreasing the utilization of renewable power. Existing schemes fail to alleviate the renewable power fluctuation, which is caused by the essential properties of renewable power. In order to address this problem, we propose an efficient and easy-to-use smooth renewable power-aware scheme, called Smoother, which consists of Flexible Smoothing (FS) and Active Delay (AD). First, in order to smooth the fluctuation of renewable power, FS carries out the optimized charge/discharge operation via computing the minimum variance of the renewable power that is supplied to data centers per interval. Second, AD improves the utilization of renewable power via actively adjusting the execution time of deferrable workloads. Extensive experimental results via examining the traces of real-world data centers demonstrate that Smoother significantly reduces the negative impact of renewable power fluctuations on data centers and improves the utilization of renewable power by 250.88 percent on average. We have released the source codes for public use.

2021-04-30

IEEE Transactions on Cloud Computing (published)

doi.org

Editorial: Social Interaction in Neuropsychiatry

Victoria Leong

Danilo Bzdok

Frieder M. Paulus

Kevin Pelphrey

Elizabeth Redcay

Leonhard Schilbach

2021-04-29

Frontiers in Psychiatry (published)

doi.org

What is Going on Inside Recurrent Meta Reinforcement Learning Agents?

Safa Alver

Doina Precup

Recurrent meta reinforcement learning (meta-RL) agents are agents that employ a recurrent neural network (RNN) for the purpose of"learning a… (see more) learning algorithm". After being trained on a pre-specified task distribution, the learned weights of the agent's RNN are said to implement an efficient learning algorithm through their activity dynamics, which allows the agent to quickly solve new tasks sampled from the same distribution. However, due to the black-box nature of these agents, the way in which they work is not yet fully understood. In this study, we shed light on the internal working mechanisms of these agents by reformulating the meta-RL problem using the Partially Observable Markov Decision Process (POMDP) framework. We hypothesize that the learned activity dynamics is acting as belief states for such agents. Several illustrative experiments suggest that this hypothesis is true, and that recurrent meta-RL agents can be viewed as agents that learn to act optimally in partially observable environments consisting of multiple related tasks. This view helps in understanding their failure cases and some interesting model-based results reported in the literature.

2021-04-29

ArXiv (preprint)

arxiv.org

Tracking white and grey matter degeneration along the spinal cord axis in degenerative cervical myelopathy

Kevin Vallotton

Gergely David

Markus Hupp

Nikolai Pfender

Julien Cohen-Adad

Michael Fehlings

Rebecca S. Samson

Claudia A. M. Gandini Wheeler-Kingshott

Armin Curt

Patrick Freund

Maryam Seif

Objective: To determine tissue-specific neurodegeneration across the spinal cord in patients with mild-moderate degenerative cervical myelop… (see more)athy (DCM). Methods: Twenty-four mild-moderate DCM and 24 healthy subjects were recruited. In patients, a T2-weighted scan was acquired at the compression site, while in all participants a T2*-weighted and diffusion-weighted scan was acquired at the cervical level (C2-C3) and in the lumbar enlargement (i.e. rostral and caudal to the site of compression). We quantified intramedullary signal changes, maximal canal and cord compression, white (WM) and grey matter (GM) atrophy, and microstructural indices from diffusion-weighted scans. All patients underwent clinical (modified Japanese Orthopaedic Association (mJOA)) and electrophysiological assessments. Regression analysis assessed associations between MRI readouts and electrophysiological and clinical outcomes. Results: Twenty patients were classified with mild and four with moderate DCM using the mJOA scale. The most frequent site of compression was at C5-C6 level with maximum cord compression of 4.68{+/-}0.83 mm. Ten patients showed imaging evidence of cervical myelopathy. In the cervical cord, WM and GM atrophy and WM microstructural changes were evident, while in the lumbar cord only WM showed atrophy and microstructural changes. Remote cervical cord WM microstructural changes were pronounced in patients with radiological myelopathy and associated with impaired electrophysiology. Lumbar cord WM atrophy was associated with lower limb sensory impairments. Conclusion: Tissue-specific neurodegeneration revealed by quantitative MRI, already apparent across the spinal cord in mild-moderate DCM prior to the onset of severe clinical impairments. WM microstructural changes are particularly sensitive to remote pathologically and clinically eloquent changes in DCM.

2021-04-22

medRxiv (preprint)

doi.org

Gradient Masked Federated Optimization

Irene Tenison

Sreya Francis

Irina Rish

2021-04-21

ArXiv (preprint)

arxiv.org

hBERT + BiasCorp - Fighting Racism on the Web

Olawale Moses Onabola

Zhuang Ma

Xie Yang

Benjamin Akera

Ibraheem Abdulrahman

Jia Xue

Dianbo Liu

Yoshua Bengio

Subtle and overt racism is still present both in physical and online communities today and has impacted many lives in different segments of … (see more)the society. In this short piece of work, we present how we’re tackling this societal issue with Natural Language Processing. We are releasing BiasCorp, a dataset containing 139,090 comments and news segment from three specific sources - Fox News, BreitbartNews and YouTube. The first batch (45,000 manually annotated) is ready for publication. We are currently in the final phase of manually labeling the remaining dataset using Amazon Mechanical Turk. BERT has been used widely in several downstream tasks. In this work, we present hBERT, where we modify certain layers of the pretrained BERT model with the new Hopfield Layer. hBert generalizes well across different distributions with the added advantage of a reduced model complexity. We are also releasing a JavaScript library 3 and a Chrome Extension Application, to help developers make use of our trained model in web applications (say chat application) and for users to identify and report racially biased contents on the web respectively

2021-04-19

OpenReview.net/Archive (published)

openreview.net

INFOSHIELD: Generalizable Information-Theoretic Human-Trafficking Detection

Meng-Chieh Lee

Catalina Vajiac

Aayushi Kulshrestha

Sacha Lévy

Namyong Park

Cara Jones

Reihaneh Rabbany

Christos Faloutsos

Given a million escort advertisements, how can we spot near-duplicates? Such micro-clusters of ads are usually signals of human trafficking.… (see more) How can we summarize them, visually, to convince law enforcement to act? Can we build a general tool that works for different languages? Spotting micro-clusters of near-duplicate documents is useful in multiple, additional settings, including spam-bot detection in Twitter ads, plagiarism, and more.We present INFOSHIELD, which makes the following contributions: (a) Practical, being scalable and effective on real data, (b) Parameter-free and Principled, requiring no user-defined parameters, (c) Interpretable, finding a document to be the cluster representative, highlighting all the common phrases, and automatically detecting "slots", i.e. phrases that differ in every document; and (d) Generalizable, beating or matching domain-specific methods in Twitter bot detection and human trafficking detection respectively, as well as being language-independent finding clusters in Spanish, Italian, and Japanese. Interpretability is particularly important for the anti human-trafficking domain, where law enforcement must visually inspect ads.Our experiments on real data show that INFOSHIELD correctly identifies Twitter bots with an F1 score over 90% and detects human-trafficking ads with 84% precision. Moreover, it is scalable, requiring about 8 hours for 4 million documents on a stock laptop.

2021-04-19

2021 IEEE 37th International Conference on Data Engineering (ICDE) (published)

doi.org

Characterizing Idioms: Conventionality and Contingency

Michaela Socolof

Jackie Cheung

Michael Wagner

Timothy O'Donnell

Idioms are unlike most phrases in two important ways. First, words in an idiom have non-canonical meanings. Second, the non-canonical meanin… (see more)gs of words in an idiom are contingent on the presence of other words in the idiom. Linguistic theories differ on whether these properties depend on one another, as well as whether special theoretical machinery is needed to accommodate idioms. We define two measures that correspond to the properties above, and we show that idioms fall at the expected intersection of the two dimensions, but that the dimensions themselves are not correlated. Our results suggest that introducing special machinery to handle idioms may not be warranted.

2021-04-17

ArXiv (preprint)

doi.org

arxiv.org

Speed Science

Leading in a New Era

Supervision Requests

Publications

Speed Science

Leading in a New Era

Supervision Requests

Popular keywords:

Publications