Publications

Combining Modular Skills in Multitask Learning
Multiscale PHATE identifies multimodal signatures of COVID-19
Manik Kuchroo
Je-chun Huang
Patrick W. Wong
Jean-Christophe Grenier
Dennis L. Shung
Alexander Tong
C. Lucas
J. Klein
Daniel B. Burkhardt
Scott Gigante
Abhinav Godavarthi
Bastian Rieck
Benjamin Israelow
Michael Simonov
Tianyang Mao
Ji Eun Oh
Julio Silva
Takehiro Takahashi
C. Odio
Arnau Casanovas‐massana … (see 10 more)
John Byrne Fournier
Shelli F. Farhadian
C. D. Dela Cruz
A. Ko
Matthew Hirn
F. Wilson
Akiko Iwasaki
Smita Krishnaswamy
Multiscale PHATE identifies multimodal signatures of COVID-19
Manik Kuchroo
Je-chun Huang
Patrick Wong
Jean-Christophe Grenier
Dennis Shung
Alexander Tong
Carolina Lucas
Jon Klein
Daniel B. Burkhardt
Scott Gigante
Abhinav Godavarthi
Bastian Rieck
Benjamin Israelow
Michael Simonov
Tianyang Mao
Ji Eun Oh
Julio Silva
Takehiro Takahashi
Camila D. Odio
Arnau Casanovas-Massana … (see 10 more)
John Fournier
Shelli Farhadian
Charles S. Dela Cruz
Albert I. Ko
Matthew Hirn
F. Perry Wilson
Akiko Iwasaki
Smita Krishnaswamy
Personalized Prediction of Future Lesion Activity and Treatment Effect in Multiple Sclerosis from Baseline MRI
Joshua D. Durso-Finley
Jean-Pierre R. Falet
Brennan Nichyporuk
Douglas Arnold
Precision medicine for chronic diseases such as multiple sclerosis (MS) involves choosing a treatment which best balances efficacy and side … (see more)effects/preferences for individual patients. Making this choice as early as possible is important, as delays in finding an effective therapy can lead to irreversible disability accrual. To this end, we present the first deep neural network model for individualized treatment decisions from baseline magnetic resonance imaging (MRI) (with clinical information if available) for MS patients which (a) predicts future new and enlarging T2 weighted (NE-T2) lesion counts on follow-up MRI on multiple treatments and (b) estimates the conditional average treatment effect (CATE), as defined by the predicted future suppression of NE-T2 lesions, between different treatment options relative to placebo. Our model is validated on a proprietary federated dataset of 1817 multi-sequence MRIs acquired from MS patients during four multi-centre randomized clinical trials. Our framework achieves high average precision in the binarized regression of future NE-T2 lesions on five different treatments, identifies heterogeneous treatment effects, and provides a personalized treatment recommendation that accounts for treatment-associated risk (side effects, patient preference, administration difficulties).
Subgraph Retrieval Enhanced Model for Multi-hop Knowledge Base Question Answering
Jing Zhang
Xiaokang Zhang
Jifan Yu
Jie Tang
Cuiping Li
Hong Chen
Recent works on knowledge base question answering (KBQA) retrieve subgraphs for easier reasoning. The desired subgraph is crucial as a small… (see more) one may exclude the answer but a large one might introduce more noises. However, the existing retrieval is either heuristic or interwoven with the reasoning, causing reasoning on the partial subgraphs, which increases the reasoning bias when the intermediate supervision is missing. This paper proposes a trainable subgraph retriever (SR) decoupled from the subsequent reasoning process, which enables a plug-and-play framework to enhance any subgraph-oriented KBQA model. Extensive experiments demonstrate SR achieves significantly better retrieval and QA performance than existing retrieval methods. Via weakly supervised pre-training as well as the end-to-end fine-tuning, SR achieves new state-of-the-art performance when combined with NSM (He et al., 2021), a subgraph-oriented reasoner, for embedding-based KBQA methods. Codes and datasets are available online (https://github.com/RUCKBReasoning/SubgraphRetrievalKBQA)
Healthsheet: Development of a Transparency Artifact for Health Datasets
Diana Mincu
Subhrajit Roy
Andrew J Smart
Lauren Wilcox
Mahima Pushkarna
Jessica Schrouff
Razvan Amironesei
Nyalleng Moorosi
Katherine Heller
Machine learning (ML) approaches have demonstrated promising results in a wide range of healthcare applications. Data plays a crucial role i… (see more)n developing ML-based healthcare systems that directly affect people’s lives. Many of the ethical issues surrounding the use of ML in healthcare stem from structural inequalities underlying the way we collect, use, and handle data. Developing guidelines to improve documentation practices regarding the creation, use, and maintenance of ML healthcare datasets is therefore of critical importance. In this work, we introduce Healthsheet, a contextualized adaptation of the original datasheet questionnaire [22] for health-specific applications. Through a series of semi-structured interviews, we adapt the datasheets for healthcare data documentation. As part of the Healthsheet development process and to understand the obstacles researchers face in creating datasheets, we worked with three publicly-available healthcare datasets as our case studies, each with different types of structured data: Electronic health Records (EHR), clinical trial study data, and smartphone-based performance outcome measures. Our findings from the interviewee study and case studies show 1) that datasheets should be contextualized for healthcare, 2) that despite incentives to adopt accountability practices such as datasheets, there is a lack of consistency in the broader use of these practices 3) how the ML for health community views datasheets and particularly Healthsheets as diagnostic tool to surface the limitations and strength of datasets and 4) the relative importance of different fields in the datasheet to healthcare concerns.
More Than Meets the Eye: Art Engages the Social Brain
Janneke E. P. van Leeuwen
Jeroen Boomgaard
Sebastian J. Crutch
Jason D. Warren
More Than Meets the Eye: Art Engages the Social Brain
Janneke E. P. van Leeuwen
Jeroen Boomgaard
S. Crutch
J. Warren
Here we present the viewpoint that art essentially engages the social brain, by demonstrating how art processing maps onto the social brain … (see more)connectome—the most comprehensive diagram of the neural dynamics that regulate human social cognition to date. We start with a brief history of the rise of neuroaesthetics as the scientific study of art perception and appreciation, in relation to developments in contemporary art practice and theory during the same period. Building further on a growing awareness of the importance of social context in art production and appreciation, we then set out how art engages the social brain and outline candidate components of the “artistic brain connectome.” We explain how our functional model for art as a social brain phenomenon may operate when engaging with artworks. We call for closer collaborations between the burgeoning field of neuroaesthetics and arts professionals, cultural institutions and diverse audiences in order to fully delineate and contextualize this model. Complementary to the unquestionable value of art for art’s sake, we argue that its neural grounding in the social brain raises important practical implications for mental health, and the care of people living with dementia and other neurological conditions.
Quantitative electrophysiological assessments as predictive markers of lower limb motor recovery after spinal cord injury: a pilot study with an adaptive trial design
Yin Nan Huang
El-Mehdi Meftah
Charlotte H. Pion
Jean-Marc Mac-Thiong
Dorothy Barthélemy
Patterns of connectome variability in autism across five functional activation tasks: findings from the LEAP project
Tristan Looden
Dorothea L. Floris
Alberto Llera
R. Chauvin
Jumana Sara Bonnie Tobias Simon Sarah Christian F. Sven T Ahmad Ambrosino Auyeung Banaschewski Baron-Cohen B
Jumana Ahmad
Sara Ambrosino
Bonnie Auyeung
Tobias Banaschewski
Simon Baron-Cohen
Sarah Baumeister
Christian Beckmann
Sven Bölte
Thomas Bourgeron
Carsten Bours
Michael Brammer
Daniel Brandeis
Claudia Brogna
Yvette de Bruijn
Jan K. Buitelaar … (see 55 more)
Bhismadev Chakrabarti
Tony Charman
Ineke Cornelissen
Daisy Crawley
F. D. Acqua
Sarah Durston
Christine Ecker
Jessica Faulkner
Vincent Frouin
Pilar Garcés
David Goyard
Lindsay Ham
Hannah Hayward
Joerg F. Hipp
Rosemary Holt
Mark Johnson
Emily J. H. Jones
Prantik Kundu
Meng-Chuan Lai
Xavier Liogier D’ardhuy
Michael V. Lombardo
Eva Loth
David J. Lythgoe
René Mandl
Andre Marquand
Luke Mason
Maarten Mennes
Andreas Meyer-Lindenberg
Carolin Moessnang
Nico Mueller
Declan Murphy
Beth Oakley
Laurence O’Dwyer
Marianne Oldehinkel
Bob Oranje
Gahan Pandina
Antonio Persico
Annika Rausch
Barbara Ruggeri
Amber N. V. Ruigrok
Jessica Sabet
Roberto Sacco
Antonia San José Cáceres
Emily Simonoff
Will Spooren
Julian Tillmann
Roberto Toro
Heike Tost
Jack Waldman
Steven C. R. Williams
Caroline Wooldridge
Iva Ilioska
Ting Mei
Marcel P. Zwiers
Stringency of containment and closures on the growth of SARS-CoV-2 in Canada prior to accelerated vaccine roll-out
David Vickers
Stefan Baral
Sharmistha Mishra
Jeffrey C. Kwong
Maria Sundaram
Alan Katz
Andrew Calzavara
Mathieu Maheu-Giroux
Tyler Williamson
Population Genomics Approaches for Genetic Characterization of SARS-CoV-2 Lineages
Fatima Mostefai
I. Gamache
Arnaud N’Guessan
Justin Pelletier
Jessie Huang
Carmen Lia Murall
Ahmad Pesaranghader
Vanda Gaonac'h-Lovejoy
David J. Hamelin
Raphael Poujol
Jean-Christophe Grenier
Martin W. Smith
Étienne Caron
Morgan Craig
Smita Krishnaswamy
B. Jesse Shapiro
The genome of the Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2), the pathogen that causes coronavirus disease 2019 (COVID-19)… (see more), has been sequenced at an unprecedented scale leading to a tremendous amount of viral genome sequencing data. To assist in tracing infection pathways and design preventive strategies, a deep understanding of the viral genetic diversity landscape is needed. We present here a set of genomic surveillance tools from population genetics which can be used to better understand the evolution of this virus in humans. To illustrate the utility of this toolbox, we detail an in depth analysis of the genetic diversity of SARS-CoV-2 in first year of the COVID-19 pandemic. We analyzed 329,854 high-quality consensus sequences published in the GISAID database during the pre-vaccination phase. We demonstrate that, compared to standard phylogenetic approaches, haplotype networks can be computed efficiently on much larger datasets. This approach enables real-time lineage identification, a clear description of the relationship between variants of concern, and efficient detection of recurrent mutations. Furthermore, time series change of Tajima's D by haplotype provides a powerful metric of lineage expansion. Finally, principal component analysis (PCA) highlights key steps in variant emergence and facilitates the visualization of genomic variation in the context of SARS-CoV-2 diversity. The computational framework presented here is simple to implement and insightful for real-time genomic surveillance of SARS-CoV-2 and could be applied to any pathogen that threatens the health of populations of humans and other organisms.