Publications

scHiCyclePred: a deep learning framework for predicting cell cycle phases from single-cell Hi-C data using multi-scale interaction information

Yingfu Wu

Zhenqi Shi

Xiangfei Zhou

Pengyu Zhang

Xiuhui Yang

Jun Ding

Hao Wu

2024-07-30

Communications Biology (publié)

doi.org

Diversity-aware Population Models: Quantifying Associations between Socio-Spatial Factors and Cognitive Development in the ABCD Cohort

Avram Holmes

Sarah W. Yip

Danilo Bzdok

Population-level analyses are inherently complex due to a myriad of latent confounding effects that underlie the interdisciplinary topics of… (voir plus) research interest. Despite the mounting demand for generative population models, the limited generalizability to underrepresented groups hinders their widespread adoption in downstream applications. Interpretability and reliability are essential for clinicians and policymakers, while accuracy and precision are prioritized from an engineering standpoint. Thus, in domains such as population neuroscience, the challenge lies in determining a suitable approach to model population data effectively. Notably, the traditional strata-agnostic nature of existing methods in this field reveals a pertinent gap in quantitative techniques that directly capture major sources of population stratification. The emergence of population-scale cohorts, like the Adolescent Brain Cognitive DevelopmentSM (ABCD) Study, provides unparalleled opportunities to explore and characterize neurobehavioral and sociodemographic relationships comprehensively. We propose diversity-aware population modeling, a framework poised to standardize systematic incorporation of diverse attributes, structured with respect to intrinsic population stratification to obtain holistic insights. Here, we leverage Bayesian multilevel regression and poststratification, to elucidate inter-individual differences in the relationships between socioeconomic status (SES) and cognitive development. We constructed 14 varying-intercepts and varying-slopes models to investigate 3 cognitive phenotypes and 5 sociodemographic variables (SDV), across 17 US states and 5 race subgroups. SDVs exhibited systemic socio-spatial effects that served as fundamental drivers of variation in cognitive outcomes. Low SES was disproportionately associated with cognitive development among Black and Hispanic children, while high SES was a robust predictor of cognitive development only among White and Asian children, consistent with the minorities’ diminished returns (MDRs) theory. Notably, adversity-susceptible subgroups demonstrated an expressive association with fluid cognition compared to crystallized cognition. Poststratification proved effective in correcting group attribution biases, particularly in Pennsylvania, highlighting sampling discrepancies in US states with the highest percentage of marginalized participants in the ABCD Study©. Our collective analyses underscore the inextricable link between race and geographic location within the US. We emphasize the importance of diversity-aware population models that consider the intersectional composition of society to derive precise and interpretable insights across applicable domains.

2024-07-29

Research Square (publié)

doi.org

Strong Gravitational Lensing as a Probe of Dark Matter

S. Vegetti

S. Birrer

G. Despali

C.D. Fassnacht

D. Gilman

Y. Hezaveh

L.

L. Perreault Levasseur

J.P. McKean

D.M. Powell

C.M. O'Riordan

G.

G. Vernardos

Dark matter structures within strong gravitational lens galaxies and along their line of sight leave a gravitational imprint on the multiple… (voir plus) images of lensed sources. Strong gravitational lensing provides, therefore, a key test of different dark matter models in a way that is independent of the baryonic content of matter structures on subgalactic scales. In this chapter, we describe how galaxy-scale strong gravitational lensing observations are sensitive to the physical nature of dark matter. We provide a historical perspective of the field, and review its current status. We discuss the challenges and advances in terms of data, treatment of systematic errors and theoretical predictions, that will enable one to deliver a stringent and robust test of different dark matter models in the near future. With the advent of the next generation of sky surveys, the number of known strong gravitational lens systems is expected to increase by several orders of magnitude. Coupled with high-resolution follow-up observations, these data will provide a key opportunity to constrain the properties of dark matter with strong gravitational lensing.

2024-07-29

Space Science Reviews (publié)

doi.org

arxiv.org

AAPM task group report 288: Recommendations for guiding radiotherapy event narratives

Bruce Thomadsen

Ajay Kapur

Bette Blankenship

Barrett Caldwell

Lindsey Claps

Joanne Cunningham

Jennifer Elee

Suzanne Evans

Eric Ford

Debbie Gilley

Sandra Hayden

Kathleen Hintenlang

Rishabh Kapoor

J. Kildea

Linda Kroger

Ksenija Kujundzic

Qing Liang

Sasa Mutic

Anita O'Donovan

Michael O'Hara … (voir 6 de plus)

Zoubir Ouhib

Jatinder Palta

Todd Pawlicki

William Salter

Stacey Schmidt

Sugata Tripathi

2024-07-28

Medical Physics (Lancaster) (publié)

doi.org

scCross: a deep generative model for unifying single-cell multi-omics with seamless integration, cross-modal generation, and in silico exploration

Xiuhui Yang

Koren K. Mann

Hao Wu

Jun Ding

Single-cell multi-omics data reveal complex cellular states, providing significant insights into cellular dynamics and disease. Yet, integra… (voir plus)tion of multi-omics data presents challenges. Some modalities have not reached the robustness or clarity of established transcriptomics. Coupled with data scarcity for less established modalities and integration intricacies, these challenges limit our ability to maximize single-cell omics benefits. We introduce scCross, a tool leveraging variational autoencoders, generative adversarial networks, and the mutual nearest neighbors (MNN) technique for modality alignment. By enabling single-cell cross-modal data generation, multi-omics data simulation, and in silico cellular perturbations, scCross enhances the utility of single-cell multi-omics studies. The online version contains supplementary material available at 10.1186/s13059-024-03338-z.

2024-07-28

Genome Biology (publié)

doi.org

The report of AAPM task group 288: Recommendations for guiding radiotherapy event narratives.

Bruce Thomadsen

Ajay Kapur

Bette Blankenship

Barrett Caldwell

Lindsey Claps

Joanne Cunningham

Jennifer Elee

Suzanne Evans

Eric Ford

Debbie Gilley

Sandra Hayden

Kathleen Hintenlang

Rishabh Kapoor

J. Kildea

Linda Kroger

Ksenija Kujundzic

Qing Liang

Sasa Mutic

Anita O'Donovan

Michael O'Hara … (voir 6 de plus)

Zoubir Ouhib

Jatinder Palta

Todd Pawlicki

William Salter

Stacey Schmidt

Sugata Tripathi

Incident reporting and learning systems provide an opportunity to identify systemic vulnerabilities that contribute to incidents and potenti… (voir plus)ally degrade quality. The narrative of an incident is intended to provide a clear, easy to understand description of an incident. Unclear, incomplete or poorly organized narratives compromise the ability to learn from them. This report provides guidance for drafting effective narratives, with particular attention to the use of narratives in incident reporting and learning systems (IRLS). Examples are given that compare effective and less than effective narratives. This report is mostly directed to organizations that maintain IRLS, but also may be helpful for individuals who desire to write a useful narrative for entry into such a system. Recommendations include the following: (1) Systems should allow a one- or two-sentence, free-text synopsis of an incident without guessing at causes; (2) Information included should form a sequence of events with chronology; and (3) Reporting and learning systems should consider using the headings suggested to guide the reporter through the narrative: (a) incident occurrences and actions by role; (b) prior circumstances and actions; (c) method by which the incident was identified; (d) equipment related details if relevant; (e) recovery actions by role; (f) relevant time span between responses; (g) and how individuals affected during or immediately after incident. When possible and appropriate, supplementary information including relevant data elements should be included using numerical scales or drop-down choices outside of the narrative. Information that should not be included in the narrative includes: (a) patient health information (PHI); (b) conjecture or blame; (c) jargon abbreviations or details without specifying their significance; (d) causal analysis.

2024-07-28

Medical Physics (Lancaster) (publié)

doi.org

Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment

Aamer Abdul Rahman

Pranav Agarwal

Rita Noumeir

Philippe Jouvet

Vincent Michalski

S Ebrahimi Kahou

Offline reinforcement learning has shown promise for solving tasks in safety-critical settings, such as clinical decision support. Its appli… (voir plus)cation, however, has been limited by the lack of interpretability and interactivity for clinicians. To address these challenges, we propose the medical decision transformer (MeDT), a novel and versatile framework based on the goal-conditioned reinforcement learning paradigm for sepsis treatment recommendation. MeDT uses the decision transformer architecture to learn a policy for drug dosage recommendation. During offline training, MeDT utilizes collected treatment trajectories to predict administered treatments for each time step, incorporating known treatment outcomes, target acuity scores, past treatment decisions, and current and past medical states. This analysis enables MeDT to capture complex dependencies among a patient's medical history, treatment decisions, outcomes, and short-term effects on stability. Our proposed conditioning uses acuity scores to address sparse reward issues and to facilitate clinician-model interactions, enhancing decision-making. Following training, MeDT can generate tailored treatment recommendations by conditioning on the desired positive outcome (survival) and user-specified short-term stability improvements. We carry out rigorous experiments on data from the MIMIC-III dataset and use off-policy evaluation to demonstrate that MeDT recommends interventions that outperform or are competitive with existing offline reinforcement learning methods while enabling a more interpretable, personalized and clinician-directed approach.

2024-07-27

ArXiv (prépublication)

doi.org

arxiv.org

On the benefits of pixel-based hierarchical policies for task generalization

T. Cristea-Platon

Bogdan Mazoure

Josh Susskind

Walter Talbott

Reinforcement learning practitioners often avoid hierarchical policies, especially in image-based observation spaces. Typically, the single-… (voir plus)task performance improvement over flat-policy counterparts does not justify the additional complexity associated with implementing a hierarchy. However, by introducing multiple decision-making levels, hierarchical policies can compose lower-level policies to more effectively generalize between tasks, highlighting the need for multi-task evaluations. We analyze the benefits of hierarchy through simulated multi-task robotic control experiments from pixels. Our results show that hierarchical policies trained with task conditioning can (1) increase performance on training tasks, (2) lead to improved reward and state-space generalizations in similar tasks, and (3) decrease the complexity of fine tuning required to solve novel tasks. Thus, we believe that hierarchical policies should be considered when building reinforcement learning architectures capable of generalizing between tasks.

2024-07-26

ArXiv (prépublication)

doi.org

arxiv.org

Canada's Provincial Covid-19 Pandemic Modelling Efforts: A Review of Mathematical Models and Their Impacts on the Responses

Yiqing Xia

Jorge Luis Flores Anato

Caroline Colijin

Naveed Janjua

Michael Otterstatter

Mike Irvine

Tyler Williamson

Marie B. Varughese

Michael Li

Nathaniel Osgood

David J. D. Earn

Beate Sander

Lauren E. Cipriano

Kumar Murty

Fanyu Xiu

Arnaud Godin

David L Buckeridge

Amy Hurford

Sharmistha Mishra

Mathieu Maheu-Giroux

2024-07-24

Canadian journal of public health (publié)

doi.org

Enhancing Agent Learning through World Dynamics Modeling

Xingdi Yuan

Large language models (LLMs) have been increasingly applied to tasks in language understanding and interactive decision-making, with their i… (voir plus)mpressive performance largely attributed to the extensive domain knowledge embedded within them. However, the depth and breadth of this knowledge can vary across domains. Many existing approaches assume that LLMs possess a comprehensive understanding of their environment, often overlooking potential gaps in their grasp of actual world dynamics. To address this, we introduce Discover, Verify, and Evolve (DiVE), a framework that discovers world dynamics from a small number of demonstrations, verifies the accuracy of these dynamics, and evolves new, advanced dynamics tailored to the current situation. Through extensive evaluations, we assess the impact of each component on performance and compare the dynamics generated by DiVE to human-annotated dynamics. Our results show that LLMs guided by DiVE make more informed decisions, achieving rewards comparable to human players in the Crafter environment and surpassing methods that require prior task-specific training in the MiniHack environment.

2024-07-24

arXiv (Cornell University) (prépublication)

doi.org

arxiv.org

Multi-Fidelity Active Learning with GFlowNets

Alex Hernández-García

Nikita Saxena

Moksh J. Jain

Cheng-Hao Liu

Yoshua Bengio

In the last decades, the capacity to generate large amounts of data in science and engineering applications has been growing steadily. Meanw… (voir plus)hile, the progress in machine learning has turned it into a suitable tool to process and utilise the available data. Nonetheless, many relevant scientific and engineering problems present challenges where current machine learning methods cannot yet efficiently leverage the available data and resources. For example, in scientific discovery, we are often faced with the problem of exploring very large, high-dimensional spaces, where querying a high fidelity, black-box objective function is very expensive. Progress in machine learning methods that can efficiently tackle such problems would help accelerate currently crucial areas such as drug and materials discovery. In this paper, we propose the use of GFlowNets for multi-fidelity active learning, where multiple approximations of the black-box function are available at lower fidelity and cost. GFlowNets are recently proposed methods for amortised probabilistic inference that have proven efficient for exploring large, high-dimensional spaces and can hence be practical in the multi-fidelity setting too. Here, we describe our algorithm for multi-fidelity active learning with GFlowNets and evaluate its performance in both well-studied synthetic tasks and practically relevant applications of molecular discovery. Our results show that multi-fidelity active learning with GFlowNets can efficiently leverage the availability of multiple oracles with different costs and fidelities to accelerate scientific discovery and engineering design.

2024-07-24

TMLR (accepté)

doi.org

openreview.net

Development of Error Passing Network for Optimizing the Prediction of VO$_2$ peak in Childhood Acute Leukemia Survivors

Nicolas Raymond

Hakima Laribi

Maxime Caru

Mehdi Mitiche

Valerie Marcil

Maja Krajinovic

Daniel Curnier

Daniel Sinnett

Martin Vallières

Approximately two-thirds of survivors of childhood acute lymphoblastic leukemia (ALL) cancer develop late adverse effects post-treatment. Pr… (voir plus)ior studies explored prediction models for personalized follow-up, but none integrated the usage of neural networks to date. In this work, we propose the Error Passing Network (EPN), a graph-based method that leverages relationships between samples to propagate residuals and adjust predictions of any machine learning model. We tested our approach to estimate patients’ \vo peak, a reliable indicator of their cardiac health. We used the EPN in conjunction with several baseline models and observed up to 12.16% improvement in the mean average percentage error compared to the last established equation predicting \vo peak in childhood ALL survivors. Along with this performance improvement, our final model is more efficient considering that it relies only on clinical variables that can be self-reported by patients, therefore removing the previous need of executing a resource-consuming physical test.

2024-07-23

Proceedings of the fifth Conference on Health, Inference, and Learning (publié)

proceedings.mlr.press

Mila sur Udemy

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Publications

Mila sur Udemy

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Mots-clés populaires:

Publications