Publications

An Attentive Approach for Building Partial Reasoning Agents from Pixels

Safa Alver

We study the problem of building reasoning agents that are able to generalize in an effective manner. Towards this goal, we propose an end-t… (see more)o-end approach for building model-based reinforcement learning agents that dynamically focus their reasoning to the relevant aspects of the environment: after automatically identifying the distinct aspects of the environment, these agents dynamically filter out the relevant ones and then pass them to their simulator to perform partial reasoning. Unlike existing approaches, our approach works with pixel-based inputs and it allows for interpreting the focal points of the agent. Our quantitative analyses show that the proposed approach allows for effective generalization in high-dimensional domains with raw observational inputs. We also perform ablation analyses to validate of design choices. Finally, we demonstrate through qualitative analyses that our approach actually allows for building agents that focus their reasoning on the relevant aspects of the environment.

2024-09-17

TMLR (accepted)

openreview.net

Deep Learning in Ultrasound Localization Microscopy: Applications and Perspectives.

Brice Rauby

Paul Xing

Maxime Gasse

Jean Provost

Ultrasound Localization Microscopy (ULM) is a novel super-resolution imaging technique that can image the vasculature in vivo at depth with … (see more)resolution far beyond the conventional limit of diffraction. By relying on the localization and tracking of clinically approved microbubbles injected in the blood stream, ULM can provide not only anatomical visualization but also hemodynamic quantification of the microvasculature of different tissues. Various deep-learning approaches have been proposed to address challenges in ULM including denoising, improving microbubble localization, estimating blood flow velocity or performing aberration correction. Proposed deep learning methods often outperform their conventional counterparts by improving image quality and reducing processing time. In addition, their robustness to high concentrations of microbubbles can lead to reduced acquisition times in ULM, addressing a major hindrance to ULM clinical application. Herein, we propose a comprehensive review of the diversity of deep learning applications in ULM focusing on approaches assuming a sparse microbubbles distribution. We first provide an overview of how existing studies vary in the constitution of their datasets or in the tasks targeted by deep learning model. We also take a deeper look into the numerous approaches that have been proposed to improve the localization of microbubbles since they differ highly in their formulation of the optimization problem, their evaluation, or their network architectures. We finally discuss the current limitations and challenges of these methods, as well as the promises and potential of deep learning for ULM in the future.

2024-09-17

IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control (published)

doi.org

Deep Learning in Ultrasound Localization Microscopy: Applications and Perspectives

Brice Rauby

Paul Xing

Maxime Gasse

Jean Provost

Ultrasound localization microscopy (ULM) is a novel super-resolution imaging technique that can image the vasculature in vivo at depth with … (see more)resolution far beyond the conventional limit of diffraction. By relying on the localization and tracking of clinically approved microbubbles injected in the blood stream, ULM can provide not only anatomical visualization but also hemodynamic quantification of the microvasculature. Several deep learning approaches have been proposed to address challenges in ULM including denoising, improving microbubble localization, estimating blood flow velocity, or performing aberration correction. Proposed deep learning methods often outperform their conventional counterparts by improving image quality and reducing processing time. In addition, their robustness to high concentrations of microbubbles can lead to reduced acquisition times in ULM, addressing a major hindrance to ULM clinical application. Herein, we propose a comprehensive review of the diversity of deep learning applications in ULM focusing on approaches assuming a sparse microbubble distribution. We first provide an overview of how existing studies vary in the constitution of their datasets or in the tasks targeted by the deep learning model. We also take a deeper look into the numerous approaches that have been proposed to improve the localization of microbubbles since they differ highly in their formulation of the optimization problem, their evaluation, or their network architectures. We finally discuss the current limitations and challenges of these methods, as well as the promises and potential of deep learning for ULM in the future.

2024-09-17

IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control (published)

doi.org

An Empirical Study of Sensitive Information in Logs

Roozbeh Aghili

Heng Li

Foutse Khomh

2024-09-17

ArXiv (preprint)

doi.org

arxiv.org

Protecting Privacy in Software Logs: What Should Be Anonymized?

Roozbeh Aghili

Heng Li

Foutse Khomh

2024-09-17

ArXiv (preprint)

arxiv.org

Protecting Privacy in Software Logs: What Should Be Anonymized?

Roozbeh Aghili

Heng Li

Foutse Khomh

2024-09-17

ArXiv (preprint)

arxiv.org

Rethinking Teacher-Student Curriculum Learning through the Cooperative Mechanics of Experience

Manfred Diaz

Liam Paull

Andrea Tacchetti

Teacher-Student Curriculum Learning (TSCL) is a curriculum learning framework that draws inspiration from human cultural transmission and le… (see more)arning. It involves a teacher algorithm shaping the learning process of a learner algorithm by exposing it to controlled experiences. Despite its success, understanding the conditions under which TSCL is effective remains challenging. In this paper, we propose a data-centric perspective to analyze the underlying mechanics of the teacher-student interactions in TSCL. We leverage cooperative game theory to describe how the composition of the set of experiences presented by the teacher to the learner, as well as their order, influences the performance of the curriculum that is found by TSCL approaches. To do so, we demonstrate that for every TSCL problem, there exists an equivalent cooperative game, and several key components of the TSCL framework can be reinterpreted using game-theoretic principles. Through experiments covering supervised learning, reinforcement learning, and classical games, we estimate the cooperative values of experiences and use value-proportional curriculum mechanisms to construct curricula, even in cases where TSCL struggles. The framework and experimental setup we present in this work represent a novel foundation for a deeper exploration of TSCL, shedding light on its underlying mechanisms and providing insights into its broader applicability in machine learning.

2024-09-17

TMLR (accepted)

doi.org

openreview.net

Deconvolving X-ray Galaxy Cluster Spectra Using a Recurrent Inference Machine

C. L. Rhea

J. Hlavacek-Larrondo

Alexandre Adam

Ralph P. Kraft

Ákos Bogdán

Laurence Perreault-Levasseur

Marine Prunier

Recent advances in machine learning algorithms have unlocked new insights in observational astronomy by allowing astronomers to probe new fr… (see more)ontiers. In this article, we present a methodology to disentangle the intrinsic X-ray spectrum of galaxy clusters from the instrumental response function. Employing state-of-the-art modeling software and data mining techniques of the Chandra data archive, we construct a set of 100,000 mock Chandra spectra. We train a recurrent inference machine (RIM) to take in the instrumental response and mock observation and output the intrinsic X-ray spectrum. The RIM can recover the mock intrinsic spectrum below the 1-

2024-09-16

ArXiv (preprint)

doi.org

arxiv.org

Abstract PR-05: Endocrine beta-cell stress promotes pancreatic ductal adenocarcinoma through endocrine-exocrine cell crosstalk

Cathy C. Garcia

Aarthi Venkat

Daniel C. McQuaid

Sherry Agabiti

Alex Tong

Rebecca Cardone

Richard G. Kibbey

Smita Krishnaswamy

Mandar Deepak Muzumdar

For a long time, the pancreas was thought to have separate cellular compartments that functioned distinctly from one another. The endocrine … (see more)pancreas (islets of Langerhans) regulates glucose homeostasis, while the exocrine pancreas (acini and ducts) produces and secretes digestive enzymes. However, it has recently become clear that the endocrine and exocrine compartments communicate with one another, and dysfunction in one leads to dysfunction in the other, resulting in diabetes or pancreatitis. However, whether and how the endocrine pancreas drives the development of pancreatic ductal adenocarcinoma (PDAC), an exocrine tumor, remains unresolved. Strikingly, we found that genetic ablation of insulin-producing islet beta (β) cells (Akita) in a faithful Kras/Trp53-driven PDAC model (KPC: Kras LSL-G12D /+; Trp 53172 /+; Pdx1-Cre) suppressed PDAC progression. Conversely, obesity-induced β cell hormone dysregulation promoted Kras-driven PDAC development. Single-cell RNA sequencing (scRNA-seq) analysis of wild-type and obese mice (high-fat diet-fed and leptin-deficient (Lep ob/ob )) revealed increased expression of the peptide hormone cholecystokinin (CCK) in a subset of β cells concordant with increasing obesity, and transgenic β cell overexpression of CCK was sufficient to promote exocrine tumorigenesis in KC mice. Combined in silico (pseudotime (TrajectoryNET) and archetypal (AANet) analysis) and experimental (CreER) lineage tracing demonstrated that CCK-expressing β cells originated from a pre-existing immature β cell population (virgin β cells). Grainger causality analysis of transcriptional networks uncovered a stress-induced JNK-cJun pathway that promotes CCK expression β cells, which we confirmed using JNK inhibitors in β cell models. Together, our findings identify cellular and molecular mechanisms of β cell adaptation to obesity that contribute to obesity-driven pancreatic cancer. Furthermore, we define a critical role for endocrine-exocrine signaling in PDAC progression and stress-induced β cell pathways which could be leveraged to target the endocrine pancreas to subvert exocrine tumorigenesis. Citation Format: Cathy Garcia, Aarthi Venkat, Daniel McQuaid, Sherry Agabiti, Alex Tong, Rebecca Cardone, Richard Kibbey, Smita Krishnaswamy, Mandar Muzumdar. Endocrine beta-cell stress promotes pancreatic ductal adenocarcinoma through endocrine-exocrine cell crosstalk [abstract]. In: Proceedings of the AACR Special Conference in Cancer Research: Advances in Pancreatic Cancer Research; 2024 Sep 15-18; Boston, MA. Philadelphia (PA): AACR; Cancer Res 2024;84(17 Suppl_2):Abstract nr PR-05.

2024-09-15

Cancer Research (published)

doi.org

Abstract PR-05: Endocrine beta-cell stress promotes pancreatic ductal adenocarcinoma through endocrine-exocrine cell crosstalk

Cathy C. Garcia

Aarthi Venkat

Daniel C. McQuaid

Sherry Agabiti

Alex Tong

Rebecca Cardone

Richard G. Kibbey

Smita Krishnaswamy

Mandar Deepak Muzumdar

For a long time, the pancreas was thought to have separate cellular compartments that functioned distinctly from one another. The endocrine … (see more)pancreas (islets of Langerhans) regulates glucose homeostasis, while the exocrine pancreas (acini and ducts) produces and secretes digestive enzymes. However, it has recently become clear that the endocrine and exocrine compartments communicate with one another, and dysfunction in one leads to dysfunction in the other, resulting in diabetes or pancreatitis. However, whether and how the endocrine pancreas drives the development of pancreatic ductal adenocarcinoma (PDAC), an exocrine tumor, remains unresolved. Strikingly, we found that genetic ablation of insulin-producing islet beta (β) cells (Akita) in a faithful Kras/Trp53-driven PDAC model (KPC: Kras LSL-G12D /+; Trp 53172 /+; Pdx1-Cre) suppressed PDAC progression. Conversely, obesity-induced β cell hormone dysregulation promoted Kras-driven PDAC development. Single-cell RNA sequencing (scRNA-seq) analysis of wild-type and obese mice (high-fat diet-fed and leptin-deficient (Lep ob/ob )) revealed increased expression of the peptide hormone cholecystokinin (CCK) in a subset of β cells concordant with increasing obesity, and transgenic β cell overexpression of CCK was sufficient to promote exocrine tumorigenesis in KC mice. Combined in silico (pseudotime (TrajectoryNET) and archetypal (AANet) analysis) and experimental (CreER) lineage tracing demonstrated that CCK-expressing β cells originated from a pre-existing immature β cell population (virgin β cells). Grainger causality analysis of transcriptional networks uncovered a stress-induced JNK-cJun pathway that promotes CCK expression β cells, which we confirmed using JNK inhibitors in β cell models. Together, our findings identify cellular and molecular mechanisms of β cell adaptation to obesity that contribute to obesity-driven pancreatic cancer. Furthermore, we define a critical role for endocrine-exocrine signaling in PDAC progression and stress-induced β cell pathways which could be leveraged to target the endocrine pancreas to subvert exocrine tumorigenesis. Citation Format: Cathy Garcia, Aarthi Venkat, Daniel McQuaid, Sherry Agabiti, Alex Tong, Rebecca Cardone, Richard Kibbey, Smita Krishnaswamy, Mandar Muzumdar. Endocrine beta-cell stress promotes pancreatic ductal adenocarcinoma through endocrine-exocrine cell crosstalk [abstract]. In: Proceedings of the AACR Special Conference in Cancer Research: Advances in Pancreatic Cancer Research; 2024 Sep 15-18; Boston, MA. Philadelphia (PA): AACR; Cancer Res 2024;84(17 Suppl_2):Abstract nr PR-05.

2024-09-15

Cancer Research (published)

doi.org

GFlowNet Pretraining with Inexpensive Rewards

Mohit Pandey

Gopeshh Subbaraj

Emmanuel Bengio

Generative Flow Networks (GFlowNets), a class of generative models have recently emerged as a suitable framework for generating diverse and … (see more)high-quality molecular structures by learning from unnormalized reward distributions. Previous works in this direction often restrict exploration by using predefined molecular fragments as building blocks, limiting the chemical space that can be accessed. In this work, we introduce Atomic GFlowNets (A-GFNs), a foundational generative model leveraging individual atoms as building blocks to explore drug-like chemical space more comprehensively. We propose an unsupervised pre-training approach using offline drug-like molecule datasets, which conditions A-GFNs on inexpensive yet informative molecular descriptors such as drug-likeliness, topological polar surface area, and synthetic accessibility scores. These properties serve as proxy rewards, guiding A-GFNs towards regions of chemical space that exhibit desirable pharmacological properties. We further our method by implementing a goal-conditioned fine-tuning process, which adapts A-GFNs to optimize for specific target properties. In this work, we pretrain A-GFN on the ZINC15 offline dataset and employ robust evaluation metrics to show the effectiveness of our approach when compared to other relevant baseline methods in drug design.

2024-09-15

ArXiv (preprint)

doi.org

arxiv.org

GFlowNet Pretraining with Inexpensive Rewards

Mohit Pandey

Gopeshh Subbaraj

Emmanuel Bengio

Generative Flow Networks (GFlowNets), a class of generative models have recently emerged as a suitable framework for generating diverse and … (see more)high-quality molecular structures by learning from unnormalized reward distributions. Previous works in this direction often restrict exploration by using predefined molecular fragments as building blocks, limiting the chemical space that can be accessed. In this work, we introduce Atomic GFlowNets (A-GFNs), a foundational generative model leveraging individual atoms as building blocks to explore drug-like chemical space more comprehensively. We propose an unsupervised pre-training approach using offline drug-like molecule datasets, which conditions A-GFNs on inexpensive yet informative molecular descriptors such as drug-likeliness, topological polar surface area, and synthetic accessibility scores. These properties serve as proxy rewards, guiding A-GFNs towards regions of chemical space that exhibit desirable pharmacological properties. We further our method by implementing a goal-conditioned fine-tuning process, which adapts A-GFNs to optimize for specific target properties. In this work, we pretrain A-GFN on the ZINC15 offline dataset and employ robust evaluation metrics to show the effectiveness of our approach when compared to other relevant baseline methods in drug design.

2024-09-15

ArXiv (preprint)

doi.org

arxiv.org

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Publications

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Popular keywords:

Publications