Publications

DiffKillR: Killing and Recreating Diffeomorphisms for Cell Annotation in Dense Microscopy Images
Chen Liu
Danqi Liao
Alejandro Parada-Mayorga
Alejandro Ribeiro
Marcello DiStasio
The proliferation of digital microscopy images, driven by advances in automated whole slide scanning, presents significant opportunities for… (see more) biomedical research and clinical diagnostics. However, accurately annotating densely packed information in these images remains a major challenge. To address this, we introduce DiffKillR, a novel framework that reframes cell annotation as the combination of archetype matching and image registration tasks. DiffKillR employs two complementary neural networks: one that learns a diffeomorphism-invariant feature space for robust cell matching and another that computes the precise warping field between cells for annotation mapping. Using a small set of annotated archetypes, DiffKillR efficiently propagates annotations across large microscopy images, reducing the need for extensive manual labeling. More importantly, it is suitable for any type of pixel-level annotation. We will discuss the theoretical properties of DiffKillR and validate it on three microscopy tasks, demonstrating its advantages over existing supervised, semi-supervised, and unsupervised methods.
Graph Neural Networks Meet Probabilistic Graphical Models: A Survey
Hyperedge Representations with Hypergraph Wavelets: Applications to Spatial Transcriptomics
Xingzhi Sun
Charles Xu
João Felipe Rocha
Chen Liu
Benjamin Hollander-Bodie
Laney Goldman
Marcello DiStasio
Michael Perlmutter
In many data-driven applications, higher-order relationships among multiple objects are essential in capturing complex interactions. Hypergr… (see more)aphs, which generalize graphs by allowing edges to connect any number of nodes, provide a flexible and powerful framework for modeling such higher-order relationships. In this work, we introduce hypergraph diffusion wavelets and describe their favorable spectral and spatial properties. We demonstrate their utility for biomedical discovery in spatially resolved transcriptomics by applying the method to represent disease-relevant cellular niches for Alzheimer’s disease.
Investigating the Effectiveness of Explainability Methods in Parkinson's Detection from Speech
Speech impairments in Parkinson's disease (PD) provide significant early indicators for diagnosis. While models for speech-based PD detectio… (see more)n have shown strong performance, their interpretability remains underexplored. This study systematically evaluates several explainability methods to identify PD-specific speech features, aiming to support the development of accurate, interpretable models for clinical decision-making in PD diagnosis and monitoring. Our methodology involves (i) obtaining attributions and saliency maps using mainstream interpretability techniques, (ii) quantitatively evaluating the faithfulness of these maps and their combinations obtained via union and intersection through a range of established metrics, and (iii) assessing the information conveyed by the saliency maps for PD detection from an auxiliary classifier. Our results reveal that, while explanations are aligned with the classifier, they often fail to provide valuable information for domain experts.
Latent Representation Learning for Multimodal Brain Activity Translation
Dhananjay Bhaskar
Erica Lindsey Busch
Laurent Caplette
Rahul Singh
Nicholas B Turk-Browne
Neuroscience employs diverse neuroimaging techniques, each offering distinct insights into brain activity, from electrophysiological recordi… (see more)ngs such as EEG, which have high temporal resolution, to hemodynamic modalities such as fMRI, which have increased spatial precision. However, integrating these heterogeneous data sources remains a challenge, which limits a comprehensive understanding of brain function. We present the Spatiotemporal Alignment of Multimodal Brain Activity (SAMBA) framework, which bridges the spatial and temporal resolution gaps across modalities by learning a unified latent space free of modality-specific biases. SAMBA introduces a novel attention-based wavelet decomposition for spectral filtering of electrophysiological recordings, graph attention networks to model functional connectivity between functional brain units, and recurrent layers to capture temporal autocorrelations in brain signal. We show that the training of SAMBA, aside from achieving translation, also learns a rich representation of brain information processing. We showcase this classify external stimuli driving brain activity from the representation learned in hidden layers of SAMBA, paving the way for broad downstream applications in neuroscience research and clinical contexts.
LMAC-TD: Producing Time Domain Explanations for Audio Classifiers
Neural networks are typically black-boxes that remain opaque with regards to their decision mechanisms. Several works in the literature have… (see more) proposed post-hoc explanation methods to alleviate this issue. This paper proposes LMAC-TD, a post-hoc explanation method that trains a decoder to produce explanations directly in the time domain. This methodology builds upon the foundation of L-MAC, Listenable Maps for Audio Classifiers, a method that produces faithful and listenable explanations. We incorporate SepFormer, a popular transformer-based time-domain source separation architecture. We show through a user study that LMAC-TD significantly improves the audio quality of the produced explanations while not sacrificing from faithfulness.
Principal Curvatures Estimation with Applications to Single Cell Data
Yanlei Zhang
Xingzhi Sun
Charles Xu
Kincaid MacDonald
Dhananjay Bhaskar
Bastian Rieck
What Are They Doing? Joint Audio-Speech Co-Reasoning
In audio and speech processing, tasks usually focus on either the audio or speech modality, even when both sounds and human speech are prese… (see more)nt in the same audio clip. Recent Auditory Large Language Models (ALLMs) have made it possible to process audio and speech simultaneously within a single model, leading to further considerations of joint audio-speech tasks. In this paper, we establish a novel benchmark to investigate how well ALLMs can perform joint audio-speech processing. Specifically, we introduce Joint Audio-Speech Co-Reasoning (JASCO), a novel task that unifies audio and speech processing, strictly requiring co-reasoning across both modalities. We also release a scene-reasoning dataset called "What Are They Doing". Additionally, we provide deeper insights into the models' behaviors by analyzing their dependence on each modality.
Progressive Multi-Source Domain Adaptation for Personalized Facial Expression Recognition
Muhammad Osama Zeeshan
Alessandro Lameiras Koerich
Eric Grange
Accelerated learning of a noninvasive human brain-computer interface via manifold geometry
Erica Lindsey Busch
E. Chandra Fincke
Nicholas B Turk-Browne
Evaluating and Enhancing Segmentation Model Robustness with Metamorphic Testing
Seif Mzoughi
Mohamed Elshafeia
Spinal Cord Tract Integrity in Degenerative Cervical Myelopathy
Newton Cho
Abdul Al-Shawwa
W. Bradley Jacobs
Nathan Evaniew
Jacques Bouchard
Steve Casha
Stephan duPlessis
Peter Lewkonia
Fred Nicholls
Alex Soroceanu
Ganesh Swamy
Kenneth C. Thomas
Michael M. H. Yang
David W. Cadotte
Degenerative cervical myelopathy (DCM) is the most common cause of spinal dysfunction globally. Despite surgical intervention, motor dysfunc… (see more)tion may persist in many patients. The purpose of this study was to comprehensively examine specific spinal cord tract changes in patients with DCM, to better understand potential substrates for compensatory recovery of function. Cervical spinal cord MRI scans with diffusion tensor imaging were performed in patients with DCM and in healthy volunteers. Spinal Cord Toolbox was used to register the PAM50 template, which includes a probabilistic atlas of the white matter tracts of the spinal cord, to the imaging data. Fractional anisotropy (FA) was extracted for each tract at C3 above the level of maximal compression and compared between patients with DCM and healthy volunteers and between patients with mild vs moderate to severe DCM. We included 25 patients with DCM (13 mild and 12 moderate to severe) and 6 healthy volunteers. FA was significantly reduced in DCM subjects relative to healthy volunteers for the lateral corticospinal tract (mild DCM vs healthy ∆ = −0.13, P = .018; moderate to severe DCM vs healthy ∆ = −0.11, P = .047), fasciculus gracilis (mild DCM vs healthy ∆ = −0.16, P = .010; moderate to severe DCM vs healthy ∆ = −0.13, P = .039), and fasciculus cuneatus (mild DCM vs healthy ∆ = −0.16, P = .007; moderate to severe DCM vs healthy ∆ = −0.15, P = .012). There were no differences in FA for all tracts between mild and moderate-to-severe DCM subjects. Patients with DCM had altered diffusion tensor imaging signal in their lateral corticospinal tract, fasciculus gracilis, and fasciculus cuneatus in comparison with healthy volunteers. These findings indicate that DCM is characterized by injury to these structures, which suggests that other tracts within the cord could potentially act as substrates for compensatory motor recovery.