Publications

DiffKillR: Killing and Recreating Diffeomorphisms for Cell Annotation in Dense Microscopy Images

Chen Liu

Danqi Liao

Alejandro Parada-Mayorga

Alejandro Ribeiro

Marcello DiStasio

The proliferation of digital microscopy images, driven by advances in automated whole slide scanning, presents significant opportunities for… (see more) biomedical research and clinical diagnostics. However, accurately annotating densely packed information in these images remains a major challenge. To address this, we introduce DiffKillR, a novel framework that reframes cell annotation as the combination of archetype matching and image registration tasks. DiffKillR employs two complementary neural networks: one that learns a diffeomorphism-invariant feature space for robust cell matching and another that computes the precise warping field between cells for annotation mapping. Using a small set of annotated archetypes, DiffKillR efficiently propagates annotations across large microscopy images, reducing the need for extensive manual labeling. More importantly, it is suitable for any type of pixel-level annotation. We will discuss the theoretical properties of DiffKillR and validate it on three microscopy tasks, demonstrating its advantages over existing supervised, semi-supervised, and unsupervised methods.

2025-04-05

ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (published)

doi.org

arxiv.org

Graph Neural Networks Meet Probabilistic Graphical Models: A Survey

Qian Zhang

2025-04-05

IEEE International Conference on Acoustics, Speech, and Signal Processing (published)

doi.org

Hyperedge Representations with Hypergraph Wavelets: Applications to Spatial Transcriptomics

Xingzhi Sun

Charles Xu

João Felipe Rocha

Chen Liu

Benjamin Hollander-Bodie

Laney Goldman

Marcello DiStasio

Michael Perlmutter

Smita Krishnaswamy

In many data-driven applications, higher-order relationships among multiple objects are essential in capturing complex interactions. Hypergr… (see more)aphs, which generalize graphs by allowing edges to connect any number of nodes, provide a flexible and powerful framework for modeling such higher-order relationships. In this work, we introduce hypergraph diffusion wavelets and describe their favorable spectral and spatial properties. We demonstrate their utility for biomedical discovery in spatially resolved transcriptomics by applying the method to represent disease-relevant cellular niches for Alzheimer’s disease.

2025-04-05

ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (published)

doi.org

arxiv.org

Investigating the Effectiveness of Explainability Methods in Parkinson's Detection from Speech

Paolo Torroni

Speech impairments in Parkinson's disease (PD) provide significant early indicators for diagnosis. While models for speech-based PD detectio… (see more)n have shown strong performance, their interpretability remains underexplored. This study systematically evaluates several explainability methods to identify PD-specific speech features, aiming to support the development of accurate, interpretable models for clinical decision-making in PD diagnosis and monitoring. Our methodology involves (i) obtaining attributions and saliency maps using mainstream interpretability techniques, (ii) quantitatively evaluating the faithfulness of these maps and their combinations obtained via union and intersection through a range of established metrics, and (iii) assessing the information conveyed by the saliency maps for PD detection from an auxiliary classifier. Our results reveal that, while explanations are aligned with the classifier, they often fail to provide valuable information for domain experts.

2025-04-05

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) (published)

doi.org

arxiv.org

Latent Representation Learning for Multimodal Brain Activity Translation

Arman Afrasiyabi

Dhananjay Bhaskar

Erica Lindsey Busch

Laurent Caplette

Rahul Singh

Guillaume Lajoie

Nicholas B Turk-Browne

Smita Krishnaswamy

Neuroscience employs diverse neuroimaging techniques, each offering distinct insights into brain activity, from electrophysiological recordi… (see more)ngs such as EEG, which have high temporal resolution, to hemodynamic modalities such as fMRI, which have increased spatial precision. However, integrating these heterogeneous data sources remains a challenge, which limits a comprehensive understanding of brain function. We present the Spatiotemporal Alignment of Multimodal Brain Activity (SAMBA) framework, which bridges the spatial and temporal resolution gaps across modalities by learning a unified latent space free of modality-specific biases. SAMBA introduces a novel attention-based wavelet decomposition for spectral filtering of electrophysiological recordings, graph attention networks to model functional connectivity between functional brain units, and recurrent layers to capture temporal autocorrelations in brain signal. We show that the training of SAMBA, aside from achieving translation, also learns a rich representation of brain information processing. We showcase this classify external stimuli driving brain activity from the representation learned in hidden layers of SAMBA, paving the way for broad downstream applications in neuroscience research and clinical contexts.

2025-04-05

ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (published)

doi.org

arxiv.org

LMAC-TD: Producing Time Domain Explanations for Audio Classifiers

Neural networks are typically black-boxes that remain opaque with regards to their decision mechanisms. Several works in the literature have… (see more) proposed post-hoc explanation methods to alleviate this issue. This paper proposes LMAC-TD, a post-hoc explanation method that trains a decoder to produce explanations directly in the time domain. This methodology builds upon the foundation of L-MAC, Listenable Maps for Audio Classifiers, a method that produces faithful and listenable explanations. We incorporate SepFormer, a popular transformer-based time-domain source separation architecture. We show through a user study that LMAC-TD significantly improves the audio quality of the produced explanations while not sacrificing from faithfulness.

2025-04-05

ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (published)

doi.org

arxiv.org

Principal Curvatures Estimation with Applications to Single Cell Data

Yanlei Zhang

Lydia Mezrag

Xingzhi Sun

Charles Xu

Kincaid MacDonald

Dhananjay Bhaskar

Smita Krishnaswamy

Guy Wolf

Bastian Rieck

2025-04-05

ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (published)

doi.org

arxiv.org

What Are They Doing? Joint Audio-Speech Co-Reasoning

Yingzhi Wang

Pooneh Mousavi

Artem Ploujnikov

Mirco Ravanelli

In audio and speech processing, tasks usually focus on either the audio or speech modality, even when both sounds and human speech are prese… (see more)nt in the same audio clip. Recent Auditory Large Language Models (ALLMs) have made it possible to process audio and speech simultaneously within a single model, leading to further considerations of joint audio-speech tasks. In this paper, we establish a novel benchmark to investigate how well ALLMs can perform joint audio-speech processing. Specifically, we introduce Joint Audio-Speech Co-Reasoning (JASCO), a novel task that unifies audio and speech processing, strictly requiring co-reasoning across both modalities. We also release a scene-reasoning dataset called "What Are They Doing". Additionally, we provide deeper insights into the models' behaviors by analyzing their dependence on each modality.

2025-04-05

ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (published)

doi.org

arxiv.org

Progressive Multi-Source Domain Adaptation for Personalized Facial Expression Recognition

Muhammad Osama Zeeshan

Marco Pedersoli

Alessandro Lameiras Koerich

Eric Grange

2025-04-04

ArXiv (preprint)

doi.org

arxiv.org

Accelerated learning of a noninvasive human brain-computer interface via manifold geometry

Erica Lindsey Busch

E. Chandra Fincke

Guillaume Lajoie

Smita Krishnaswamy

Nicholas B Turk-Browne

2025-04-02

bioRxiv (preprint)

doi.org

Evaluating and Enhancing Segmentation Model Robustness with Metamorphic Testing

Seif Mzoughi

Mohamed Elshafeia

Foutse Khomh

2025-04-02

ArXiv (preprint)

doi.org

arxiv.org

Spinal Cord Tract Integrity in Degenerative Cervical Myelopathy

Newton Cho

Abdul Al-Shawwa

W. Bradley Jacobs

Nathan Evaniew

Jacques Bouchard

Steve Casha

Stephan duPlessis

Peter Lewkonia

Fred Nicholls

Alex Soroceanu

Ganesh Swamy

Kenneth C. Thomas

Michael M. H. Yang

Julien Cohen-Adad

David W. Cadotte

Degenerative cervical myelopathy (DCM) is the most common cause of spinal dysfunction globally. Despite surgical intervention, motor dysfunc… (see more)tion may persist in many patients. The purpose of this study was to comprehensively examine specific spinal cord tract changes in patients with DCM, to better understand potential substrates for compensatory recovery of function. Cervical spinal cord MRI scans with diffusion tensor imaging were performed in patients with DCM and in healthy volunteers. Spinal Cord Toolbox was used to register the PAM50 template, which includes a probabilistic atlas of the white matter tracts of the spinal cord, to the imaging data. Fractional anisotropy (FA) was extracted for each tract at C3 above the level of maximal compression and compared between patients with DCM and healthy volunteers and between patients with mild vs moderate to severe DCM. We included 25 patients with DCM (13 mild and 12 moderate to severe) and 6 healthy volunteers. FA was significantly reduced in DCM subjects relative to healthy volunteers for the lateral corticospinal tract (mild DCM vs healthy ∆ = −0.13, P = .018; moderate to severe DCM vs healthy ∆ = −0.11, P = .047), fasciculus gracilis (mild DCM vs healthy ∆ = −0.16, P = .010; moderate to severe DCM vs healthy ∆ = −0.13, P = .039), and fasciculus cuneatus (mild DCM vs healthy ∆ = −0.16, P = .007; moderate to severe DCM vs healthy ∆ = −0.15, P = .012). There were no differences in FA for all tracts between mild and moderate-to-severe DCM subjects. Patients with DCM had altered diffusion tensor imaging signal in their lateral corticospinal tract, fasciculus gracilis, and fasciculus cuneatus in comparison with healthy volunteers. These findings indicate that DCM is characterized by injury to these structures, which suggests that other tracts within the cord could potentially act as substrates for compensatory motor recovery.

2025-04-02

Neurosurgery (unknown)

doi.org

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Publications

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Popular keywords:

Publications