Portrait de Jun Ding

Jun Ding

Membre affilié
Professeur adjoint, McGill University, Département de médecine
Sujets de recherche
Apprentissage automatique médical
Apprentissage de représentations
Biologie computationnelle

Biographie

Jun Ding est professeur adjoint au Département de médecine de la Faculté de médecine et des sciences de la santé de l'Université McGill. Aux côtés de son équipe, il se consacre à l'utilisation de techniques d'apprentissage automatique pour éclaircir les dynamiques complexes des cellules dans diverses maladies, telles que les troubles du développement, les maladies pulmonaires et les cancers. La nature diversifiée et complexe de ces affections nécessite l’usage d’approches innovantes, incitant à l'utilisation de technologies unicellulaires de pointe. Ces technologies offrent des possibilités sans précédent pour faire avancer la compréhension, notamment dans des domaines tels que la biologie du développement et du cancer. Cependant, elles posent également des défis dans le développement de modèles informatiques capables de relier ces données biomédicales complexes à des découvertes potentielles.

Jun Ding a comme objectif le développement et l'affinement des méthodologies d'apprentissage automatique, en particulier des modèles graphiques probabilistes, pour analyser, modéliser et visualiser efficacement des données omiques à la fois de cellules uniques et de cellules groupées, souvent avec des dimensions longitudinales ou spatiales. Le but de ses recherches est d'utiliser ces techniques avancées d'apprentissage automatique pour approfondir la compréhension des dynamiques cellulaires, afin de développer des stratégies diagnostiques et thérapeutiques novatrices susceptibles de bénéficier considérablement à la santé publique.

Étudiants actuels

Doctorat - McGill
Superviseur⋅e principal⋅e :

Publications

BCG immunization induces CX3CR1hi effector memory T cells to provide cross-protection via IFN-γ-mediated trained immunity.
Kim A. Tran
Erwan Pernet
Mina Sadeghi
Jeffrey Downey
Julia Chronopoulos
Elizabeth Lapshina
Oscar Tsai
Eva Kaufmann
Maziar Divangahi
GFETM: Genome Foundation-based Embedded Topic Model for scATAC-seq Modeling
Yimin Fan
Shi Han
Single-cell Assay for Transposase-Accessible Chromatin with sequencing (scATAC-seq) has emerged as a powerful technique for investigating op… (voir plus)en chromatin landscapes at single-cell resolution. However, analyzing scATAC-seq data remain challenging due to its sparsity and noise. Genome Foundation Models (GFMs), pre-trained on massive DNA sequences, have proven effective at genome analysis. Given that open chromatin regions (OCRs) harbour salient sequence features, we hypothesize that leveraging GFMs’ sequence embeddings can improve the accuracy and generalizability of scATAC-seq modeling. Here, we introduce the Genome Foundation Embedded Topic Model (GFETM), an interpretable deep learning framework that combines GFMs with the Embedded Topic Model (ETM) for scATAC-seq data analysis. By integrating the DNA sequence embeddings extracted by a GFM from OCRs, GFETM demonstrates superior accuracy and generalizability and captures cell-state specific TF activity both with zero-shot inference and attention mechanism analysis. Finally, the topic mixtures inferred by GFETM reveal biologically meaningful epigenomic signatures of kidney diabetes.
Unagi: Deep Generative Model for Deciphering Cellular Dynamics and In-Silico Drug Discovery in Complex Diseases
Yumin Zheng
Jonas C. Schupp
Taylor S Adams
Geremy Clair
Aurelien Justet
Farida Ahangari
Xiting Yan
Paul Hansen
Marianne Carlon
Emanuela Cortesi
Marie Vermant
Robin Vos
De Sadeleer J Laurens
Ivan O Rosas
Ricardo Pineda
John Sembrat
Melanie Königshoff
John E McDonough
Bart M. Vanaudenaerde … (voir 2 de plus)
Wim A Wuyts
Naftali Kaminski
Human diseases are characterized by intricate cellular dynamics. Single-cell sequencing provides critical insights, yet a persistent gap rem… (voir plus)ains in computational tools for detailed disease progression analysis and targeted in-silico drug interventions. Here, we introduce UNAGI, a deep generative neural network tailored to analyze time-series single-cell transcriptomic data. This tool captures the complex cellular dynamics underlying disease progression, enhancing drug perturbation modeling and discovery. When applied to a dataset from patients with Idiopathic Pulmonary Fibrosis (IPF), UNAGI learns disease-informed cell embeddings that sharpen our understanding of disease progression, leading to the identification of potential therapeutic drug candidates. Validation via proteomics reveals the accuracy of UNAGI’s cellular dynamics analyses, and the use of the Fibrotic Cocktail treated human Precision-cut Lung Slices confirms UNAGI’s predictions that Nifedipine, an antihypertensive drug, may have antifibrotic effects on human tissues. UNAGI’s versatility extends to other diseases, including a COVID dataset, demonstrating adaptability and confirming its broader applicability in decoding complex cellular dynamics beyond IPF, amplifying its utility in the quest for therapeutic solutions across diverse pathological landscapes.
scGeneRythm: Using Neural Networks and Fourier Transformation to Cluster Genes by Time-Frequency Patterns in Single-Cell Data
Yiming Jia
Hao Wu
scSniper: Single-cell Deep Neural Network-based Identification of Prominent Biomarkers
Mingyang Li
Yanshuo Chen
Unveiling the Impact of Arsenic Toxicity on Immune Cells in Atherosclerotic Plaques: Insights from Single-Cell Multi-Omics Profiling
Kiran Makhani
Xiuhui Yang
France Dierick
Nivetha Subramaniam
Natascha Gagnon
Talin Ebrahimian
Hao Wu
Koren K. Mann
scCobra: Contrastive cell embedding learning with domain-adaptation for single-cell data integration and harmonization
Bowen Zhao
Dong-Qing Wei
Yi Xiong
The rapid development of single-cell technologies has underscored the need for more effective methods in the integration and harmonization o… (voir plus)f single-cell sequencing data. The prevalent challenge of batch effects, resulting from technical and biological variations across studies, demands accurate and reliable solutions for data integration. Traditional tools often have limitations, both due to reliance on gene expression distribution assumptions and the common issue of over-correction, particularly in methods based on anchor alignments. Here we introduce scCobra, a deep neural network tool designed specifically to address these challenges. By leveraging a deep generative model that combines a contrastive neural network with domain adaptation, scCobra effectively mitigates batch effects and minimizes over-correction without depending on gene expression distribution assumptions. Additionally, scCobra enables online label transfer across datasets with batch effects, facilitating the continuous integration of new data without retraining, and offers features for batch effect simulation and advanced multi-omic batch integration. These capabilities make scCobra a versatile data integration and harmonization tool for achieving accurate and insightful biological interpretations from complex datasets.