Julie Hussin

Membre académique associé

Professeure adjointe, Université de Montréal

Sujets de recherche

Apprentissage automatique médical

Apprentissage multimodal

Apprentissage profond

Biologie computationnelle

Exploration des données

Site web

Google Scholar

Biographie

Julie Hussin est professeure agrégée au Département de médecine de l’Université de Montréal (UdeM) et chercheuse à l’Institut de cardiologie de Montréal (ICM). Elle est titulaire d’une Chaire de recherche du Canada (niveau 2) en science responsable des données multi-omiques et préside les programmes d’études supérieures en bio-informatique à l’UdeM.

Formée en génomique statistique et évolutive, la Dre Hussin possède une vaste expérience dans l’analyse de grands ensembles de données multi-omiques provenant de cohortes de population. Ses travaux en biologie computationnelle s’inscrivent dans les domaines de la génomique médicale et de la génomique des populations, auxquels elle a apporté plusieurs avancées méthodologiques. Son programme de recherche interdisciplinaire vise à développer des outils novateurs pour la médecine de précision. Ses projets portent notamment sur l’amélioration de la prédiction et de la gestion du risque de maladies cardiométaboliques, en particulier l’insuffisance cardiaque.

Ses approches intègrent différents types de données — cliniques, génétiques, transcriptomiques, protéomiques et métabolomiques — afin de mieux comprendre les déterminants biologiques des maladies du cœur, notamment grâce à des techniques d’apprentissage non supervisé. Dans le contexte de la pandémie de COVID-19, son équipe a également dirigé le développement d’algorithmes de science des données pour analyser les données génétiques virales, soutenir la surveillance épidémiologique et étudier les interactions hôte-pathogène ainsi que l’évolution virale.

Ses travaux s’intéressent aussi à l’interprétabilité, à la généralisabilité et à l’équité des algorithmes d’apprentissage automatique appliqués à la recherche en santé. La Dre Hussin milite pour une intelligence artificielle équitable, sécuritaire et transparente dans la recherche en santé, et s’engage à favoriser l’inclusivité et la représentativité afin que ses travaux bénéficient à l’ensemble de la population.

Elle enseigne plusieurs cours de premier et de deuxième cycles en biologie computationnelle, en génétique des populations et en apprentissage automatique appliqué à la génomique. Avant de se joindre à l’UdeM comme professeure, elle a été boursière postdoctorale du Human Frontier Science Program au Wellcome Trust Centre for Human Genetics de l’Université d’Oxford (Linacre College), ainsi que chercheuse invitée à l’Université McGill.

Étudiants actuels

Doctorat - UdeM

Postdoctorat - UdeM

Doctorat - UdeM

Camille Rochefort-Boulanger

Doctorat - UdeM

Co-superviseur⋅e :

Yoshua Bengio

Github

Issam Saadie

Maîtrise recherche - UdeM

Matthew Scicluna

Doctorat - UdeM

Co-superviseur⋅e :

Guy Wolf

Github

Google Scholar

Yonglin Zhu

Doctorat - UdeM

Publications

Lipidomics Identifies HFpEF Phenogroups and a High-Risk Metabolic Signature - The BElgian and CAnadian MEtabolomics in HFpEF (BECAME-HF) project.

Julie Hussin

Nassiba Menghoum

Anik Forest

Pamela Mehanna

Olivier Tastet

Julie Legault

Isabelle Robillard Frayne

Sibille Lejeune

David Vancraeynest

Clotilde Roy

Galadriel Briere

Gabrielle Boucher

L Bertrand

Sandrine Horman

David Rhainds

J.‐C. Tardif

Christophe Beauloye

Anne-Catherine Pouleur

Christine Des Rosiers

Rationale: Heart failure with preserved ejection fraction (HFpEF) is a heterogeneous syndrome with substantial unmet diagnostic and therapeu… (voir plus)tic needs. Circulating lipid metabolism is increasingly implicated in HFpEF pathophysiology but has not been systematically leveraged for molecular stratification. Objective: To determine whether plasma lipidomics can identify molecular phenogroups of HFpEF associated with distinct clinical characteristics and outcomes. Methods and Results: Untargeted plasma lipidomics was performed in non-HF subjects and HFpEF patients from a primary Belgian cohort and an independent Canadian cohort (n=177 in each cohort). In the Belgian cohort, 235 unique lipids spanning 19 subclasses were annotated, including 96 significantly associated with HFpEF (q<0.02). Unsupervised analyses revealed marked lipidomic heterogeneity, with a distinct HFpEF subgroup separable from non-HF subjects. Hierarchical clustering identified three phenogroups with divergent lipid profiles and clinical features. One phenogroup exhibited severe atrial dysfunction, congestion-related biomarkers, elevated indices of cardiac and liver fibrosis, and markedly reduced survival, a second was characterized by prominent metabolic syndrome features, and a third by preserved renal function. Cross-cohort comparison using a supervised classifier trained on 158 shared lipids confirmed analogous lower-risk phenogroups in the Canadian cohort, while the high-risk phenotype was underrepresented. A signature of 10 lipids across six subclasses, including long-chain acylcarnitines, ether phosphatidylcholines, and oxidized sphingomyelins, discriminated the high-risk group and correlated with markers of disease severity. Conclusion: Our findings demonstrate that HFpEF comprises metabolically distinct patient subgroups across cohorts, revealing specific lipidomic dysfunctions that deepen our understanding of HFpEF heterogeneity and underlying pathophysiology.

2026-04-01

medRxiv (accepté)

doi.org

CETP alternative splicing variation impacts human traits

Isabel Gamache

Marc‐André Legault

Jean‐Christophe Grenier

Éric Rhéaume

J.C. Tardif

Marie‐Pierre Dubé

Julie Hussin

Abstract The cholesteryl ester transfer protein (CETP) is an important protein in reverse cholesterol transport and has been identified as a… (voir plus) significant factor associated with cardiovascular disease (CVD), making it a widely studied pharmaceutical target. Three protein-coding isoforms of CETP exist, distinguished by the alternative splicing of one exon each. The isoform primarily responsible for cholesterol-related functions in the plasma is well studied, but specific functions of each isoform remain poorly understood. In this study, we demonstrate the significance of considering CETP’s isoforms in analyses of human traits. Using bulk RNA-seq data from multiple tissues, we characterized the expression patterns and genetic regulation determinants of CETP transcripts. Leveraging publicly available GWAS summary statistics, we conducted multivariable Mendelian Randomisation (MVMR) to estimate the impact of variation in isoform proportions on phenotypes, highlighting the importance of CETP’s isoforms in pituitary and thyroid glands. Furthermore, we uncovered tissue-specific associations between CETP’s isoforms and CVD-associated phenotypes. Additionally, we observed that the epistatic interaction previously reported between CETP and ADCY9 , a gene implicated in modulating a CETP modulator’s response, may be mediated through the regulation of alternative splicing of exon 9. Our results underscore the importance of a comprehensive understanding of CETP’s isoforms, which can significantly impact both fundamental and clinical research efforts.

2026-03-17

bioRxiv (prépublication)

doi.org

Chromatin landscape and enhancer-gene interaction differences between three cardiac cell types

Yan Zhu

Jean‐Christophe Grenier

Raphaël Poujol

Olivier Tastet

Caroline Lee

Svenja Koslowski

Marouane Benzaki

Talal Fawaz

Julie Hussin

Roger Foo

Chukwuemeka George Anene-Nzelu

Matthew Ackers-Johnson

2026-02-28

Journal of Molecular and Cellular Cardiology Plus (publié)

doi.org

Foundation models for electrocardiogram interpretation: clinical implications

Alexis Nolin-Lapalme

Achille Sowa

Jacques Delfrate

Olivier Tastet

Denis Corbin

Merve Kulbay

Derman Ozdemir

Marie-Jeanne Noël

François-Christophe Marois-Blanchet

François Harvey

Surbhi Sharma

Minhaj Ansari

I-Min Chiu

Valentina D'souza

Sam F. Friedman

Michael Chassé

Brian J. Potter

Jonathan Afilalo

Pierre Adil Elias

Gilbert Jabbour … (voir 13 de plus)

Mourad Bahani

Marie-Pierre Dubé

Patrick M. Boyle

Neal A. Chatterjee

Joshua Barrios

Geoffrey H. Tison

David Ouyang

Mahnaz Maddah

Shaan Khurshid

Julia Cadrin-Tourigny

Rafik Tadros

Julie Hussin

Robert Avram

The 12-lead electrocardiogram (ECG) remains a cornerstone of cardiac diagnostics, yet existing artificial intelligence (AI) solutions for au… (voir plus)tomated interpretation often lack generalizability, remain closed source, and are primarily trained using supervised learning (SL), which requires extensive labelled datasets and may limit adaptability across diverse clinical settings. Self-supervised learning (SSL) can potentially overcome these limitations by learning robust representations from unlabelled data. To address these challenges, this study developed and compared two open-source foundational ECG models: DeepECG-SL, a supervised multilabel ECG model, and DeepECG-SSL, a self-supervised model. Both models were trained on over 1 million ECGs using a standardized preprocessing pipeline and automated free-text extraction from ECG reports to predict 77 cardiac conditions. DeepECG-SSL leveraged unlabelled data through self-supervised contrastive learning and masked lead modelling before fine-tuning for downstream tasks, while DeepECG-SL was trained directly on labelled diagnostic data in an end-to-end fashion. Performance was evaluated across seven private, multilingual healthcare systems and four public ECG repositories, with assessment of fairness by age and sex, and investigation of privacy vulnerabilities as well as memory and compute requirements. DeepECG-SSL achieved micro-averaged area under the receiver operating characteristic curves (AUROCs) across all 77 cardiac conditions for ECG interpretation of 0.990 [95% confidence interval (CI): 0.990, 0.990] on the internal dataset (MHI-ds), 0.981 (95% CI: 0.981, 0.981) on external public datasets (UKB, CLSA, MIMIC-IV and PTB), and 0.983 (95% CI: 0.983, 0.983) on external private datasets (UW, UCSF, JGH, NYP, MGH, CSH and CHUM), while DeepECG-SL demonstrated AUROCs of 0.992 (95% CI: 0.992, 0.992), 0.980 (95% CI: 0.980, 0.980), and 0.983 (95% CI: 0.983, 0.984), respectively. Fairness analyses revealed minimal disparities (true-positive rate and false-positive rate difference <0.1) across age and sex groups for both models. DeepECG-SSL demonstrated superior performance on limited-data digital biomarker tasks, with the largest improvements in long QT syndrome (LQTS) genotype classification (AUROC 0.931 vs 0.850, P = .026, n = 127 ECGs) and 5 year atrial fibrillation risk prediction (AUROC 0.742 vs 0.734, P < 0.001, n = 132 050 ECGs), while achieving superior performance in left ventricular ejection fraction ≤40% classification (AUROC 0.926 vs 0.917, P < 0.001, n = 25 252 ECGs) and comparable performance in LQTS detection (AUROC 0.767 vs 0.735, P = 0.117, n = 934 ECGs). This study establishes SSL as a promising paradigm for ECG analysis, particularly in settings with limited annotated data, enhancing accessibility, generalizability, and fairness in AI-driven cardiac diagnostics. By releasing model weights, preprocessing tools, and validation code, this work aims to support robust, data-efficient AI diagnostics across diverse clinical environments and questions.

2026-01-21

European Heart Journal (publié)

doi.org

Chromatin landscape and enhancer-gene interaction differences between three cardiac cell types

Chukwuemeka George Anene-Nzelu

Yan Zhu

Jean‐Christophe Grenier

Raphaël Poujol

Svenja Koslowski

Olivier Tastet

Chang Jie Mick Lee

Matthew Ackers‐Johnson

Roger Foo

Julie Hussin

ABSTRACT Genome-wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNP) associated with a specific tr… (voir plus)aits and diseases, however, uncovering the true disease-relevant SNPs remains challenging. One limitation for prioritizing true disease-relevant SNPs from GWAS is that most of the identified SNPs are non-coding, making it difficult to unravel their mechanism of action. Nevertheless, mapping non-coding SNPs to enhancers is a validated approach to link SNPs to their target genes through the analysis of enhancer-gene interactions (EGI) and thus provide insight into their mechanism of action. While previous studies linking cardiac disease-relevant SNPs to enhancers and their target genes have focused on the principal cardiac cell type, cardiomyocytes (CMs), the analysis of other non-CM cell types has been largely ignored and has only gained attention recently. We hypothesize that characterizing cell-type-specific enhancer-gene interactions (EGIs) for these non-CMs, namely cardiac fibroblasts (CFs), endothelial cells (ECs), and smooth muscle cells (SMCs), followed by mapping cardiac-disease-associated non-coding SNPs to those enhancers will identify novel disease-relevant genes and provide insights for future mechanistic research. To identify the landscape of cell-type-specific EGIs in these cardiac cells, we have employed the activity-by-Contact (ABC) model. It integrates assay for transposase-accessible chromatin sequencing (ATAC-seq), H3K27ac chromatin immunoprecipitation with sequencing (ChIP-seq), and high-throughput chromosome conformation capture with H3K27ac immunoprecipitation (H3K27ac HiChIP) data to identify EGIs. We have identified the landscape of cell-type-specific EGIs in these cardiac cells. Furthermore, a higher similarity of the chromatin accessibility profile (ATAC-seq) between CF and SMC, compared to CF and EC, and SMC and EC was observed. Finally, overlapping identified EGIs with cardiac-disease-associated non-coding variants has allowed the identification of a QT-interval-associated SNP that is mapped to the enhancer region of an EC-specific EGI.

2025-09-24

bioRxiv (prépublication)

doi.org

LIPTER, a cardiomyocyte-enriched long noncoding RNA, controls cardiac cytoskeletal maturation and is regulated by a cardiomyocyte-specific enhancer.

George Anene Nzelu

Mick Lee

Svenja Koslowski

Wenhao Zheng

Marouane Benzaki

Michelle Mak

Xiaohua Wang

Lek Wen Tan

Albert Dashi

Yonglin Zhu

Talal Fawaz

Kenneth Ng

Davy Pham

Francis LeBlanc

Guillaume Lettre

Julie Hussin

Roger Foo

Cardiac development is characterized by a complex series of molecular, cytoskeletal and electrophysiological changes that guarantee the prop… (voir plus)er functioning of adult cardiomyocytes (CMs). These changes are defined by cell-type-specific transcriptional rewiring of progenitor cells to form CMs, and are regulated by various epigenetic elements, such as long noncoding RNAs (lncRNAs). LncRNAs are versatile epigenetic regulators as they may act in cis or in trans to orchestrate important gene programs during cardiac development and may concurrently encode micropeptides. LIPTER is one such lncRNA, previously shown to regulate lipid droplet transport in cardiomyocytes and thus an important regulator of cardiomyocyte metabolism. Here we show that LIPTER also plays a role in the cytoskeletal maturation of CMs, as loss of LIPTER leads to persistent expression of fetal genes, changes in chromatin accessibility, disorganized sarcomeres and impaired calcium homeostasis in CMs. Furthermore, we have identified a cardiomyocyte-specific regulatory enhancer that regulates the expression of LIPTER in CMs. CRISPR-mediated inhibition of this enhancer led to reduced LIPTER expression in CMs and increased expression of fetal genes. This CM-specific enhancer could therefore be manipulated to control the expression of LIPTER for therapeutic benefit. In summary, we have unravelled a novel role of LIPTER in CMs cytoskeletal maturation and have identified a CM-specific enhancer for LIPTER.

2025-08-11

bioRxiv (prépublication)

doi.org

LIPTER, a cardiomyocyte-enriched long noncoding RNA, controls cardiac cytoskeletal maturation and is regulated by a cardiomyocyte-specific enhancer

Chukwuemeka George Anene-Nzelu

C.J Mick Lee

Svenja Koslowski

Wenhao Zheng

Marouane Benzaki

Michelle Mak

Xiaohua Wang

Lek Wen Tan

Albert Dashi

Yonglin Zhu

Talal Fawaz

Kenneth Ng

Davy Pham

Francis LeBlanc

Guillaume Lettre

Julie Hussin

Roger Foo

2025-08-11

BioRxiv (prépublication)

doi.org

Foundation models for generalizable electrocardiogram interpretation: comparison of supervised and self-supervised electrocardiogram foundation models

Alexis Nolin-Lapalme

Achille Sowa

Jacques Delfrate

Olivier Tastet

Denis Corbin

Merve Kulbay

Derman Ozdemir

Marie-Jeanne Noël

François-Christophe Marois-Blanchet

François Harvey

Surbhi Sharma

Minhaj Ansari

I-Min Chiu

Valentina Dsouza

Sam F. Friedman

Michael Chassé

Brian J. Potter

Jonathan Afilalo

Pierre Adil Elias

Gilbert Jabbour … (voir 13 de plus)

Mourad Bahani

Marie-Pierre Dubé

Patrick M. Boyle

Neal A. Chatterjee

Joshua Barrios

Geoffrey H. Tison

David Ouyang

Mahnaz Maddah

Shaan Khurshid

Julia Cadrin-Tourigny

Rafik Tadros

Julie Hussin

Robert Avram

The 12-lead electrocardiogram (ECG) remains a cornerstone of cardiac diagnostics, yet existing artificial intelligence (AI) solutions for au… (voir plus)tomated interpretation often lack generalizability, remain closed-source, and are primarily trained using supervised learning, limiting their adaptability across diverse clinical settings. To address these challenges, we developed and compared two open-source foundational ECG models: DeepECG-SSL, a self-supervised learning model, and DeepECG-SL, a supervised learning model. Both models were trained on over 1 million ECGs using a standardized preprocessing pipeline and automated free-text extraction from ECG reports to predict 77 cardiac conditions. DeepECG-SSL was pretrained using self-supervised contrastive learning and masked lead modeling. The models were evaluated on six multilingual private healthcare systems and four public datasets for ECG interpretation across 77 diagnostic categories. Fairness analyses assessed disparities in performance across age and sex groups, while also investigating fairness and resource utilization. DeepECG-SSL achieved AUROCs of 0.990 (95%CI 0.990, 0.990) on internal dataset, 0.981 (95%CI 0.981, 0.981) on external public datasets, and 0.983 (95%CI 0.983, 0.983) on external private datasets, while DeepECG-SL demonstrated AUROCs of 0.992 (95%CI 0.992, 0.992), 0.980 (95%CI 0.980, 0.980) and 0.983 (95%CI 0.983, 0.983) respectively. Fairness analyses revealed minimal disparities (true positive rate & false positive rate difference<0.010) across age and sex groups. Digital biomarker prediction (Long QT syndrome (LQTS) classification, 5-year atrial fibrillation prediction and left ventricular ejection fraction (LVEF) classification) with limited labeled data, DeepECG-SSL outperformed DeepECG-SL in predicting 5-year atrial fibrillation risk (N=132,050; AUROC 0.742 vs. 0.720; Δ=0.022; P<0.001), identifying reduced LVEF ≤40% (N=25,252; 0.928 vs. 0.900; Δ=0.028; P<0.001), and classifying LQTS syndrome subtypes (N=127; 0.931 vs. 0.853; Δ=0.078; P=0.026). By releasing model weights, preprocessing tools, and validation code, we aim to support robust, data-efficient AI diagnostics across diverse clinical environments. This study establishes self-supervised learning as a promising paradigm for ECG analysis, particularly in settings with limited annotated data, enhancing accessibility, generalizability, and fairness in AI-driven cardiac diagnostics. Can self-supervised (SSL) learning yield ECG-based AI foundational models with enhanced performance, fairness, privacy, and generalizability compared to traditional supervised learning (SL) approaches? Our evaluation of DeepECG-SL and DeepECG-SSL across seven external health center datasets and four international publicly accessible datasets demonstrated that while both models achieve comparable diagnostic accuracy for ECG interpretation, SSL outperforms SL on novel tasks with smaller datasets. We validated DeepECG-SL and DeepECG-SSL across public and private datasets and demonstrated that SSL model had a superior generalizability by addressing fairness, privacy, and efficiency, and open sourcing our models, we advance ethical, adaptable AI for equitable, real-world ECG diagnostics. Graphical abstract: DeepECG-SL and DeepECG-SSL, two open-source AI models for 12-lead ECG interpretation, were trained on over 1 million ECGs. DeepECG-SSL, utilizing self-supervised contrastive learning and masked lead modeling, outperformed DeepECG-SL in utilizing digital biomarkers to predict atrial fibrillation risk, reduced LVEF, and long QT syndrome subtypes, while both models achieved high diagnostic accuracy with minimal fairness disparities across age and sex. Validated on ten external datasets, our work provides a robust, reproducible framework for equitable, efficient ECG-based cardiac diagnostics.

2025-03-04

medRxiv (prépublication)

doi.org

The Canadian VirusSeq Data Portal and Duotang: open resources for SARS-CoV-2 viral sequences and genomic epidemiology

Erin E. Gill

Baofeng Jia

Carmen Lia Murall

Raphaël Poujol

Muhammad Zohaib Anwar

Nithu Sara John

Justin Richardsson

Ashley E. Hobb

Abayomi S. Olabode

Alexandru Lepsa

Ana T. Duggan

Andrea D. Tyler

Arnaud N’Guessan

Atul Kachru

Brandon Chan

Catherine Yoshida

Christina K. Yung

David Bujold

Dusan Andric

Edmund Su … (voir 47 de plus)

Emma Griffiths

Gary Van Domselaar

Gordon Jolly

Heather Ward

Henrich Feher

Jared Baker

Jared T. Simpson

Jaser Uddin

Jiannis Ragoussis

Jon Eubank

Jörg H. Fritz

José Héctor Gálvez

Karen Fang

Kim Cullion

Leonardo Landa Rivera

Qian Xiang

Matthew A. Croxen

Mitchell Shiell

Natalie Prystajecky

Pierre-Olivier Quirion

Rosita Bajari

Samantha Rich

Samira Mubareka

Sandrine Moreira

Scott Cain

Steven G. Sutcliffe

Susanne A. Kraemer

Yelizar Alturmessov

Yann Joly

VirusSeq Data Portal Academic and Health Network**

Marc Fiume

Terrance P. Snutch

Cindy Bell

Catalina López-Correa

Julie Hussin

Jeffrey B. Joy

Caroline Colijn

Paul M. K. Gordon

William Hsiao

Art F. Y. Poon

Natalie Knox

Mélanie Courtot

Lincoln Stein

Sarah P. Otto

Guillaume Bourque

B. Jesse Shapiro

Fiona S. L. Brinkman

The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform t… (voir plus)he public health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN – VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian VirusSeq Data Portal, with associated data pipelines and procedures, to support these efforts. The goal of VirusSeq was to allow open access to Canadian SARS-CoV-2 genomic sequences and enhanced, standardized contextual data that were unavailable in other repositories and that meet FAIR standards (Findable, Accessible, Interoperable and Reusable). In addition, the portal data submission pipeline contains data quality checking procedures and appropriate acknowledgement of data generators that encourages collaboration. From inception to execution, the portal was developed with a conscientious focus on strong data governance principles and practices. Extensive efforts ensured a commitment to Canadian privacy laws, data security standards, and organizational processes. This portal has been coupled with other resources, such as Viral AI, and was further leveraged by the Coronavirus Variants Rapid Response Network (CoVaRR-Net) to produce a suite of continually updated analytical tools and notebooks. Here we highlight this portal (https://virusseq-dataportal.ca/), including its contextual data not available elsewhere, and the Duotang (https://covarr-net.github.io/duotang/duotang.html), a web platform that presents key genomic epidemiology and modelling analyses on circulating and emerging SARS-CoV-2 variants in Canada. Duotang presents dynamic changes in variant composition of SARS-CoV-2 in Canada and by province, estimates variant growth, and displays complementary interactive visualizations, with a text overview of the current situation. The VirusSeq Data Portal and Duotang resources, alongside additional analyses and resources computed from the portal (COVID-MVP, CoVizu), are all open source and freely available. Together, they provide an updated picture of SARS-CoV-2 evolution to spur scientific discussions, inform public discourse, and support communication with and within public health authorities. They also serve as a framework for other jurisdictions interested in open, collaborative sequence data sharing and analyses.

2024-10-13

Microbial Genomics (publié)

doi.org

Refining SARS-CoV-2 intra-host variation by leveraging large-scale sequencing data

Fatima Mostefai

Jean-Christophe Grenier

Raphaël Poujol

Julie Hussin

Understanding viral genome evolution during host infection is crucial for grasping viral diversity and evolution. Analyzing intra-host singl… (voir plus)e nucleotide variants (iSNVs) offers insights into new lineage emergence, which is important for predicting and mitigating future viral threats. Despite next-generation sequencing’s potential, challenges persist, notably sequencing artifacts leading to false iSNVs. We developed a workflow to enhance iSNV detection in large NGS libraries, using over 130 000 SARS-CoV-2 libraries to distinguish mutations from errors. Our approach integrates bioinformatics protocols, stringent quality control, and dimensionality reduction to tackle batch effects and improve mutation detection reliability. Additionally, we pioneer the application of the PHATE visualization approach to genomic data and introduce a methodology that quantifies how related groups of data points are represented within a two-dimensional space, enhancing clustering structure explanation based on genetic similarities. This workflow advances accurate intra-host mutation detection, facilitating a deeper understanding of viral diversity and evolution.

2024-09-27

NAR Genomics and Bioinformatics (publié)

doi.org

Temperature-dependent Spike-ACE2 interaction of Omicron subvariants is associated with viral transmission

Mehdi Benlarbi

Shilei Ding

Étienne Bélanger

Alexandra Tauzin

Raphaël Poujol

Halima Medjahed

Omar El Ferri

Yuxia Bo

Catherine Bourassa

Julie Hussin

Judith Fafard

Marzena Pazgier

Inès Levade

Cameron Abrams

Marceline Côté

Andrés Finzi

The continued evolution of severe acute respiratory syndrome 2 (SARS-CoV-2) requires persistent monitoring of its subvariants. Omicron subva… (voir plus)riants are responsible for the vast majority of SARS-CoV-2 infections worldwide, with XBB and BA.2.86 sublineages representing more than 90% of circulating strains as of January 2024. To better understand parameters involved in viral transmission, we characterized the functional properties of Spike glycoproteins from BA.2.75, CH.1.1, DV.7.1, BA.4/5, BQ.1.1, XBB, XBB.1, XBB.1.16, XBB.1.5, FD.1.1, EG.5.1, HK.3, BA.2.86 and JN.1. We tested their capacity to evade plasma-mediated recognition and neutralization, binding to angiotensin-converting enzyme 2 (ACE2), their susceptibility to cold inactivation, Spike processing, as well as the impact of temperature on Spike-ACE2 interaction. We found that compared to the early wild-type (D614G) strain, most Omicron subvariants' Spike glycoproteins evolved to escape recognition and neutralization by plasma from individuals who received a fifth dose of bivalent (BA.1 or BA.4/5) mRNA vaccine and improve ACE2 binding, particularly at low temperatures. Moreover, BA.2.86 had the best affinity for ACE2 at all temperatures tested. We found that Omicron subvariants’ Spike processing is associated with their susceptibility to cold inactivation. Intriguingly, we found that Spike-ACE2 binding at low temperature was significantly associated with growth rates of Omicron subvariants in humans. Overall, we report that Spikes from newly emerged Omicron subvariants are relatively more stable and resistant to plasma-mediated neutralization, present improved affinity for ACE2 which is associated, particularly at low temperatures, with their growth rates. The persistent evolution of SARS-CoV-2 gave rise to a wide range of variants harboring new mutations in their Spike glycoproteins. Several factors have been associated with viral transmission and fitness such as plasma-neutralization escape and ACE2 interaction. To better understand whether additional factors could be of importance in SARS-CoV-2 variants’ transmission, we characterize the functional properties of Spike glycoproteins from several Omicron subvariants. We found that the Spike glycoprotein of Omicron subvariants presents an improved escape from plasma-mediated recognition and neutralization, Spike processing, and ACE2 binding which was further improved at low temperature. Intriguingly, Spike-ACE2 interaction at low temperature is strongly associated with viral growth rate, as such, low temperatures could represent another parameter affecting viral transmission.

2024-07-01

mBio (publié)

doi.org

The Canadian VirusSeq Data Portal & Duotang: open resources for SARS-CoV-2 viral sequences and genomic epidemiology

Erin E. Gill

Baofeng Jia

Carmen Lia Murall

Raphaël Poujol

Muhammad Zohaib Anwar

Nithu Sara John

Justin Richardsson

Ashley E. Hobb

Abayomi S. Olabode

Alexandru Lepsa

Ana T. Duggan

Andrea D. Tyler

Arnaud N’Guessan

Atul Kachru

Brandon Chan

Catherine Yoshida

Christina K. Yung

David Bujold

Dusan Andric

Edmund Su … (voir 50 de plus)

Emma Griffiths

Gary Van Domselaar

Gordon Jolly

Heather Ward

Henrich Feher

Jared Baker

Jared T. Simpson

Jaser Uddin

Jiannis Ragoussis

Jon Eubank

Fritz, Jörg H.

José Héctor Gálvez

Karen Fang

Kim Cullion

Leonardo Landa Rivera

Linda Xiang

Matthew A. Croxen

Mitchell Shiell

Natalie Prystajecky

Pierre-Olivier Quirion

Rosita Bajari

Samantha Rich

Samira Mubareka

Sandrine Moreira

Scott Cain

Steven G. Sutcliffe

Susanne A. Kraemer

Yann Joly

Yelizar Alturmessov

Cphln Consortium

CanCOGeN Consortium

Academic, VirusSeq Data Portal

network, Health

Marc Fiume

Terrance P Snutch

Cindy Bell

Catalina López-Correa

Julie Hussin

Jeffrey B. Joy

Caroline Colijn

Paul M. K. Gordon

William Hsiao

Art F. Y. Poon

Natalie Knox

Mélanie Courtot

Lincoln Stein

Sarah P. Otto

Guillaume Bourque

B. Jesse Shapiro

Fiona S. L. Brinkman

The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform p… (voir plus)ublic health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN - VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian VirusSeq Data Portal, with associated data pipelines and procedures, to support these efforts. The goal of VirusSeq was to allow open access to Canadian SARS-CoV-2 genomic sequences and enhanced, standardized contextual data that were unavailable in other repositories and that meet FAIR standards (Findable, Accessible, Interoperable and Reusable). The Portal data submission pipeline contains data quality checking procedures and appropriate acknowledgement of data generators that encourages collaboration. Here we also highlight Duotang, a web platform that presents genomic epidemiology and modeling analyses on circulating and emerging SARS-CoV-2 variants in Canada. Duotang presents dynamic changes in variant composition of SARS-CoV-2 in Canada and by province, estimates variant growth, and displays complementary interactive visualizations, with a text overview of the current situation. The VirusSeq Data Portal and Duotang resources, alongside additional analyses and resources computed from the Portal (COVID-MVP, CoVizu), are all open-source and freely available. Together, they provide an updated picture of SARS-CoV-2 evolution to spur scientific discussions, inform public discourse, and support communication with and within public health authorities. They also serve as a framework for other jurisdictions interested in open, collaborative sequence data sharing and analyses.

2024-05-07

ArXiv (prépublication)

doi.org

arxiv.org

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Julie Hussin

Biographie

Étudiants actuels

Publications

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Mots-clés populaires:

Julie Hussin

Biographie

Étudiants actuels

Publications