Ali Saberi

Doctorat - McGill

Superviseur⋅e principal⋅e

Amin Emad

Sujets de recherche

Biologie computationnelle

Publications

A long-context RNA foundation model for predicting transcriptome architecture

Ali Saberi

Benedict Choi

Sean Wang

Aldo Hernández-Corchado

Mohsen Naghipourfar

Arsham Mikaeili Namini

Vijay Ramani

Amin Emad

Hamed S. Najafabadi

Hani Goodarzi

Linking DNA sequence to genomic function remains one of the grand challenges in genetics and genomics. Here, we combine large-scale single-m… (voir plus)olecule transcriptome sequencing of diverse cancer cell lines with cutting-edge machine learning to build LoRNASH, an RNA foundation model that learns how the nucleotide sequence of unspliced pre-mRNA dictates transcriptome architecture—the relative abundances and molecular structures of mRNA isoforms. Owing to its use of the StripedHyena architecture, LoRNASH handles extremely long sequence inputs (∼65 kilobase pairs), allowing for quantitative, zero-shot prediction of all aspects of transcriptome architecture, including isoform abundance, isoform structure, and the impact of DNA sequence variants on transcript structure and abundance. We anticipate that our public data release and proof-of-concept model will accelerate varying aspects of RNA biotechnology. More broadly, we envision the use of LoRNASH as a foundation for fine-tuning of any transcriptome-related downstream prediction task, including cell-type specific gene expression, splicing, and general RNA processing.

2024-08-27

bioRxiv (prépublication)

doi.org

Hackathon | Créer une IA plus sécuritaire pour la santé mentale des jeunes

Éclaireurs autochtones en IA

Avantage IA

Ali Saberi

Publications

Hackathon | Créer une IA plus sécuritaire pour la santé mentale des jeunes

Éclaireurs autochtones en IA

Avantage IA

Mots-clés populaires:

Ali Saberi

Publications