Publications

Autonomous optimization of neuroprosthetic stimulation parameters that drive the motor cortex and spinal cord outputs in rats and monkeys

Marco Bonizzato

Rose Guay Hottin

Sandrine L. Côté

Elena Massai

Leo Choiniere

Uzay Macar

Samuel Laferrière

Parikshat Sirpal

Stephan Quessy

Guillaume Lajoie

Marina Martinez

Numa Dancause

2023-04-11

Cell Reports Medicine (publié)

doi.org

Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation

Gandharv Patil

Prashanth L.A.

Dheeraj M. Nagaraj

Doina Precup

We study the finite-time behaviour of the popular temporal difference (TD) learning algorithm, when combined with tail-averaging. We derive … (voir plus)finite time bounds on the parameter error of the tail-averaged TD iterate under a step-size choice that does not require information about the eigenvalues of the matrix underlying the projected TD fixed point. Our analysis shows that tail-averaged TD converges at the optimal O (1/t) rate, both in expectation and with high probability. In addition, our bounds exhibit a sharper rate of decay for the initial error (bias), which is an improvement over averaging all iterates. We also propose and analyse a variant of TD that incorporates regularisation, and show that this variant fares favourably in problems with ill-conditioned features.

2023-04-11

Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (publié)

doi.org

arxiv.org

A Novel Stochastic Gradient Descent Algorithm for LearningPrincipal Subspaces

Charline Le Lan

Joshua Greaves

Jesse Farebrother

Mark Rowland

Fabian Pedregosa

Rishabh Agarwal

Marc Gendron-Bellemare

In this paper, we derive an algorithm that learns a principal subspace from sample entries, can be applied when the approximate subspace i… (voir plus)s represented by a neural network, and hence can bescaled to datasets with an effectively infinite number of rows and columns. Our method consistsin defining a loss function whose minimizer is the desired principal subspace, and constructing agradient estimate of this loss whose bias can be controlled.

2023-04-11

Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (publié)

doi.org

openreview.net

A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation

Florian Bordes

Samuel Lavoie

Randall Balestriero

Nicolas Ballas

Pascal Vincent

2023-04-11

ArXiv (prépublication)

doi.org

arxiv.org

Conservative objective models are a special kind of contrastive divergence-based energy model

Christopher Beckham

Chris Pal

In this work we theoretically show that conservative objective models (COMs) for offline model-based optimisation (MBO) are a special kind o… (voir plus)f contrastive divergence-based energy model, one where the energy function represents both the unconditional probability of the input and the conditional probability of the reward variable. While the initial formulation only samples modes from its learned distribution, we propose a simple fix that replaces its gradient ascent sampler with a Langevin MCMC sampler. This gives rise to a special probabilistic model where the probability of sampling an input is proportional to its predicted reward. Lastly, we show that better samples can be obtained if the model is decoupled so that the unconditional and conditional probabilities are modelled separately.

2023-04-07

ArXiv (prépublication)

doi.org

arxiv.org

Approach Intelligent Writing Assistants Usability with Seven Stages of Action

Avinash Bhat

Disha Shrivastava

Jin Guo

2023-04-06

ArXiv (prépublication)

doi.org

arxiv.org

MARSY: a multitask deep-learning framework for prediction of drug combination synergy scores

Mohamed Reda El Khili

Safyan Aman Memon

Amin Emad

Motivation Combination therapies have emerged as a treatment strategy for cancers to reduce the probability of drug resistance and to improv… (voir plus)e outcome. Large databases curating the results of many drug screening studies on preclinical cancer cell lines have been developed, capturing the synergistic and antagonistic effects of combination of drugs in different cell lines. However, due to the high cost of drug screening experiments and the sheer size of possible drug combinations, these databases are quite sparse. This necessitates the development of transductive computational models to accurately impute these missing values. Results Here, we developed MARSY, a deep learning multi-task model that incorporates information on gene expression profile of cancer cell lines, as well as the differential expression signature induced by each drug to predict drug-pair synergy scores. By utilizing two encoders to capture the interplay between the drug-pairs, as well as the drug-pairs and cell lines, and by adding auxiliary tasks in the predictor, MARSY learns latent embeddings that improve the prediction performance compared to state-of-the-art and traditional machine learning models. Using MARSY, we then predicted the synergy scores of 133,722 new drug-pair cell line combinations, which we have made available to the community as part of this study. Moreover, we validated various insights obtained from these novel predictions using independent studies, confirming the ability of MARSY in making accurate novel predictions. Availability and Implementation An implementation of the algorithms in Python and cleaned input datasets are provided in https://github.com/Emad-COMBINE-lab/MARSY. Contact amin.emad@mcgill.ca Supplementary Information Online-only supplementary data is available at the journal’s website.

2023-04-06

Bioinformatics (publié)

doi.org

PopulAtion Parameter Averaging (PAPA)

Alexia Jolicoeur-Martineau

Emy Gervais

Kilian Fatras

Yan Zhang

Simon Lacoste-Julien

Ensemble methods combine the predictions of multiple models to improve performance, but they require significantly higher computation costs … (voir plus)at inference time. To avoid these costs, multiple neural networks can be combined into one by averaging their weights. However, this usually performs significantly worse than ensembling. Weight averaging is only beneficial when different enough to benefit from combining them, but similar enough to average well. Based on this idea, we propose PopulAtion Parameter Averaging (PAPA): a method that combines the generality of ensembling with the efficiency of weight averaging. PAPA leverages a population of diverse models (trained on different data orders, augmentations, and regularizations) while slowly pushing the weights of the networks toward the population average of the weights. We also propose PAPA variants (PAPA-all, and PAPA-2) that average weights rarely rather than continuously; all methods increase generalization, but PAPA tends to perform best. PAPA reduces the performance gap between averaging and ensembling, increasing the average accuracy of a population of models by up to 0.8% on CIFAR-10, 1.9% on CIFAR-100, and 1.6% on ImageNet when compared to training independent (non-averaged) models.

2023-04-06

ArXiv (prépublication)

doi.org

arxiv.org

PopulAtion Parameter Averaging (PAPA)

Alexia Jolicoeur-Martineau

Emy Gervais

Kilian Fatras

Yan Zhang

Simon Lacoste-Julien

Ensemble methods combine the predictions of multiple models to improve performance, but they require significantly higher computation costs … (voir plus)at inference time. To avoid these costs, multiple neural networks can be combined into one by averaging their weights. However, this usually performs significantly worse than ensembling. Weight averaging is only beneficial when different enough to benefit from combining them, but similar enough to average well. Based on this idea, we propose PopulAtion Parameter Averaging (PAPA): a method that combines the generality of ensembling with the efficiency of weight averaging. PAPA leverages a population of diverse models (trained on different data orders, augmentations, and regularizations) while slowly pushing the weights of the networks toward the population average of the weights. We also propose PAPA variants (PAPA-all, and PAPA-2) that average weights rarely rather than continuously; all methods increase generalization, but PAPA tends to perform best. PAPA reduces the performance gap between averaging and ensembling, increasing the average accuracy of a population of models by up to 0.8% on CIFAR-10, 1.9% on CIFAR-100, and 1.6% on ImageNet when compared to training independent (non-averaged) models.

2023-04-06

ArXiv (prépublication)

doi.org

arxiv.org

Source-free Domain Adaptation Requires Penalized Diversity

Laya Rafiee Sevyeri

Ivaxi Sheth

Farhood Farahnak

Alexandre See

Samira Ebrahimi Kahou

Thomas Fevens

Mohammad Havaei

While neural networks are capable of achieving human-like performance in many tasks such as image classification, the impressive performance… (voir plus) of each model is limited to its own dataset. Source-free domain adaptation (SFDA) was introduced to address knowledge transfer between different domains in the absence of source data, thus, increasing data privacy. Diversity in representation space can be vital to a model`s adaptability in varied and difficult domains. In unsupervised SFDA, the diversity is limited to learning a single hypothesis on the source or learning multiple hypotheses with a shared feature extractor. Motivated by the improved predictive performance of ensembles, we propose a novel unsupervised SFDA algorithm that promotes representational diversity through the use of separate feature extractors with Distinct Backbone Architectures (DBA). Although diversity in feature space is increased, the unconstrained mutual information (MI) maximization may potentially introduce amplification of weak hypotheses. Thus we introduce the Weak Hypothesis Penalization (WHP) regularizer as a mitigation strategy. Our work proposes Penalized Diversity (PD) where the synergy of DBA and WHP is applied to unsupervised source-free domain adaptation for covariate shift. In addition, PD is augmented with a weighted MI maximization objective for label distribution shift. Empirical results on natural, synthetic, and medical domains demonstrate the effectiveness of PD under different distributional shifts.

2023-04-06

ArXiv (prépublication)

doi.org

arxiv.org

Bugs in machine learning-based systems: a faultload benchmark

Mohammad Mehdi Morovati

Amin Nikanjam

Foutse Khomh

Z. Jiang

2023-04-05

Empirical Software Engineering (publié)

doi.org

arxiv.org

Abstract 2987: BamQuery: a new proteogenomic tool to explore the immunopeptidome and prioritize actionable tumor antigens

Maria-Virginia Ruiz Cuevas

Marie-Pierre Hardy

Jean-David Larouche

Anca Apavaloaei

Eralda Kina

Krystel Vincent

Patrick Gendron

Jean-Philippe Laverdure

Chantal Durette

Pierre Thibault

Sébastien Lemieux

Claude Perreault

Grégory Ehx

MHC class I-associated peptides (MAPs), collectively referred to as the immunopeptidome, have a pivotal role in cancer immunosurveillance. W… (voir plus)hile MAPs were long thought to be solely generated by the degradation of canonical proteins, recent advances in the field of proteogenomics (genomically-informed proteomics) evidenced that ∼10% of them originate from allegedly noncoding genomic sequences. Among these sequences, endogenous retroelements (EREs) are under intense scrutiny as a possible source of actionable tumor antigens (TAs). With the increasing number of cancer-oriented immunopeptidomic and proteogenomic studies comes the need to accurately attribute an RNA expression level to each MAP identified by mass-spectrometry. Here, we introduce BamQuery (BQ), a computational tool to attribute an exhaustive RNA expression to MAPs of any genomic origin (exon, intron, UTR, intergenic) from bulk and single-cell RNA-sequencing data. By using BQ on large datasets of published MAPs identified by mass spectrometry, we show that many of them can arise from more than one genomic region. Indeed, 27% of MAPs reported as deriving from protein-coding exons (canonical MAPs) could also arise from non-canonical genomic regions, sometimes with greater probability, and 61% of non-canonical MAPs could arise from more than a single genomic origin (334 possible regions on average per non-canonical MAP; up to 35,343 for EREs). The consideration of all these origins evidenced an unsuspected high RNA expression in normal human tissues of (i) published neoantigens/TAs (mutated or not); (ii) MAPs derived from proteasomal splicing, supposedly not genomically templated, and (iii) MAPs derived from viruses. In particular, the high expression of candidate immunotherapeutic targets such as TAs highlights the relevance of BamQuery and the necessity of using it to validate such antigens before translating their usage in clinical trials. We also demonstrate that BamQuery can be used to directly identify safe and actionable TAs as well as to predict their immunogenicity through our freely accessible web portal (https://bamquery.iric.ca/search). Therefore, BQ could become an essential tool in any TA prioritization pipeline in the near future. Citation Format: Maria-Virginia Ruiz Cuevas, Marie-Pierre Hardy, Jean-David Larouche, Anca Apavaloaei, Eralda Kina, Krystel Vincent, Patrick Gendron, Jean-Philippe Laverdure, Chantal Durette, Pierre Thibault, Sebastien Lemieux, Claude Perreault, Gregory Ehx. BamQuery: a new proteogenomic tool to explore the immunopeptidome and prioritize actionable tumor antigens [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2023; Part 1 (Regular and Invited Abstracts); 2023 Apr 14-19; Orlando, FL. Philadelphia (PA): AACR; Cancer Res 2023;83(7_Suppl):Abstract nr 2987.

2023-04-04

Cancer Research (publié)

doi.org

Programme d’apprentissage IA sur mesure

Mil'Haq Fest 2025

Communauté de pratique de Mila

Demandes de supervision

Publications

Programme d’apprentissage IA sur mesure

Mil'Haq Fest 2025

Communauté de pratique de Mila

Demandes de supervision

Mots-clés populaires:

Publications