Publications

The semantic distance between a linguistic prime and a natural scene target predicts reaction times in a visual search experiment

Katerina Marie Simkova

Jasper JF van den Bosch

Damiano Grignolio

Clayton Hickey

Ian Charest

2023-08-01

Journal of Vision (publié)

doi.org

Variational Nested Dropout

Yufei Cui

Yu Mao

Ziquan Liu

Qiao Li

Antoni B. Chan

Xue (Steve) Liu

Tei-Wei Kuo

Chun Jason Xue

Nested dropout is a variant of dropout operation that is able to order network parameters or features based on the pre-defined importance du… (voir plus)ring training. It has been explored for: I. Constructing nested nets Cui et al. 2020, Cui et al. 2021: the nested nets are neural networks whose architectures can be adjusted instantly during testing time, e.g., based on computational constraints. The nested dropout implicitly ranks the network parameters, generating a set of sub-networks such that any smaller sub-network forms the basis of a larger one. II. Learning ordered representation Rippel et al. 2014: the nested dropout applied to the latent representation of a generative model (e.g., auto-encoder) ranks the features, enforcing explicit order of the dense representation over dimensions. However, the dropout rate is fixed as a hyper-parameter during the whole training process. For nested nets, when network parameters are removed, the performance decays in a human-specified trajectory rather than in a trajectory learned from data. For generative models, the importance of features is specified as a constant vector, restraining the flexibility of representation learning. To address the problem, we focus on the probabilistic counterpart of the nested dropout. We propose a variational nested dropout (VND) operation that draws samples of multi-dimensional ordered masks at a low cost, providing useful gradients to the parameters of nested dropout. Based on this approach, we design a Bayesian nested neural network that learns the order knowledge of the parameter distributions. We further exploit the VND under different generative models for learning ordered latent distributions. In experiments, we show that the proposed approach outperforms the nested network in terms of accuracy, calibration, and out-of-domain detection in classification tasks. It also outperforms the related generative models on data generation tasks.

2023-08-01

IEEE Transactions on Pattern Analysis and Machine Intelligence (publié)

doi.org

arxiv.org

Do visual mental imagery and exteroceptive perception rely on the same mechanisms?

Catherine Landry

Jasper JF van den Bosch

Ian Charest

Frédéric Gosselin

Vincent Taschereau-Dumouchel

2023-08-01

Journal of Vision (publié)

doi.org

Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering

Vaibhav Adlakha

Parishad BehnamGhader

Xing Han Lu

Nicholas Meade

Siva Reddy

Retriever-augmented instruction-following models are attractive alternatives to fine-tuned approaches for information-seeking tasks such as … (voir plus)question answering (QA). By simply prepending retrieved documents in its input along with an instruction, these models can be adapted to various information domains and tasks without additional fine-tuning. While the model responses tend to be natural and fluent, the additional verbosity makes traditional QA evaluation metrics such as exact match (EM) and F1 unreliable for accurately quantifying model performance. In this work, we investigate the performance of instruction-following models across three information-seeking QA tasks. We use both automatic and human evaluation to evaluate these models along two dimensions: 1) how well they satisfy the user's information need (correctness), and 2) whether they produce a response based on the provided knowledge (faithfulness). Guided by human evaluation and analysis, we highlight the shortcomings of traditional metrics for both correctness and faithfulness. We then propose simple token-overlap based and model-based metrics that reflect the true performance of these models. Our analysis reveals that instruction-following models are competitive, and sometimes even outperform fine-tuned models for correctness. However, these models struggle to stick to the provided knowledge and often hallucinate in their responses. We hope our work encourages a more holistic evaluation of instruction-following models for QA. Our code and data is available at https://github.com/McGill-NLP/instruct-qa

2023-07-31

ArXiv (prépublication)

doi.org

arxiv.org

GRouNdGAN: GRN-guided simulation of single-cell RNA-seq data using causal generative adversarial networks

Yazdan Zinati

Abdulrahman Takiddeen

Amin Emad

We introduce GRouNdGAN, a gene regulatory network (GRN)-guided causal implicit generative model for simulating single-cell RNA-seq data, in-… (voir plus)silico perturbation experiments, and benchmarking GRN inference methods. Through the imposition of a user-defined GRN in its architecture, GRouNdGAN simulates steady-state and transient-state single-cell datasets where genes are causally expressed under the control of their regulating transcription factors (TFs). Training on three experimental datasets, we show that our model captures non-linear TF-gene dependences and preserves gene identities, cell trajectories, pseudo-time ordering, and technical and biological noise, with no user manipulation and only implicit parameterization. Despite imposing rigid causality constraints, it outperforms state-of-the-art simulators in generating realistic cells. GRouNdGAN learns meaningful causal regulatory dynamics, allowing sampling from both observational and interventional distributions. This enables it to synthesize cells under conditions that do not occur in the dataset at inference time, allowing to perform in-silico TF knockout experiments. Our results show that in-silico knockout of cell type-specific TFs significantly reduces cells of that type being generated. Interactions imposed through the GRN are emphasized in the simulated datasets, resulting in GRN inference algorithms assigning them much higher scores than interactions not imposed but of equal importance in the experimental training dataset. Benchmarking various GRN inference algorithms reveals that GRouNdGAN effectively bridges the existing gap between simulated and biological data benchmarks of GRN inference algorithms, providing gold standard ground truth GRNs and realistic cells corresponding to the biological system of interest. Our results show that GRouNdGAN is a stable, realistic, and effective simulator with various applications in single-cell RNA-seq analysis.

2023-07-31

bioRxiv (prépublication)

doi.org

Multivariate analytical approaches for investigating brain-behavior relationships

E. Leighton Durham

Karam Ghanem

Andrew J. Stier

Carlos Cardenas-Iniguez

Gabrielle E. Reimann

Hee Jung Jeong

Randolph M. Dupont

Xiaoyu Dong

Tyler M. Moore

Marc G. Berman

Benjamin B. Lahey

Danilo Bzdok

Antonia N. Kaczkurkin

2023-07-31

Frontiers in Neuroscience (publié)

doi.org

FASHION AND SUSTAINABILITY: A SYSTEMATIC LITERATURE REVIEW

Osmud Rahman

Dingtao Hu

Benjamin Fung

2023-07-30

Global Fashion Management Conference (publié)

doi.org

GPS++: Reviving the Art of Message Passing for Molecular Property Prediction

Dominic Masters

Josef Dean

Kerstin Klaeser

Zhiyi Li

Samuel Maddrell-Mander

Adam Sanders

Hatem Helal

Deniz Beker

Andrew William Fitzgibbon

Shenyang Huang

Ladislav Rampášek

Dominique Beaini

2023-07-30

TMLR (accepté)

doi.org

openreview.net

Online Interior-point Methods for Time-varying Equality-constrained Optimization

Jean-Luc Lupien

Iman Shames

Antoine Lesage-Landry

2023-07-30

ArXiv (prépublication)

doi.org

arxiv.org

Repurposing Density Functional Theory to Suit Deep Learning

Alexander Mathiasen

Hatem Helal

Paul Balanca

Kerstin Klaeser

Josef Dean

Carlo Luschi

Dominique Beaini

Andrew William Fitzgibbon

Dominic Masters

Density Functional Theory (DFT) accurately predicts the properties of molecules given their atom types and positions, and often serves as gr… (voir plus)ound truth for molecular property prediction tasks. Neural Networks (NN) are popular tools for such tasks and are trained on DFT datasets, with the aim to approximate DFT at a fraction of the computational cost. Research in other areas of machine learning has shown that generalisation performance of NNs tends to improve with increased dataset size, however, the computational cost of DFT limits the size of DFT datasets. We present PySCFIPU, a DFT library that allows us to iterate on both dataset generation and NN training. We create QM10X, a dataset with 100M conformers, in 13 hours, on which we subsequently train SchNet in 12 hours. We show that the predictions of SchNet improve solely by increasing training data without incorporating further inductive biases.

2023-07-28

ICML.cc/2023/Workshop/SynS_and_ML (publié)

openreview.net

What if We Enrich day-ahead Solar Irradiance Time Series Forecasting with Spatio-Temporal Context?

Oussama Boussif

Ghait Boukachab

Dan Assouline

Stefano Massaroli

Tianle Yuan

Loubna Benabbou

Yoshua Bengio

The global integration of solar power into the electrical grid could have a crucial impact on climate change mitigation, yet poses a challen… (voir plus)ge due to solar irradiance variability. We present a deep learning architecture which uses spatio-temporal context from satellite data for highly accurate day-ahead time-series forecasting, in particular Global Horizontal Irradiance (GHI). We provide a multi-quantile variant which outputs a prediction interval for each time-step, serving as a measure of forecasting uncertainty. In addition, we suggest a testing scheme that separates easy and difficult scenarios, which appears useful to evaluate model performance in varying cloud conditions. Our approach exhibits robust performance in solar irradiance forecasting, including zero-shot generalization tests at unobserved solar stations, and holds great promise in promoting the effective use of solar power and the resulting reduction of CO

2023-07-28

ICML.cc/2023/Workshop/SynS_and_ML (publié)

openreview.net

What if We Enrich day-ahead Solar Irradiance Time Series Forecasting with Spatio-Temporal Context?

Oussama Boussif

Ghait Boukachab

Dan Assouline

Stefano Massaroli

Tianle Yuan

Loubna Benabbou

Yoshua Bengio

2023-07-28

ICML.cc/2023/Workshop/SynS_and_ML (publié)

doi.org

openreview.net

Le traitement du langage naturel à l'ère de l'IA générative

Boussole des politiques en IA

Vie étudiante et ressources

Publications

Le traitement du langage naturel à l'ère de l'IA générative

Boussole des politiques en IA

Vie étudiante et ressources

Mots-clés populaires:

Publications