Publications

Open, Closed, or Small Language Models for Text Classification?
Hao Yu
Zachary Yang
Kellin Pelrine
Jean-François Godbout
Recent advancements in large language models have demonstrated remarkable capabilities across various NLP tasks. But many questions remain, … (voir plus)including whether open-source models match closed ones, why these models excel or struggle with certain tasks, and what types of practical procedures can improve performance. We address these questions in the context of classification by evaluating three classes of models using eight datasets across three distinct tasks: named entity recognition, political party prediction, and misinformation detection. While larger LLMs often lead to improved performance, open-source models can rival their closed-source counterparts by fine-tuning. Moreover, supervised smaller models, like RoBERTa, can achieve similar or even greater performance in many datasets compared to generative LLMs. On the other hand, closed models maintain an advantage in hard tasks that demand the most generalizability. This study underscores the importance of model selection based on task requirements
Pontomedullary junction as a reference for spinal cord cross-sectional area: validation across neck positions
Sandrine Bédard
Maxime Bouthillier
GTM-decon: guided-topic modeling of single-cell transcriptomes enables sub-cell-type and disease-subtype deconvolution of bulk transcriptomes
Lakshmipuram Seshadri Swapna
Michael Huang
YORC: Yoruba Reading Comprehension dataset
Aremu Anuoluwapo
Jesujoba Oluwadara Alabi
In this paper, we create YORC: a new multi-choice Yoruba Reading Comprehension dataset that is based on Yoruba high-school reading comprehen… (voir plus)sion examination. We provide baseline results by performing cross-lingual transfer using existing English RACE dataset based on a pre-trained encoder-only model. Additionally, we provide results by prompting large language models (LLMs) like GPT-4.
Age-related bias and artificial intelligence: a scoping review
Charlene H Chu
Simon Donato-Woodger
Shehroz S Khan
Rune Nyrup
Kathleen Leslie
Alexandra Lyn
Tianyu Shi
Andria Bianchi
Amanda Grenier
Consciousness in Artificial Intelligence: Insights from the Science of Consciousness
Patrick Mark Butlin
R. Long
Eric Elmoznino
Jonathan C. P. Birch
Axel Constant
George Deane
S. Fleming
C. Frith
Xuanxiu Ji
Ryota Kanai
C. Klein
Grace W. Lindsay
Matthias Michel
Liad Mudrik
Megan A. K. Peters
Eric Schwitzgebel
Jonathan Simon
Rufin Vanrullen
Hitting the High-Dimensional Notes: An ODE for SGD learning dynamics on GLMs and multi-index models
Elizabeth Collins-Woodfin
Elliot Paquette
Inbar Seroussi
AstroPhot: Fitting Everything Everywhere All at Once in Astronomical Images
Connor J Stone
Stéphane Courteau
Jean-Charles Cuillandre
Nikhil Arora
BamQuery: a proteogenomic tool to explore the immunopeptidome and prioritize actionable tumor antigens
Maria-Virginia Ruiz Cuevas
Marie-Pierre Hardy
Jean-David Larouche
Anca Apavaloaei
Eralda Kina
Krystel Vincent
Patrick Gendron
Jean-Philippe Laverdure
Chantal Durette
Pierre Thibault
Claude Perreault
Grégory Ehx
Morphological Parameters and Associated Uncertainties for 8 Million Galaxies in the Hyper Suprime-Cam Wide Survey
Aritra Ghosh
C. Urry
Aayush Mishra
P. Natarajan
D. Sanders
Daisuke Nagai
Chuan Tian
Nico Cappelluti
J. Kartaltepe
M. Powell
Amrit Rau
Ezequiel Treister
We use the Galaxy Morphology Posterior Estimation Network (GaMPEN) to estimate morphological parameters and associated uncertainties for ∼… (voir plus)8 million galaxies in the Hyper Suprime-Cam Wide survey with z ≤ 0.75 and m ≤ 23. GaMPEN is a machine-learning framework that estimates Bayesian posteriors for a galaxy’s bulge-to-total light ratio (L B /L T ), effective radius (R e ), and flux (F). By first training on simulations of galaxies and then applying transfer learning using real data, we trained GaMPEN with 1% of our data set. This two-step process will be critical for applying machine-learning algorithms to future large imaging surveys, such a
Using Confounded Data in Latent Model-Based Reinforcement Learning
Damien GRASSET
Guillaume Gaudron
Pierre-Yves Oudeyer
The evolution of SARS-CoV-2 seroprevalence in Canada: a time-series study, 2020–2023
Tanya J. Murphy
Hanna Swail
Jaspreet Jain
Maureen Anderson
Philip Awadalla
Lesley Behl
P. Brown
C. Charlton
Karen Colwill
S. Drews
A. Gingras
Deena Hinshaw
P. Jha
J. Kanji
Victoria A. Kirsh
Amanda Lang
Marc-andré Langlois
Stephen Lee
Antoine Lewin
Sheila F O’Brien … (voir 10 de plus)
Chantale Pambrun
Kimberly Skead
David A. Stephens
Derek R. Stein
G. Tipples
Paul G. Van Caeseele
Timothy Grant Evans
Olivia Oxlade
Bruce D. Mazer
Background: During the first year of the COVID-19 pandemic, the proportion of reported cases of COVID-19 among Canadians was under 6%. Altho… (voir plus)ugh high vaccine coverage was achieved in Canada by fall 2021, the Omicron variant caused unprecedented numbers of infections, overwhelming testing capacity and making it difficult to quantify the trajectory of population immunity. Methods: Using a time-series approach and data from more than 900 000 samples collected by 7 research studies collaborating with the COVID-19 Immunity Task Force (CITF), we estimated trends in SARS-CoV-2 seroprevalence owing to infection and vaccination for the Canadian population over 3 intervals: prevaccination (March to November 2020), vaccine roll-out (December 2020 to November 2021), and the arrival of the Omicron variant (December 2021 to March 2023). We also estimated seroprevalence by geographical region and age. Results: By November 2021, 9.0% (95% credible interval [CrI] 7.3%–11%) of people in Canada had humoral immunity to SARS-CoV-2 from an infection. Seroprevalence increased rapidly after the arrival of the Omicron variant — by Mar. 15, 2023, 76% (95% CrI 74%–79%) of the population had detectable antibodies from infections. The rapid rise in infection-induced antibodies occurred across Canada and was most pronounced in younger age groups and in the Western provinces: Manitoba, Saskatchewan, Alberta and British Columbia. Interpretation: Data up to March 2023 indicate that most people in Canada had acquired antibodies against SARS-CoV-2 through natural infection and vaccination. However, given variations in population seropositivity by age and geography, the potential for waning antibody levels, and new variants that may escape immunity, public health policy and clinical decisions should be tailored to local patterns of population immunity.