Publications

A flaw in using pre-trained pLLMs in protein-protein interaction inference models

Joseph Szymborski

With the growing pervasiveness of pre-trained protein large language models (pLLMs), pLLM-based methods are increasingly being put forward f… (see more)or the protein-protein interaction (PPI) inference task. Here, we identify and confirm that existing pre-trained pLLMs are a source of data leakage for the downstream PPI task. We characterize the extent of the data leakage problem by training and comparing small and efficient pLLMs on a dataset that controls for data leakage (“strict”) with one that does not (“non-strict”). While data leakage from pre-trained pLLMs cause measurable inflation of testing scores, we find that this does not necessarily extend to other, non-paired biological tasks such as protein keyword annotation. Further, we find no connection between the context-lengths of pLLMs and the performance of pLLM-based PPI inference methods on proteins with sequence lengths that surpass it. Furthermore, we show that pLLM-based and non-pLLM-based models fail to generalize in tasks such as prediction of the human-SARS-CoV-2 PPIs or the effect of point mutations on binding-affinities. This study demonstrates the importance of extending existing protocols for the evaluation of pLLM-based models applied to paired biological datasets and identifies areas of weakness of current pLLM models.

2025-04-23

bioRxiv (preprint)

doi.org

Representation Learning via Non-Contrastive Mutual Information

Zhaohan Daniel Guo

Bernardo Avila Pires

Khimya Khetarpal

Dale Schuurmans

Bo Dai

2025-04-23

ArXiv (preprint)

doi.org

arxiv.org

Representation Learning via Non-Contrastive Mutual Information

Zhaohan Daniel Guo

Bernardo Avila Pires

Khimya Khetarpal

Dale Schuurmans

Bo Dai

2025-04-23

ArXiv (preprint)

arxiv.org

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Thomas Schmied

Jorg Bornschein

Jordi Grau-Moya

Markus Wulfmeier

Razvan Pascanu

2025-04-22

ArXiv (preprint)

arxiv.org

Neural Kinematic Bases for Fluids

Yibo Liu

Paul Kry

Kenny Erleben

Noam Aigerman

Sune Darkner

Teseo Schneider

2025-04-22

ArXiv (preprint)

arxiv.org

Refining sequence-to-expression modelling with chromatin accessibility

Orsolya Lapohos

Gregory J. Fonseca

Amin Emad

2025-04-21

bioRxiv (preprint)

doi.org

Cortical differences across psychiatric disorders and associated common and rare genetic variants

Kuldeep Kumar

Zhijie Liao

Jakub Kopal

Clara Moreau

Christopher R. K. Ching

Claudia Modenato

Will Snyder

Sayeh Kazem

Charles-Olivier Martin

C.O. Martin

Anne-Marie Bélanger

Valérie K. Fontaine

Khadije Jizi

Rune Boen

Guillaume Huguet

Zohra Saci

Leila Kushan

Ana I. Silva

Marianne B.M. van den Bree

David E.J. Linden … (see 16 more)

Michael J. Owen

Jeremy Hall

Sarah Lippé

Guillaume Dumas

Bogdan Draganski

Laura Almasy

Sophia I. Thomopoulos

Neda Jahanshad

Ida E. Sønderby

Ole A. Andreassen

David C. Glahn

Armin Raznahan

Carrie Bearden

Tomas Paus

Paul M. Thompson

Sébastien Jacquemont

2025-04-19

medRxiv (preprint)

doi.org

Impact of Reducing Time Lived With Colostomies on Social Stigma Affecting Children With Anorectal Malformations in Southwestern Uganda.

Felix Oyania

Caroline Q. Stephens

Sarah Ullrich

Meera Kotagal

Amy M. Shui

Caleb Tuhumwire

G. Rukundo

Joseph Ngonzi

Ava Yap

Francis Bajunirwe

Dan Poenaru

Doruk Ozgediz

2025-04-19

World Journal of Surgery (published)

doi.org

Impact of Reducing Time Lived With Colostomies on Social Stigma Affecting Children With Anorectal Malformations in Southwestern Uganda.

Felix Oyania

Caroline Q. Stephens

Sarah Ullrich

Meera Kotagal

Amy M. Shui

Caleb Tuhumwire

G. Rukundo

Joseph Ngonzi

Ava Yap

Francis Bajunirwe

Dan Poenaru

Doruk Ozgediz

BACKGROUND The social stigma of families of children living with colostomies due to anorectal malformation (ARM) is significant in low-incom… (see more)e countries (LICs). Improved access to pediatric surgery has resulted in more 1-stage ARM procedures in Southwestern Uganda, avoiding colostomy creation, but the impact on social stigma experienced by families is unknown. We hypothesized that this change would decrease the social stigma experienced by families. METHODS A single-center mixed retrospective and prospective cohort study with combined qualitative data of families of children with ARM who underwent corrective surgery compared the stigma experienced by those with colostomies to those without. The Kilifi Stigma Scale of Epilepsy (KSSE) was used to assess social stigma. Multivariable regression analysis assessed differences in the stigma experienced, controlling for age at diagnosis, rurality, distance traveled, sex, and parental education. Subgroup analysis assessed the impact of colostomy duration on stigma, stratified over parental education. RESULTS Patient/family dyads with 238 ARM were included; 177 (74%) received a colostomy. Most patients were male (51%), lived in rural areas (71%), and had parents with primary school education (65%). For those without a colostomy, the median KSSE was 0 (Q1-Q3 0-0), compared to 11 (Q1-Q3 3-20) for colostomy. On multivariable analysis, after controlling for age at diagnosis, rurality, distance traveled, sex, and parental education attainment, families of patients with ARM who received a colostomy had a median KSSE score 7.8 points higher than those who did not receive a colostomy (coefficient 7.78, 95% 3.14-12.43, and p = 0.001). When the duration of colostomy (in years) was examined, the median KSSE score increased by 1.58 points for each additional year for a patient who had a colostomy (IRR 1.58, 95% CI: 0.76-2.40, and p 0.001). CONCLUSION Adopting a 1-stage ARM repair for the select types, which avoids colostomy creation, significantly reduces the exper

2025-04-19

World Journal of Surgery (published)

doi.org

Learning from Stochastic Teacher Representations Using Student-Guided Knowledge Distillation

Muhammad Haseeb Aslam

Clara Martinez

Marco Pedersoli

Alessandro Lameiras Koerich

Ali Etemad

Eric Granger

2025-04-19

ArXiv (preprint)

arxiv.org

Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D

Sergio Arnaud

Paul McVay

Ada Martin

Arjun Majumdar

Krishna Murthy

Phillip Thomas

Ruslan Partsey

Daniel Dugas

Abha Gejji

Alexander Sax

Vincent-Pierre Berges

Mikael Henaff

Ayush Jain

Ang Cao

Ishita Prasad

Mrinal Kalakrishnan

Michael Rabbat

Nicolas Ballas

Mido Assran

Oleksandr Maksymets … (see 2 more)

Aravind Rajeswaran

Franziska Meier

2025-04-19

ArXiv (preprint)

arxiv.org

Online HD-tRNS over the right temporoparietal junction modulates social inference but not motor coordination

Quentin Moreau

Vincent Chamberland

Lisane Moses

Gabriela Milanova

Guillaume Dumas

2025-04-19

bioRxiv (preprint)

doi.org

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Publications

AI Advantage

Leveraging AI for a Sustainable Future

Mila AI Policy Fellowship

AI Advantage

Leveraging AI for a Sustainable Future

Popular keywords:

Publications