Portrait de Almer Van Der Sloot n'est pas disponible

Almer Van Der Sloot

Facilitateur scientifique, Direction scientifique

Publications

Protein Language Models: Is Scaling Necessary?
Quentin Fournier
Robert M. Vernon
Almer van der Sloot
Benjamin Schulz
Christopher James Langmead
Protein Language Models: Is Scaling Necessary?
Quentin Fournier
Robert M. Vernon
Almer van der Sloot
Benjamin Schulz
Christopher James Langmead
Public protein sequence databases contain samples from the fitness landscape explored by nature. Protein language models (pLMs) pre-trained … (voir plus)on these sequences aim to capture this landscape for tasks like property prediction and protein design. Following the same trend as in natural language processing, pLMs have continuously been scaled up. However, the premise that scale leads to better performance assumes that source databases provide accurate representation of the underlying fitness landscape, which is likely false. By developing an efficient codebase, designing a modern architecture, and addressing data quality concerns such as sample bias, we introduce AMPLIFY, a best-in-class pLM that is orders of magnitude less expensive to train and deploy than previous models. Furthermore, to support the scientific community and democratize the training of pLMs, we have open-sourced AMPLIFY’s pre-training codebase, data, and model checkpoints.
RECOVER identifies synergistic drug combinations in vitro through sequential model optimization
Paul Bertin
Jarrid Rector-Brooks
Deepak Sharma
Thomas Gaudelet
Andrew Anighoro
Torsten Gross
Francisco Martínez-Peña
Eileen L. Tang
M.S. Suraj
Cristian Regep
Jeremy B.R. Hayter
Maksym Korablyov
Nicholas Valiante
Almer van der Sloot
Mike Tyers
Charles E.S. Roberts
Michael M. Bronstein
Luke L. Lairson
Jake P. Taylor-King