Quentin Fournier

quentin.fournier@mila.quebec

Fellow de recherche, Talent et écosystème

Billets de blogue

19 décembre 2025

Optimiser la conception CAO grâce aux LLM

par

Prashant Govindarajan

Davide Baldelli

Quentin Fournier

Sarath Chandar

Lire l'article

A digital picture of Bert from Sesame street, wering black trench coat and sunglasses

3 mars 2025

NeoBERT: une nouvelle frontière pour les modèles de langage encodeurs open-source

par

Lola Le Breton

Quentin Fournier

Sarath Chandar

Lire l'article

Publications

Exploring Quantization for Efficient Pre-Training of Transformer Language Models

Kamran Chitsaz

Quentin Fournier

Goncalo Mordido

A. Chandar

The increasing scale of Transformer models has led to an increase in their pre-training computational requirements. While quantization has p… (voir plus)roven to be effective after pre-training and during fine-tuning, applying quantization in Transformers during pre-training has remained largely unexplored at scale for language modeling. This study aims to explore the impact of quantization for efficient pre-training of Transformers, with a focus on linear layer components. By systematically applying straightforward linear quantization to weights, activations, gradients, and optimizer states, we assess its effects on model efficiency, stability, and performance during training. By offering a comprehensive recipe of effective quantization strategies to be applied during the pre-training of Transformers, we promote high training efficiency from scratch while retaining language modeling ability. Code is available at https://github.com/chandar-lab/EfficientLLMs.

2023-12-31

EMNLP (Findings) (publié)

doi.org

arxiv.org

Mila sur Udemy

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Quentin Fournier

Billets de blogue

Publications

Mila sur Udemy

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Mots-clés populaires:

Quentin Fournier

Billets de blogue

Publications