Gaurav Iyer

Maîtrise recherche - McGill

Superviseur⋅e principal⋅e

David Rolnick

Sujets de recherche

Apprentissage profond

Google Scholar

Publications

Linear Weight Interpolation Leads to Transient Performance Gains

Gaurav Iyer

Gintare Karolina Dziugaite

David Rolnick

2023-12-31

Trans. Mach. Learn. Res. (publié)

openreview.net

Maximal Initial Learning Rates in Deep ReLU Networks

Gaurav Iyer

Boris Hanin

David Rolnick

Training a neural network requires choosing a suitable learning rate, which involves a trade-off between speed and effectiveness of converge… (voir plus)nce. While there has been considerable theoretical and empirical analysis of how large the learning rate can be, most prior work focuses only on late-stage training. In this work, we introduce the maximal initial learning rate

2023-07-02

Proceedings of the 40th International Conference on Machine Learning (publié)

doi.org

proceedings.mlr.press

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Gaurav Iyer

Publications

Publications du Fellowship en politiques de l'IA

La plateforme Mila Ventures

Boussole des politiques en IA

Mots-clés populaires:

Gaurav Iyer

Publications