Portrait de Michael Rabbat n'est pas disponible

Michael Rabbat

Membre industriel associé
Chercheur scientifique, AMI Labs
Sujets de recherche
Apprentissage de représentations
Optimisation
Systèmes distribués

Biographie

Mike Rabbat est membre affilié de Mila et cofondateur et vice-président des « World Models » chez Advanced Machine Intelligence (AMI). Ses intérêts de recherche portent sur l'apprentissage de représentations efficace et robuste, l'apprentissage auto-supervisé, ainsi que la planification et le contrôle basés sur des modèles. Il s'intéresse également à l'optimisation pour l'entraînement efficace de modèles.

Publications

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Mahmoud Assran
Quentin Duval
Ishan Misra
Piotr Bojanowski
This paper demonstrates an approach for learning highly semantic image representations without relying on hand-crafted data-augmentations. W… (voir plus)e introduce the Image-based Joint-Embedding Predictive Architecture (I-JEPA), a non-generative approach for self-supervised learning from images. The idea behind I-JEPA is simple: from a single context block, predict the representations of various target blocks in the same image. A core design choice to guide I-JEPA towards producing semantic representations is the masking strategy; specifically, it is crucial to (a) sample target blocks with sufficiently large scale (semantic), and to (b) use a sufficiently informative (spatially distributed) context block. Empirically, when combined with Vision Transformers, we find I-JEPA to be highly scalable. For instance, we train a ViT-Huge/14 on ImageNet using 16 A100 GPUs in under 72 hours to achieve strong downstream performance across a wide range of tasks, from linear classification to object counting and depth prediction.