Portrait de Komal Kumar Teru n'est pas disponible

Komal Kumar Teru

Alumni

Publications

Tracing the Representation Geometry of Language Models from Pretraining to Post-training
Melody Zixuan Li
Adam Santoro
Blake A. Richards
Standard training metrics like loss fail to explain the emergence of complex capabilities in large language models. We take a spectral appro… (voir plus)ach to investigate the geometry of learned representations across pretraining and post-training, measuring effective rank (RankMe) and eigenspectrum decay (