Publications

Faithfulness Measurable Masked Language Models

Andreas Madsen

Siva Reddy

Sarath Chandar

2024-05-01

ICML.cc/2024/Conference (spotlight)

Generative AI in Software Engineering Must Be Human-Centered: The Copenhagen Manifesto

Daniel Russo

Sebastian Baltes

Niels van Berkel

Paris Avgeriou

Fabio Calefato

Beatriz Cabrero-Daniel

Gemma Catolino

Jürgen Cito

Neil Ernst

Thomas Fritz

Hideaki Hata

Reid Holmes

Maliheh Izadi

Mikkel Baun Kjærgaard

Grischa Liebel

Alberto Lluch Lafuente

Stefano Lambiase

Walid Maalej

Gail Murphy … (voir 15 de plus)

Nils Brede Moe

Gabrielle O'Brien

Elda Paja

Mauro Pezzè

John Stouby Persson

Rafael Prikladnicki

Paul Ralph

Martin P. Robillard

Thiago Rocha Silva

Klaas-Jan Stol

Margaret-Anne Storey

Viktoria Stray

Paolo Tell

Christoph Treude

Bogdan Vasilescu

2024-05-01

Journal of Systems and Software (publié)

Generative AI in Software Engineering Must Be Human-Centered: The Copenhagen Manifesto

Daniel Russo

Sebastian Baltes

Niels van Berkel

Paris Avgeriou

Fabio Calefato

Beatriz Cabrero-Daniel

Gemma Catolino

Jürgen Cito

Neil Ernst

Thomas Fritz

Hideaki Hata

Reid Holmes

Maliheh Izadi

Mikkel Baun Kjærgaard

Grischa Liebel

Alberto Lluch Lafuente

Stefano Lambiase

Walid Maalej

Gail Murphy … (voir 15 de plus)

Nils Brede Moe

Gabrielle O'Brien

Elda Paja

Mauro Pezzè

John Stouby Persson

Rafael Prikladnicki

Paul Ralph

Martin P. Robillard

Thiago Rocha Silva

Klaas-Jan Stol

Margaret-Anne Storey

Viktoria Stray

Paolo Tell

Christoph Treude

Bogdan Vasilescu

2024-05-01

Journal of Systems and Software (publié)

Generative AI in Software Engineering Must Be Human-Centered: The Copenhagen Manifesto

Daniel Russo

Sebastian Baltes

Niels van Berkel

Paris Avgeriou

Fabio Calefato

Beatriz Cabrero-Daniel

Gemma Catolino

Jürgen Cito

Neil Ernst

Thomas Fritz

Hideaki Hata

Reid Holmes

Maliheh Izadi

Mikkel Baun Kjærgaard

Grischa Liebel

Alberto Lluch Lafuente

Stefano Lambiase

Walid Maalej

Gail Murphy … (voir 15 de plus)

Nils Brede Moe

Gabrielle O'Brien

Elda Paja

Mauro Pezzè

John Stouby Persson

Rafael Prikladnicki

Paul Ralph

Martin P. Robillard

Thiago Rocha Silva

Klaas-Jan Stol

Margaret-Anne Storey

Viktoria Stray

Paolo Tell

Christoph Treude

Bogdan Vasilescu

2024-05-01

Journal of Systems and Software (publié)

Generative AI in Software Engineering Must Be Human-Centered: The Copenhagen Manifesto

Daniel Russo

Sebastian Baltes

Niels van Berkel

Paris Avgeriou

Fabio Calefato

Beatriz Cabrero-Daniel

Gemma Catolino

Jürgen Cito

Neil Ernst

Thomas Fritz

Hideaki Hata

Reid Holmes

Maliheh Izadi

Mikkel Baun Kjærgaard

Grischa Liebel

Alberto Lluch Lafuente

Stefano Lambiase

Walid Maalej

Gail Murphy … (voir 15 de plus)

Nils Brede Moe

Gabrielle O'Brien

Elda Paja

Mauro Pezzè

John Stouby Persson

Rafael Prikladnicki

Paul Ralph

Martin P. Robillard

Thiago Rocha Silva

Klaas-Jan Stol

Margaret-Anne Storey

Viktoria Stray

Paolo Tell

Christoph Treude

Bogdan Vasilescu

2024-05-01

Journal of Systems and Software (publié)

Generative AI in Software Engineering Must Be Human-Centered: The Copenhagen Manifesto

Daniel Russo

Sebastian Baltes

Niels van Berkel

Paris Avgeriou

Fabio Calefato

Beatriz Cabrero-Daniel

Gemma Catolino

Jürgen Cito

Neil Ernst

Thomas Fritz

Hideaki Hata

Reid Holmes

Maliheh Izadi

Mikkel Baun Kjærgaard

Grischa Liebel

Alberto Lluch Lafuente

Stefano Lambiase

Walid Maalej

Gail Murphy … (voir 15 de plus)

Nils Brede Moe

Gabrielle O'Brien

Elda Paja

Mauro Pezzè

John Stouby Persson

Rafael Prikladnicki

Paul Ralph

Martin P. Robillard

Thiago Rocha Silva

Klaas-Jan Stol

Margaret-Anne Storey

Viktoria Stray

Paolo Tell

Christoph Treude

Bogdan Vasilescu

2024-05-01

Journal of Systems and Software (publié)

Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis

Stefan Horoi

Albert Manuel Orozco Camacho

Eugene Belilovsky

Guy Wolf

Ensembling multiple models enhances predictive performance by utilizing the varied learned features of the different models but incurs signi… (voir plus)ficant computational and storage costs. Model fusion, which combines parameters from multiple models into one, aims to mitigate these costs but faces practical challenges due to the complex, non-convex nature of neural network loss landscapes, where learned minima are often separated by high loss barriers. Recent works have explored using permutations to align network features, reducing the loss barrier in parameter space. However, permutations are restrictive since they assume a one-to-one mapping between the different models' neurons exists. We propose a new model merging algorithm, CCA Merge, which is based on Canonical Correlation Analysis and aims to maximize the correlations between linear combinations of the model features. We show that our method of aligning models leads to better performances than past methods when averaging models trained on the same, or differing data splits. We also extend this analysis into the harder many models setting where more than 2 models are merged, and we find that CCA Merge works significantly better in this setting than past methods.

2024-05-01

ICML.cc/2024/Conference (poster)

Gintare Karolina Dziugaite

Information Complexity of Stochastic Convex Optimization: Applications to Generalization and Memorization

Idan Attias

MAHDI HAGHIFAM

Roi Livni

Daniel M. Roy

In this work, we investigate the interplay between memorization and learning in the context of \emph{stochastic convex optimization} (SCO). … (voir plus)We define memorization via the information a learning algorithm reveals about its training data points. We then quantify this information using the framework of conditional mutual information (CMI) proposed by Steinke and Zakynthinou (2020). Our main result is a precise characterization of the tradeoff between the accuracy of a learning algorithm and its CMI, answering an open question posed by Livni (2023). We show that, in the

2024-05-01

ICML.cc/2024/Conference (présentation orale)

Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

Cheng-Hao Liu

Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-… (voir plus)body systems, is a foundational problem in science. In this paper, we propose Iterated Denoising Energy Matching (iDEM), an iterative algorithm that uses a novel stochastic score matching objective leveraging solely the energy function and its gradient---and no data samples---to train a diffusion-based sampler. Specifically, iDEM alternates between (I) sampling regions of high model density from a diffusion-based sampler and (II) using these samples in our stochastic matching objective to further improve the sampler. iDEM is scalable to high dimensions as the inner matching objective, is *simulation-free*, and requires no MCMC samples. Moreover, by leveraging the fast mode mixing behavior of diffusion, iDEM smooths out the energy landscape enabling efficient exploration and learning of an amortized sampler. We evaluate iDEM on a suite of tasks ranging from standard synthetic energy functions to invariant

2024-05-01

ICML.cc/2024/Conference (poster)

Language-guided Skill Learning with Temporal Variational Inference

Haotian Fu

Pratyusha Sharma

Elias Stengel-Eskin

George Konidaris

Nicolas Le Roux

Marc-Alexandre Côté

Xingdi Yuan

2024-05-01

ICML.cc/2024/Conference (poster)

Learning to Scale Logits for Temperature-Conditional GFlowNets

Joohwan Ko

Woo Chang Kim

Jinkyoo Park

Emmanuel Bengio

Yoshua Bengio

GFlowNets are probabilistic models that sequentially generate compositional structures through a stochastic policy. Among GFlowNets, tempera… (voir plus)ture-conditional GFlowNets can introduce temperature-based controllability for exploration and exploitation. We propose \textit{Logit-scaling GFlowNets} (Logit-GFN), a novel architectural design that greatly accelerates the training of temperature-conditional GFlowNets. It is based on the idea that previously proposed approaches introduced numerical challenges in the deep network training, since different temperatures may give rise to very different gradient profiles as well as magnitudes of the policy's logits. We find that the challenge is greatly reduced if a learned function of the temperature is used to scale the policy's logits directly. Also, using Logit-GFN, GFlowNets can be improved by having better generalization capabilities in offline learning and mode discovery capabilities in online learning, which is empirically verified in various biological and chemical tasks. Our code is available at https://github.com/dbsxodud-11/logit-gfn

2024-05-01

ICML.cc/2024/Conference (poster)