Publications

AmbieGen at the SBFT 2024 Tool Competition - CPS-UAV Track

Dmytro Humeniuk

Foutse Khomh

2024-01-01

SBFT@ICSE (published)

doi.org

AmbieGenVAE at the SBFT 2024 Tool Competition - Cyber-Physical Systems Track

Dmytro Humeniuk

Foutse Khomh

2024-01-01

SBFT@ICSE (published)

doi.org

An Analysis of Quantile Temporal-Difference Learning

Mark Rowland

Remi Munos

Mohammad Gheshlaghi Azar

Yunhao Tang

Georg Ostrovski

Anna Harutyunyan

K. Tuyls

Marc Gendron-Bellemare

Will Dabney

We analyse quantile temporal-difference learning (QTD), a distributional reinforcement learning algorithm that has proven to be a key compon… (see more)ent in several successful large-scale applications of reinforcement learning. Despite these empirical successes, a theoretical understanding of QTD has proven elusive until now. Unlike classical TD learning, which can be analysed with standard stochastic approximation tools, QTD updates do not approximate contraction mappings, are highly non-linear, and may have multiple fixed points. The core result of this paper is a proof of convergence to the fixed points of a related family of dynamic programming procedures with probability 1, putting QTD on firm theoretical footing. The proof establishes connections between QTD and non-linear differential inclusions through stochastic approximation theory and non-smooth analysis.

arxiv.org

An Analytic Hierarchy Process based approach for assessing the performance of photovoltaic solar power plants

Meryam Chafiq

Loubna Benabbou

Hanane Dagdougui

Ismail Belhaj

Abdelali Djdiaa

Hicham Bouzekri

Abdelaziz Berrado

2024-01-01

IFAC-PapersOnLine (published)

doi.org

Application-Driven Innovation in Machine Learning

David Rolnick

Alan Aspuru-Guzik

Sara Beery

Bistra Dilkina

Priya L. Donti

Marzyeh Ghassemi

Hannah Kerner

Claire Monteleoni

Esther Rolf

Milind Tambe

Adam White

As applications of machine learning proliferate, innovative algorithms inspired by specific real-world challenges have become increasingly i… (see more)mportant. Such work offers the potential for significant impact not merely in domains of application but also in machine learning itself. In this paper, we describe the paradigm of application-driven research in machine learning, contrasting it with the more standard paradigm of methods-driven research. We illustrate the benefits of application-driven machine learning and how this approach can productively synergize with methods-driven work. Despite these benefits, we find that reviewing, hiring, and teaching practices in machine learning often hold back application-driven innovation. We outline how these processes may be improved.

2024-01-01

ICML (published)

doi.org

arxiv.org

Are LLMs Breaking MT Metrics? Results of the WMT24 Metrics Shared Task

Markus Freitag

Nitika Mathur

Daniel Deutsch

Chi-kiu Lo

Eleftherios Avramidis

Ricardo Rei

Brian Thompson

Frédéric Blain

Tom Kocmi

Jiayi Wang

David Ifeoluwa Adelani

Marianna Buchicchio

Chrysoula Zerva

2024-01-01

Conference on Machine Translation (published)

doi.org

Are LLMs Breaking MT Metrics? Results of the WMT24 Metrics Shared Task

Markus Freitag

Nitika Mathur

Daniel Deutsch

Chi-kiu Lo

Eleftherios Avramidis

Ricardo Rei

Brian Thompson

Frédéric Blain

Tom Kocmi

Jiayi Wang

David Ifeoluwa Adelani

Marianna Buchicchio

Chrysoula Zerva

2024-01-01

Conference on Machine Translation (published)

doi.org

AsmDocGen: Generating Functional Natural Language Descriptions for Assembly Code

Jesia Yuki

Mohammadhossein Amouei

Benjamin Fung

Philippe Charland

Andrew Walenstein

2024-01-01

International Conference on Software and Data Technologies (published)

doi.org

Assessing Neural Network Representations During Training Using Noise-Resilient Diffusion Spectral Entropy

Danqi Liao

Chen Liu

Benjamin W Christensen

Alexander Tong

Guillaume Huguet

Guy Wolf

Maximilian Nickel

Ian Adelstein

Smita Krishnaswamy

Entropy and mutual information in neural networks provide rich information on the learning process, but they have proven difficult to comput… (see more)e reliably in high dimensions. Indeed, in noisy and high-dimensional data, traditional estimates in ambient dimensions approach a fixed entropy and are prohibitively hard to compute. To address these issues, we leverage data geometry to access the underlying manifold and reliably compute these information-theoretic measures. Specifically, we define diffusion spectral entropy (DSE) in neural representations of a dataset as well as diffusion spectral mutual information (DSMI) between different variables representing data. First, we show that they form noise-resistant measures of intrinsic dimensionality and relationship strength in high-dimensional simulated data that outperform classic Shannon entropy, nonparametric estimation, and mutual information neural estimation (MINE). We then study the evolution of representations in classification networks with supervised learning, self-supervision, or overfitting. We observe that (1) DSE of neural representations increases during training; (2) DSMI with the class label increases during generalizable learning but stays stagnant during overfitting; (3) DSMI with the input signal shows differing trends: on MNIST it increases, while on CIFAR-10 and STL-10 it decreases. Finally, we show that DSE can be used to guide better network initialization and that DSMI can be used to predict downstream classification accuracy across 962 models on ImageNet.

2024-01-01

CISS (published)

doi.org

openreview.net

Asymmetry in the complexity of the multi-commodity network pricing problem

Quang Minh Bui

Margarida Carvalho

José Neto

2024-01-01

Math. Program. (published)

doi.org

arxiv.org