Learn how to leverage generative AI to support and improve your productivity at work. The next cohort will take place online on April 28 and 30, 2026, in French.
We use cookies to analyze the browsing and usage of our website and to personalize your experience. You can disable these technologies at any time, but this may limit certain functionalities of the site. Read our Privacy Policy for more information.
Setting cookies
You can enable and disable the types of cookies you wish to accept. However certain choices you make could affect the services offered on our sites (e.g. suggestions, personalised ads, etc.).
Essential cookies
These cookies are necessary for the operation of the site and cannot be deactivated. (Still active)
Analytics cookies
Do you accept the use of cookies to measure the audience of our sites?
Multimedia Player
Do you accept the use of cookies to display and allow you to watch the video content hosted by our partners (YouTube, etc.)?
Publications
Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
An Analysis of Quantile Temporal-Difference Learning
Mark Rowland
Remi Munos
Mohammad Gheshlaghi Azar
Yunhao Tang
Georg Ostrovski
Anna Harutyunyan
K. Tuyls
Bellemare Marc-Emmanuel
Will Dabney
We analyse quantile temporal-difference learning (QTD), a distributional reinforcement learning algorithm that has proven to be a key compon… (see more)ent in several successful large-scale applications of reinforcement learning. Despite these empirical successes, a theoretical understanding of QTD has proven elusive until now. Unlike classical TD learning, which can be analysed with standard stochastic approximation tools, QTD updates do not approximate contraction mappings, are highly non-linear, and may have multiple fixed points. The core result of this paper is a proof of convergence to the fixed points of a related family of dynamic programming procedures with probability 1, putting QTD on firm theoretical footing. The proof establishes connections between QTD and non-linear differential inclusions through stochastic approximation theory and non-smooth analysis.
As applications of machine learning proliferate, innovative algorithms inspired by specific real-world challenges have become increasingly i… (see more)mportant. Such work offers the potential for significant impact not merely in domains of application but also in machine learning itself. In this paper, we describe the paradigm of application-driven research in machine learning, contrasting it with the more standard paradigm of methods-driven research. We illustrate the benefits of application-driven machine learning and how this approach can productively synergize with methods-driven work. Despite these benefits, we find that reviewing, hiring, and teaching practices in machine learning often hold back application-driven innovation. We outline how these processes may be improved.
Over the past decade, hyperscanning has emerged as an important methodology to study neural processes underlying human interaction using fMR… (see more)I, EEG, fNIRS, and MEG. However, many methodological decisions regarding preprocessing and analysis of hyperscanning data have not yet been standardized in the hyperscanning community, yet may affect inter-brain estimates. Here we systematically investigate the effects common methodological choices can have on estimates of phase-based inter-brain synchronization (IBS) measures, using real and simulated hyperscanning (dual) EEG data. Notably, we introduce a new method to compute circular correlation (CCorr) coefficients in IBS studies, which performs more reliably in comparison to the standard approach, showing that the conventional CCorr implementation leads to large fluctuations in IBS estimates due to fluctuations in circular mean directions. Furthermore, we demonstrate how short epoch durations (of 1 second or less) can lead to inflated IBS estimates in scenarios with no strong underlying interaction. Finally, we show how signal-to-noise ratios and temporal factors may confound IBS estimates, particularly when comparing e.g., resting states with conditions involving motor actions. For each of these investigated effects, we provide recommendations for future research employing hyperscanning-EEG techniques, aimed at increasing validity and replicability of inter-brain synchronization studies.
Over the past two decades, scientists have increasingly realized the importance of the three-dimensional (3D) genome organization in regulat… (see more)ing cellular activity. Hi-C and related experiments yield 2D contact matrices that can be used to infer 3D models of chromosome structure. Visualizing and analyzing genomes in 3D space remains challenging. Here, we present ARGV, an augmented reality 3D Genome Viewer. ARGV contains more than 350 pre-computed and annotated genome structures inferred from Hi-C and imaging data. It offers interactive and collaborative visualization of genomes in 3D space, using standard mobile phones or tablets. A user study comparing ARGV to existing tools demonstrates its benefits.