Homayoun Honari

Collaborateur·rice de recherche - UdeM

Superviseur⋅e principal⋅e

Glen Berseth

Sujets de recherche

Apprentissage de représentations

Apprentissage par renforcement

Causalité

Cognition

Conscience

Généralisation

IA inspirée du cerveau

IAG (Intelligence Artificielle Générale)

Méthodes inspirées de la causalité

Raisonnement

Robotique

Théorie de l'apprentissage automatique

Google Scholar

GitHub

Publications

Training PPO-Clip with Parallelized Data Generation: A Case of Fixed-Point Convergence

Homayoun Honari

Roger Creus Castanyer

Pablo Samuel Castro

Glen Berseth

In recent years, with the increase in the compute power of GPUs, parallelized data collection has become the dominant approach for training … (voir plus)reinforcement learning (RL) agents. Proximal Policy Optimization (PPO) is one of the widely-used on-policy methods for training RL agents. In this paper, we focus on the training behavior of PPO-Clip with the increase in the number of parallel environments. In particular, we show that as we increase the amount of data used to train PPO-Clip, the optimized policy would converge to a fixed distribution. We use the results to study the behavior of PPO-Clip in two case studies: the effect of change in the minibatch size and the effect of increase in the number of parallel environments versus the increase in the rollout lengths. The experiments show that settings with high-return PPO runs result in slower convergence to the fixed-distribution and higher consecutive KL divergence changes. Our results aim to offer a better understanding for the prediction of the performance of PPO with the scaling of the parallel environments.

2025-06-22

rl-conference.cc/RLC/2025/Workshop/IBRL (publié)

openreview.net

Conférence sur les politiques de l'IA de Mila

À l’avant-garde d’une nouvelle ère

TRAIL : IA responsable pour les professionnels et les leaders

Homayoun Honari

Publications

Conférence sur les politiques de l'IA de Mila

À l’avant-garde d’une nouvelle ère

TRAIL : IA responsable pour les professionnels et les leaders

Mots-clés populaires:

Homayoun Honari

Publications