Giovanni Beltrame

Attention-Based Multi-Agent RL for Multi-Machine Tending Using Mobile Robots

Abdalwhab Abdalwhab

David St-Onge

Robotics can help address the growing worker shortage challenge of the manufacturing industry. As such, machine tending is a task collaborat… (voir plus)ive robots can tackle that can also greatly boost productivity. Nevertheless, existing robotics systems deployed in that sector rely on a fixed single-arm setup, whereas mobile robots can provide more flexibility and scalability. We introduce a multi-agent multi-machine-tending learning framework using mobile robots based on multi-agent reinforcement learning (MARL) techniques, with the design of a suitable observation and reward. Moreover, we integrate an attention-based encoding mechanism into the Multi-Agent Proximal Policy Optimization (MAPPO) algorithm to boost its performance for machine-tending scenarios. Our model (AB-MAPPO) outperforms MAPPO in this new challenging scenario in terms of task success, safety, and resource utilization. Furthermore, we provided an extensive ablation study to support our design decisions.

2025-10-01

Applied Informatics (publié)

doi.org

Attention-Based Multi-Agent RL for Multi-Machine Tending Using Mobile Robots

Abdalwhab Bakheet Mohamed Abdalwhab

Giovanni Beltrame

Samira Ebrahimi Kahou

David St-Onge

2025-10-01

AI (publié)

doi.org

Attention-Based Multi-Agent RL for Multi-Machine Tending Using Mobile Robots

Abdalwhab Abdalwhab

Giovanni Beltrame

Samira Ebrahimi Kahou

David St-Onge

Robotics can help address the growing worker shortage challenge of the manufacturing industry. As such, machine tending is a task collaborat… (voir plus)ive robots can tackle that can also greatly boost productivity. Nevertheless, existing robotics systems deployed in that sector rely on a fixed single-arm setup, whereas mobile robots can provide more flexibility and scalability. We introduce a multi-agent multi-machine-tending learning framework using mobile robots based on multi-agent reinforcement learning (MARL) techniques, with the design of a suitable observation and reward. Moreover, we integrate an attention-based encoding mechanism into the Multi-Agent Proximal Policy Optimization (MAPPO) algorithm to boost its performance for machine-tending scenarios. Our model (AB-MAPPO) outperforms MAPPO in this new challenging scenario in terms of task success, safety, and resource utilization. Furthermore, we provided an extensive ablation study to support our design decisions.

2025-10-01

AI (publié)

doi.org

A Blockchain Framework for Equitable and Secure Task Allocation in Robot Swarms

Alexandre Pacheco

Marco Dorigo

Recent studies demonstrate the potential of blockchain to enable robots in a swarm to achieve secure consensus about the environment, partic… (voir plus)ularly when robots are homogeneous and perform identical tasks. Typically, robots receive rewards for their contributions to consensus achievement, but no studies have yet targeted heterogeneous swarms, in which the robots have distinct physical capabilities suited to different tasks. We present a novel framework that leverages domain knowledge to decompose the swarm mission into a hierarchy of tasks within smart contracts. This allows the robots to reach a consensus about both the environment and the action plan, allocating tasks among robots with diverse capabilities to improve their performance while maintaining security against faults and malicious behaviors. We refer to this concept as equitable and secure task allocation. Validated in Simultaneous Localization and Mapping missions, our approach not only achieves equitable task allocation among robots with varying capabilities, improving mapping accuracy and efficiency, but also shows resilience against malicious attacks.

2025-10-01

IEEE Robotics and Automation Letters (publié)

doi.org

GNN-based Decentralized Perception in Multirobot Systems for Predicting Worker Actions

Ali Imran

Giovanni Beltrame

David St-Onge

In industrial environments, predicting human actions is essential for ensuring safe and effective collaboration between humans and robots. T… (voir plus)his paper introduces a perception framework that enables mobile robots to understand and share information about human actions in a decentralized way. The framework first allows each robot to build a spatial graph representing its surroundings, which it then shares with other robots. This shared spatial data is combined with temporal information to track human behavior over time. A swarm-inspired decision-making process is used to ensure all robots agree on a unified interpretation of the human's actions. Results show that adding more robots and incorporating longer time sequences improve prediction accuracy. Additionally, the consensus mechanism increases system resilience, making the multi-robot setup more reliable in dynamic industrial settings.

2025-06-01

IEEE Robotics and Automation Letters (publié)

doi.org

arxiv.org

Learning Multi-agent Multi-machine Tending by Mobile Robots

Abdalwhab Abdalwhab

Giovanni Beltrame

Samira Ebrahimi Kahou

David St-Onge

Robotics can help address the growing worker shortage challenge of the manufacturing industry. As such, machine tending is a task collaborat… (voir plus)ive robots can tackle that can also highly boost productivity. Nevertheless, existing robotics systems deployed in that sector rely on a fixed single-arm setup, whereas mobile robots can provide more flexibility and scalability. In this work, we introduce a multi-agent multi-machine tending learning framework by mobile robots based on Multi-agent Reinforcement Learning (MARL) techniques with the design of a suitable observation and reward. Moreover, an attention-based encoding mechanism is developed and integrated into Multi-agent Proximal Policy Optimization (MAPPO) algorithm to boost its performance for machine tending scenarios. Our model (AB-MAPPO) outperformed MAPPO in this new challenging scenario in terms of task success, safety, and resources utilization. Furthermore, we provided an extensive ablation study to support our various design decisions.

2025-02-25

AAAI.org/2025/Workshop/MARW (publié)

doi.org

openreview.net

GNN-based Decentralized Perception in Multirobot Systems for Predicting Worker Actions

Ali Imran

Giovanni Beltrame

David St-Onge

In industrial environments, predicting human actions is essential for ensuring safe and effective collaboration between humans and robots. T… (voir plus)his paper introduces a perception framework that enables mobile robots to understand and share information about human actions in a decentralized way. The framework first allows each robot to build a spatial graph representing its surroundings, which it then shares with other robots. This shared spatial data is combined with temporal information to track human behavior over time. A swarm-inspired decision-making process is used to ensure all robots agree on a unified interpretation of the human's actions. Results show that adding more robots and incorporating longer time sequences improve prediction accuracy. Additionally, the consensus mechanism increases system resilience, making the multi-robot setup more reliable in dynamic industrial settings.

2025-01-08

ArXiv (prépublication)

arxiv.org

3D Foundation Model-Based Loop Closing for Decentralized Collaborative SLAM

Pierre-Yves Lajoie

Benjamin Ramtoula

Daniele De Martini

Giovanni Beltrame

Decentralized Collaborative Simultaneous Localization and Mapping (C-SLAM) techniques often struggle to identify map overlaps due to signifi… (voir plus)cant viewpoint variations among robots. Motivated by recent advancements in 3D foundation models, which can register images despite large viewpoint differences, we propose a robust loop closing approach that leverages these models to establish inter-robot measurements. In contrast to resource-intensive methods requiring full 3D reconstruction within a centralized map, our approach integrates foundation models into existing SLAM pipelines, yielding scalable and robust multi-robot mapping. Our contributions include: 1) integrating 3D foundation models to reliably estimate relative poses from monocular image pairs within decentralized C-SLAM; 2) introducing robust outlier mitigation techniques critical to the use of these relative poses and 3) developing specialized pose graph optimization formulations that efficiently resolve scale ambiguities. We evaluate our method against state-of-the-art approaches, demonstrating improvements in localization and mapping accuracy, alongside significant gains in computational and memory efficiency. These results highlight the potential of our approach for deployment in large-scale multi-robot scenarios.

2025-01-01

IEEE Robotics Autom. Lett. (publié)

doi.org

Multi-Robot Decentralized Collaborative SLAM in Planetary Analogue Environments: Dataset, Challenges, and Lessons Learned

Pierre-Yves Lajoie

Karthik Soma

Haechan Mark Bong

Alice Lemieux-Bourque

Rongge Zhang

Vivek Shankar Vardharajan

Giovanni Beltrame

2025-01-01

IEEE Transactions on Field Robotics (publié)

doi.org

A Multi-Robot Exploration Planner for Space Applications

Vivek Shankar Vardharajan

Giovanni Beltrame

2025-01-01

IEEE Robotics and Automation Letters (publié)

doi.org

A Multi-Robot Exploration Planner for Space Applications

Vivek Shankar Vardharajan

Giovanni Beltrame

We propose a distributed multi-robot exploration planning method designed for complex, unconstrained environments featuring steep elevation … (voir plus)changes. The method employs a two-tiered approach: a local exploration planner that constructs a grid graph to maximize exploration gain and a global planner that maintains a sparse navigational graph to track visited locations and frontier information. The global graphs are periodically synchronized among robots within communication range to maintain an updated representation of the environment. Our approach integrates localization loop closure estimates to correct global graph drift. In simulation and field tests, the proposed method achieves 50% lower computational runtime compared to state-of-the-art methods while demonstrating superior exploration coverage. We evaluate its performance in two simulated subterranean environments and in field experiments at a Mars-analog terrain.

2025-01-01

IEEE Robotics and Automation Letters (publié)

doi.org

Neural Incremental Dynamic Inversion Control of a Multirotor Robotic Airship

Ely Carneiro de Paiva

José Raul Azinheira

Rafael de Angelis Cordeiro

José Reginaldo H. Carvalho

Apolo Marton

Giovanni Beltrame

2025-01-01

International Journal of Intelligent Systems (publié)

doi.org

Conférence sur les politiques de l'IA de Mila

À l’avant-garde d’une nouvelle ère

TRAIL : IA responsable pour les professionnels et les leaders

Giovanni Beltrame

Biographie

Étudiants actuels

Publications

Conférence sur les politiques de l'IA de Mila

À l’avant-garde d’une nouvelle ère

TRAIL : IA responsable pour les professionnels et les leaders

Mots-clés populaires:

Giovanni Beltrame

Biographie

Étudiants actuels

Publications