Giovanni Beltrame

Doctorat - Polytechnique

Co-superviseur⋅e :

Collaborateur·rice de recherche - Polytechnique Montreal

Co-superviseur⋅e :

Maîtrise recherche - Polytechnique

Co-superviseur⋅e :

Github

Maeva Guerrier

Doctorat - Polytechnique

Co-superviseur⋅e :

Github

Simon Roy

Maîtrise recherche - UdeM

Co-superviseur⋅e :

Site web

Github

Soma Soma

Doctorat - Polytechnique

Co-superviseur⋅e :

Publications

GNN-based Decentralized Perception in Multirobot Systems for Predicting Worker Actions

Ali Imran

David St-Onge

In industrial environments, predicting human actions is essential for ensuring safe and effective collaboration between humans and robots. T… (voir plus)his paper introduces a perception framework that enables mobile robots to understand and share information about human actions in a decentralized way. The framework first allows each robot to build a spatial graph representing its surroundings, which it then shares with other robots. This shared spatial data is combined with temporal information to track human behavior over time. A swarm-inspired decision-making process is used to ensure all robots agree on a unified interpretation of the human's actions. Results show that adding more robots and incorporating longer time sequences improve prediction accuracy. Additionally, the consensus mechanism increases system resilience, making the multi-robot setup more reliable in dynamic industrial settings.

2025-06-01

IEEE Robotics and Automation Letters (publié)

Learning Multi-agent Multi-machine Tending by Mobile Robots

Abdalwhab Abdalwhab

Samira Ebrahimi Kahou

David St-Onge

Robotics can help address the growing worker shortage challenge of the manufacturing industry. As such, machine tending is a task collaborat… (voir plus)ive robots can tackle that can also highly boost productivity. Nevertheless, existing robotics systems deployed in that sector rely on a fixed single-arm setup, whereas mobile robots can provide more flexibility and scalability. In this work, we introduce a multi-agent multi-machine tending learning framework by mobile robots based on Multi-agent Reinforcement Learning (MARL) techniques with the design of a suitable observation and reward. Moreover, an attention-based encoding mechanism is developed and integrated into Multi-agent Proximal Policy Optimization (MAPPO) algorithm to boost its performance for machine tending scenarios. Our model (AB-MAPPO) outperformed MAPPO in this new challenging scenario in terms of task success, safety, and resources utilization. Furthermore, we provided an extensive ablation study to support our various design decisions.

2025-02-25

AAAI.org/2025/Workshop/MARW (publié)

openreview.net

GNN-based Decentralized Perception in Multirobot Systems for Predicting Worker Actions

Ali Imran

David St-Onge

2025-01-08

ArXiv (prépublication)

Multi-Robot Decentralized Collaborative SLAM in Planetary Analogue Environments: Dataset, Challenges, and Lessons Learned

Pierre-Yves Lajoie

Karthik Soma

Alice Lemieux-Bourque

Rongge Zhang

Vivek Shankar Vardharajan

2025-01-01

IEEE Transactions on Field Robotics (publié)

A Multi-Robot Exploration Planner for Space Applications

Vivek Shankar Vardharajan

We propose a distributed multi-robot exploration planning method designed for complex, unconstrained environments featuring steep elevation … (voir plus)changes. The method employs a two-tiered approach: a local exploration planner that constructs a grid graph to maximize exploration gain and a global planner that maintains a sparse navigational graph to track visited locations and frontier information. The global graphs are periodically synchronized among robots within communication range to maintain an updated representation of the environment. Our approach integrates localization loop closure estimates to correct global graph drift. In simulation and field tests, the proposed method achieves 50% lower computational runtime compared to state-of-the-art methods while demonstrating superior exploration coverage. We evaluate its performance in two simulated subterranean environments and in field experiments at a Mars-analog terrain.

2025-01-01

IEEE Robotics and Automation Letters (publié)

A Multi-Robot Exploration Planner for Space Applications

Vivek Shankar Vardharajan

2025-01-01

IEEE Robotics and Automation Letters (publié)

BlabberSeg: Real-Time Embedded Open-Vocabulary Aerial Segmentation

Ricardo de Azambuja

Real-time aerial image segmentation plays an important role in the environmental perception of Uncrewed Aerial Vehicles (UAVs). We introduce… (voir plus) BlabberSeg, an optimized Vision-Language Model built on CLIPSeg for on-board, real-time processing of aerial images by UAVs. BlabberSeg improves the efficiency of CLIPSeg by reusing prompt and model features, reducing computational overhead while achieving real-time open-vocabulary aerial segmentation. We validated BlabberSeg in a safe landing scenario using the Dynamic Open-Vocabulary Enhanced SafE-Landing with Intelligence (DOVESEI) framework, which uses visual servoing and open-vocabulary segmentation. BlabberSeg reduces computational costs significantly, with a speed increase of 927.41% (16.78 Hz) on a NVIDIA Jetson Orin AGX (64GB) compared with the original CLIPSeg (1.81Hz), achieving real-time aerial segmentation with negligible loss in accuracy (2.1% as the ratio of the correctly segmented area with respect to CLIPSeg). BlabberSeg's source code is open and available online.

2024-10-16

ArXiv (prépublication)

Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration

Rongge Zhang

2024-10-14

2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (publié)

Physical Simulation for Multi-agent Multi-machine Tending

Abdalwhab Abdalwhab

David St-Onge

2024-10-11

ArXiv (prépublication)

Multi-Objective Risk Assessment Framework for Exploration Planning Using Terrain and Traversability Analysis

Riana Gagnon Souleiman

Vivek Shankar Vardharajan

2024-10-04

ArXiv (prépublication)

Frequency-based View Selection in Gaussian Splatting Reconstruction

Monica Li

Pierre-Yves Lajoie

Three-dimensional reconstruction is a fundamental problem in robotics perception. We examine the problem of active view selection to perform… (voir plus) 3D Gaussian Splatting reconstructions with as few input images as possible. Although 3D Gaussian Splatting has made significant progress in image rendering and 3D reconstruction, the quality of the reconstruction is strongly impacted by the selection of 2D images and the estimation of camera poses through Structure-from-Motion (SfM) algorithms. Current methods to select views that rely on uncertainties from occlusions, depth ambiguities, or neural network predictions directly are insufficient to handle the issue and struggle to generalize to new scenes. By ranking the potential views in the frequency domain, we are able to effectively estimate the potential information gain of new viewpoints without ground truth data. By overcoming current constraints on model architecture and efficacy, our method achieves state-of-the-art results in view selection, demonstrating its potential for efficient image-based 3D reconstruction.

2024-09-24

ArXiv (prépublication)

Swarming Out of the Lab: Comparing Relative Localization Methods for Collective Behavior

Rafael Gomes Braga

Vivek Shankar Vardharajan

David St-Onge

2024-09-15

Lecture Notes in Computer Science (publié)