Hanqing Zhao

Affiliate Member

Assistant Professor, Université Laval, Electrical and Computer Engineering

Research Topics

Multi-Agent Systems

Multitask Learning

Reinforcement Learning

Robotics

Swarm Intelligence

Website

Google Scholar

Biography

Hanqing Zhao is an Assistant Professor in the Département de génie électrique et de génie informatique of Université Laval. He is a member of the Laboratoire de Vision et Systèmes Numériques (LVSN).

Hanqing began his academic journey at the École Centrale de Pékin (Université Beihang). He earned an Ingénieur civil en informatique degree from École Polytechnique de Bruxelles (Université libre de Bruxelles), supervised by Marco Dorigo; and later received his Ph.D. in Computer Science (robotics) from McGill University, supervised by Gregory Dudek and Xue (Steve) Liu. He was then a Postdoctoral Researcher at the MIST Lab of École Polytechnique de Montréal, supervised by Giovanni Beltrame.

His research focuses on enabling robots to accomplish complex tasks while remaining resilient to faults and external disturbances. He leverages machine learning, adaptive control, and advanced consensus achievement techniques, such as reinforcement learning, supervised learning, Blockchain technologies to develop robust, (especially multi-)robot systems.

Publications

A Blockchain Framework for Equitable and Secure Task Allocation in Robot Swarms

Alexandre Pacheco

Marco Dorigo

Recent studies demonstrate the potential of blockchain to enable robots in a swarm to achieve secure consensus about the environment, partic… (see more)ularly when robots are homogeneous and perform identical tasks. Typically, robots receive rewards for their contributions to consensus achievement, but no studies have yet targeted heterogeneous swarms, in which the robots have distinct physical capabilities suited to different tasks. We present a novel framework that leverages domain knowledge to decompose the swarm mission into a hierarchy of tasks within smart contracts. This allows the robots to reach a consensus about both the environment and the action plan, allocating tasks among robots with diverse capabilities to improve their performance while maintaining security against faults and malicious behaviors. We refer to this concept as equitable and secure task allocation. Validated in Simultaneous Localization and Mapping missions, our approach not only achieves equitable task allocation among robots with varying capabilities, improving mapping accuracy and efficiency, but also shows resilience against malicious attacks.

2025-10-01

IEEE Robotics and Automation Letters (published)

doi.org

A Generic Framework for Byzantine-Tolerant Consensus Achievement in Robot Swarms

Hanqing Zhao

Alexandre Pacheco

Volker Strobel

Andreagiovanni Reina

Xue (Steve) Liu

Gregory Dudek

Marco Dorigo

Recent studies show that some security features that blockchains grant to decentralized networks on the internet can be ported to swarm robo… (see more)tics. Although the integration of blockchain technology and swarm robotics shows great promise, thus far, research has been limited to proof-of-concept scenarios where the blockchain-based mechanisms are tailored to a particular swarm task and operating environment. In this study, we propose a generic framework based on a blockchain smart contract that enables robot swarms to achieve secure consensus in an arbitrary observation space. This means that our framework can be customized to fit different swarm robotics missions, while providing methods to identify and neutralize Byzantine robots, that is, robots which exhibit detrimental behaviours stemming from faults or malicious tampering.

2023-10-01

2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (published)

doi.org

Zero-Shot Fault Detection for Manipulators Through Bayesian Inverse Reinforcement Learning

Hanqing Zhao

Xue (Steve) Liu

Gregory Dudek

We consider the detection of faults in robotic manipulators, with particular emphasis on faults that have not been observed or identified in… (see more) advance, which naturally includes those that occur very infrequently. Recent studies indicate that the reward function obtained through Inverse Reinforcement Learning (IRL) can help detect anomalies caused by faults in a control system (i.e. fault detection). Current IRL methods for fault detection, however, either use a linear reward representation or require extensive sampling from the environment to estimate the policy, rendering them inappropriate for safety-critical situations where sampling of failure observations via fault injection can be expensive and dangerous. To address this issue, this paper proposes a zero-shot and exogenous fault detector based on an approximate variational reward imitation learning (AVRIL) structure. The fault detector recovers a reward signal as a function of externally observable information to describe the normal operation, which can then be used to detect anomalies caused by faults. Our method incorporates expert knowledge through a customizable reward prior distribution, allowing the fault detector to learn the reward solely from normal operation samples, without the need for a simulator or costly interactions with the environment. We evaluate our approach for exogenous partial fault detection in multi-stage robotic manipulator tasks, comparing it with several baseline methods. The results demonstrate that our method more effectively identifies unseen faults even when they occur within just three controller time steps.

2023-10-01

2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (published)

doi.org

Mila AI Policy Conference

Leading in a New Era

TRAIL: Responsible AI for Professionals and Leaders

Hanqing Zhao

Biography

Publications

Mila AI Policy Conference

Leading in a New Era

TRAIL: Responsible AI for Professionals and Leaders

Popular keywords:

Hanqing Zhao

Biography

Publications