Portrait of Hanqing Zhao

Hanqing Zhao

Affiliate Member
Assistant Professor, Université Laval, Electrical and Computer Engineering
Research Topics
Multi-Agent Systems
Multitask Learning
Reinforcement Learning
Robotics
Swarm Intelligence

Biography

Hanqing Zhao is an Assistant Professor in the Département de génie électrique et de génie informatique of Université Laval. He is a member of the Laboratoire de Vision et Systèmes Numériques (LVSN).

Hanqing began his academic journey at the École Centrale de Pékin (Université Beihang). He earned an Ingénieur civil en informatique degree from École Polytechnique de Bruxelles (Université libre de Bruxelles), supervised by Marco Dorigo; and later received his Ph.D. in Computer Science (robotics) from McGill University, supervised by Gregory Dudek and Xue (Steve) Liu. He was then a Postdoctoral Researcher at the MIST Lab of École Polytechnique de Montréal, supervised by Giovanni Beltrame.

His research focuses on enabling robots to accomplish complex tasks while remaining resilient to faults and external disturbances. He leverages machine learning, adaptive control, and advanced consensus achievement techniques, such as reinforcement learning, supervised learning, Blockchain technologies to develop robust, (especially multi-)robot systems.

Publications

A Generic Framework for Byzantine-Tolerant Consensus Achievement in Robot Swarms
Alexandre Pacheco
Volker Strobel
Andreagiovanni Reina
Marco Dorigo
Recent studies show that some security features that blockchains grant to decentralized networks on the internet can be ported to swarm robo… (see more)tics. Although the integration of blockchain technology and swarm robotics shows great promise, thus far, research has been limited to proof-of-concept scenarios where the blockchain-based mechanisms are tailored to a particular swarm task and operating environment. In this study, we propose a generic framework based on a blockchain smart contract that enables robot swarms to achieve secure consensus in an arbitrary observation space. This means that our framework can be customized to fit different swarm robotics missions, while providing methods to identify and neutralize Byzantine robots, that is, robots which exhibit detrimental behaviours stemming from faults or malicious tampering.
Zero-Shot Fault Detection for Manipulators Through Bayesian Inverse Reinforcement Learning
We consider the detection of faults in robotic manipulators, with particular emphasis on faults that have not been observed or identified in… (see more) advance, which naturally includes those that occur very infrequently. Recent studies indicate that the reward function obtained through Inverse Reinforcement Learning (IRL) can help detect anomalies caused by faults in a control system (i.e. fault detection). Current IRL methods for fault detection, however, either use a linear reward representation or require extensive sampling from the environment to estimate the policy, rendering them inappropriate for safety-critical situations where sampling of failure observations via fault injection can be expensive and dangerous. To address this issue, this paper proposes a zero-shot and exogenous fault detector based on an approximate variational reward imitation learning (AVRIL) structure. The fault detector recovers a reward signal as a function of externally observable information to describe the normal operation, which can then be used to detect anomalies caused by faults. Our method incorporates expert knowledge through a customizable reward prior distribution, allowing the fault detector to learn the reward solely from normal operation samples, without the need for a simulator or costly interactions with the environment. We evaluate our approach for exogenous partial fault detection in multi-stage robotic manipulator tasks, comparing it with several baseline methods. The results demonstrate that our method more effectively identifies unseen faults even when they occur within just three controller time steps.