2021-11
Scalable Operator Allocation for Multi-Robot Assistance: A Restless Bandit Approach.
2021-10
Robustness and sample complexity of model-based MARL for general-sum Markov games.
2021-08
A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems
Scalable regret for learning to control network-coupled subsystems with unknown dynamics.
2021-06
Structure-aware reinforcement learning for node-overload protection in mobile edge computing
2021-04
Maintenance of a collection of machines under partial observability: Indexability and computation of Whittle index.
2021-01
Multi-Agent Estimation and Filtering for Minimizing Team Mean-Squared Error
Optimal control of network-coupled subsystems: Spectral decomposition and low-dimensional solutions
IEEE Transactions on Control of Network Systems
(2021-01-01)
xplorestaging.ieee.org[Also on arXiv e-prints (2020-09-25)]2020-12
Team Optimal Control of Coupled Major-Minor Subsystems with Mean-Field Sharing
Reinforcement Learning in Decentralized Stochastic Control Systems with Partial History Sharing.
Team-Optimal Solution of Finite Number of Mean-Field Coupled LQG Subsystems.
Team Optimal Control of Coupled Subsystems with Mean-Field Sharing
2020-11
Thompson sampling for linear quadratic mean-field teams.
2020-10
Approximate information state for approximate planning and reinforcement learning in partially observed systems.
2020-09
Networked control of coupled subsystems: Spectral decomposition and low-dimensional solutions
(venue unknown)
(2020-09-25)
2020-08
Conditions for indexability of restless bandits and an algorithm to compute Whittle index
Optimal Local and Remote Controllers With Unreliable Uplink Channels: An Elementary Proof
Renewal Monte Carlo: Renewal Theory-Based Reinforcement Learning
2020-07
Counterexamples on the Monotonicity of Delay Optimal Strategies for Energy Harvesting Transmitters
2020-06
Cross-layer communication over fading channels with adaptive decision feedback
Restless bandits: Indexability and computation of Whittle index
2020-05
Remote Estimation Over a Packet-Drop Channel With Markovian State
2020-04
Decentralized linear quadratic systems with major and minor agents and non-Gaussian noise
2020-02
Publications collected and formatted using Paperoni