Xue (Steve) Liu

Ralph Cui

Postdoctorate

Dheeraj Dheeraj Vattikonda

Master's Research - McGill University

Linfeng Du

PhD - McGill University

Lyu Fuyuan

PhD - McGill University

Co-supervisor :

Kaiyuan Hu

PhD - McGill University

Chengming Hu

PhD - McGill University

Li Jiang Jiang

PhD - McGill University

Yili Jin

PhD - McGill University

Hong Kang

PhD - McGill University

Zonglun Li

PhD - McGill University

Yuyan Lin

Master's Research - McGill University

Junliang Luo

PhD - McGill University

Shaoxiang Qin

PhD - McGill University

Alejandro Salinas-Medina

PhD - McGill University

Zipeng Sun

PhD - McGill University

Haolun Wu

PhD - McGill University

Co-supervisor :

Fernando Diaz

Ye Yuan

PhD - McGill University

Dun Yuan

PhD - McGill University

Importance-Aware Co-Teaching for Offline Model-Based Optimization

Weixu Zhang

PhD - McGill University

Blog Posts

Co-enseignement sensible à l’importance pour optimisation hors-ligne fondée sur un modèle

July 1, 2024

Ye Yuan

Can Chen

Zixuan Liu

Willie Neiswanger

Xue Liu

Read the article

September 20, 2023

Bidirectional Learning for Offline Model-based Optimization

Can Chen

Yingxue Zhang

Xue Liu

Mark Coates

Read the article

Publications

A Cost-Efficient Metadata Scheme for High-Performance Deduplication Systems

Yuxuan Mo

Yu Hua

Pengfei Li

Qin Cao

Data deduplication has been widely used in backup systems to eliminate redundant data, which speeds up the backup process and reduces the st… (see more)orage overhead. Deduplication packs multiple chunks into a large, fixed-size container as a storage unit to maintain the locality and achieve efficient compression. We observe that the traditional containers have low filling ratios due to a large amount of metadata generated by small files. Unfilled containers require more space to store a backup, which decreases the storage efficiency and reduces restore performance. In order to address this problem, we propose a Metadata region Adaptive Container Structure, called MACS. MACS maintains a tag to record the length of metadata region in the container. The boundary between meta-data region and data region is dynamically decided to ensure the maximum space efficiency of the containers. Moreover, we propose a container metadata length-based indexing and cache replacement strategy to allow MACS to be practical in data backup systems. We demonstrate the advantages of MACS with three real world backup datasets. MACS achieves over 95% average container filling ratio, which is significantly higher than existing designs. MACS further achieves better restore performance than the traditional container structure. When combined with existing rewriting method, MACS achieves an efficient trade-off between deduplication ratio and restore performance.

2021-12-20

2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys) (published)

Learning Assisted Identification of Scenarios Where Network Optimization Algorithms Under-Perform

Dmitriy Rivkin

David Meger

Di Wu

X. T. Chen

Gregory Dudek

We present a generative adversarial method that uses deep learning to identify network load traffic conditions in which network optimization… (see more) algorithms under-perform other known algorithms: the Deep Convolutional Failure Generator (DCFG). The spatial distribution of network load presents challenges for network operators for tasks such as load balancing, in which a network optimizer attempts to maintain high quality communication while at the same time abiding capacity constraints. Testing a network optimizer for all possible load distributions is challenging if not impossible. We propose a novel method that searches for load situations where a target network optimization method underperforms baseline, which are key test cases that can be used for future refinement and performance optimization. By modeling a realistic network simulator's quality assessments with a deep network and, in parallel, optimizing a load generation network, our method efficiently searches the high dimensional space of load patterns and reliably finds cases in which a target network optimization method under-performs a baseline by a significant margin.

2021-12-01

Global Communications Conference (published)

Design and Implementation of Smooth Renewable Power in Cloud Data Centers

Xinxin Liu

Yu Hua

Ling Yang

Yuanyuan Sun

The renewable power has been widely used in modern cloud data centers, which also produce large electricity bills and the negative impacts o… (see more)n environments. However, frequent fluctuation and intermittency of renewable power often cause the challenges in terms of the stability of both electricity grid and data centers, as well as decreasing the utilization of renewable power. Existing schemes fail to alleviate the renewable power fluctuation, which is caused by the essential properties of renewable power. In order to address this problem, we propose an efficient and easy-to-use smooth renewable power-aware scheme, called Smoother, which consists of Flexible Smoothing (FS) and Active Delay (AD). First, in order to smooth the fluctuation of renewable power, FS carries out the optimized charge/discharge operation via computing the minimum variance of the renewable power that is supplied to data centers per interval. Second, AD improves the utilization of renewable power via actively adjusting the execution time of deferrable workloads. Extensive experimental results via examining the traces of real-world data centers demonstrate that Smoother significantly reduces the negative impact of renewable power fluctuations on data centers and improves the utilization of renewable power by 250.88 percent on average. We have released the source codes for public use.

2021-04-30

IEEE Transactions on Cloud Computing (published)

Smart Futures Based Resource Trading and Coalition Formation for Real-Time Mobile Data Processing

Ruitao Chen

Xianbin Wang

Collaboration among mobile devices (MDs) is becoming more important, as it could augment computing capacity at the network edge through peer… (see more)-to-peer service provisioning, and directly enhance real-time computational performance in smart Internet-of-Things applications. As an important aspect of collaboration mechanism, conventional resource trading (RT) among MDs relies on an onsite interaction process, i.e., price negotiation between service providers and requesters, which, however, inevitably incurs excessive latency and degrades RT efficiency. To overcome this challenge, this article adopts the concept of futures contract (FC) used in financial market, and proposes a smart futures for low latency RT. This new technique enables MDs to form trading coalitions and negotiate multilateral forward contracts applied to a collaboration term in the future. To maximize the benefits of self-interested MDs, the negotiation process of FC is modelled as a coalition formation game comprised of three components executed in an iterative manner, i.e., futures resource allocation, revenue sharing and payment allocation, and distributed decision-making of individual MD. Additionally, a FC enforcement scheme is implemented to efficiently manage the onsite resource sharing via recording resource balances of different task-types and MDs. Simulation results prove the superiority of smart futures in RT latency reduction and trading fairness provisioning.

2021-02-18

IEEE Transactions on Services Computing (published)

Smart Futures Based Resource Trading and Coalition Formation for Real-Time Mobile Data Processing

Ruitao Chen

Xianbin Wang

2021-02-18

IEEE Transactions on Services Computing (published)

Variational Nested Dropout

Yufei Cui

Yushun Mao

Ziquan Liu

Qiao Li

Antoni Bert Chan

Tei-Wei Kuo

Chun Jason Xue

Nested dropout is a variant of dropout operation that is able to order network parameters or features based on the pre-defined importance du… (see more)ring training. It has been explored for: I. Constructing nested nets Cui et al. 2020, Cui et al. 2021: the nested nets are neural networks whose architectures can be adjusted instantly during testing time, e.g., based on computational constraints. The nested dropout implicitly ranks the network parameters, generating a set of sub-networks such that any smaller sub-network forms the basis of a larger one. II. Learning ordered representation Rippel et al. 2014: the nested dropout applied to the latent representation of a generative model (e.g., auto-encoder) ranks the features, enforcing explicit order of the dense representation over dimensions. However, the dropout rate is fixed as a hyper-parameter during the whole training process. For nested nets, when network parameters are removed, the performance decays in a human-specified trajectory rather than in a trajectory learned from data. For generative models, the importance of features is specified as a constant vector, restraining the flexibility of representation learning. To address the problem, we focus on the probabilistic counterpart of the nested dropout. We propose a variational nested dropout (VND) operation that draws samples of multi-dimensional ordered masks at a low cost, providing useful gradients to the parameters of nested dropout. Based on this approach, we design a Bayesian nested neural network that learns the order knowledge of the parameter distributions. We further exploit the VND under different generative models for learning ordered latent distributions. In experiments, we show that the proposed approach outperforms the nested network in terms of accuracy, calibration, and out-of-domain detection in classification tasks. It also outperforms the related generative models on data generation tasks.

2021-01-27

ArXiv (preprint)