Xue (Steve) Liu

Ralph Cui

Postdoctorat

chengming.hu@mail.mcgill.ca

Lyu Fuyuan

Doctorat - McGill

Co-superviseur⋅e :

Doctorat - McGill

Yili Jin

Doctorat - McGill

Zonglun Li

Doctorat - McGill

Junliang Luo

Doctorat - McGill

Github

Shaoxiang Qin

Doctorat - McGill

Alejandro Salinas-Medina

Doctorat - McGill

Maîtrise recherche - McGill

Haolun Wu

Doctorat - McGill

Co-superviseur⋅e :

Fernando Diaz

Ye Yuan

Doctorat - McGill

ye.yuan3@mail.mcgill.ca

Github

Dun Yuan

Maîtrise recherche - McGill

Co-enseignement sensible à l’importance pour optimisation hors-ligne fondée sur un modèle

Weixu Zhang

Doctorat - McGill

Github

Shuhao Zheng

Doctorat - McGill

Billets de blogue

1 juillet 2024

par

Ye Yuan

Can Chen

Zixuan Liu

Willie Neiswanger

Xue Liu

Lire l'article

Bidirectional Learning for Offline Model-based Optimization

20 septembre 2023

Apprentissage bidirectionnel pour l’optimisation hors ligne basée sur un modèle

par

Can Chen

Yingxue Zhang

Xue Liu

Mark Coates

Lire l'article

Publications

Learning Multi-Objective Curricula for Robotic Policy Learning

Jikun Kang

Miao Liu

Abhinav Gupta

Chris Pal

Jie Fu

2023-03-06

Proceedings of The 6th Conference on Robot Learning (publié)

proceedings.mlr.press

openreview.net

Ternary Quantization: A Survey

Danyang Liu

Inference time, model size, and accuracy are critical for deploying deep neural network models. Numerous research efforts have been made to … (voir plus)compress neural network models with faster inference and higher accuracy. Pruning and quantization are mainstream methods to this end. During model quantization, converting individual float values of layer weights to low-precision ones can substantially reduce the computational overhead and improve the inference speed. Many quantization methods have been studied, for example, vector quantization, low-bit quantization, and binary/ternary quantization. This survey focuses on ternary quantization. We review the evolution of ternary quantization and investigate the relationships among existing ternary quantization methods from the perspective of projection function and optimization methods.

2023-03-02

ArXiv (prépublication)

Design and Implementation of Smooth Renewable Power in Cloud Data Centers

Xinxin Liu

Yu Hua

Ling Yang

Yuanyuan Sun

The renewable power has been widely used in modern cloud data centers, which also produce large electricity bills and the negative impacts o… (voir plus)n environments. However, frequent fluctuation and intermittency of renewable power often cause the challenges in terms of the stability of both electricity grid and data centers, as well as decreasing the utilization of renewable power. Existing schemes fail to alleviate the renewable power fluctuation, which is caused by the essential properties of renewable power. In order to address this problem, we propose an efficient and easy-to-use smooth renewable power-aware scheme, called Smoother, which consists of Flexible Smoothing (FS) and Active Delay (AD). First, in order to smooth the fluctuation of renewable power, FS carries out the optimized charge/discharge operation via computing the minimum variance of the renewable power that is supplied to data centers per interval. Second, AD improves the utilization of renewable power via actively adjusting the execution time of deferrable workloads. Extensive experimental results via examining the traces of real-world data centers demonstrate that Smoother significantly reduces the negative impact of renewable power fluctuations on data centers and improves the utilization of renewable power by 250.88 percent on average. We have released the source codes for public use.

2023-03-01

IEEE Transactions on Cloud Computing (publié)

Learning From FM Communications: Toward Accurate, Efficient, All-Terrain Vehicle Localization

X. T. Chen

Qiao Xiang

Linghe Kong

Huisan Xu

Vehicle localization service is a fundamental component of intelligent transportation systems. The widely used satellite navigation systems … (voir plus)perform poorly in urban areas because the lines of sight to satellites are blocked by complex terrain characteristics, e.g., buildings, elevated streets and interchanges. In this paper, we design RadioLoc, a novel system achieving accurate, efficient, all-terrain vehicle localization with two key design points. First, RadioLoc harvests the frequency modulation (FM) signal, which has higher availability than satellite signal in complex terrains, as the signal source for localization. Second, RadioLoc integrates modern machine learning techniques into the processing of FM signals to efficiently learn the accurate vehicle localization in all-terrain environments. We validate the feasibility of FM-based vehicle localization and corresponding challenges and practical issues via field tests (e.g., signal distortion, signal inconsistency and limited in- vehicle radio bandwidth), and develop a series of advanced techniques in RadioLoc to address them, including adaptive batching, frequency sweeping, a novel multipath delay spread filter, a reconstructive PCA denoiser and a tailored FM feature extractor. We then develop a generic, modular localization module in RadioLoc, and design different learning-based 3D position identification algorithms for this module. We implement a prototype of RadioLoc and perform extensive field experiments to evaluate its efficiency and efficacy. Results show that (1) RadioLoc achieves a real-time localization latency of less than 100 milliseconds; (2) RadioLoc achieves a worst-case localization accuracy of 99.6% even in an underground parking lot, and (3) the horizontal error of RadioLoc is only one sixth of a dedicated GPS device even when the vehicle is moving at a high-speed (i.e., 80 km/h) in a complex highway scenario.

2023-02-01

IEEE/ACM Transactions on Networking (publié)

Hyperspherical Quantization: Toward Smaller and More Accurate Models

Dan Liu

X. T. Chen

Chen Ma

Model quantization enables the deployment of deep neural networks under resource-constrained devices. Vector quantization aims at reducing t… (voir plus)he model size by indexing model weights with full-precision embeddings, i.e., codewords, while the index needs to be restored to 32-bit during computation. Binary and other low-precision quantization methods can reduce the model size up to 32×, however, at the cost of a considerable accuracy drop. In this paper, we propose an efficient framework for ternary quantization to produce smaller and more accurate compressed models. By integrating hyperspherical learning, pruning and reinitialization, our proposed Hyperspherical Quantization (HQ) method reduces the cosine distance between the full-precision and ternary weights, thus reducing the bias of the straight-through gradient estimator during ternary quantization. Compared with existing work at similar compression levels (~30×, ~40×), our method significantly improves the test accuracy and reduces the model size.

2023-01-02

2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (publié)

Bayes-MIL: A New Probabilistic Perspective on Attention-based Multiple Instance Learning for Whole Slide Images

Yufei Cui

Ziquan Liu

Xiangyu Liu

Cong Wang

Tei-Wei Kuo

Chun Jason Xue

Antoni B. Chan

Multiple instance learning (MIL) is a popular weakly-supervised learning model on the whole slide image (WSI) for AI-assisted pathology diag… (voir plus)nosis. The recent advance in attention-based MIL allows the model to find its region-of-interest (ROI) for interpretation by learning the attention weights for image patches of WSI slides. However, we empirically find that the interpretability of some related methods is either untrustworthy as the principle of MIL is violated or unsatisfactory as the high-attention regions are not consistent with experts’ annotations. In this paper, we propose Bayes-MIL to address the problem from a probabilistic perspective. The induced patch-level uncertainty is proposed as a new measure of MIL interpretability, which outperforms previous methods in matching doctors annotations. We design a slide-dependent patch regularizer (SDPR) for the attention, imposing constraints derived from the MIL assumption, on the attention distribution. SDPR explicitly constrains the model to generate correct attention values. The spatial information is further encoded by an approximate convolutional conditional random field (CRF), for better interpretability. Experimental results show Bayes-MIL outperforms the related methods in patch-level and slide-level metrics and provides much better interpretable ROI on several large-scale WSI datasets.

2023-01-01

ICLR (publié)

dblp.uni-trier.de

Bayes-MIL: A New Probabilistic Perspective on Attention-based Multiple Instance Learning for Whole Slide Images

Yufei Cui

Ziquan Liu

Xiangyu Liu

Cong Wang

Tei-Wei Kuo

Chun Jason Xue

Antoni Bert Chan

2023-01-01

International Conference on Learning Representations (published)

dblp.uni-trier.de

A Distributed Pricing Strategy for Edge Computation Offloading Optimization in Autonomous Driving

Jie Tang

Weilin Zhu

Xiaoming Li

Shaoshan Liu

The increase of on-vehicle applications has brought explosive computation demands to autonomous vehicles and overwhelmed their limited onboa… (voir plus)rd resources. Edge computing can offload application load and effectively alleviate this problem. However, the introduction of edge computing faces significant challenges, including the considerable amount of resource contention due to the scarcity of edge resources and the competition among edge computing resource providers to earn usersâ€™ services requests. We notice that the problem is not purely technical as solutions for these two problems can become conflicting to each other. In this paper, we propose a distributed pricing strategy to achieve full use of computing resources at the edge and maximize the revenue of service operators, both with guaranteed quality-of-service of on-vehicle applications. More specifically, we first use the multi-leader multi-follower Stackelberg game theory to model the pricing of on-vehicle task offloading under edge computing. Next, we propose a distributed pricing strategy to enable edge servers to adjust their local price distributions so that edge servers can bargain with offloading requesters independently. Experimental results confirm that the proposed distributed pricing strategy can provide more optimized server computing resource utilization while guaranteeing the performance of in-vehicle applications.

2023-01-01

IEEE Network (publié)

A Survey of Diversification Metrics and Approaches in Retrieval Systems: From the Perspective of Search and Recommendation

Haolun Wu

Yansen Zhang

Chen Ma

Fuyuan Lyu

Fernando Diaz

Diversifying search results is an important research topic in retrieval systems in order to satisfy both the various interests of customers … (voir plus)and the equal market exposure of providers. There has been a growing attention on diversity-aware research during recent years, accompanied by a proliferation of literature on methods to promote diversity in search and recommendation. However, the diversity-aware studies in retrieval systems lack a systematic organization and are rather fragmented. In this survey, we are the first to propose a unified taxonomy for classifying the metrics and approaches of diversification in both search and recommendation, which are two of the most extensively researched fields of retrieval systems. We begin the survey with a brief discussion of why diversity is important in retrieval systems

Dynamic Consolidation for Continual Learning

Hang Li

Chen Ma

X. T. Chen

Abstract Training deep learning models from a stream of nonstationary data is a critical problem to be solved to achieve general artificial … (voir plus)intelligence. As a promising solution, the continual learning (CL) technique aims to build intelligent systems that have the plasticity to learn from new information without forgetting the previously obtained knowledge. Unfortunately, existing CL methods face two nontrivial limitations. First, when updating a model with new data, existing CL methods usually constrain the model parameters within the vicinity of the parameters optimized for old data, limiting the exploration ability of the model; second, the important strength of each parameter (used to consolidate the previously learned knowledge) is fixed and thus is suboptimal for the dynamic parameter updates. To address these limitations, we first relax the vicinity constraints with a global definition of the important strength, which allows us to explore the full parameter space. Specifically, we define the important strength as the sensitivity of the global loss function to the model parameters. Moreover, we propose adjusting the important strength adaptively to align it with the dynamic parameter updates. Through extensive experiments on popular data sets, we demonstrate that our proposed method outperforms the strong baselines by up to 24% in terms of average accuracy.

2022-12-14

Neural Computation (publié)

Adapting Triplet Importance of Implicit Feedback for Personalized Recommendation

Haolun Wu

Chen Ma

Yingxue Zhang

Ruiming Tang

Mark Coates

2022-10-17

Proceedings of the 31st ACM International Conference on Information & Knowledge Management (publié)

OptEmbed: Learning Optimal Embedding Table for Click-through Rate Prediction

Fuyuan Lyu

Xing Tang

Hong Zhu

Huifeng Guo

Yingxue Zhang

Ruiming Tang

Click-through rate (CTR) prediction model usually consists of three components: embedding table, feature interaction layer, and classifier. … (voir plus)Learning embedding table plays a fundamental role in CTR prediction from the view of the model performance and memory usage. The embedding table is a two-dimensional tensor, with its axes indicating the number of feature values and the embedding dimension, respectively. To learn an efficient and effective embedding table, recent works either assign various embedding dimensions for feature fields and reduce the number of embeddings respectively or mask the embedding table parameters. However, all these existing works cannot get an optimal embedding table. On the one hand, various embedding dimensions still require a large amount of memory due to the vast number of features in the dataset. On the other hand, decreasing the number of embeddings usually suffers from performance degradation, which is intolerable in CTR prediction. Finally, pruning embedding parameters will lead to a sparse embedding table, which is hard to be deployed. To this end, we propose an optimal embedding table learning framework OptEmbed, which provides a practical and general method to find an optimal embedding table for various base CTR models. Specifically, we propose pruning the redundant embeddings regarding corresponding features' importance by learnable pruning thresholds. Furthermore, we consider assigning various embedding dimensions as one single candidate architecture. To efficiently search the optimal embedding dimensions, we design a uniform embedding dimension sampling scheme to equally train all candidate architectures, meaning architecture-related parameters and learnable thresholds are trained simultaneously in one supernet. We then propose an evolution search method based on the supernet to find the optimal embedding dimensions for each field. Experiments on public datasets show that OptEmbed can learn a compact embedding table which can further improve the model performance.

2022-10-17

Proceedings of the 31st ACM International Conference on Information & Knowledge Management (publié)