Follow
Weichao Mao
Title
Cited by
Cited by
Year
Provably efficient reinforcement learning in decentralized general-sum Markov games
W Mao, T Başar
Dynamic Games and Applications 13 (1), 165-186, 2023
632023
On improving model-free algorithms for decentralized multi-agent reinforcement learning
W Mao, L Yang, K Zhang, T Basar
International Conference on Machine Learning, 15007-15049, 2022
61*2022
Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs
W Mao, K Zhang, R Zhu, D Simchi-Levi, T Basar
International Conference on Machine Learning, 7447-7458, 2021
44*2021
Pricing for revenue maximization in IoT data markets: An information design perspective
W Mao, Z Zheng, F Wu
IEEE INFOCOM 2019-IEEE Conference on Computer Communications, 1837-1845, 2019
382019
Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
W Mao, K Zhang, E Miehling, T Başar
2020 59th IEEE Conference on Decision and Control (CDC), 6124-6131, 2020
222020
Reinforcement learning for resource management in multi-tenant serverless platforms
H Qiu, W Mao, A Patke, C Wang, H Franke, ZT Kalbarczyk, T Başar, ...
Proceedings of the 2nd European Workshop on Machine Learning and Systems, 20-28, 2022
182022
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
W Mao, K Zhang, Q Xie, T Başar
Advances in Neural Information Processing Systems 33, 2020
182020
Adjusting Matching Algorithm to Adapt to Workload Fluctuations in Content-based Publish/Subscribe Systems
S Qian, W Mao, J Cao, F Le Mouël, M Li
IEEE INFOCOM 2019-IEEE Conference on Computer Communications, 1936-1944, 2019
162019
Model-free non-stationary rl: Near-optimal regret and applications in multi-agent rl and inventory control
W Mao, K Zhang, R Zhu, D Simchi-Levi, T Başar
arXiv preprint arXiv:2010.03161, 2020
152020
Online Pricing for Revenue Maximization with Unknown Time Discounting Valuations.
W Mao, Z Zheng, F Wu, G Chen
IJCAI, 440-446, 2018
142018
A mean-field game approach to cloud resource management with function approximation
W Mao, H Qiu, C Wang, H Franke, Z Kalbarczyk, R Iyer, T Basar
Advances in Neural Information Processing Systems 35, 36243-36258, 2022
122022
{AWARE}: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud Systems
H Qiu, W Mao, C Wang, H Franke, A Youssef, ZT Kalbarczyk, T Başar, ...
2023 USENIX Annual Technical Conference (USENIX ATC 23), 387-402, 2023
112023
SIMPPO: a scalable and incremental online learning framework for serverless resource management
H Qiu, W Mao, A Patke, C Wang, H Franke, ZT Kalbarczyk, T Başar, ...
Proceedings of the 13th Symposium on Cloud Computing, 306-322, 2022
112022
A fast and anti-matchability matching algorithm for content-based publish/subscribe systems
S Qian, J Cao, W Mao, Y Zhu, J Yu, M Li, J Wang
Computer Networks 149, 213-225, 2019
102019
Challenges and Opportunities in IoT Data Markets
Z Zheng, W Mao, F Wu, G Chen
Proceedings of the Fourth International Workshop on Social Sensing, 1-2, 2019
72019
Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms
X Zhang, W Mao, S Mowlavi, M Benosman, T Başar
arXiv preprint arXiv:2311.18736, 2023
32023
Action Dynamics Task Graphs for Learning Plannable Representations of Procedural Tasks
W Mao, R Desai, ML Iuzzolino, N Kamra
arXiv preprint arXiv:2302.05330, 2023
32023
Adjusting matching algorithm to adapt to dynamic subscriptions in content-based publish/subscribe systems
S Qian, W Mao, J Cao, G Xue, J Yu, Y Zhu, M Li, W Li
2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2018
32018
Semiparametric information state embedding for policy search under imperfect information
S Bhatt, W Mao, A Koppel, T Başar
2021 60th IEEE Conference on Decision and Control (CDC), 4501-4506, 2021
22021
On the Promise and Challenges of Foundation Models for Learning-based Cloud Systems Management
H Qiu, W Mao, CWH Franke, ZT Kalbarczyk, T Basar, RK Iyer
Annual Conference on Neural Information Processing Systems, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20