Learning attentional communication for multi-agent cooperation J Jiang, Z Lu Advances in neural information processing systems 31, 2018 | 491 | 2018 |
Graph convolutional reinforcement learning J Jiang, C Dun, T Huang, Z Lu arXiv preprint arXiv:1810.09202, 2018 | 391 | 2018 |
Learning fairness in multi-agent systems J Jiang, Z Lu Advances in Neural Information Processing Systems 32, 2019 | 56 | 2019 |
Towards human-level bimanual dexterous manipulation with reinforcement learning Y Chen, T Wu, S Wang, X Feng, J Jiang, Z Lu, S McAleer, H Dong, ... Advances in Neural Information Processing Systems 35, 5150-5163, 2022 | 49 | 2022 |
The emergence of individuality J Jiang, Z Lu International Conference on Machine Learning, 4992-5001, 2021 | 35* | 2021 |
Offline decentralized multi-agent reinforcement learning J Jiang, Z Lu arXiv preprint arXiv:2108.01832, 2021 | 32 | 2021 |
Model-based opponent modeling X Yu, J Jiang, W Zhang, H Jiang, Z Lu Advances in Neural Information Processing Systems 35, 28208-28221, 2022 | 18 | 2022 |
I2q: A fully decentralized q-learning algorithm J Jiang, Z Lu Advances in Neural Information Processing Systems 35, 20469-20481, 2022 | 11 | 2022 |
MA2QL: A minimalist approach to fully decentralized multi-agent reinforcement learning K Su, S Zhou, J Jiang, C Gan, X Wang, Z Lu arXiv preprint arXiv:2209.08244, 2022 | 7 | 2022 |
Generative exploration and exploitation J Jiang, Z Lu Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4337-4344, 2020 | 6 | 2020 |
Online tuning for offline decentralized multi-agent reinforcement learning J Jiang, Z Lu Proceedings of the AAAI Conference on Artificial Intelligence 37 (7), 8050-8059, 2023 | 4 | 2023 |
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study W Tan, Z Ding, W Zhang, B Li, B Zhou, J Yue, H Xia, J Jiang, L Zheng, ... arXiv preprint arXiv:2403.03186, 2024 | 2 | 2024 |
Learning from Visual Observation via Offline Pretrained State-to-Go Transformer B Zhou, K Li, J Jiang, Z Lu Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |
Best possible q-learning J Jiang, Z Lu arXiv preprint arXiv:2302.01188, 2023 | 1 | 2023 |
A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges X Xu, Y Wang, C Xu, Z Ding, J Jiang, Z Ding, BF Karlsson arXiv preprint arXiv:2403.10249, 2024 | | 2024 |
Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey J Jiang, K Su, Z Lu arXiv preprint arXiv:2401.04934, 2024 | | 2024 |
Opponent Modeling based on Sub-Goal Inference XP Yu, J Jiang, Z Lu | | 2023 |
Model-Based Decentralized Policy Optimization H Luo, J Jiang, Z Lu arXiv preprint arXiv:2302.08139, 2023 | | 2023 |
Adaptive Learning Rates for Multi-Agent Reinforcement Learning J Jiang, Z Lu | | 2020 |