Yi Wu

Cited by

	All	Since 2019
Citations	9493	9015
h-index	24	24
i10-index	33	32

2800

1400

700

2100

20172018201920202021202220232024113 319 666 1029 1491 2054 2726 1041

Public access

View all

12 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Aviv TamarTechnionVerified email at technion.ac.il
Stuart RussellProfessor of Computer Science, University of California, BerkeleyVerified email at cs.berkeley.edu
Yu Wang (汪玉)Department of Electronic Engineering, Tsinghua University, ChinaVerified email at mail.tsinghua.edu.cn
Yuandong TianResearch Scientist, Meta AI (FAIR)Verified email at fb.com
Fei FangCarnegie Mellon UniversityVerified email at cmu.edu
Igor MordatchGoogle DeepMindVerified email at google.com
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Huazhe XuTsinghua UniversityVerified email at berkeley.edu
Xiaolong WangAssistant Professor, UC San DiegoVerified email at ucsd.edu
Ryan LoweOpenAIVerified email at openai.com
Jean HarbOpenAIVerified email at openai.com
Chao Yu（于超）Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Akash VeluStudent, Stanford UniversityVerified email at stanford.edu
Eugene VinitskyAssistant Professer, NYUVerified email at nyu.edu
Georgia GkioxariCaltechVerified email at caltech.edu
Yunfei LiTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Shusheng XuIIIS, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Alexandre BayenProfessor Electrical Engineering and Computer Science, UC BerkeleyVerified email at berkeley.edu
Yuxin WuVerified email at google.com
Ingmar KanitscheiderOpenAIVerified email at openai.com

Yi Wu

Institute for Interdisciplinary Information Sciences, Tsinghua University

Verified email at mail.tsinghua.edu.cn - Homepage

Reinforcement Learning Human-AI Interaction Multi-Agent Learning Robot Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017	4688	2017
The surprising effectiveness of ppo in cooperative multi-agent games C Yu, A Velu, E Vinitsky, J Gao, Y Wang, A Bayen, Y Wu Advances in Neural Information Processing Systems 35, 24611-24624, 2022	932	2022
Emergent tool use from multi-agent autocurricula B Baker, I Kanitscheider, T Markov, Y Wu, G Powell, B McGrew, ... arXiv preprint arXiv:1909.07528, 2019	806	2019
Value iteration networks A Tamar, Y Wu, G Thomas, S Levine, P Abbeel Advances in neural information processing systems 29, 2016	731	2016
Building generalizable agents with a realistic and rich 3d environment Y Wu, Y Wu, G Gkioxari, Y Tian arXiv preprint arXiv:1801.02209, 2018	370	2018
Robust multi-agent reinforcement learning via minimax deep deterministic policy gradient S Li, Y Wu, X Cui, H Dong, F Fang, S Russell Proceedings of the AAAI conference on artificial intelligence 33 (01), 4213-4220, 2019	300	2019
Adversarial training for relation extraction Y Wu, D Bamman, S Russell Proceedings of the 2017 Conference on Empirical Methods in Natural Language …, 2017	245	2017
Multi-task reinforcement learning with soft modularization R Yang, H Xu, Y Wu, X Wang Advances in Neural Information Processing Systems 33, 4767-4777, 2020	171	2020
Influence-based multi-agent exploration T Wang, J Wang, Y Wu, C Zhang arXiv preprint arXiv:1910.05512, 2019	132	2019
Bayesian relational memory for semantic visual navigation Y Wu, Y Wu, A Tamar, S Russell, G Gkioxari, Y Tian Proceedings of the IEEE/CVF international conference on computer vision …, 2019	122*	2019
Evolutionary population curriculum for scaling multi-agent reinforcement learning Q Long, Z Zhou, A Gupta, F Fang, Y Wu, X Wang arXiv preprint arXiv:2003.10423, 2020	105	2020
Noveld: A simple yet effective exploration criterion T Zhang, H Xu, X Wang, Y Wu, K Keutzer, JE Gonzalez, Y Tian Advances in Neural Information Processing Systems 34, 25217-25230, 2021	97*	2021
Deep reinforcement learning for green security games with real-time information Y Wang, ZR Shi, L Yu, Y Wu, R Singh, L Joppa, F Fang Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 1401-1408, 2019	88	2019
Sequence level contrastive learning for text summarization S Xu, X Zhang, Y Wu, F Wei Proceedings of the AAAI conference on artificial intelligence 36 (10), 11556 …, 2022	73	2022
Unsupervised extractive summarization by pre-training hierarchical transformers S Xu, X Zhang, Y Wu, F Wei, M Zhou arXiv preprint arXiv:2010.08242, 2020	54	2020
Discovering diverse multi-agent strategic behavior via reward randomization Z Tang, C Yu, B Chen, H Xu, X Wang, F Fang, S Du, Y Wang, Y Wu arXiv preprint arXiv:2103.04564, 2021	48	2021
Swift: Compiled inference for probabilistic programming languages Y Wu, L Li, S Russell, R Bodik arXiv preprint arXiv:1606.09242, 2016	40*	2016
Maximum entropy population-based training for zero-shot human-ai coordination R Zhao, J Song, Y Yuan, H Hu, Y Gao, Y Wu, Z Sun, W Yang Proceedings of the AAAI Conference on Artificial Intelligence 37 (5), 6145-6153, 2023	39	2023
Meta-learning MCMC proposals T Wang, Y Wu, D Moore, SJ Russell Advances in neural information processing systems 31, 2018	38	2018
Revisiting some common practices in cooperative multi-agent reinforcement learning W Fu, C Yu, Z Xu, J Yang, Y Wu arXiv preprint arXiv:2206.07505, 2022	37	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors