Follow
Luowei Zhou
Luowei Zhou
Research Scientist, Google Brain
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Unified vision-language pre-training for image captioning and vqa
L Zhou, H Palangi, L Zhang, H Hu, J Corso, J Gao
Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 13041 …, 2020
4862020
End-to-end dense video captioning with masked transformer
L Zhou, Y Zhou, JJ Corso, R Socher, C Xiong
Proceedings of the IEEE conference on computer vision and pattern …, 2018
4012018
Towards automatic learning of procedures from web instructional videos
L Zhou, C Xu, JJ Corso
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
3962018
Less is more: Clipbert for video-and-language learning via sparse sampling
J Lei, L Li, L Zhou, Z Gan, TL Berg, M Bansal, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
2142021
Grounded video description
L Zhou, Y Kalantidis, X Chen, JJ Corso, M Rohrbach
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1512019
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
1232021
Watch what you just said: Image captioning with text-conditional attention
L Zhou, C Xu, P Koch, JJ Corso
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, 305-313, 2017
792017
Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction
L Zhou, N Louis, JJ Corso
British Machine Vision Conference, 2018
652018
Bevt: Bert pretraining of video transformers
R Wang, D Chen, Z Wu, Y Chen, X Dai, M Liu, YG Jiang, L Zhou, L Yuan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
532022
Dense video captioning
Y Zhou, L Zhou, C Xiong, R Socher
US Patent 10,542,270, 2020
502020
Multiagent reinforcement learning with sparse interactions by negotiation and knowledge transfer
L Zhou, P Yang, C Chen, Y Gao
IEEE transactions on cybernetics 47 (5), 1238-1250, 2016
492016
Image caption generation with text-conditional semantic attention
L Zhou, C Xu, P Koch, JJ Corso
arXiv preprint arXiv:1606.04621 2, 2016
402016
Value: A multi-task benchmark for video-and-language understanding evaluation
L Li, J Lei, Z Gan, L Yu, YC Chen, R Pillai, Y Cheng, L Zhou, XE Wang, ...
arXiv preprint arXiv:2106.04632, 2021
392021
Uc2: Universal cross-lingual cross-modal vision-and-language pre-training
M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
382021
A balanced heuristic mechanism for multirobot task allocation of intelligent warehouses
L Zhou, Y Shi, J Wang, P Yang
Mathematical Problems in Engineering 2014, 2014
252014
Regionclip: Region-based language-image pretraining
Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
242022
Cluster-former: Clustering-based sparse transformer for question answering
S Wang, L Zhou, Z Gan, YC Chen, Y Fang, S Sun, Y Cheng, J Liu
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021
22*2021
Clip-event: Connecting text and images with event structures
M Li, R Xu, S Wang, L Zhou, X Lin, C Zhu, M Zeng, H Ji, SF Chang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
192022
Procnets: Learning to segment procedures in untrimmed and unconstrained videos
L Zhou, C Xu, JJ Corso
arXiv preprint arXiv:1703.09788 2 (6), 7, 2017
14*2017
Dynamic graph modules for modeling object-object interactions in activity recognition
H Huang, L Zhou, W Zhang, JJ Corso, C Xu
arXiv preprint arXiv:1812.05637, 2018
10*2018
The system can't perform the operation now. Try again later.
Articles 1–20