Zhu Zhang
Zhu Zhang
Verified email at zju.edu.cn
Title
Cited by
Cited by
Year
Open-Ended Long-form Video Question Answering via Adaptive Hierarchical Reinforced Networks.
Z Zhao, Z Zhang, S Xiao, Z Yu, J Yu, D Cai, F Wu, Y Zhuang
IJCAI, 3683-3689, 2018
202018
Cross-modal interaction networks for query-based moment retrieval in videos
Z Zhang, Z Lin, Z Zhao, Z Xiao
Proceedings of the 42nd International ACM SIGIR Conference on Research and …, 2019
172019
Weakly-Supervised Video Moment Retrieval via Semantic Completion Network
Z Lin, Z Zhao, Z Zhang, Q Wang, H Liu
arXiv preprint arXiv:1911.08199, 2019
82019
Multi-turn video question answering via hierarchical attention context reinforced networks
Z Zhao, Z Zhang, X Jiang, D Cai
IEEE Transactions on Image Processing 28 (8), 3860-3872, 2019
62019
Localizing unseen activities in video via image query
Z Zhang, Z Zhao, Z Lin, J Song, D Cai
arXiv preprint arXiv:1906.12165, 2019
52019
Moment Retrieval via Cross-Modal Interaction Networks With Query Reconstruction
Z Lin, Z Zhao, Z Zhang, Z Zhang, D Cai
IEEE Transactions on Image Processing 29, 3750-3762, 2020
42020
Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentences
Z Zhang, Z Zhao, Y Zhao, Q Wang, H Liu, L Gao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
42020
Long-form video question answering via dynamic hierarchical reinforced networks
Z Zhao, Z Zhang, S Xiao, Z Xiao, X Yan, J Yu, D Cai, F Wu
IEEE Transactions on Image Processing 28 (12), 5939-5952, 2019
42019
Object-Aware Multi-Branch Relation Networks for Spatio-Temporal Video Grounding
Z Zhang, Z Zhao, Z Lin, B Huai, NJ Yuan
arXiv preprint arXiv:2008.06941, 2020
12020
Text-Guided Image Inpainting
Z Zhang, Z Zhao, Z Zhang, B Huai, J Yuan
Proceedings of the 28th ACM International Conference on Multimedia, 4079-4087, 2020
2020
Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos
Z Zhang, Z Lin, Z Zhao, J Zhu, X He
Proceedings of the 28th ACM International Conference on Multimedia, 4098-4106, 2020
2020
Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks
Z Zhang, Z Zhao, Z Zhang, Z Lin, Q Wang, R Hong
IEEE Transactions on Multimedia, 2020
2020
Open-ended long-form video question answering via hierarchical convolutional self-attention networks
Z Zhang, Z Zhao, Z Lin, J Song, X He
arXiv preprint arXiv:1906.12158, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–13