Deep captioning with multimodal recurrent neural networks (m-rnn) J Mao, W Xu, Y Yang, J Wang, Z Huang, A Yuille arXiv preprint arXiv:1412.6632, 2014 | 969 | 2014 |
Cnn-rnn: A unified framework for multi-label image classification J Wang, Y Yang, J Mao, Z Huang, C Huang, W Xu Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 681 | 2016 |
Are you talking to a machine? dataset and methods for multilingual image question answering H Gao, J Mao, J Zhou, Z Huang, L Wang, W Xu arXiv preprint arXiv:1505.05612, 2015 | 432 | 2015 |
Generation and comprehension of unambiguous object descriptions J Mao, J Huang, A Toshev, O Camburu, AL Yuille, K Murphy Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 423 | 2016 |
Explain images with multimodal recurrent neural networks J Mao, W Xu, Y Yang, J Wang, AL Yuille arXiv preprint arXiv:1410.1090, 2014 | 339 | 2014 |
Deep compositional captioning: Describing novel object categories without paired training data LA Hendricks, S Venugopalan, M Rohrbach, R Mooney, K Saenko, ... Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 218 | 2016 |
Attention correctness in neural image captioning C Liu, J Mao, F Sha, A Yuille Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017 | 161 | 2017 |
Learning like a child: Fast novel visual concept learning from sentence descriptions of images J Mao, X Wei, Y Yang, J Wang, Z Huang, AL Yuille Proceedings of the IEEE international conference on computer vision, 2533-2541, 2015 | 139 | 2015 |
Training and evaluating multimodal word embeddings with large-scale web annotated images J Mao, J Xu, Y Jing, A Yuille arXiv preprint arXiv:1611.08321, 2016 | 55* | 2016 |
Multilingual image question answering H Gao, J Mao, J Zhou, Z Huang, L Wang, W Xu US Patent App. 15/137,179, 2016 | 32 | 2016 |
Learning from weakly supervised data by the expectation loss svm (e-svm) algorithm J Zhu, J Mao, AL Yuille Advances in neural information processing systems 27, 1125-1133, 2014 | 31 | 2014 |
Scale based region growing for scene text detection J Mao, H Li, W Zhou, S Yan, Q Tian Proceedings of the 21st ACM international conference on Multimedia, 1007-1016, 2013 | 30 | 2013 |
An active patch model for real world texture and appearance classification J Mao, J Zhu, AL Yuille European Conference on Computer Vision, 140-155, 2014 | 17 | 2014 |
Intelligent image captioning J Mao, W Xu, Y Yang, J Wang, Z Huang US Patent 10,423,874, 2019 | 14 | 2019 |
STINet: Spatio-temporal-interactive network for pedestrian detection and trajectory prediction Z Zhang, J Gao, J Mao, Y Liu, D Anguelov, C Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 9 | 2020 |
Systems and methods for fast novel visual concept learning from sentence descriptions of images J Mao, W Xu, Y Yang, J Wang, Z Huang US Patent 10,504,010, 2019 | 7 | 2019 |
Neural networks for coarse-and fine-object classifications J Mao, C Li, Y Song US Patent 10,867,210, 2020 | | 2020 |
Object classification using extra-regional context J Mao, Q Yu, C Li US Patent App. 16/230,187, 2020 | | 2020 |
Training a classifier to detect open vehicle doors J Mao, LP Tsui, C Li, ES Walker US Patent App. 16/231,297, 2020 | | 2020 |
Searching an autonomous vehicle sensor data repository Z Guo, N Abdo, J Mao, C Li, ES Walker US Patent App. 16/726,060, 2020 | | 2020 |