Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups G Hinton, L Deng, D Yu, GE Dahl, A Mohamed, N Jaitly, A Senior, ... IEEE Signal processing magazine 29 (6), 82-97, 2012 | 9151 | 2012 |
Large Scale Distributed Deep Networks AYN Jeffrey Dean, Greg S. Corrado, Rajat Monga, Kai NIPS, 2012 | 3117* | 2012 |
Wavenet: A generative model for raw audio A Oord, S Dieleman, H Zen, K Simonyan, O Vinyals, A Graves, ... arXiv preprint arXiv:1609.03499, 2016 | 2625 | 2016 |
Long short-term memory recurrent neural network architectures for large scale acoustic modeling H Sak, AW Senior, F Beaufays | 1924 | 2014 |
Guide to biometrics RM Bolle, JH Connell, S Pankanti, NK Ratha, AW Senior Springer Science & Business Media, 2013 | 1040 | 2013 |
Convolutional, long short-term memory, fully connected deep neural networks TN Sainath, O Vinyals, A Senior, H Sak 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 973 | 2015 |
Recent advances in the automatic recognition of audiovisual speech G Potamianos, C Neti, G Gravier, A Garg, AW Senior Proceedings of the IEEE 91 (9), 1306-1326, 2003 | 816 | 2003 |
Statistical parametric speech synthesis using deep neural networks H Zen, A Senior, M Schuster Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International …, 2013 | 813 | 2013 |
Improving the speed of neural networks on CPUs V Vanhoucke, A Senior, MZ Mao Advances in Neural Information Processing Systems, 2011 | 698 | 2011 |
Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition H Sak, A Senior, F Beaufays arXiv preprint arXiv:1402.1128, 2014 | 609 | 2014 |
On rectified linear units for speech processing MD Zeiler, M Ranzato, R Monga, M Mao, K Yang, QV Le, P Nguyen, ... 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 513 | 2013 |
Appearance models for occlusion handling A Senior, A Hampapur, YL Tian, L Brown, S Pankanti, R Bolle Image and Vision Computing 24 (11), 1233-1243, 2006 | 474 | 2006 |
Learning the speech front-end with raw waveform CLDNNs TN Sainath, RJ Weiss, A Senior, KW Wilson, O Vinyals Sixteenth Annual Conference of the International Speech Communication …, 2015 | 434 | 2015 |
Lip reading sentences in the wild J Son Chung, A Senior, O Vinyals, A Zisserman Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017 | 395* | 2017 |
Fast and accurate recurrent neural network acoustic models for speech recognition H Sak, A Senior, K Rao, F Beaufays arXiv preprint arXiv:1507.06947, 2015 | 378 | 2015 |
An off-line cursive handwriting recognition system AW Senior, AJ Robinson IEEE transactions on pattern analysis and machine intelligence 20 (3), 309-321, 1998 | 355 | 1998 |
Enabling video privacy through computer vision A Senior, S Pankanti, A Hampapur, L Brown, YL Tian, A Ekin, J Connell, ... IEEE Security & Privacy 3 (3), 50-57, 2005 | 333 | 2005 |
Improved protein structure prediction using potentials from deep learning AW Senior, R Evans, J Jumper, J Kirkpatrick, L Sifre, T Green, C Qin, ... Nature 577 (7792), 706-710, 2020 | 328 | 2020 |
Application of pretrained deep neural networks to large vocabulary speech recognition N Jaitly, P Nguyen, A Senior, V Vanhoucke | 315 | 2012 |
System and method for automatically setting image acquisition controls RM Bolle, JH Connell, A Hampapur, AW Senior US Patent 6,301,440, 2001 | 298 | 2001 |