Niki Parmar
Niki Parmar
Senior Research Scientist, Google Brain
Verified email at
Cited by
Cited by
Attention is all you need
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv:1706.03762, 2017
Advances in neural information processing systems
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
Neural Information Processing Systems Foundation, 5998-6008, 2017
Tensor2tensor for neural machine translation
A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ...
arXiv preprint arXiv:1803.07416, 2018
Image transformer
N Parmar, A Vaswani, J Uszkoreit, Ł Kaiser, N Shazeer, A Ku, D Tran
arXiv preprint arXiv:1802.05751, 2018
The best of both worlds: Combining recent advances in neural machine translation
MX Chen, O Firat, A Bapna, M Johnson, W Macherey, G Foster, L Jones, ...
arXiv preprint arXiv:1804.09849, 2018
One model to learn them all
L Kaiser, AN Gomez, N Shazeer, A Vaswani, N Parmar, L Jones, ...
arXiv preprint arXiv:1706.05137, 2017
Stand-alone self-attention in vision models
P Ramachandran, N Parmar, A Vaswani, I Bello, A Levskaya, J Shlens
arXiv preprint arXiv:1906.05909, 2019
Mesh-tensorflow: Deep learning for supercomputers
N Shazeer, Y Cheng, N Parmar, D Tran, A Vaswani, P Koanantakool, ...
arXiv preprint arXiv:1811.02084, 2018
Purity homophily in social networks.
M Dehghani, K Johnson, J Hoover, E Sagi, J Garten, NJ Parmar, S Vaisey, ...
Journal of Experimental Psychology: General 145 (3), 366, 2016
Conformer: Convolution-augmented transformer for speech recognition
A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ...
arXiv preprint arXiv:2005.08100, 2020
Fast decoding in sequence models using discrete latent variables
L Kaiser, S Bengio, A Roy, A Vaswani, N Parmar, J Uszkoreit, N Shazeer
International Conference on Machine Learning, 2390-2399, 2018
Corpora generation for grammatical error correction
J Lichtarge, C Alberti, S Kumar, N Shazeer, N Parmar, S Tong
arXiv preprint arXiv:1904.05780, 2019
TACIT: An open-source text analysis, crawling, and interpretation tool
M Dehghani, KM Johnson, J Garten, R Boghrati, J Hoover, ...
Behavior research methods 49 (2), 538-547, 2017
Towards a better understanding of vector quantized autoencoders
A Roy, A Vaswani, N Parmar, A Neelakantan
Bottleneck transformers for visual recognition
A Srinivas, TY Lin, N Parmar, J Shlens, P Abbeel, A Vaswani
arXiv preprint arXiv:2101.11605, 2021
Weakly supervised grammatical error correction using iterative decoding
J Lichtarge, C Alberti, S Kumar, N Shazeer, N Parmar
arXiv preprint arXiv:1811.01710, 2018
High resolution medical image analysis with spatial partitioning
L Hou, Y Cheng, N Shazeer, N Parmar, Y Li, P Korfiatis, TM Drucker, ...
arXiv preprint arXiv:1909.03108, 2019
Attention-based image generation neural networks
NM Shazeer, LM Kaiser, JD Uszkoreit, N Parmar, AT Vaswani
US Patent 10,839,259, 2020
Machine translation using neural network models
Z Chen, MR Hughes, Y Wu, M Schuster, X Chen, LO Jones, NJ Parmar, ...
US Patent App. 16/521,780, 2020
Natural language processing
D Parsing
Proceedings of the ACL Workshop on Statistical NLP and Weighted Automata …, 2016
The system can't perform the operation now. Try again later.
Articles 1–20