Language modeling with gated convolutional networks YN Dauphin, A Fan, M Auli, D Grangier Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017 | 1017 | 2017 |
fairseq: A Fast, Extensible Toolkit for Sequence Modeling M Ott, S Edunov, A Baevski, A Fan, S Gross, N Ng, D Grangier, M Auli arXiv preprint arXiv:1904.01038, 2019 | 619 | 2019 |
Hierarchical Neural Story Generation A Fan, M Lewis, Y Dauphin arXiv preprint arXiv:1805.04833, 2018 | 316 | 2018 |
Pay Less Attention with Lightweight and Dynamic Convolutions F Wu, A Fan, A Baevski, YN Dauphin, M Auli arXiv preprint arXiv:1901.10430, 2019 | 206 | 2019 |
Wizard of Wikipedia: Knowledge-Powered Conversational agents E Dinan, S Roller, K Shuster, A Fan, M Auli, J Weston arXiv preprint arXiv:1811.01241, 2018 | 169 | 2018 |
Controllable abstractive summarization A Fan, D Grangier, M Auli arXiv preprint arXiv:1711.05217, 2017 | 123 | 2017 |
Reducing Transformer Depth on Demand with Structured Dropout A Fan, E Grave, A Joulin arXiv preprint arXiv:1909.11556, 2019 | 92 | 2019 |
Integration of responses within and across Arabidopsis natural accessions uncovers loci controlling root systems architecture U Rosas, A Cibrian-Jaramillo, D Ristova, JA Banta, ML Gifford, AH Fan, ... Proceedings of the National Academy of Sciences 110 (37), 15133-15138, 2013 | 71 | 2013 |
Strategies for Structuring Story Generation A Fan, M Lewis, Y Dauphin arXiv preprint arXiv:1902.01109, 2019 | 61 | 2019 |
Learning to Speak and Act in a Fantasy Text Adventure Game J Urbanek, A Fan, S Karamcheti, S Jain, S Humeau, E Dinan, ... arXiv preprint arXiv:1903.03094, 2019 | 41 | 2019 |
ELI5: Long Form Question Answering A Fan, Y Jernite, E Perez, D Grangier, J Weston, M Auli arXiv preprint arXiv:1907.09190, 2019 | 37 | 2019 |
Generative question answering: Learning to answer the whole question M Lewis, A Fan International Conference on Learning Representations, 2018 | 30 | 2018 |
Using Local Knowledge Graph Construction to Scale Seq2Seq Models to Multi-Document Inputs A Fan, C Gardent, C Braud, A Bordes arXiv preprint arXiv:1910.08435, 2019 | 23 | 2019 |
Training with quantization noise for extreme model compression A Fan, P Stock, B Graham, E Grave, R Gribonval, H Jégou, A Joulin arXiv e-prints, arXiv: 2004.07320, 2020 | 18* | 2020 |
Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation E Dinan, A Fan, A Williams, J Urbanek, D Kiela, J Weston arXiv preprint arXiv:1911.03842, 2019 | 14 | 2019 |
KILT: a Benchmark for Knowledge Intensive Language Tasks F Petroni, A Piktus, A Fan, P Lewis, M Yazdani, N De Cao, J Thorne, ... arXiv preprint arXiv:2009.02252, 2020 | 11 | 2020 |
Multi-Dimensional Gender Bias Classification E Dinan, A Fan, L Wu, J Weston, D Kiela, A Williams arXiv preprint arXiv:2005.00614, 2020 | 11 | 2020 |
Assessing topic model relevance: Evaluation and informative priors A Fan, F Doshi‐Velez, L Miratrix Statistical Analysis and Data Mining: The ASA Data Science Journal 12 (3 …, 2019 | 11* | 2019 |
Augmenting Transformers with KNN-Based Composite Memory for Dialogue A Fan, C Gardent, C Braud, A Bordes arXiv preprint arXiv:2004.12744, 2020 | 6 | 2020 |
Generating interactive worlds with text A Fan, J Urbanek, P Ringshia, E Dinan, E Qian, S Karamcheti, ... Proceedings of the AAAI Conference on Artificial Intelligence 34 (02), 1693-1700, 2020 | 6 | 2020 |