Follow
Pierre-Luc Bacon
Pierre-Luc Bacon
University of Montreal
Verified email at mila.quebec - Homepage
Title
Cited by
Cited by
Year
The option-critic architecture
PL Bacon, J Harb, D Precup
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
11672017
Conditional computation in neural networks for faster models
E Bengio, PL Bacon, J Pineau, D Precup
arXiv preprint arXiv:1511.06297, 2015
3212015
When waiting is not an option: Learning options with a deliberation cost
J Harb, PL Bacon, M Klissarov, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
1502018
The primacy bias in deep reinforcement learning
E Nikishin, M Schwarzer, P D’Oro, PL Bacon, A Courville
International conference on machine learning, 16828-16847, 2022
972022
Learnings options end-to-end for continuous action tasks
M Klissarov, PL Bacon, J Harb, D Precup
arXiv preprint arXiv:1712.00004, 2017
552017
Convergent tree backup and retrace with function approximation
A Touati, PL Bacon, D Precup, P Vincent
International Conference on Machine Learning, 4955-4964, 2018
462018
Learning robust options
D Mankowitz, T Mann, PL Bacon, D Precup, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
442018
Options of interest: Temporal abstraction with interest functions
K Khetarpal, M Klissarov, M Chevalier-Boisvert, PL Bacon, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4444-4451, 2020
422020
Sample-efficient reinforcement learning by breaking the replay ratio barrier
P D'Oro, M Schwarzer, E Nikishin, PL Bacon, MG Bellemare, A Courville
Deep Reinforcement Learning Workshop NeurIPS 2022, 2022
402022
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling
Y Liu, PL Bacon, E Brunskill
International Conference on Machine Learning, 6184-6193, 2020
392020
Policy evaluation networks
J Harb, T Schaul, D Precup, PL Bacon
arXiv preprint arXiv:2002.11833, 2020
382020
Control-oriented model-based reinforcement learning with implicit differentiation
E Nikishin, R Abachi, R Agarwal, PL Bacon
Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7886-7894, 2022
282022
Temporal Representation Learning
PL Bacon
McGill University (Canada), 2018
272018
Learning with options that terminate off-policy
A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
232018
Direct behavior specification via constrained reinforcement learning
J Roy, R Girgis, J Romoff, PL Bacon, C Pal
arXiv preprint arXiv:2112.12228, 2021
202021
An information-theoretic perspective on credit assignment in reinforcement learning
D Arumugam, P Henderson, PL Bacon
arXiv preprint arXiv:2103.06224, 2021
192021
Xlvin: executed latent value iteration nets
A Deac, P Veličković, O Milinković, PL Bacon, J Tang, M Nikolić
arXiv preprint arXiv:2010.13146, 2020
182020
Continuous-time meta-learning with forward mode differentiation
T Deleu, D Kanaa, L Feng, G Kerg, Y Bengio, G Lajoie, PL Bacon
arXiv preprint arXiv:2203.01443, 2022
152022
Neural algorithmic reasoners are implicit planners
AI Deac, P Veličković, O Milinkovic, PL Bacon, J Tang, M Nikolic
Advances in Neural Information Processing Systems 34, 15529-15542, 2021
152021
The barbados 2018 list of open issues in continual learning
T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ...
arXiv preprint arXiv:1811.07004, 2018
132018
The system can't perform the operation now. Try again later.
Articles 1–20