The option-critic architecture
PL Bacon, J Harb, D Precup
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
Mandate: Natural Sciences and Engineering Research Council of Canada
When waiting is not an option: Learning options with a deliberation cost
J Harb, PL Bacon, M Klissarov, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
Mandate: Natural Sciences and Engineering Research Council of Canada
Options of interest: Temporal abstraction with interest functions
K Khetarpal, M Klissarov, M Chevalier-Boisvert, PL Bacon, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4444-4451, 2020
Mandate: Natural Sciences and Engineering Research Council of Canada
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling
Y Liu, PL Bacon, E Brunskill
International Conference on Machine Learning, 6184-6193, 2020
Mandate: US National Science Foundation, US Department of Defense
Neural algorithmic reasoners are implicit planners
AI Deac, P Veličković, O Milinkovic, PL Bacon, J Tang, M Nikolic
Advances in Neural Information Processing Systems 34, 15529-15542, 2021
Mandate: Natural Sciences and Engineering Research Council of Canada
Constructing temporal abstractions autonomously in reinforcement learning
PL Bacon, D Precup
Ai Magazine 39 (1), 39-50, 2018
Mandate: Natural Sciences and Engineering Research Council of Canada
Policy optimization in a noisy neighborhood: On return landscapes in continuous control
N Rahn, P D'Oro, H Wiltzer, PL Bacon, M Bellemare
Advances in Neural Information Processing Systems 36, 2024
Mandate: Fonds de recherche du Québec - Nature et technologies
