Ranked reward: Enabling self-play reinforcement learning for combinatorial optimization A Laterre, Y Fu, MK Jabri, AS Cohen, D Kas, K Hajjar, TS Dahl, A Kerkeni, ... arXiv preprint arXiv:1807.01672, 2018 | 121 | 2018 |
Learning compositional neural programs with recursive tree search and planning T Pierrot, G Ligner, SE Reed, O Sigaud, N Perrin, A Laterre, D Kas, ... Advances in Neural Information Processing Systems 32, 2019 | 46 | 2019 |
Early computational detection of potential high-risk SARS-CoV-2 variants K Beguir, MJ Skwark, Y Fu, T Pierrot, NL Carranza, A Laterre, I Kadri, ... Computers in biology and medicine 155, 106618, 2023 | 35 | 2023 |
Reinforcement learning for branch-and-bound optimisation using retrospective trajectories CWF Parsonson, A Laterre, TD Barrett Proceedings of the AAAI Conference on Artificial Intelligence 37 (4), 4061-4069, 2023 | 31 | 2023 |
Jumanji: a diverse suite of scalable reinforcement learning environments in jax C Bonnet, D Luo, D Byrne, S Surana, S Abramowitz, P Duckworth, ... arXiv preprint arXiv:2306.09884, 2023 | 29* | 2023 |
A foundational large language model for edible plant genomes J Mendoza-Revilla, E Trop, L Gonzalez, M Roller, H Dalla-Torre, ... Communications Biology 7 (1), 835, 2024 | 27 | 2024 |
Combinatorial optimization with policy adaptation using latent space search F Chalumeau, S Surana, C Bonnet, N Grinsztajn, A Pretorius, A Laterre, ... Advances in Neural Information Processing Systems 36, 7947-7959, 2023 | 25 | 2023 |
Mava: A research framework for distributed multi-agent reinforcement learning A Pretorius, K Tessera, AP Smit, C Formanek, SJ Grimbly, K Eloff, ... arXiv e-prints, arXiv: 2107.01460, 2021 | 22* | 2021 |
A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning A Pretorius, S Cameron, E Van Biljon, T Makkink, S Mawjee, J du Plessis, ... Advances in neural information processing systems 33, 9983-9994, 2020 | 15 | 2020 |
Offline reinforcement learning hands-on L Monier, J Kmec, A Laterre, T Pierrot, V Courgeau, O Sigaud, K Beguir arXiv preprint arXiv:2011.14379, 2020 | 13 | 2020 |
Learning to solve combinatorial graph partitioning problems via efficient exploration TD Barrett, CWF Parsonson, A Laterre arXiv preprint arXiv:2205.14105, 2022 | 12 | 2022 |
Jumanji: a diverse suite of scalable reinforcement learning environments in jax, 2024 C Bonnet, D Luo, D Byrne, S Surana, S Abramowitz, P Duckworth, ... URL https://arxiv. org/abs/2306.09884, 0 | 12 | |
Chatnt: A multimodal conversational agent for dna, rna and protein tasks G Richard, BP de Almeida, H Dalla-Torre, C Blum, L Hexemer, P Pandey, ... bioRxiv, 2024.04. 30.591835, 2024 | 11 | 2024 |
SegmentNT: annotating the genome at single-nucleotide resolution with DNA foundation models BP de Almeida, H Dalla-Torre, G Richard, C Blum, L Hexemer, M Gélard, ... bioRxiv, 2024.03. 14.584712, 2024 | 11 | 2024 |
Flashbax: Streamlining experience replay buffers for reinforcement learning with jax, 2023 E Toledo, L Midgley, D Byrne, CR Tilbury, M Macfarlane, C Courtot, ... URL https://github. com/instadeepai/flashbax 7, 0 | 10 | |
One step at a time: Pros and cons of multi-step meta-gradient reinforcement learning C Bonnet, P Caron, T Barrett, I Davies, A Laterre arXiv preprint arXiv:2111.00206, 2021 | 5 | 2021 |
Factored action spaces in deep reinforcement learning T Pierrot, V Macé, JB Sevestre, L Monier, A Laterre, N Perrin, K Beguir, ... | 5 | 2021 |
Learning compositional neural programs for continuous control T Pierrot, N Perrin, F Behbahani, A Laterre, O Sigaud, K Beguir, ... arXiv preprint arXiv:2007.13363, 2020 | 4 | 2020 |
Designing a prospective COVID-19 therapeutic with reinforcement learning MJ Skwark, NL Carranza, T Pierrot, J Phillips, S Said, A Laterre, A Kerkeni, ... arXiv preprint arXiv:2012.01736, 2020 | 3 | 2020 |
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function C Bonnet, L Midgley, A Laterre arXiv preprint arXiv:2211.10550, 2022 | 1 | 2022 |