Follow
Dries Smit
Dries Smit
Verified email at sun.ac.za - Homepage
Title
Cited by
Cited by
Year
Jumanji: a diverse suite of scalable reinforcement learning environments in jax
C Bonnet, D Luo, D Byrne, S Surana, V Coyette, P Duckworth, LI Midgley, ...
URL https://arxiv. org/abs/2306.09884, 2023
22*2023
Mava: A research framework for distributed multi-agent reinforcement learning
A Pretorius, K Tessera, AP Smit, C Formanek, SJ Grimbly, K Eloff, ...
arXiv e-prints, arXiv: 2107.01460, 2021
19*2021
Scaling multi-agent reinforcement learning to full 11 versus 11 simulated robotic football
A Smit, HA Engelbrecht, W Brink, A Pretorius
Autonomous Agents and Multi-Agent Systems 37 (1), 20, 2023
42023
Learning to communicate through imagination with model-based deep multi-agent reinforcement learning
A Pretorius, S Cameron, AP Smit, E van Biljon, L Francis, F Azeez, ...
32020
Are we going MAD? Benchmarking Multi-Agent Debate between Language Models for Medical Q&A
A Smit, P Duckworth, N Grinsztajn, K Tessera, TD Barrett, A Pretorius
arXiv preprint arXiv:2311.17371, 2023
22023
Merging deep neural networks and probabilistic models using Sum product networks
AP Smit
Stellenbosch: Stellenbosch University, 2020
12020
Generative Model for Small Molecules with Latent Space RL Fine-Tuning to Protein Targets
UAM Sob, Q Li, M Arbesú, O Bent, AP Smit, A Pretorius
arXiv preprint arXiv:2407.13780, 2024
2024
Generative Model for Small Molecules with Latent Space RL Fine-Tuning to Protein Targets
UA Mbou Sob, Q Li, M Arbesú, O Bent, AP Smit, A Pretorius
arXiv e-prints, arXiv: 2407.13780, 2024
2024
Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs
AP Smit, N Grinsztajn, P Duckworth, TD Barrett, A Pretorius
Forty-first International Conference on Machine Learning, 2024
2024
Offline RL for generative design of protein binders
D Tarasov, UA Mbou Sob, M Arbesu, N Siboni, S Boyer, M Skwark, A Smit, ...
bioRxiv, 2023.11. 29.569328, 2023
2023
Scaling multi-agent reinforcement learning to eleven aside simulated robot soccer
A Smit
Stellenbosch: Stellenbosch University, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–11