Seguir
Denis Tarasov
Denis Tarasov
Dirección de correo verificada de ethz.ch - Página principal
Título
Citado por
Citado por
Año
CORL: Research-oriented deep offline reinforcement learning library
D Tarasov, A Nikulin, D Akimov, V Kurenkov, S Kolesnikov
Advances in Neural Information Processing Systems 36, 2024
382024
Anti-exploration by random network distillation
A Nikulin, V Kurenkov, D Tarasov, S Kolesnikov
International Conference on Machine Learning, 26228-26244, 2023
132023
Q-ensemble for offline rl: Don't scale the ensemble, scale the batch size
A Nikulin, V Kurenkov, D Tarasov, D Akimov, S Kolesnikov
arXiv preprint arXiv:2211.11092, 2022
82022
Let offline rl flow: Training conservative agents in the latent space of normalizing flows
D Akimov, V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov
arXiv preprint arXiv:2211.11096, 2022
72022
Revisiting the minimalist approach to offline reinforcement learning
D Tarasov, V Kurenkov, A Nikulin, S Kolesnikov
Advances in Neural Information Processing Systems 36, 2024
62024
Katakomba: Tools and benchmarks for data-driven nethack
V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov
Advances in Neural Information Processing Systems 36, 2024
22024
Predicting perceived ethnicity with data on personal names in Russia
A Bessudnov, D Tarasov, V Panasovets, V Kostenko, I Smirnov, ...
Journal of Computational Social Science 6 (2), 589-608, 2023
22023
Prompts and pre-trained language models for offline reinforcement learning
D Tarasov, V Kurenkov, S Kolesnikov
ICLR 2022 Workshop on Generalizable Policy Learning in Physical World, 2022
22022
Distilling LLMs' Decomposition Abilities into Compact Language Models
D Tarasov, K Shridhar
arXiv preprint arXiv:2402.01812, 2024
12024
Revisiting Behavior Regularized Actor-Critic
D Tarasov, V Kurenkov, A Nikulin, S Kolesnikov
Workshop on Reincarnating Reinforcement Learning at ICLR 2023, 2023
2023
Offline RL for generative design of protein binders
D Tarasov, UA Mbou Sob, M Arbesu, N Siboni, S Boyer, M Skwark, A Smit, ...
bioRxiv, 2023.11. 29.569328, 2023
2023
Fixing 1-bit Adam and 1-bit LAMB algorithms
D Tarasov, VA Ershov
Computing 15 (4), 86-97, 2022
2022
Predicting ethnicity with data on personal names in Russia
A Bessudnov, D Tarasov, V Panasovets, V Kostenko, I Smirnov, ...
2021
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–13