Seguir
Sobhan Miryoosefi
Título
Citado por
Citado por
Año
Bellman Eluder dimension: New rich classes of RL problems, and sample-efficient algorithms
C Jin, Q Liu, S Miryoosefi
Advances in Neural Information Processing Systems 34, 13406-13418, 2021
2132021
Reinforcement learning with convex constraints
S Miryoosefi, K Brantley, H Daumé III, M Dudík, R Schapire
Advances in Neural Information Processing Systems 32, 14093-14102, 2019
932019
Constrained episodic reinforcement learning in concave-convex and knapsack settings
K Brantley, M Dudik, T Lykouris, S Miryoosefi, M Simchowitz, A Slivkins, ...
Advances in Neural Information Processing Systems 33, 16315-16326, 2020
482020
Provable reinforcement learning with a short-term memory
Y Efroni, C Jin, A Krishnamurthy, S Miryoosefi
International Conference on Machine Learning, 5832-5850, 2022
292022
A simple reward-free approach to constrained reinforcement learning
S Miryoosefi, C Jin
International Conference on Machine Learning, 15666-15698, 2022
272022
Rest meets react: Self-improvement for multi-step reasoning llm agent
R Aksitov, S Miryoosefi, Z Li, D Li, S Babayan, K Kopparapu, Z Fisher, ...
arXiv preprint arXiv:2312.10003, 2023
52023
Efficient training of language models using few-shot learning
SJ Reddi, S Miryoosefi, S Karp, S Krishnan, S Kale, S Kim, S Kumar
International Conference on Machine Learning, 14553-14568, 2023
22023
Efficient Stagewise Pretraining via Progressive Subnetworks
A Panigrahi, N Saunshi, K Lyu, S Miryoosefi, S Reddi, S Kale, S Kumar
arXiv preprint arXiv:2402.05913, 2024
2024
Provable Reinforcement Learning with Constraints and Function Approximation
SSM Yoosefi
Princeton University, 2022
2022
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–9