SOLO: search online, learn offline for combinatorial optimization problems J Oren, C Ross, M Lefarov, F Richter, A Taitler, Z Feldman, D Di Castro, ... Proceedings of the international symposium on combinatorial search 12 (1 …, 2021 | 22 | 2021 |
On-policy model errors in reinforcement learning LP Fröhlich, M Lefarov, MN Zeilinger, F Berkenkamp arXiv preprint arXiv:2110.07985, 2021 | 7 | 2021 |
Method and Device for Optimum Parameterization of a Driving Dynamics Control System for Vehicles A Doerr, F Berkenkamp, M Lefarov, V Loeffelmann US Patent App. 17/809,587, 2023 | | 2023 |
Device and method to improve learning of a policy for robots F Berkenkamp, L Froehlich, M Lefarov, A Doerr US Patent App. 17/652,983, 2022 | | 2022 |
Method and device for an industrial system FM Richter, M Lefarov US Patent App. 17/365,851, 2022 | | 2022 |
Device and method for scheduling a set of jobs for a plurality of machines A Taitler, C Daniel, D Di Castro, FM Richter, J Oren, M Lefarov, NM Dizbin, ... US Patent App. 17/179,702, 2021 | | 2021 |
Model-based policy search for learning mulitvariate PID gain scheduling control M Lefarov | | 2018 |