Markus Wulfmeier
Markus Wulfmeier
DeepMind
Dirección de correo verificada de google.com - Página principal
Título
Citado por
Citado por
Año
Reverse curriculum generation for reinforcement learning
C Florensa, D Held, M Wulfmeier, M Zhang, P Abbeel
arXiv preprint arXiv:1707.05300, 2017
1612017
Maximum entropy deep inverse reinforcement learning
M Wulfmeier, P Ondruska, I Posner
arXiv preprint arXiv:1507.04888, 2015
1482015
Watch this: Scalable cost-function learning for path planning in urban environments
M Wulfmeier, DZ Wang, I Posner
2016 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2016
602016
Deep inverse reinforcement learning
M Wulfmeier, P Ondruska, I Posner
CoRR, abs/1507.04888, 2015
452015
Large-scale cost function learning for path planning using deep inverse reinforcement learning
M Wulfmeier, D Rao, DZ Wang, P Ondruska, I Posner
The International Journal of Robotics Research 36 (10), 1073-1087, 2017
442017
Addressing appearance change in outdoor robotics with adversarial domain adaptation
M Wulfmeier, A Bewley, I Posner
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2017
402017
Design and implementation of a particle image velocimetry method for analysis of running gear–soil interaction
C Senatore, M Wulfmeier, I Vlahinić, J Andrade, K Iagnemma
Journal of Terramechanics 50 (5-6), 311-326, 2013
352013
Incremental Adversarial Domain Adaptation for Continually Changing Environments
M Wulfmeier, A Bewley, I Posner
arXiv preprint arXiv:1712.07436, 2017
322017
Mutual alignment transfer learning
M Wulfmeier, I Posner, P Abbeel
arXiv preprint arXiv:1707.07907, 2017
302017
Investigation of stress and failure in granular soils for lightweight robotic vehicle applications
C Senatore, M Wulfmeier, J MacLennan, P Jayakumar, K Iagnemma
ARMY TANK AUTOMOTIVE RESEARCH DEVELOPMENT AND ENGINEERING CENTER WARREN MI, 2012
232012
Taco: Learning task decomposition via temporal alignment for control
K Shiarlis, M Wulfmeier, S Salter, S Whiteson, I Posner
arXiv preprint arXiv:1803.01840, 2018
222018
Incorporating human domain knowledge into large scale cost function learning
M Wulfmeier, D Rao, I Posner
arXiv preprint arXiv:1612.04318, 2016
132016
Compositional Transfer in Hierarchical Reinforcement Learning
M Wulfmeier, A Abdolmaleki, R Hafner, JT Springenberg, M Neunert, ...
12*2019
Voronoi-based heuristic for nonholonomic search-based path planning
Q Wang, M Wulfmeier, B Wagner
Intelligent Autonomous Systems 13, 445-458, 2016
10*2016
Neural Stethoscopes: Unifying analytic, auxiliary and adversarial network probing
FB Fuchs, O Groth, AR Kosiorek, A Bewley, M Wulfmeier, A Vedaldi, ...
arXiv, 2018
52018
Development of a particle image velocimetry method for analysis of mars rover wheel-terrain interaction phenomena
M Wulfmeier
BS Thesis, Gottfried Wilhelm Leibniz Universitaet Hannover, 2012
52012
Efficient supervision for robot learning via imitation, simulation, and adaptation
M Wulfmeier
KI-Künstliche Intelligenz 33 (4), 401-405, 2019
32019
Attention-Privileged Reinforcement Learning
S Salter, D Rao, M Wulfmeier, R Hadsell, I Posner
arXiv, arXiv: 1911.08363, 2019
2*2019
On Machine Learning and Structure for Mobile Robots
M Wulfmeier
arXiv preprint arXiv:1806.06003, 2018
22018
Neural Information Processing Systems (NIPS) Workshop on Acting and Interacting in the Real World: Challenges in Robot Learning
I Posner, R Hadsell, M Riedmiller, M Wulfmeier, R Paul
22016
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20