Herke van Hoof
TitleCited byYear
Addressing Function Approximation Error in Actor-Critic Methods
S Fujimoto, H van Hoof, D Meger
arXiv preprint arXiv:1802.09477, 2018
1752018
Towards Learning Hierarchical Skills for Multi-Phase Manipulation Tasks
O Kroemer, C Daniel, G Neumann, H van Hoof, J Peters
Proceedings of the International Conference on Robotics and Automation, 2015
652015
Learning Robot In-Hand Manipulation with Tactile Features
H van Hoof, T Hermans, G Neumann, J Peters
572015
Probabilistic inference for determining options in reinforcement learning
C Daniel, H Van Hoof, J Peters, G Neumann
Machine Learning 104 (2-3), 337-357, 2016
542016
Probabilistic Segmentation and Targeted Exploration of Objects in Cluttered Environments
H van Hoof, O Kroemer, J Peters
IEEE Transactions on Robotics, 2014
542014
Stable reinforcement learning with autoencoders for tactile and visual data
H van Hoof, N Chen, M Karl, P van der Smagt, J Peters
2016 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2016
442016
Stabilizing novel objects by learning to predict tactile slip
F Veiga, H Van Hoof, J Peters, T Hermans
2015 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2015
412015
Learning to Predict Phases of Manipulation Tasks as Hidden States
O Kroemer, H van Hoof, G Neumann, J Peters
IEEE International Conference on Robotics and Automation, 2014
372014
Maximally Informative Interaction Learning for Scene Exploration
H van Hoof, O Kroemer, HB Amor, J Peters
Intelligent Robots and Systems, 2012
372012
Learning of Non-Parametric Control Policies with High-Dimensional State Features
H van Hoof, J Peters, G Neumann
Proceedings of the Eighteenth International Conference on Artificial …, 2015
342015
Attention, Learn to Solve Routing Problems!
W Kool, H van Hoof, M Welling
arXiv preprint arXiv:1803.08475, 2018
312018
Active tactile object exploration with gaussian processes
Z Yi, R Calandra, F Veiga, H van Hoof, T Hermans, Y Zhang, J Peters
2016 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2016
312016
Attention solves your TSP, approximately
W Kool, H van Hoof, M Welling
stat 1050, 22, 2018
282018
Policy Search For Learning Robot Control Using Sparse Data
B Bischoff, D Nguyen-Tuong, H van Hoof, A McHutchon, CE Rasmussen, ...
International Conference on Robotics and Automation, 2014
182014
BanditSum: Extractive Summarization as a Contextual Bandit
Y Dong, Y Shen, E Crawford, H van Hoof, JCK Cheung
arXiv preprint arXiv:1809.09672, 2018
152018
Non-parametric policy search with limited information loss
H Van Hoof, G Neumann, J Peters
The Journal of Machine Learning Research 18 (1), 2472-2517, 2017
122017
Probabilistic Interactive Segmentation for Anthropomorphic Robots in Cluttered Environments
H van Hoof, O Kroemer, J Peters
International Conference on Humanoid Robotics, 2013
102013
An Inference-Based Policy Gradient Method for Learning Options
M Smith, H Hoof, J Pineau
International Conference on Machine Learning, 4710-4719, 2018
72018
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
W Kool, H van Hoof, M Welling
arXiv preprint arXiv:1903.06059, 2019
52019
Generalized exploration in policy search
H van Hoof, D Tanneberg, J Peters
Machine Learning 106 (9-10), 1705-1724, 2017
42017
The system can't perform the operation now. Try again later.
Articles 1–20