A multi-objective auto-tuning framework for parallel codes H Jordan, P Thoman, JJ Durillo, S Pellegrini, P Gschwandtner, ... SC'12: Proceedings of the International Conference on High Performance …, 2012 | 101 | 2012 |
A taxonomy of task-based parallel programming technologies for high-performance computing P Thoman, K Dichev, T Heller, R Iakymchuk, X Aguilar, K Hasanov, ... The Journal of Supercomputing 74 (4), 1422-1434, 2018 | 74 | 2018 |
INSPIRE: The Insieme parallel intermediate representation H Jordan, S Pellegrini, P Thoman, K Kofler, T Fahringer Proceedings of the 22nd international conference on Parallel architectures …, 2013 | 48 | 2013 |
Automatic OpenCL device characterization: Guiding optimized kernel design P Thoman, K Kofler, H Studt, J Thomson, T Fahringer European Conference on Parallel Processing, 438-452, 2011 | 47 | 2011 |
Automatic OpenMP loop scheduling: a combined compiler and runtime approach P Thoman, H Jordan, S Pellegrini, T Fahringer International Workshop on OpenMP, 88-101, 2012 | 33 | 2012 |
GPU-based multigrid: Real-time performance in high resolution nonlinear image processing H Grossauer, P Thoman International Conference on Computer Vision Systems, 141-150, 2008 | 32 | 2008 |
Application-level energy awareness for openmp F Alessi, P Thoman, G Georgakoudis, T Fahringer, DS Nikolopoulos International Workshop on OpenMP, 219-232, 2015 | 31 | 2015 |
Adaptive granularity control in task parallel programs using multiversioning P Thoman, H Jordan, T Fahringer European Conference on Parallel Processing, 164-177, 2013 | 25 | 2013 |
On the quality of implementation of the c++ 11 thread support library P Thoman, P Gschwandtner, T Fahringer 2015 23rd euromicro international conference on parallel, distributed, and …, 2015 | 12 | 2015 |
A context-aware primitive for nested recursive parallelism H Jordan, P Thoman, P Zangerl, T Heller, T Fahringer European Conference on Parallel Processing, 149-161, 2016 | 11 | 2016 |
Compiler multiversioning for automatic task granularity control P Thoman, H Jordan, T Fahringer Concurrency and Computation: Practice and Experience 26 (14), 2367-2385, 2014 | 10 | 2014 |
Insieme-rs: A compiler-supported parallel runtime system P Thoman na, 2013 | 9 | 2013 |
The allscale runtime application model H Jordan, T Heller, P Gschwandtner, P Zangerl, P Thoman, D Fey, ... 2018 IEEE International Conference on Cluster Computing (CLUSTER), 445-455, 2018 | 8 | 2018 |
Scalo: Scalability-aware parallelism orchestration for multi-threaded workloads G Georgakoudis, H Vandierendonck, P Thoman, BRD Supinski, ... ACM Transactions on Architecture and Code Optimization (TACO) 14 (4), 1-25, 2017 | 6 | 2017 |
Multigrid Methods on GPUs P Thoman VDM, Saarbrücken, 62, 2008 | 6 | 2008 |
Celerity: High-level c++ for accelerator clusters P Thoman, P Salzmann, B Cosenza, T Fahringer European Conference on Parallel Processing, 291-303, 2019 | 5 | 2019 |
Optimizing task parallelism with library-semantics-aware compilation P Thoman, S Moosbrugger, T Fahringer European Conference on Parallel Processing, 237-249, 2015 | 5 | 2015 |
The AllScale Runtime Interface—Theoretical Foundation and Concept A Hendricks, T Heller, H Jordan, P Thoman, T Fahringer, D Fey 2016 9th Workshop on Many-Task Computing on Clouds, Grids, and …, 2016 | 4 | 2016 |
A high-level ir transformation system H Jordan, P Thoman, T Fahringer European Conference on Parallel Processing, 647-656, 2013 | 4 | 2013 |
Exploring the semantic gap in compiling embedded DSLs P Zangerl, H Jordan, P Thoman, P Gschwandtner, T Fahringer Proceedings of the 18th International Conference on Embedded Computer …, 2018 | 3 | 2018 |