Solving large problem sizes of index-digit algorithms on GPU: FFT and tridiagonal system solvers AP Diéguez, M Amor, J Lobeiras, R Doallo IEEE Transactions on Computers 67 (1), 86-101, 2017 | 23 | 2017 |
New tridiagonal systems solvers on GPU architectures AP Dieguez, M Amor, R Doallo 2015 IEEE 22nd International Conference on High Performance Computing (HiPC …, 2015 | 13 | 2015 |
Efficient scan operator methods on a GPU AP Diéguez, M Amor, R Doallo 2014 IEEE 26th International Symposium on Computer Architecture and High …, 2014 | 11 | 2014 |
Tree partitioning reduction: A new parallel partition method for solving tridiagonal systems AP Diéguez, M Amor, R Doallo ACM Transactions on Mathematical Software (TOMS) 45 (3), 1-26, 2019 | 6 | 2019 |
Parallel prefix operations on GPU: tridiagonal system solvers and scan operators AP Diéguez, M Amor, R Doallo The Journal of Supercomputing 75, 1510-1523, 2019 | 5 | 2019 |
Solving multiple tridiagonal systems on a multi-GPU platform AP Dieguez, MA Lopez, RD Biempica 2018 26th Euromicro International Conference on Parallel, Distributed and …, 2018 | 5 | 2018 |
BPLG–BMCS: GPU-sorting algorithm using a tuning skeleton library AP Diéguez, M Amor, R Doallo The Journal of Supercomputing 73 (1), 4-16, 2017 | 4 | 2017 |
Efficient high-precision integer multiplication on the GPU AP Dieguez, M Amor, R Doallo, A Nukada, S Matsuoka The International Journal of High Performance Computing Applications 36 (3 …, 2022 | 3 | 2022 |
ML-based performance portability for time-dependent density functional theory in HPC environments AP Dieguez, M Choi, X Zhu, BM Wong, KZ Ibrahim 2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking …, 2022 | 2 | 2022 |
TRAVOLTA: GPU acceleration and algorithmic improvements for constructing quantum optimal control fields in photo-excited systems JM Rodríguez-Borbón, X Wang, AP Diéguez, KZ Ibrahim, BM Wong Computer Physics Communications 296, 109017, 2024 | 1 | 2024 |
Parallel prefix operations on heterogeneous platforms AP Diéguez Universidade da Coruña, 2019 | 1 | 2019 |
Efficient Solving of Scan Primitive on Multi-GPU Systems AP Diéguez, M Amor, R Doallo, A Nukada, S Matsuoka 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018 | 1 | 2018 |
Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality AP Dieguez, M Choi, M Okyay, M Del Ben, BM Wong, KZ Ibrahim arXiv preprint arXiv:2403.08131, 2024 | | 2024 |
Performance Tuning for GPU-Embedded Systems: Machine-Learning-Based and Analytical Model-Driven Tuning Methodologies AP Diéguez, MA López 2023 IEEE 35th International Symposium on Computer Architecture and High …, 2023 | | 2023 |
Parallel prefix operations on heterogeneous platforms A Pérez Diéguez | | 2018 |
Techniques for Autotuning Algorithms on Heterogenous Platforms AP Diéguez, M Amor, R Doallo | | 2016 |
CUDA Techniques Optimization For Scan Operator R Doallo, M Amor, AP Dieguez | | 2014 |
2023 IEEE 35th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)| 979-8-3503-0548-7/23/$31.00© 2023 IEEE| DOI: 10.1109/SBAC-PAD59825 … BS Alves, MAZ Alves, R Azevedo, MW Azhar, N Bain, CHS Barbosa, ... | | |