Towards Cross-Platform Performance Portability of DNN Models using SYCL M Goli, K Narasimhan, R Reyes, B Tracy, D Soutar, S Georgiev, ... 2020 IEEE/ACM International Workshop on Performance, Portability and …, 2020 | 16 | 2020 |
Optimizing geometric multigrid method computation using a dsl approach V Vasista, K Narasimhan, S Bhat, U Bondhugula Proceedings of the International Conference for High Performance Computing …, 2017 | 7 | 2017 |
A practical tile size selection model for affine loop nests K Narasimhan, A Acharya, A Baid, U Bondhugula Proceedings of the ACM International Conference on Supercomputing, 27-39, 2021 | 6 | 2021 |
Towards performance portability of AI models using SYCL-DNN M Tanvir, K Narasimhan, M Goli, O El Farouki, S Georgiev, I Ault International Workshop on OpenCL, 1-3, 2022 | 5 | 2022 |
Improving performance of SYCL applications on CPU architectures using LLVM-directed compilation flow P Ghiglio, U Dolinsky, M Goli, K Narasimhan Proceedings of the Thirteenth International Workshop on Programming Models …, 2022 | 4 | 2022 |
User-Driven Online Kernel Fusion for SYCL V Perez, L Sommer, V Lomüller, K Narasimhan, M Goli ACM Transactions on Architecture and Code Optimization, 2023 | 2 | 2023 |
Towards performance portability of AI graphs using SYCL K Narasimhan, O El Farouki, M Goli, M Tanvir, S Georgiev, I Ault 2022 IEEE/ACM International Workshop on Performance, Portability and …, 2022 | 1 | 2022 |
Accelerating Neural Networks Using Open Standard Software on RISC-V K Narasimhan, M Goli International Conference on High Performance Computing, 552-564, 2023 | | 2023 |
Vetter, Jeffrey 45 T Ben-Nun, E Chereshnev, T Deakin, W Elwasif, EM Fomenko, T Gamblin, ... | | |