FPDeep: Acceleration and load balancing of CNN training on FPGA clusters T Geng, T Wang, A Sanaullah, C Yang, R Xu, R Patel, M Herbordt 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom …, 2018 | 47 | 2018 |
A Framework for Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters with Work and Weight Load Balancing T Geng, T Wang, A Sanaullah, C Yang, R Patel, M Herbordt 2018 28th International Conference on Field Programmable Logic and …, 2018 | 28* | 2018 |
Fully integrated FPGA molecular dynamics simulations C Yang, T Geng, T Wang, R Patel, Q Xiong, A Sanaullah, C Wu, J Sheng, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 17 | 2019 |
Fully integrated FPGA molecular dynamics simulations C Yang, T Geng, T Wang, R Patel, Q Xiong, A Sanaullah, C Wu, J Sheng, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 17 | 2019 |
High performance dynamic communication on reconfigurable clusters J Sheng, C Yang, T Wang, M Herbordt 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom …, 2018 | 17 | 2018 |
LP-BNN: Ultra-low-Latency BNN Inference with Layer Parallelism T Geng, T Wang, C Wu, C Yang, SL Song, A Li, M Herbordt 2019 IEEE 30th International Conference on Application-specific Systems …, 2019 | 16 | 2019 |
O3BNN: an out-of-order architecture for high-performance binarized neural network inference with fine-grained pruning T Geng, T Wang, C Wu, C Yang, W Wu, A Li, MC Herbordt Proceedings of the ACM International Conference on Supercomputing, 461-472, 2019 | 14 | 2019 |
BSTC: a novel binarized-soft-tensor-core design for accelerating bit-based approximated neural nets A Li, T Geng, T Wang, M Herbordt, SL Song, K Barker Proceedings of the International Conference for High Performance Computing …, 2019 | 12 | 2019 |
Molecular Dynamics Range-Limited Force Evaluation Optimized for FPGAs C Yang, T Geng, T Wang, C Lin, J Sheng, V Sachdeva, W Sherman, ... 2019 IEEE 30th International Conference on Application-specific Systems …, 2019 | 12 | 2019 |
UWB-GCN: Hardware Acceleration of Graph-Convolution-Network through Runtime Workload Rebalancing T Geng, A Li, T Wang, C Wu, Y Li, A Tumeo, M Herbordt arXiv preprint arXiv:1908.10834, 2019 | 9* | 2019 |
A Scalable Framework for Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters with Weight and Workload Balancing T Geng, T Wang, A Li, X Jin, M Herbordt arXiv preprint arXiv:1901.01007, 2019 | 9 | 2019 |
An accelerating solution for-body mond simulation with fpga-soc B Peng, T Wang, X Jin, C Wang International Journal of Reconfigurable Computing 2016, 2016 | 9 | 2016 |
FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters T Wang, T Geng, A Li, X Jin, M Herbordt IEEE Transactions on Computers 69 (8), 1143-1158, 2020 | 8 | 2020 |
A 56-ps multi-phase clock time-to-digital convertor based on Artix-7 FPGA T Xiang, L Zhao, X Jin, T Wang, S Chu, C Ma, S Liu, Q An 2014 19th IEEE-NPSS Real Time Conference, 1-4, 2014 | 8 | 2014 |
Accelerating AP3M-Based Computational Astrophysics Simulations with Reconfigurable Clusters T Wang, T Geng, X Jin, M Herbordt 2019 IEEE 30th International Conference on Application-specific Systems …, 2019 | 7 | 2019 |
FP-AMR: A Reconfigurable Fabric Framework for Adaptive Mesh Refinement Applications T Wang, T Geng, X Jin, M Herbordt 2019 IEEE 27th Annual International Symposium on Field-Programmable Custom …, 2019 | 7 | 2019 |
O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference T Geng, A Li, T Wang, C Wu, Y Li, R Shi, W Wu, M Herbordt IEEE Transactions on Parallel and Distributed Systems 32 (1), 199-213, 2020 | 5 | 2020 |
FP-AMG: FPGA-Based Acceleration Framework for Algebraic Multigrid Solvers P Haghi, T Geng, A Guo, T Wang, M Herbordt 2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020 | 4 | 2020 |
Soft-Core. Multiple-Lane, FPGA-based ADCs for a Liquid Helium Environment Z Xiang, T Wang, T Geng, T Xiang, X Jin, M Herbordt 2018 IEEE High Performance extreme Computing Conference (HPEC), 1-6, 2018 | 4 | 2018 |
An access-pattern-aware on-chip vector memory system with automatic loading for SIMD architectures T Geng, E Diken, T Wang, L Jozwiak, M Herbordt 2018 IEEE High Performance extreme Computing Conference (HPEC), 1-7, 2018 | 3 | 2018 |