Scan primitives for GPU computing S Sengupta, M Harris, Y Zhang, JD Owens Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics …, 2007 | 846 | 2007 |
Parallel computing experiences with CUDA M Garland, S Le Grand, J Nickolls, J Anderson, J Hardwick, S Morton, ... IEEE micro 28 (4), 13-27, 2008 | 773 | 2008 |
A quantitative performance analysis model for GPU architectures Y Zhang, JD Owens 2011 IEEE 17th international symposium on high performance computer …, 2011 | 393 | 2011 |
Fast tridiagonal solvers on the GPU Y Zhang, J Cohen, JD Owens ACM SIGPLAN Notices 45 (5), 127-136, 2010 | 335 | 2010 |
Parallel lossless data compression on the GPU RA Patel, Y Zhang, J Mak, A Davidson, JD Owens 2012 Innovative Parallel Computing (InPar), 1-9, 2012 | 136 | 2012 |
An auto-tuned method for solving large tridiagonal systems on the GPU A Davidson, Y Zhang, JD Owens 2011 IEEE International Parallel & Distributed Processing Symposium, 956-965, 2011 | 123 | 2011 |
Rapid aerodynamic performance prediction on a cluster of graphics processing units EH Phillips, Y Zhang, RL Davis, JD Owens AIAA Proceedings.[np]. 05-08 Jan, 2009 | 97 | 2009 |
Dynamic detection of uniform and affine vectors in GPGPU computations S Collange, D Defour, Y Zhang Euro-Par 2009–Parallel Processing Workshops, 46-55, 2010 | 89 | 2010 |
CUDPP: CUDA data parallel primitives library M Harris, J Owens, S Sengupta, Y Zhang, A Davidson 2015-04-05]. http://code. google. com/p/cudpp, 2007 | 67 | 2007 |
Improving Performance Portability in OpenCL Programs Y Zhang, MII Sinclair, AA Chien International Supercomputing Conference (ISC '13), 2013 | 63 | 2013 |
GPGPU parallel algorithms for structured-grid CFD codes C Stone, E Duque, Y Zhang, D Car, R Davis, J Owens 20th AIAA Computational Fluid Dynamics Conference, 3221, 2011 | 31 | 2011 |
A hybrid method for solving tridiagonal systems on the GPU Y Zhang, J Cohen, AA Davidson, JD Owens Gpu Computing Gems Jade Edition, 117, 2011 | 28 | 2011 |
A parallel error diffusion implementation on a GPU Y Zhang, JL Recker, R Ulichney, GB Beretta, I Tastl, IJ Lin, JD Owens Proceedings of SPIE 7872, 78720K, 2011 | 18 | 2011 |
Plane-dependent error diffusion on a GPU Y Zhang, JL Recker, R Ulichney, I Tastl, JD Owens Image Processing: Algorithms and Systems X; and Parallel Processing for …, 2012 | 11 | 2012 |
An efficient high quality color transformation I Tastl, JL Recker, Y Zhang, G Beretta Proc. 17th Col. Img. Conf, 111-116, 2009 | 9 | 2009 |
Acceleration of 2-D compressible flow solvers with graphics processing unit clusters EH Phillips, Y Zhang, RL Davis, JD Owens Journal of Aerospace Computing, Information, and Communication 8 (8), 237-249, 2011 | 4 | 2011 |