Follow
Yunquan Zhang
Title
Cited by
Cited by
Year
AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs
Q Wang, X Zhang, Y Zhang, Q Yi
Proceedings of the international conference on high performance computing …, 2013
2742013
Model-driven level 3 BLAS performance optimization on Loongson 3A processor
Z Xianyi, W Qian, Z Yunquan
2012 IEEE 18th international conference on parallel and distributed systems …, 2012
2472012
yaSpMV: Yet another SpMV framework on GPUs
S Yan, C Li, Y Zhang, H Zhou
Acm Sigplan Notices 49 (8), 107-118, 2014
1792014
StreamScan: fast scan algorithms for GPUs without global barrier synchronization
S Yan, G Long, Y Zhang
Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of …, 2013
1212013
Parallel processing systems for big data: a survey
Y Zhang, T Cao, S Li, X Tian, L Yuan, H Jia, AV Vasilakos
Proceedings of the IEEE 104 (11), 2114-2136, 2016
1122016
Models of parallel computation: a survey and classification
Y Zhang, G Chen, G Sun, Q Miao
Frontiers of Computer Science in China 1, 156-165, 2007
492007
MPFFT: An auto-tuning FFT library for OpenCL GPUs
Y Li, YQ Zhang, YQ Liu, GP Long, HP Jia
Journal of Computer Science and Technology 28 (1), 90-105, 2013
432013
GPURoofline: a model for guiding performance optimizations on GPUs
H Jia, Y Zhang, G Long, J Xu, S Yan, Y Li
Euro-Par 2012 Parallel Processing: 18th International Conference, Euro-Par …, 2012
432012
Study on parallel computing
GL Chen, GZ Sun, YQ Zhang, ZY Mo
Journal of Computer Science and Technology 21, 665-673, 2006
402006
Optimizing SpMV for diagonal sparse matrices on GPU
X Sun, Y Zhang, T Wang, X Zhang, L Yuan, L Rao
2011 International conference on parallel processing, 492-501, 2011
392011
A parallel shortest path algorithm based on graph-partitioning and iterative correcting
Y Tang, Y Zhang, H Chen
2008 10th IEEE International Conference on High Performance Computing and …, 2008
392008
Accelerating viola-jones facce detection algorithm on gpus
H Jia, Y Zhang, W Wang, J Xu
2012 IEEE 14th International Conference on High Performance Computing and …, 2012
372012
Performance evaluation of allgather algorithms on terascale linux cluster with fast ethernet
J Chen, L Zhang, Y Zhang, W Yuan
Eighth International Conference on High-Performance Computing in Asia …, 2005
352005
Cache-oblivious MPI all-to-all communications based on Morton order
S Li, Y Zhang, T Hoefler
IEEE Transactions on Parallel and Distributed Systems 29 (3), 542-555, 2017
292017
Performance evaluation of multithreaded sparse matrix-vector multiplication using openmp
S Liu, Y Zhang, X Sun, RR Qiu
2009 11th IEEE International Conference on High Performance Computing and …, 2009
292009
Parallelization and performance optimization on face detection algorithm with OpenCL: A case study
W Wang, Y Zhang, S Yan, Y Zhang, H Jia
Tsinghua Science and Technology 17 (3), 287-295, 2012
242012
DRAM (h): a parallel computation model for high performance numerical computing
YQ Zhang
CHINESE JOURNAL OF COMPUTERS-CHINESE EDITION- 26 (12), 1660-1670, 2003
242003
pVOCL: Power-aware dynamic placement and migration in virtualized GPU environments
P Lama, Y Li, AM Aji, P Balaji, J Dinan, S Xiao, Y Zhang, W Feng, ...
2013 IEEE 33rd International Conference on Distributed Computing Systems …, 2013
222013
LogGPH: A parallel computational model with hierarchical communication awareness
L Yuan, Y Zhang, Y Tang, L Rao, X Sun
2010 13th IEEE International Conference on Computational Science and …, 2010
212010
Earth system model: CAS-ESM
Z Guangqing, Z Yunquan, J Jinrong, Z He, W Baodong, C Hang, W Tianyi, ...
Frontiers of Data and Domputing 2 (1), 38-54, 2020
202020
The system can't perform the operation now. Try again later.
Articles 1–20