Improving main memory hash joins on intel xeon phi processors: An experimental approach S Jha, B He, M Lu, X Cheng, HP Huynh Proceedings of the VLDB Endowment 8 (6), 642-653, 2015 | 98 | 2015 |
Efficient GPU spatial-temporal multitasking Y Liang, HP Huynh, K Rupnow, RSM Goh, D Chen IEEE Transactions on Parallel and Distributed Systems 26 (3), 748-760, 2014 | 91 | 2014 |
Optimizing the mapreduce framework on intel xeon phi coprocessor M Lu, L Zhang, HP Huynh, Z Ong, Y Liang, B He, RSM Goh, R Huynh 2013 IEEE International Conference on Big Data, 125-130, 2013 | 82 | 2013 |
Improving GPGPU energy-efficiency through concurrent kernel execution and DVFS Q Jiao, M Lu, HP Huynh, T Mitra 2015 IEEE/ACM International Symposium on Code Generation and Optimization …, 2015 | 69 | 2015 |
Scalable framework for mapping streaming applications onto multi-GPU systems HP Huynh, A Hagiescu, WF Wong, RSM Goh Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of …, 2012 | 66 | 2012 |
Optimizing and auto-tuning scale-free sparse matrix-vector multiplication on Intel Xeon Phi WT Tang, R Zhao, M Lu, Y Liang, HP Huynh, X Li, RSM Goh Code Generation and Optimization (CGO), 2015 IEEE/ACM International …, 2015 | 60 | 2015 |
Mrphi: An optimized mapreduce framework on intel xeon phi coprocessors M Lu, Y Liang, HP Huynh, Z Ong, B He, RSM Goh IEEE Transactions on Parallel and Distributed Systems 26 (11), 3066-3078, 2014 | 42 | 2014 |
Automated architecture-aware mapping of streaming applications onto GPUs A Hagiescu, HP Huynh, WF Wong, RSM Goh 2011 IEEE International Parallel & Distributed Processing Symposium, 467-478, 2011 | 41 | 2011 |
An efficient framework for dynamic reconfiguration of instruction-set customization HP Huynh, JE Sim, T Mitra Design Automation for Embedded Systems 13 (1), 91-113, 2009 | 39 | 2009 |
Hierarchical parallel algorithm for modularity-based community detection using GPUs CY Cheong, HP Huynh, D Lo, RSM Goh European conference on parallel processing, 775-787, 2013 | 38 | 2013 |
Runtime Adaptive Extensible Embedded Processors—A Survey HP Huynh, T Mitra International Workshop on Embedded Computer Systems, 215-225, 2009 | 25 | 2009 |
Exploiting sparsity to accelerate fully connected layers of cnn-based applications on mobile socs X Xie, D Du, Q Li, Y Liang, WT Tang, ZL Ong, M Lu, HP Huynh, RSM Goh ACM Transactions on Embedded Computing Systems (TECS) 17 (2), 1-25, 2017 | 20 | 2017 |
Efficient custom instructions generation for system-level design HP Huynh, Y Liang, T Mitra 2010 International Conference on Field-Programmable Technology, 445-448, 2010 | 16 | 2010 |
Evaluating design trade-offs in customizable processors UD Bordoloi, HP Huynh, S Chakraborty, T Mitra 2009 46th ACM/IEEE Design Automation Conference, 244-249, 2009 | 16 | 2009 |
Mapping streaming applications onto GPU systems HP Huynh, A Hagiescu, OZ Liang, WF Wong, RSM Goh IEEE Transactions on Parallel and Distributed Systems 25 (9), 2374-2385, 2013 | 15 | 2013 |
Runtime reconfiguration of custom instructions for real-time embedded systems HP Huynh, T Mitra 2009 Design, Automation & Test in Europe Conference & Exhibition, 1536-1541, 2009 | 14 | 2009 |
Efficient query processing on many-core architectures: A case study with intel xeon phi processor X Cheng, B He, M Lu, CT Lau, HP Huynh, RSM Goh Proceedings of the 2016 International Conference on Management of Data, 2081 …, 2016 | 13 | 2016 |
Instruction-set customization for real-time embedded systems HP Huynh, T Mitra 2007 Design, Automation & Test in Europe Conference & Exhibition, 1-6, 2007 | 11 | 2007 |
Design space exploration of instruction set customizable MPSoCs for multimedia applications UD Bordoloi, HP Huynh, T Mitra, S Chakraborty 2010 International Conference on Embedded Computer Systems: Architectures …, 2010 | 9 | 2010 |
Scale-free sparse matrix-vector multiplication on many-core architectures Y Liang, WT Tang, R Zhao, M Lu, HP Huynh, RSM Goh IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2017 | 8 | 2017 |