Jaewoong Sim

Cytowane przez

	Wszystkie	Od 2019
Cytowania	2972	2056
h-indeks	20	18
i10-indeks	23	23

480

240

120

360

201220132014201520162017201820192020202120222023202414 44 69 100 121 189 332 374 391 471 412 335 73

Dostęp publiczny

Wyświetl wszystko

4 artykuły

2 artykuły

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Hyesoon KimGeorgia TechZweryfikowany adres z cc.gatech.edu
Gabriel H. LohAMD Research and Advanced Development (RAD)Zweryfikowany adres z amd.com
Asit MishraNvidiaZweryfikowany adres z nvidia.com
Srivatsan KrishnanHarvard UniversityZweryfikowany adres z seas.harvard.edu
Mike O'ConnorNVIDIA ResearchZweryfikowany adres z nvidia.com
Lifeng NaiGoogleZweryfikowany adres z google.com
Chris WilkersonIntelZweryfikowany adres z intel.com
Alaa R. AlameldeenSimon Fraser UniversityZweryfikowany adres z cs.sfu.ca
Philip H.W. LeongProfessor of Computer Systems, The University of SydneyZweryfikowany adres z sydney.edu.au
Zeshan ChishtiStaff Research Scientist, Intel CorporationZweryfikowany adres z intel.com
Mithuna ThottethodiPurdue UniversityZweryfikowany adres z purdue.edu
Vilas SridharanAMD, Inc.Zweryfikowany adres z amd.com
Richard VuducGeorgia Institute of TechnologyZweryfikowany adres z cc.gatech.edu
Moinuddin QureshiProfessor, Georgia Institute of TechnologyZweryfikowany adres z gatech.edu
Jaekyu LeeArm ResearchZweryfikowany adres z arm.com

Obserwuj

Jaewoong Sim

Seoul National University

Zweryfikowany adres z snu.ac.kr - Strona główna

Computer Architecture Machine Learning


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Can FPGAs beat GPUs in accelerating next-generation deep neural networks? E Nurvitadhi, G Venkatesh, J Sim, D Marr, R Huang, J Ong Gee Hock, ... Proceedings of the 2017 ACM/SIGDA international symposium on field …, 2017	561	2017
Accelerating binarized neural networks: Comparison of FPGA, CPU, GPU, and ASIC E Nurvitadhi, D Sheffield, J Sim, A Mishra, G Venkatesh, D Marr 2016 International Conference on Field-Programmable Technology (FPT), 77-84, 2016	386	2016
Graphpim: Enabling instruction-level pim offloading in graph computing frameworks L Nai, R Hadidi, J Sim, H Kim, P Kumar, H Kim 2017 IEEE International symposium on high performance computer architecture …, 2017	326	2017
A performance analysis framework for identifying potential benefits in GPGPU applications J Sim, A Dasgupta, H Kim, R Vuduc Proceedings of the 17th ACM SIGPLAN Annual Symposium on Principles and …, 2012	266	2012
Accelerating recurrent neural networks in analytics servers: Comparison of FPGA, CPU, GPU, and ASIC E Nurvitadhi, J Sim, D Sheffield, A Mishra, S Krishnan, D Marr 2016 26th International Conference on Field Programmable Logic and …, 2016	233	2016
Transparent hardware management of stacked dram as part of memory J Sim, AR Alameldeen, Z Chishti, C Wilkerson, H Kim 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture, 13-24, 2014	149	2014
A mostly-clean DRAM cache for effective hit speculation and self-balancing dispatch J Sim, GH Loh, H Kim, M OConnor, M Thottethodi 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 247-257, 2012	126	2012
Dynamically configuring regions of a main memory in a write-back mode or a write-through mode J Sim, MS Thottethodi, GH Loh US Patent 9,552,294, 2017	109	2017
A customizable matrix multiplication framework for the intel harpv2 xeon+ fpga platform: A deep learning case study DJM Moss, S Krishnan, E Nurvitadhi, P Ratuszniak, C Johnson, J Sim, ... Proceedings of the 2018 ACM/SIGDA International Symposium on Field …, 2018	100	2018
High performance binary neural networks on the Xeon+ FPGA™ platform DJM Moss, E Nurvitadhi, J Sim, A Mishra, D Marr, S Subhaschandra, ... 2017 27Th International conference on field programmable logic and …, 2017	93	2017
Macsim: A cpu-gpu heterogeneous simulation framework user guide H Kim, J Lee, NB Lakshminarayana, J Sim, J Lim, T Pho Georgia Institute of Technology, 1-57, 2012	85	2012
BSSync: Processing near memory for machine learning workloads with bounded staleness consistency models JH Lee, J Sim, H Kim 2015 International Conference on Parallel Architecture and Compilation (PACT …, 2015	80	2015
Why compete when you can work together: FPGA-ASIC integration for persistent RNNs E Nurvitadhi, D Kwon, A Jafari, A Boutros, J Sim, P Tomson, H Sumbul, ... 2019 IEEE 27th Annual International Symposium on Field-Programmable Custom …, 2019	67	2019
Batch-aware unified memory management in GPUs for irregular workloads H Kim, J Sim, P Gera, R Hadidi, H Kim Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020	65	2020
Resilient die-stacked DRAM caches J Sim, GH Loh, V Sridharan, M O'Connor ACM SIGARCH Computer Architecture News 41 (3), 416-427, 2013	65	2013
FLEXclusion: Balancing cache capacity and on-chip bandwidth via flexible exclusion J Sim, J Lee, MK Qureshi, H Kim ACM SIGARCH Computer Architecture News 40 (3), 321-332, 2012	54	2012
Partitioning caches for sub-entities in computing devices GH Loh, J Sim US Patent 9,098,417, 2015	35	2015
Method and apparatus for implementing a heterogeneous memory subsystem CB Wilkerson, AR Alameldeen, ZA Chishti, J Sim US Patent 9,472,248, 2016	26	2016
CoolPIM: Thermal-aware source throttling for efficient PIM instruction offloading L Nai, R Hadidi, H Xiao, H Kim, J Sim, H Kim 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018	23	2018
Specializing FGPU for persistent deep learning R Ma, JC Hsu, T Tan, E Nurvitadhi, D Sheffield, R Pelt, M Langhammer, ... ACM Transactions on Reconfigurable Technology and Systems (TRETS) 14 (2), 1-23, 2021	20	2021

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy