A reconfigurable fabric for accelerating large-scale datacenter services A Putnam, AM Caulfield, ES Chung, D Chiou, K Constantinides, J Demme, ...
ACM SIGARCH Computer Architecture News 42 (3), 13-24, 2014
1575 2014 A cloud-scale acceleration architecture AM Caulfield, ES Chung, A Putnam, H Angepat, J Fowers, M Haselman, ...
2016 49th Annual IEEE/ACM international symposium on microarchitecture …, 2016
896 2016 Accelerating deep convolutional neural networks using specialized hardware K Ovtcharov, O Ruwase, JY Kim, J Fowers, K Strauss, ES Chung
Microsoft Research Whitepaper 2, 2015
532 2015 A reconfigurable fabric for accelerating large-scale datacenter services A Putnam, AM Caulfield, ES Chung, D Chiou, K Constantinides, J Demme, ...
IEEE Micro 35 (3), 10-22, 2015
178 2015 A 201.4 GOPS 496 mW real-time multi-object recognition processor with bio-inspired neural perception engine JY Kim, M Kim, S Lee, J Oh, K Kim, HJ Yoo
IEEE Journal of Solid-State Circuits 45 (1), 32-45, 2009
157 2009 A scalable high-bandwidth architecture for lossless compression on fpgas J Fowers, JY Kim, D Burger, S Hauck
2015 IEEE 23rd Annual International Symposium on Field-Programmable Custom …, 2015
130 2015 Convolutional neural networks on hardware accelerators E Chung, K Strauss, K Ovtcharov, JY Kim, O Ruwase
US Patent App. 14/754,367, 2016
114 2016 A 125 GOPS 583 mW network-on-chip based parallel processor with bio-inspired visual attention engine K Kim, S Lee, JY Kim, M Kim, HJ Yoo
IEEE Journal of Solid-State Circuits 44 (1), 136-147, 2008
92 2008 Toward accelerating deep learning at scale using specialized hardware in the datacenter K Ovtcharov, O Ruwase, JY Kim, J Fowers, K Strauss, ES Chung
2015 IEEE Hot Chips 27 Symposium (HCS), 1-38, 2015
91 2015 Bitwise competition logic for compact digital comparator JY Kim, HJ Yoo
2007 IEEE Asian Solid-State Circuits Conference, 59-62, 2007
77 2007 Familiarity based unified visual attention model for fast and robust object recognition S Lee, K Kim, JY Kim, M Kim, HJ Yoo
Pattern Recognition 43 (3), 1116-1128, 2010
62 2010 Dfx: A low-latency multi-fpga appliance for accelerating transformer-based text generation S Hong, S Moon, J Kim, S Lee, M Kim, D Lee, JY Kim
2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO), 616-630, 2022
57 2022 A reconfigurable fabric for accelerating large-scale datacenter services A Putnam, AM Caulfield, ES Chung, D Chiou, K Constantinides, J Demme, ...
Communications of the ACM 59 (11), 114-122, 2016
56 2016 Configurable clouds AM Caulfield, ES Chung, A Putnam, H Angepat, D Firestone, J Fowers, ...
IEEE Micro 37 (3), 52-61, 2017
53 2017 An overview of processing-in-memory circuits for artificial intelligence and machine learning D Kim, C Yu, S Xie, Y Chen, JY Kim, B Kim, JP Kulkarni, TTH Kim
IEEE Journal on Emerging and Selected Topics in Circuits and Systems 12 (2 …, 2022
49 2022 An 81.6 GOPS object recognition processor based on NoC and visual image processing memory D Kim, K Kim, JY Kim, S Lee, HJ Yoo
2007 IEEE Custom Integrated Circuits Conference, 443-446, 2007
49 2007 Z-PIM: A sparsity-aware processing-in-memory architecture with fully variable weight bit-precision for energy-efficient deep neural networks JH Kim, J Lee, J Lee, J Heo, JY Kim
IEEE Journal of Solid-State Circuits 56 (4), 1093-1104, 2021
46 2021 Solutions for real chip implementation issues of NoC and their application to memory-centric NoC D Kim, K Kim, JY Kim, SJ Lee, HJ Yoo
First International Symposium on Networks-on-Chip (NOCS'07), 30-39, 2007
46 2007 24-GOPS 4.5- Digital Cellular Neural Network for Rapid Visual Attention in an Object-Recognition SoC S Lee, M Kim, K Kim, JY Kim, HJ Yoo
IEEE transactions on neural networks 22 (1), 64-73, 2010
34 2010 A 118.4 gb/s multi-casting network-on-chip with hierarchical star-ring combined topology for real-time object recognition JY Kim, J Park, S Lee, M Kim, J Oh, HJ Yoo
IEEE Journal of Solid-State Circuits 45 (7), 1399-1409, 2010
31 2010