Obserwuj
Zheng Qu
Tytuł
Cytowane przez
Cytowane przez
Rok
A network-centric hardware/algorithm co-design to accelerate distributed training of deep neural networks
Y Li, J Park, M Alian, Y Yuan, Z Qu, P Pan, R Wang, A Schwing, ...
2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018
712018
DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture
L Liu, Z Qu, L Deng, F Tu, S Li, X Hu, Z Gu, Y Ding, Y Xie
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
162020
H2learn: High-efficiency learning accelerator for high-accuracy spiking neural networks
L Liang, Z Qu, Z Chen, F Tu, Y Wu, L Deng, G Li, P Li, Y Xie
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2021
92021
Efficient tensor core-based GPU kernels for structured sparsity under reduced precision
Z Chen, Z Qu, L Liu, Y Ding, Y Xie
Proceedings of the International Conference for High Performance Computing …, 2021
32021
INSPIRE: in-storage private information retrieval via protocol and architecture co-design
J Lin, L Liang, Z Qu, I Ahmad, L Liu, F Tu, T Gupta, Y Ding, Y Xie
Proceedings of the 49th Annual International Symposium on Computer …, 2022
12022
Tensor train decomposition for solving large-scale linear equations
H Chen, L Deng, Z Qu, L Liang, T Yan, Y Xie, G Li
Neurocomputing 464, 203-217, 2021
12021
Improving Streaming Graph Processing Performance using Input Knowledge
A Basak, Z Qu, J Lin, AR Alameldeen, Z Chishti, Y Ding, Y Xie
MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021
12021
ENMC: Extreme Near-Memory Classification via Approximate Screening
L Liu, J Lin, Z Qu, Y Ding, Y Xie
MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021
12021
Hardware-Enabled Efficient Data Processing with Tensor-Train Decomposition
Z Qu, L Deng, B Wang, H Chen, J Lin, L Liang, G Li, Z Zhang, Y Xie
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2021
12021
Dynamic N: M Fine-grained Structured Sparse Attention Mechanism
Z Chen, Y Quan, Z Qu, L Liu, Y Ding, Y Xie
arXiv preprint arXiv:2203.00091, 2022
2022
DOTA: detect and omit weak attentions for scalable transformer acceleration
Z Qu, L Liu, F Tu, Z Chen, Y Ding, Y Xie
Proceedings of the 27th ACM International Conference on Architectural …, 2022
2022
Transformer Acceleration with Dynamic Sparse Attention
L Liu, Z Qu, Z Chen, Y Ding, Y Xie
arXiv preprint arXiv:2110.11299, 2021
2021
DFSSATTEN: Dynamic Fine-grained Structured Sparse Attention Mechanism
Z Chen, L Liu, Y Quan, Z Qu, Y Ding, Y Xie
2021
Efficient Processing of Sparse Tensor Decomposition via Unified Abstraction and PE-interactive Architecture
B Wang, L Deng, Z Qu, S Li, Z Zhang, Y Xie
IEEE Transactions on Computers 71 (2), 266-281, 2021
2021
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–14