Adams Wei Yu

Cited by

	All	Since 2019
Citations	8602	8328
h-index	25	23
i10-index	27	24

3700

1850

925

2775

20162017201820192020202120222023202426 49 165 382 381 486 1134 3634 2284

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Adams Wei Yu

Research Scientist, Google DeepMind

Verified email at cs.cmu.edu - Homepage

Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Finetuned language models are zero-shot learners J Wei, M Bosma, VY Zhao, K Guu, AW Yu, B Lester, N Du, AM Dai, QV Le ICLR 2022, 2022	2041	2022
Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024	1851*	2024
QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension AW Yu, D Dohan, MT Luong, R Zhao, K Chen, M Norouzi, QV Le ICLR 2018, 2018	1293*	2018
Simvlm: Simple visual language model pretraining with weak supervision Z Wang, J Yu, AW Yu, Z Dai, Y Tsvetkov, Y Cao ICLR 2022, 2022	659	2022
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	510	2023
Glam: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... ICML 2022, 2022	472*	2022
Deepfusion: Lidar-camera deep fusion for multi-modal 3d object detection Y Li, AW Yu, T Meng, B Caine, J Ngiam, D Peng, J Shen, Y Lu, D Zhou, ... CVPR 2022, 2022	246	2022
Orthogonal weight normalization: Solution to optimization over multiple dependent stiefel manifolds in deep neural networks L Huang, X Liu, B Lang, AW Yu, B Li AAAI 2018, 2017	218	2017
Combined scaling for zero-shot transfer learning H Pham, Z Dai, G Ghiasi, H Liu, AW Yu, MT Luong, M Tan, QV Le arXiv preprint arXiv:2111.10050, 2021	188*	2021
Learning to skim text AW Yu, H Lee, QV Le ACL 2017, 2017	158	2017
Neural symbolic reader: Scalable integration of distributed and symbolic representations for reading comprehension X Chen, C Liang, AW Yu, D Zhou, D Song, QV Le ICLR 2020, 2019	110	2019
Large language models cannot self-correct reasoning yet J Huang, X Chen, S Mishra, HS Zheng, AW Yu, X Song, D Zhou arXiv preprint arXiv:2310.01798, 2023	93	2023
Compositional generalization via neural-symbolic stack machines X Chen, C Liang, AW Yu, D Song, D Zhou NeurIPS 2020, 2020	89	2020
Adadelay: Delay adaptive distributed stochastic convex optimization S Sra, AW Yu, M Li, AJ Smola AISTATS 2016, 2016	84*	2016
Towards zero-label language learning Z Wang, AW Yu, O Firat, Y Cao arXiv preprint arXiv:2109.09193, 2021	75	2021
AutoHAS: Efficient hyperparameter and architecture search X Dong, M Tan, AW Yu, D Peng, B Gabrys, QV Le arXiv preprint arXiv:2006.03656, 2020	67*	2020
On computationally tractable selection of experiments in measurement-constrained regression models Y Wang, AW Yu, A Singh The Journal of Machine Learning Research 18 (1), 5238-5278, 2017	66*	2017
An improved gap-dependency analysis of the noisy power method MF Balcan, SS Du, Y Wang, AW Yu COLT 2016, 2016	64	2016
Dscovr: Randomized primal-dual block coordinate algorithms for asynchronous distributed optimization L Xiao, AW Yu, Q Lin, W Chen Journal of Machine Learning Research 20 (43), 1-58, 2019	53	2019
BLOCK-NORMALIZED GRADIENT METHOD: AN EMPIRICAL STUDY FOR TRAINING DEEP NEURAL NETWORK AW Yu, L Huang, Q Lin, R Salakhutdinov, J Carbonell	45*	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by