Yuhuai(Tony) Wu

Cytowane przez

	Wszystkie	Od 2019
Cytowania	15297	14591
h-indeks	34	33
i10-indeks	45	45

6000

3000

1500

4500

20162017201820192020202120222023202440 148 475 816 1585 1988 2746 5136 2275

Dostęp publiczny

Wyświetl wszystko

17 artykułów

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Roger GrosseAssociate Professor, University of TorontoZweryfikowany adres z cs.toronto.edu
Jimmy BaUniversity of TorontoZweryfikowany adres z cs.toronto.edu
Christian SzegedyResearcherZweryfikowany adres z szegedy.org
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARZweryfikowany adres z umontreal.ca
Ruslan SalakhutdinovUPMC Professor, Machine Learning Department, CMUZweryfikowany adres z cs.cmu.edu
Behnam NeyshaburSenior Staff Research Scientist, DeepMindZweryfikowany adres z google.com
David DuvenaudAssociate Professor, University of TorontoZweryfikowany adres z cs.toronto.edu
Pieter AbbeelUC Berkeley | CovariantZweryfikowany adres z cs.berkeley.edu
Albert Q. JiangUniversity of Cambridge | Mistral AIZweryfikowany adres z mistral.ai
Percy LiangAssociate Professor of Computer Science, Stanford UniversityZweryfikowany adres z cs.stanford.edu
Saizheng Zhang
Oriol VinyalsResearch Scientist at Google DeepMindZweryfikowany adres z google.com

Obserwuj

Yuhuai(Tony) Wu

Co-Founder of xAI

Zweryfikowany adres z x.ai - Strona główna

Machine Learning Machine Reasoning Theorem Proving


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ... Nature 575 (7782), 350-354, 2019	4627*	2019
On the opportunities and risks of foundation models R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2021	2763	2021
Openai baselines P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...	1835*	2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation Y Wu, E Mansimov, RB Grosse, S Liao, J Ba Advances in Neural Information Processing Systems, 5283-5292, 2017	789	2017
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	783	2023
Holistic evaluation of language models P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ... arXiv preprint arXiv:2211.09110, 2022	608	2022
Solving quantitative reasoning problems with language models A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ... Advances in Neural Information Processing Systems 35, 3843-3857, 2022	424	2022
Backpropagation through the void: Optimizing control variates for black-box gradient estimation W Grathwohl, D Choi, Y Wu, G Roeder, D Duvenaud ICLR2018, 2017	312	2017
STaR: Bootstrapping reasoning with reasoning E Zelikman, Y Wu, ND Goodman arXiv preprint arXiv:2203.14465, 2022	275*	2022
On the quantitative analysis of decoder-based generative models Y Wu, Y Burda, R Salakhutdinov, R Grosse 5th International Conference on Learning Representations (ICLR 2017), 2016	266	2016
Sticking the landing: Simple, lower-variance gradient estimators for variational inference G Roeder, Y Wu, DK Duvenaud Advances in Neural Information Processing Systems 30, 2017	257*	2017
Architectural complexity measures of recurrent neural networks S Zhang, Y Wu, T Che, Z Lin, R Memisevic, RR Salakhutdinov, Y Bengio Advances in neural information processing systems 29, 2016	190	2016
STDP-compatible approximation of backpropagation in an energy-based model Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu Neural computation 29 (3), 555-577, 2017	182*	2017
On multiplicative integration with recurrent neural networks Y Wu, S Zhang, Y Zhang, Y Bengio, RR Salakhutdinov Advances in neural information processing systems 29, 2016	179	2016
Memorizing Transformers Y Wu, MN Rabe, DL Hutchins, C Szegedy International Conference on Learning Representations 2022, 2022	164	2022
The Importance of Sampling in Meta-Reinforcement Learning B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ... Advances in Neural Information Processing Systems, 9299-9309, 2018	164*	2018
Understanding Short-Horizon Bias in Stochastic Meta-Optimization Y Wu, M Ren, R Liao, RB Grosse Sixth International Conference on Learning Representations (ICLR 2018), 2018	132	2018
Invariant Causal Representation Learning for Out-of-Distribution Generalization C Lu, Y Wu, JM Hernández-Lobato, B Schölkopf International Conference on Learning Representations, 2022	121*	2022
Exploring length generalization in large language models C Anil, Y Wu, A Andreassen, A Lewkowycz, V Misra, V Ramasesh, ... Advances in Neural Information Processing Systems 35, 38546-38556, 2022	104	2022
Autoformalization with large language models Y Wu, AQ Jiang, W Li, M Rabe, C Staats, M Jamnik, C Szegedy Advances in Neural Information Processing Systems 35, 32353-32368, 2022	95	2022

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy