‪Yihan Du‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	129	129
h-index	6	6
i10-index	5	5

0

36

18

202020212022202320248 24 25 36 36

Public access

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Longbo HuangProfessor, IIIS @ Tsinghua University, China, ACM Distinguished ScientistVerified email at tsinghua.edu.cn
Wei Chen （陈卫）Microsoft ResearchVerified email at microsoft.com
Haoyu ZhaoPrinceton UniversityVerified email at princeton.edu
R. SrikantUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu
Wen SunAssistant Professor, Cornell UniversityVerified email at cornell.edu

Yihan Du

Yihan Du

Postdoc, University of Illinois at Urbana-Champaign

Verified email at illinois.edu - Homepage

Online Learning Reinforcement Learning Representation Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Object-adaptive LSTM network for real-time visual tracking with adversarial data augmentation Y Du, Y Yan, S Chen, Y Hua Neurocomputing 384, 67-83, 2020	23	2020
Combinatorial pure exploration with full-bandit or partial linear feedback Y Du, Y Kuroki, W Chen Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021	22*	2021
Provably efficient risk-sensitive reinforcement learning: iterated CVaR and worst path Y Du, S Wang, L Huang International Conference on Learning Representations (ICLR), 2023	20	2023
Collaborative pure exploration in kernel bandit Y Du, W Chen, Y Yuroki, L Huang International Conference on Learning Representations (ICLR), 2023	14	2023
Combinatorial pure exploration for dueling bandit W Chen, Y Du, L Huang, H Zhao (*in alphabetical order) International Conference on Machine Learning (ICML), 1531-1541, 2020	12	2020
Provably safe reinforcement learning with step-wise violation constraints N Xiong, Y Du, L Huang Advances in Neural Information Processing Systems (NeurIPS) 36, 2024	6	2024
Object-adaptive LSTM network for visual tracking Y Du, Y Yan, S Chen, Y Hua, H Wang International Conference on Pattern Recognition (ICPR), 1719-1724, 2018	6	2018
A one-size-fits-all solution to conservative bandit problems Y Du, S Wang, L Huang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021	5	2021
Exploration-driven policy optimization in rlhf: Theoretical insights on efficient data utilization Y Du, A Winnicki, G Dalal, S Mannor, R Srikant arXiv preprint arXiv:2402.10342, 2024	4	2024
Continuous mean-covariance bandits Y Du, S Wang, Z Fang, L Huang Advances in Neural Information Processing Systems (NeurIPS) 34, 875-886, 2021	4	2021
Dueling bandits: from two-dueling to multi-dueling Y Du, S Wang, L Huang International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020	4	2020
Provably efficient iterated cvar reinforcement learning with function approximation Y Chen, Y Du, P Hu, S Wang, D Wu, L Huang International Conference on Learning Representations (ICLR), 2023	3	2023
Multi-task Representation Learning for Pure Exploration in Linear Bandits Y Du, L Huang, W Sun International Conference on Machine Learning (ICML), 2023	3	2023
Combinatorial pure exploration with bottleneck reward function Y Du, Y Kuroki, W Chen Advances in Neural Information Processing Systems (NeurIPS) 34, 23956-23967, 2021	3	2021
Cascading Reinforcement Learning Y Du, R Srikant, W Chen International Conference on Learning Representations (ICLR, spotlight), 2024		2024
Branching reinforcement learning Y Du, W Chen International Conference on Machine Learning (ICML), 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–16