Follow
Yihan Du
Title
Cited by
Cited by
Year
Object-adaptive LSTM network for real-time visual tracking with adversarial data augmentation
Y Du, Y Yan, S Chen, Y Hua
Neurocomputing 384, 67-83, 2020
222020
Combinatorial pure exploration with full-bandit or partial linear feedback
Y Du, Y Kuroki, W Chen
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021
21*2021
Provably efficient risk-sensitive reinforcement learning: iterated CVaR and worst path
Y Du, S Wang, L Huang
International Conference on Learning Representations (ICLR), 2023
162023
Collaborative pure exploration in kernel bandit
Y Du, W Chen, Y Yuroki, L Huang
International Conference on Learning Representations (ICLR), 2023
142023
Combinatorial pure exploration for dueling bandit
W Chen, Y Du, L Huang, H Zhao (*in alphabetical order)
International Conference on Machine Learning (ICML), 1531-1541, 2020
122020
Object-adaptive LSTM network for visual tracking
Y Du, Y Yan, S Chen, Y Hua, H Wang
International Conference on Pattern Recognition (ICPR), 1719-1724, 2018
62018
A one-size-fits-all solution to conservative bandit problems
Y Du, S Wang, L Huang
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021
52021
Continuous mean-covariance bandits
Y Du, S Wang, Z Fang, L Huang
Advances in Neural Information Processing Systems (NeurIPS) 34, 875-886, 2021
42021
Provably safe reinforcement learning with step-wise violation constraints
N Xiong, Y Du, L Huang
Advances in Neural Information Processing Systems (NeurIPS) 36, 2024
32024
Combinatorial pure exploration with bottleneck reward function
Y Du, Y Kuroki, W Chen
Advances in Neural Information Processing Systems (NeurIPS) 34, 23956-23967, 2021
32021
Dueling bandits: from two-dueling to multi-dueling
Y Du, S Wang, L Huang
International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020
32020
Provably efficient iterated cvar reinforcement learning with function approximation
Y Chen, Y Du, P Hu, S Wang, D Wu, L Huang
International Conference on Learning Representations (ICLR), 2023
22023
Multi-task Representation Learning for Pure Exploration in Linear Bandits
Y Du, L Huang, W Sun
International Conference on Machine Learning (ICML), 2023
22023
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Y Du, A Winnicki, G Dalal, S Mannor, R Srikant
arXiv preprint arXiv:2402.10342, 2024
2024
Cascading Reinforcement Learning
Y Du, R Srikant, W Chen
International Conference on Learning Representations (ICLR, spotlight), 2024
2024
Branching reinforcement learning
Y Du, W Chen
International Conference on Machine Learning (ICML), 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–16