Obserwuj
Roshan Shariff
Tytuł
Cytowane przez
Cytowane przez
Rok
Conservative bandits
Y Wu, R Shariff, T Lattimore, C Szepesvári
International Conference on Machine Learning, 1254-1262, 2016
1222016
Differentially private contextual linear bandits
R Shariff, O Sheffet
Advances in Neural Information Processing Systems 31, 2018
1152018
Discounted reinforcement learning is not an optimization problem
A Naik, R Shariff, N Yasui, H Yao, RS Sutton
arXiv preprint arXiv:1910.02140, 2019
682019
Efficient planning in large MDPs with weak linear function approximation
R Shariff, C Szepesvári
Advances in Neural Information Processing Systems 33, 19163-19174, 2020
272020
Exploiting symmetries to construct efficient MCMC algorithms with an application to SLAM
R Shariff, A György, C Szepesvári
Artificial Intelligence and Statistics, 866-874, 2015
82015
Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have
NM Ady, R Shariff, J Günther, PM Pilarski
arXiv preprint arXiv:2212.00187, 2022
32022
Prototyping three key properties of specific curiosity in computational reinforcement learning
NM Ady, R Shariff, J Günther, PM Pilarski
arXiv preprint arXiv:2205.10407, 2022
12022
Lunar Lander: A Continous-Action Case Study for Policy-Gradient Actor-Critic Algorithms
R Shariff, T Dick
RLDM, 2013
12013
A Value Function Basis for Nexting and Multi-step Prediction
A Jacobsen, V Liu, R Shariff, A White, M White
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–9