Conservative bandits Y Wu, R Shariff, T Lattimore, C Szepesvári International Conference on Machine Learning, 1254-1262, 2016 | 100 | 2016 |
Differentially private contextual linear bandits R Shariff, O Sheffet Advances in Neural Information Processing Systems 31, 2018 | 79 | 2018 |
Discounted reinforcement learning is not an optimization problem A Naik, R Shariff, N Yasui, H Yao, RS Sutton arXiv preprint arXiv:1910.02140, 2019 | 41 | 2019 |
Efficient planning in large MDPs with weak linear function approximation R Shariff, C Szepesvári Advances in Neural Information Processing Systems 33, 19163-19174, 2020 | 20 | 2020 |
Exploiting symmetries to construct efficient MCMC algorithms with an application to SLAM R Shariff, A György, C Szepesvári Artificial Intelligence and Statistics, 866-874, 2015 | 7 | 2015 |
Lunar Lander: A Continous-Action Case Study for Policy-Gradient Actor-Critic Algorithms R Shariff, T Dick RLDM, 2013 | 1 | 2013 |
Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have NM Ady, R Shariff, J Günther, PM Pilarski arXiv preprint arXiv:2212.00187, 2022 | | 2022 |
Prototyping three key properties of specific curiosity in computational reinforcement learning NM Ady, R Shariff, J Günther, PM Pilarski arXiv preprint arXiv:2205.10407, 2022 | | 2022 |
A Value Function Basis for Nexting and Multi-step Prediction A Jacobsen, V Liu, R Shariff, A White, M White | | |