Shalabh Bhatnagar

Cytowane przez

	Wszystkie	Od 2019
Cytowania	7281	3877
h-indeks	35	25
i10-indeks	90	51

1100

550

275

825

200320042005200620072008200920102011201220132014201520162017201820192020202120222023202427 30 59 71 64 62 87 133 231 232 255 281 280 294 309 423 520 524 731 753 1085 261

Dostęp publiczny

Wyświetl wszystko

33 artykuły

10 artykułów

dostępne

niedostępne

Objęte finansowaniem

Obserwuj

Shalabh Bhatnagar

Professor in the Department of Computer Science and Automation, Indian Institute of Science

Zweryfikowany adres z iisc.ac.in - Strona główna

Stochastic systems control simulation optimization


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Natural Actor Critic Algorithms S Bhatnagar, R Sutton, M Ghavamzadeh, Mohammed and Lee Automatica 45 (11), 2471-2482, 2009	910	2009
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	699	2009
Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation Methods HLPLAP S.Bhatnagar Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation …, 2013	445*	2013
Reinforcement learning with function approximation for traffic signal control LA Prashanth, S Bhatnagar IEEE Transactions on Intelligent Transportation Systems 12 (2), 412-421, 2010	373	2010
Toward off-policy learning control with function approximation. HR Maei, C Szepesvári, S Bhatnagar, RS Sutton ICML 10, 719-726, 2010	332	2010
Convergent temporal-difference learning with arbitrary smooth function approximation H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton Advances in neural information processing systems 22, 2009	329	2009
An online actor–critic algorithm with function approximation for constrained markov decision processes S Bhatnagar, K Lakshmanan Journal of Optimization Theory and Applications 153, 688-708, 2012	319	2012
An actor–critic algorithm with function approximation for discounted cost constrained Markov decision processes S Bhatnagar Systems & Control Letters 59 (12), 760-766, 2010	263	2010
Incremental natural actor-critic algorithms S Bhatnagar, M Ghavamzadeh, M Lee, RS Sutton Advances in neural information processing systems 20, 2007	242	2007
Memory-based deep reinforcement learning for obstacle avoidance in UAV with limited environment knowledge A Singla, S Padakandla, S Bhatnagar IEEE transactions on intelligent transportation systems 22 (1), 107-118, 2019	204	2019
Reinforcement learning algorithm for non-stationary environments S Padakandla, P KJ, S Bhatnagar Applied Intelligence 50 (11), 3590-3606, 2020	130	2020
Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences S Bhatnagar, MC Fu, SI Marcus, IJ Wang ACM Transactions on Modeling and Computer Simulation (TOMACS) 13 (2), 180-209, 2003	115	2003
Multi-agent reinforcement learning for traffic signal control KJ Prabuchandran, HK AN, S Bhatnagar 17th International IEEE Conference on Intelligent Transportation Systems …, 2014	111	2014
A time aggregation approach to Markov decision processes XR Cao, Z Ren, S Bhatnagar, M Fu, S Marcus Automatica 38 (6), 929-943, 2002	89	2002
Reinforcement learning with average cost for adaptive control of traffic lights at intersections LA Prashanth, S Bhatnagar 2011 14th International IEEE Conference on Intelligent Transportation …, 2011	85	2011
Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization S Bhatnagar ACM Transactions on Modeling and Computer Simulation (TOMACS) 15 (1), 74-107, 2005	78	2005
Two-timescale algorithms for learning Nash equilibria in general-sum stochastic games HL Prasad, P LA, S Bhatnagar Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	68	2015
Two time-scale stochastic approximation with controlled Markov noise and off-policy temporal-difference learning P Karmakar, S Bhatnagar Mathematics of Operations Research 43 (1), 130-151, 2018	67	2018
Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization S Bhatnagar ACM Transactions on Modeling and Computer Simulation (TOMACS) 18 (1), 1-35, 2007	67	2007
Two-timescale algorithms for simulation optimization of hidden Markov models S Bhatnagar, MC Fu, SI Marcus, S Bhatnagar Iie Transactions 33 (3), 245-258, 2001	59	2001

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez