Hamid Maei
Hamid Maei
Cruise Automation
Zweryfikowany adres z getcruise.com
Tytuł
Cytowane przez
Cytowane przez
Rok
Fast gradient-descent methods for temporal-difference learning with linear function approximation
RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ...
Proceedings of the 26th Annual International Conference on Machine Learning …, 2009
5072009
Involvement of the anterior cingulate cortex in the expression of remote spatial memory
CM Teixeira, SR Pomedli, HR Maei, N Kee, PW Frankland
Journal of Neuroscience 26 (29), 7555-7564, 2006
2682006
Toward off-policy learning control with function approximation
HR Maei, C Szepesvári, S Bhatnagar, RS Sutton
Proceedings of the 27th International Conference on Machine Learning (ICML …, 2010
2482010
Convergent temporal-difference learning with arbitrary smooth function approximation
HR Maei, S Szepesvári, Csaba, Bhatnagar, D Precup, D Silver, RS Sutton
Advances in Neural Information Processing Systems, 1204-1212, 2009
2352009
What is the most sensitive measure of water maze probe test performance?
HR Maei, K Zaslavsky, CM Teixeira, PW Frankland
Frontiers in integrative neuroscience 3, 4, 2009
1982009
A convergent o (n) temporal-difference algorithm for off-policy learning with linear function approximation
RS Sutton, C Szepesvári, HR Maei
NIPS, 2008
1832008
Optimal demand response using device-based reinforcement learning
Z Wen, D O’Neill, H Maei
IEEE Transactions on Smart Grid 6 (5), 2312-2324, 2015
1742015
A convergent O (n) algorithm for off-policy temporal-difference learning with linear function approximation
RS Sutton, C Szepesvári, HR Maei
Advances in neural information processing systems 21 (21), 1609-1616, 2008
1642008
GQ (lambda): A general gradient algorithm for temporal-difference prediction learning with eligibility traces
HR Maei, RS Sutton
3d Conference on Artificial General Intelligence (AGI-2010), 2010
1532010
Gradient temporal-difference learning algorithms
HR Maei
1342011
Deep reinforcement learning for visual object tracking in videos
D Zhang, H Maei, X Wang, YF Wang
arXiv preprint arXiv:1701.08936, 2017
892017
Design challenges of implantable pressure monitoring system
G Jiang
Frontiers in neuroscience 4, 2, 2010
572010
Randomly connected networks have short temporal memory
E Wallace, HR Maei, PE Latham
Neural computation 25 (6), 1408-1439, 2013
352013
Correlated quantum percolation in the lowest Landau level
N Sandler, HR Maei, J Kondev
Physical Review B 70 (4), 045309, 2004
232004
A batch, off-policy, actor-critic algorithm for optimizing the average reward
SA Murphy, Y Deng, EB Laber, HR Maei, RS Sutton, K Witkiewitz
arXiv preprint arXiv:1607.05047, 2016
202016
Development and validation of a sensitive entropy-based measure for the water maze
HR Maei, K Zaslavsky, AH Wang, AP Yiu, CM Teixeira, SA Josselyn, ...
Frontiers in integrative neuroscience 3, 33, 2009
182009
Convergent actor-critic algorithms under off-policy training and function approximation
HR Maei
arXiv preprint arXiv:1802.07842, 2018
172018
Quantum and classical localization in the lowest Landau level
N Sandler, HR Maei, J Kondev
Physical Review B 68 (20), 205315, 2003
142003
How can realistic networks process time-varying signals?
H Maei
University of London, 2005
22005
Convergent Temporal-Difference Learning with Arbitrary Differentiable Function Approximator
HR Maei, C Szepesvári, S Bhathnagar, D Silver, D Precup, R Sutton
2010
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20