Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ...
Proceedings of the 26th Annual International Conference on Machine Learning …, 2009
586 2009 Involvement of the anterior cingulate cortex in the expression of remote spatial memory CM Teixeira, SR Pomedli, HR Maei, N Kee, PW Frankland
Journal of Neuroscience 26 (29), 7555-7564, 2006
286 2006 Toward off-policy learning control with function approximation HR Maei, C Szepesvári, S Bhatnagar, RS Sutton
ICML, 2010
277 2010 Convergent temporal-difference learning with arbitrary smooth function approximation HR Maei, S Szepesvári, Csaba, Bhatnagar, D Precup, D Silver, RS Sutton
Advances in Neural Information Processing Systems, 1204-1212, 2009
268 2009 Optimal demand response using device-based reinforcement learning Z Wen, D O’Neill, H Maei
IEEE Transactions on Smart Grid 6 (5), 2312-2324, 2015
225 2015 A Convergent Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation RS Sutton, H Maei, C Szepesvári
Advances in neural information processing systems 21, 2008
216 2008 What is the most sensitive measure of water maze probe test performance? HR Maei, K Zaslavsky, CM Teixeira, PW Frankland
Frontiers in integrative neuroscience 3, 4, 2009
215 2009 A convergent O (n) algorithm for off-policy temporal-difference learning with linear function approximation RS Sutton, C Szepesvári, HR Maei
Advances in neural information processing systems 21 (21), 1609-1616, 2008
186 2008 Gradient temporal-difference learning algorithms HR Maei
161 2011 GQ (λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces HR Maei, RS Sutton
Proceedings of the third conference on artificial general intelligence 1, 91-96, 2010
159 2010 Deep reinforcement learning for visual object tracking in videos D Zhang, H Maei, X Wang, YF Wang
arXiv preprint arXiv:1701.08936, 2017
107 2017 Randomly connected networks have short temporal memory E Wallace, HR Maei, PE Latham
Neural computation 25 (6), 1408-1439, 2013
38 2013 Correlated quantum percolation in the lowest Landau level N Sandler, HR Maei, J Kondev
Physical Review B 70 (4), 045309, 2004
24 2004 A batch, off-policy, actor-critic algorithm for optimizing the average reward SA Murphy, Y Deng, EB Laber, HR Maei, RS Sutton, K Witkiewitz
arXiv preprint arXiv:1607.05047, 2016
23 2016 Convergent actor-critic algorithms under off-policy training and function approximation HR Maei
arXiv preprint arXiv:1802.07842, 2018
22 2018 Development and validation of a sensitive entropy-based measure for the water maze HR Maei, K Zaslavsky, AH Wang, AP Yiu, CM Teixeira, SA Josselyn, ...
Frontiers in integrative neuroscience 3, 33, 2009
20 2009 Quantum and classical localization in the lowest Landau level N Sandler, HR Maei, J Kondev
Physical Review B 68 (20), 205315, 2003
14 2003 How can realistic networks process time-varying signals? H Maei
PQDT-Global, 2005
2 2005 A novel analytic measure for the water maze utilizing the concept of entropy HR Lee, BK Kaang
Frontiers in Neuroscience, 2, 2010
2010 Convergent Temporal-Difference Learning with Arbitrary Differentiable Function Approximator HR Maei, C Szepesvári, S Bhathnagar, D Silver, D Precup, R Sutton
2010