A. Rupam Mahmood

Citado por

	Todos	Desde 2019
Citações	1399	1112
Índice h	17	15
Índice i10	19	17

280

140

210

2013201420152016201720182019202020212022202320247 19 34 55 57 106 122 175 233 234 262 86

Acesso público

Ver todos

6 artigos

0 artigo

disponível

não disponível

Com base nas autorizações de financiamento

Coautores

Richard S. SuttonKeen, Amii, and University of AlbertaE-mail confirmado em richsutton.com
Martha WhiteUniversity of AlbertaE-mail confirmado em ualberta.ca
Gautham VasanAmii, University of AlbertaE-mail confirmado em ualberta.ca
Dmytro KorenkevychMeta AIE-mail confirmado em meta.com
James BergstraE-mail confirmado em uwaterloo.ca
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLE-mail confirmado em google.com
Patrick M. PilarskiUniversity of Alberta, Amii (Alberta Machine Intelligence Institute)E-mail confirmado em ualberta.ca
Qingfeng LanPhD student @ University of AlbertaE-mail confirmado em ualberta.ca
Shibhansh DoharePhD Student, University of AlbertaE-mail confirmado em ualberta.ca
Harm van SeijenSony AIE-mail confirmado em sony.com
Brent KomerPhD Student, University of WaterlooE-mail confirmado em uwaterloo.ca
Marlos C. MachadoUniversity of AlbertaE-mail confirmado em ualberta.ca
Doina PrecupDeepMind and McGill UniversityE-mail confirmado em cs.mcgill.ca
Thomas DegrisDeepMindE-mail confirmado em google.com
Oliver LimoyoUniversity of Toronto Institute for Aerospace StudiesE-mail confirmado em mail.utoronto.ca
Bryan ChanUniversity of AlbertaE-mail confirmado em ualberta.ca
Jonathan KellyUniversity of Toronto Institute for Aerospace StudiesE-mail confirmado em utias.utoronto.ca

Seguir

A. Rupam Mahmood

University of Alberta, Amii

E-mail confirmado em ualberta.ca - Página inicial

Reinforcement learning robotics artificial intelligence machine learning


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
An emphatic approach to the problem of off-policy temporal-difference learning RS Sutton, AR Mahmood, M White (JMLR) Journal of Machine Learning Research 17, 2016	274	2016
Benchmarking reinforcement learning algorithms on real-world robots AR Mahmood, D Korenkevych, G Vasan, W Ma, J Bergstra (CoRL) Proceedings of the 2nd Annual Conference on Robot Learning, 2018	183	2018
Weighted importance sampling for off-policy learning with linear function approximation AR Mahmood, H van Hasselt, RS Sutton (NeurIPS) Advances in Neural Information Processing Systems 27, 2014	165	2014
True online temporal-difference learning H van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton (JMLR) Journal of Machine Learning Research 17, 2016	110	2016
Setting up a reinforcement learning task with a real-world robot AR Mahmood, D Korenkevych, BJ Komer, J Bergstra (IROS) 2018 IEEE/RSJ International Conference on Intelligent Robots and …, 2018	84	2018
Tuning-free step-size adaptation AR Mahmood, RS Sutton, T Degris, PM Pilarski (ICASSP) Acoustics, Speech and Signal Processing, 2012 IEEE International …, 2012	78	2012
Loss of Plasticity in Deep Continual Learning S Dohare, JF Hernandez-Garcia, P Rahman, RS Sutton, AR Mahmood arXiv preprint arXiv:2306.13812, 2023	53*	2023
Multi-step off-policy learning without importance sampling ratios AR Mahmood, H Yu, RS Sutton arXiv preprint arXiv:1702.03006, 2017	49	2017
Representation Search through Generate and Test AR Mahmood, RS Sutton Workshops at the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013	48	2013
Off-policy TD (λ) with a true online equivalence H van Hasselt, AR Mahmood, RS Sutton (UAI) Proceedings of the 30th Conference on Uncertainty in Artificial …, 2014	45	2014
A new Q (λ) with interim forward view and Monte Carlo equivalence RS Sutton, AR Mahmood, D Precup, M CA, H van Hasselt, U CA (ICML) In International Conference on Machine Learning, 2014	40	2014
On generalized Bellman equations and temporal-difference learning H Yu, AR Mahmood, RS Sutton (JMLR) The Journal of Machine Learning Research 19 (1), 1864-1912, 2018	39	2018
Emphatic temporal-difference learning AR Mahmood, H Yu, M White, RS Sutton In European Workshops on Reinforcement Learning, 2015	37	2015
Off-policy learning based on weighted importance sampling with linear computational complexity AR Mahmood, RS Sutton (UAI) Proceedings of the 31st Conference on Uncertainty in Artificial …, 2015	30	2015
Autoregressive policies for continuous control deep reinforcement learning D Korenkevych, AR Mahmood, G Vasan, J Bergstra (IJCAI) Proceedings of the 28th International Joint Conference on Artificial …, 2019	25	2019
Incremental Off-policy Reinforcement Learning Algorithms A Mahmood University of Alberta, 2017	18	2017
Greedification operators for policy optimization: investigating forward and reverse KL divergences A Chan, H Silva, S Lim, T Kozuno, AR Mahmood, M White (JMLR) Journal of Machine Learning Research, 2022	17	2022
Structure Learning of Causal Bayesian Networks: A Survey A Mahmood Department of Computing Science, University of Alberta, Edmonton, Canada …, 2011	11	2011
Automatic Step-size Adaptation In Incremental Supervised Learning A Mahmood University of Alberta, 2010	11	2010
Asynchronous reinforcement learning for real-time control of physical robots Y Yuan, AR Mahmood (ICRA) In Proceedings of the 2022 International Conference on Robotics and …, 2022	9	2022

O sistema não pode executar a operação agora. Tente novamente mais tarde.

Artigos 1–20

Citações por ano

Citações duplicadas

Citações mescladas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores