Mohammad Ghavamzadeh

Cited by

	All	Since 2019
Citations	12999	9588
h-index	57	44
i10-index	118	110

2900

1450

725

2175

2005200620072008200920102011201220132014201520162017201820192020202120222023202436 50 58 69 107 104 181 232 200 263 297 355 412 578 916 1186 1675 2093 2824 886

Public access

View all

14 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yinlam ChowResearch Scientist, Google ResearchVerified email at google.com
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchVerified email at inria.fr
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchVerified email at technion.ac.il
Branislav KvetonAmazonVerified email at amazon.com
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, AmherstVerified email at cs.umass.edu
Rémi MunosDeepMindVerified email at inria.fr
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Georgios TheocharousAdobe ResearchVerified email at adobe.com
Amir-massoud FarahmandUniversity of TorontoVerified email at cs.toronto.edu
Craig BoutilierPrincipal Scientist, GoogleVerified email at google.com
Marek PetrikUniversity of New HampshireVerified email at cs.unh.edu
Philip ThomasUniversity of Massachusetts AmherstVerified email at cs.umass.edu
Ofir NachumOpenAIVerified email at openai.com
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceVerified email at iisc.ac.in
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Hung BuiResearch Scientist, Google DeepMindVerified email at google.com
Zheng WenGoogle DeepMindVerified email at google.com
Aviv TamarTechnionVerified email at technion.ac.il
Bo LiuEx-Associate Professor, AAAI SM, IEEE SMVerified email at cs.umass.edu
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr

Mohammad Ghavamzadeh

Amazon

Verified email at amazon.com - Homepage

Reinforcement Learning Online Learning Machine Learning Control AI


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges M Abdar, F Pourpanah, S Hussain, D Rezazadegan, L Liu, ... Information Fusion, 2021	1658	2021
Natural Actor–critic Algorithms S Bhatnagar, RS Sutton, M Ghavamzadeh, M Lee Automatica 45 (11), 2471-2482, 2009	1084*	2009
A Lyapunov-based Approach to Safe Reinforcement Learning Y Chow, O Nachum, E Duenez-Guzman, M Ghavamzadeh Neural Information Processing Systems, 8103-8112, 2018	522	2018
Bayesian Reinforcement Learning: A Survey M Ghavamzadeh, S Mannor, J Pineau, A Tamar Foundations and Trends in Machine Learning 8 (5-6), 359-483, 2015	522	2015
Risk-constrained Reinforcement Learning with Percentile Risk Criteria Y Chow, M Ghavamzadeh, L Janson, M Pavone Journal of Machine Learning Research (JMLR) 18, 6070-6120, 2017	495	2017
Algorithms for CVaR Optimization in MDPs Y Chow, M Ghavamzadeh Advances in Neural Information Processing Systems, 3509-3517, 2014	398	2014
Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence V Gabillon, M Ghavamzadeh, A Lazaric Neural Information Processing Systems, 3221-3229, 2012	338	2012
Actor-Critic Algorithms for Risk-sensitive MDPs LA Prashanth, M Ghavamzadeh Neural Information Processing Systems, 252-260, 2013	335*	2013
High-confidence Off-policy Evaluation P Thomas, G Theocharous, M Ghavamzadeh AAAI, 3000-3006, 2015	303	2015
More Robust Doubly Robust Off-policy Evaluation M Farajtabar, Y Chow, M Ghavamzadeh ICML, 1447-1456, 2018	251	2018
Safe Policy Learning for Continuous Control Y Chow, O Nachum, A Faust, E Duenez-Guzman, M Ghavamzadeh Conference on Robot Learning (CoRL), 2020	244*	2020
High Confidence Policy Improvement P Thomas, G Theocharous, M Ghavamzadeh ICML, 2380-2388, 2015	213	2015
Speedy Q-learning M Ghavamzadeh, H Kappen, M Azar, R Munos Neural Information Processing Systems 24, 2411-2419, 2011	201*	2011
Supervised actor-critic reinforcement learning MT Rosenstein, AG Barto, J Si, A Barto, W Powell, D Wunsch Learning and approximate dynamic programming: scaling up to the real world …, 2004	197	2004
Hierarchical Multi-agent Reinforcement Learning R Makar, S Mahadevan, M Ghavamzadeh International Conference on Autonomous Agents, 246-253, 2001	194	2001
Personalized Ad Recommendation Systems for Life-time Value Optimization with Guarantees G Theocharous, PS Thomas, M Ghavamzadeh IJCAI, 1806-1812, 2015	189*	2015
Benchmarking Batch Deep Reinforcement Learning Algorithms S Fujimoto, E Conti, M Ghavamzadeh, J Pineau arXiv preprint arXiv:1910.01708, 2019	188	2019
Finite-Sample Analysis of Proximal Gradient TD Algorithms B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik UAI, 504-513, 2015	172*	2015
Hierarchical Multi-agent Reinforcement Learning M Ghavamzadeh, S Mahadevan, R Makar Journal of Autonomous Agents and Multi-Agent Systems (JAAMAS) 13 (2), 197-229, 2006	172	2006
Regularized Policy Iteration AM Farahmand, M Ghavamzadeh, C Szepesvári, S Mannor Neural Information Processing Systems, 441-448, 2008	162	2008

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors