Kavosh Asadi

Cited by

	All	Since 2019
Citations	2296	2118
h-index	13	13
i10-index	16	15

560

280

140

420

2017201820192020202120222023202442 125 146 253 396 466 552 277

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Michael LittmanBrown UniversityVerified email at brown.edu
Alex SmolaBoson AIVerified email at smola.org
George KonidarisBrownVerified email at cs.brown.edu
Dipendra MisraMicrosoft Research New YorkVerified email at microsoft.com
Rasool FakoorAmazon Web ServicesVerified email at amazon.com
Jason D. WilliamsAppleVerified email at apple.com
David AbelResearch Scientist, DeepMindVerified email at deepmind.com
Seungchan KimCarnegie Mellon UniversityVerified email at cs.cmu.edu
Cameron S. AllenPostdoc, UC BerkeleyVerified email at berkeley.edu
Yuu JinnaiCyberAgent, Inc.Verified email at cyberagent.co.jp
Dilip ArumugamPh.D. Candidate - Stanford UniversityVerified email at cs.stanford.edu
Shoham SabachAssociate Professor, Technion, Faculty of Data and Decision SciencesVerified email at technion.ac.il
Omer GottesmanAmazonVerified email at amazon.com
Abdelrahman MohamedResearch scientist, Facebook AI ResearchVerified email at fb.com
Ronald ParrProfessor of Computer Science, Duke UniversityVerified email at cs.duke.edu
Lawson L.S. WongAssistant Professor, CCIS, Northeastern UniversityVerified email at ccs.neu.edu
Erwan LecarpentierPhD in Computer ScienceVerified email at isae-supaero.fr
Yao LiuAmazonVerified email at stanford.edu
Taesup KimAssistant Professor, Seoul National UniversityVerified email at snu.ac.kr

Kavosh Asadi

Research Scientist, Amazon

Verified email at amazon.com - Homepage

Reinforcement Learning AI Alignment Optimization


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Dive into deep learning A Zhang, ZC Lipton, M Li, AJ Smola arXiv preprint arXiv:2106.11342, 2021	1072	2021
Hybrid code networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning JD Williams, K Asadi, G Zweig arXiv preprint arXiv:1702.03274, 2017	405	2017
An Alternative Softmax Operator for Reinforcement Learning K Asadi, ML Littman Proceedings of the 34th International Conference on Machine Learning, 243-252, 2017	221	2017
Lipschitz Continuity in Model-based Reinforcement Learning K Asadi, D Misra, ML Littman Proceedings of the 35th International Conference on Machine Learning, 2018	168	2018
Deepmellow: removing the need for a target network in deep Q-learning S Kim, K Asadi, M Littman, G Konidaris Proceedings of the Twenty Eighth International Joint Conference on …, 2019	76*	2019
State abstraction as compression in apprenticeship learning D Abel, D Arumugam, K Asadi, Y Jinnai, ML Littman, LLS Wong Proceedings of the AAAI Conference on Artificial Intelligence 33, 3134-3142, 2019	58	2019
Combating the Compounding-Error Problem with a Multi-step Model K Asadi, D Misra, S Kim, ML Littman arXiv preprint arXiv:1905.13320, 2019	55	2019
Lipschitz lifelong reinforcement learning E Lecarpentier, D Abel, K Asadi, Y Jinnai, E Rachelson, ML Littman Proceedings of the AAAI Conference on Artificial Intelligence 35 (9), 8270-8278, 2021	36	2021
Mean Actor Critic K Asadi, C Allen, M Roderick, A Mohamed, G Konidaris, M Littman arXiv preprint arXiv:1709.00503, 2017	35*	2017
Continuous doubly constrained batch reinforcement learning R Fakoor, J Mueller, K Asadi, P Chaudhari, AJ Smola arXiv preprint arXiv:2102.09225, 2021	28	2021
Deep radial-basis value functions for continuous control K Asadi, N Parikh, RE Parr, GD Konidaris, ML Littman Proceedings of the AAAI Conference on Artificial Intelligence, 2021	27*	2021
Sample-efficient Reinforcement Learning for Dialog Control K Asadi, JD Williams arXiv preprint arXiv:1612.06000, 2016	25	2016
Strengths, weaknesses, and combinations of model-based and model-free reinforcement learning K Asadi Department of Computing Science University of Alberta, 2015	14	2015
Mitigating Planner Overfitting in Model-Based Reinforcement Learning D Arumugam, D Abel, K Asadi, N Gopalan, C Grimm, JK Lee, L Lehnert, ... arXiv preprint arXiv:1812.01129, 2018	13	2018
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning K Asadi, E Cater, D Misra, ML Littman arXiv preprint arXiv:1811.00128, 2018	13	2018
Equivalence between wasserstein and value-aware model-based reinforcement learning K Asadi, E Cater, D Misra, ML Littman FAIM Workshop on Prediction and Generative Modeling in Reinforcement Learning 3, 2018	13*	2018
Resetting the optimizer in deep RL: An empirical study K Asadi, R Fakoor, S Sabach Advances in Neural Information Processing Systems 36, 2023	9	2023
Fair E3: Efficient welfare-centric fair reinforcement learning C Cousins, K Asadi, ML Littman 5th Multidisciplinary Conference on Reinforcement Learning and Decision …, 2022	6	2022
Learning State Abstractions for Transfer in Continuous Control K Asadi, D Abel, ML Littman arXiv preprint arXiv:2002.05518, 2020	6	2020
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models Z Liu, J Zhang, K Asadi, Y Liu, D Zhao, S Sabach, R Fakoor arXiv preprint arXiv:2310.05905, 2023	4	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors