Shivaram Kalyanakrishnan
Shivaram Kalyanakrishnan
E-mail confirmado em cse.iitb.ac.in - Página inicial
Título
Citado por
Citado por
Ano
Artificial intelligence and life in 2030: the one hundred year study on artificial intelligence
P Stone, R Brooks, E Brynjolfsson, R Calo, O Etzioni, G Hager, ...
Stanford University, 2016
498*2016
PAC subset selection in stochastic multi-armed bandits.
S Kalyanakrishnan, A Tewari, P Auer, P Stone
ICML 12, 655-662, 2012
2742012
RoboCup 2006: Robot Soccer World Cup X
DG SORRENTI
Springer, 2007
136*2007
Information complexity in bandit subset selection
E Kaufmann, S Kalyanakrishnan
Conference on Learning Theory, 228-251, 2013
1322013
Batch reinforcement learning in a complex domain
S Kalyanakrishnan, P Stone
Proceedings of the 6th international joint conference on Autonomous agents …, 2007
972007
Efficient selection of multiple bandit arms: Theory and practice
S Kalyanakrishnan, P Stone
ICML, 2010
962010
Half field offense: An environment for multiagent learning and ad hoc teamwork
M Hausknecht, P Mupparaju, S Subramanian, S Kalyanakrishnan, ...
AAMAS Adaptive Learning Agents (ALA) Workshop, 2016
622016
On optimizing interdependent skills: a case study in simulated 3D humanoid robot soccer.
D Urieli, P MacAlpine, S Kalyanakrishnan, Y Bentor, P Stone
AAMAS 11, 769, 2011
532011
UT Austin Villa 2011: a champion agent in the RoboCup 3D soccer simulation competition.
P MacAlpine, D Urieli, S Barrett, S Kalyanakrishnan, F Barrera, ...
AAMAS, 129-136, 2012
522012
Learning to predict humanoid fall
S Kalyanakrishnan, A Goswami
International Journal of Humanoid Robotics 8 (02), 245-273, 2011
472011
Direction-changing fall control of humanoid robots: theory and experiments
A Goswami, S Yun, U Nagarajan, SH Lee, KK Yin, S Kalyanakrishnan
Autonomous Robots 36 (3), 199-223, 2014
382014
RoboCup 2009: Robot Soccer World Cup XIII
S Kalyanakrishnan, P Stone, J Baltes
Lecture Notes in Computer Science 5949, 153-165, 2010
37*2010
An empirical analysis of value function-based and policy search reinforcement learning
S Kalyanakrishnan, P Stone
Proceedings of The 8th International Conference on Autonomous Agents and …, 2009
352009
Machine learning approach for predicting humanoid robot fall
A Goswami, S Kalyanakrishnan
US Patent 8,554,370, 2013
272013
UT Austin Villa 2011: 3D Simulation Team Report
P MacAlpine, D Urieli, S Barrett, F Barrera, A Lopez-Mobilia, V Vu, ...
University of Texas at Austin Austin United States, 2011
262011
Characterizing reinforcement learning methods through parameterized learning problems
S Kalyanakrishnan, P Stone
Machine Learning 84 (1), 205-247, 2011
222011
PAC identification of a bandit arm relative to a reward quantile
AR Chaudhuri, S Kalyanakrishnan
Thirty-First AAAI Conference on Artificial Intelligence, 2017
212017
Opportunities and challenges for artificial intelligence in India
S Kalyanakrishnan, RA Panicker, S Natarajan, S Rao
Proceedings of the 2018 AAAI/ACM conference on AI, Ethics, and Society, 164-170, 2018
182018
Learning methods for sequential decision making with imperfect representations
S Kalyanakrishnan
122011
Model-based reinforcement learning in a complex domain
S Kalyanakrishnan, P Stone, Y Liu
Robot Soccer World Cup, 171-183, 2007
112007
O sistema não pode executar a operação agora. Tente novamente mais tarde.
Artigos 1–20