|Unifying count-based exploration and intrinsic motivation|
M Bellemare, S Srinivasan, G Ostrovski, T Schaul, D Saxton, R Munos
Advances in Neural Information Processing Systems, 1471-1479, 2016
|Emergence of locomotion behaviours in rich environments|
N Heess, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ...
arXiv preprint arXiv:1707.02286, 2017
|Neural episodic control|
A Pritzel, B Uria, S Srinivasan, AP Badia, O Vinyals, D Hassabis, ...
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
|Actor-critic policy optimization in partially observable multiagent environments|
S Srinivasan, M Lanctot, V Zambaldi, J Pérolat, K Tuyls, R Munos, ...
Advances in Neural Information Processing Systems, 3422-3435, 2018
|Segmenting web-domains and hashtags using length specific models|
S Srinivasan, S Bhattacharya, R Chakraborty
Proceedings of the 21st ACM international conference on Information and …, 2012
|Domain-independent optimistic initialization for reinforcement learning|
MC Machado, S Srinivasan, M Bowling
Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
|Improving exploration in UCT using local manifolds|
S Srinivasan, E Talvitie, M Bowling
Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
|Emergence of locomotion behaviours in rich environments (2017)|
N Heess, TB Dhruva, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, ...
arXiv preprint arXiv:1707.02286, 0
|Learning to tokenize web domains|
S Srinivasan, S Bhattachaya
Proceedings of the 20th international conference companion on World wide web …, 2011
|Learning Markov Networks with Bounded Inference Complexity|
UD Gupta, S Sriram, S Sharma, R Greiner