Learning to learn by gradient descent by gradient descent M Andrychowicz, M Denil, S Gomez, MW Hoffman, D Pfau, T Schaul, ... Advances in neural information processing systems 29, 2016 | 1521 | 2016 |
Hindsight experience replay M Andrychowicz, F Wolski, A Ray, J Schneider, R Fong, P Welinder, ... Advances in neural information processing systems 30, 2017 | 1517 | 2017 |
Learning dexterous in-hand manipulation OAIM Andrychowicz, B Baker, M Chociej, R Jozefowicz, B McGrew, ... The International Journal of Robotics Research 39 (1), 3-20, 2020 | 896 | 2020 |
Sim-to-real transfer of robotic control with dynamics randomization XB Peng, M Andrychowicz, W Zaremba, P Abbeel 2018 IEEE international conference on robotics and automation (ICRA), 3803-3810, 2018 | 804 | 2018 |
One-shot imitation learning Y Duan, M Andrychowicz, B Stadie, OAI Jonathan Ho, J Schneider, ... Advances in neural information processing systems 30, 2017 | 552 | 2017 |
Secure multiparty computations on bitcoin M Andrychowicz, S Dziembowski, D Malinowski, L Mazurek 2014 IEEE Symposium on Security and Privacy, 443-458, 2014 | 486 | 2014 |
Overcoming exploration in reinforcement learning with demonstrations A Nair, B McGrew, M Andrychowicz, W Zaremba, P Abbeel 2018 IEEE international conference on robotics and automation (ICRA), 6292-6299, 2018 | 481 | 2018 |
Parameter space noise for exploration M Plappert, R Houthooft, P Dhariwal, S Sidor, RY Chen, X Chen, T Asfour, ... arXiv preprint arXiv:1706.01905, 2017 | 474 | 2017 |
Solving rubik's cube with a robot hand I Akkaya, M Andrychowicz, M Chociej, M Litwin, B McGrew, A Petron, ... arXiv preprint arXiv:1910.07113, 2019 | 391 | 2019 |
Multi-goal reinforcement learning: Challenging robotics environments and request for research M Plappert, M Andrychowicz, A Ray, B McGrew, B Baker, G Powell, ... arXiv preprint arXiv:1802.09464, 2018 | 283 | 2018 |
Asymmetric actor critic for image-based robot learning L Pinto, M Andrychowicz, P Welinder, W Zaremba, P Abbeel arXiv preprint arXiv:1710.06542, 2017 | 221 | 2017 |
Fair two-party computations via bitcoin deposits M Andrychowicz, S Dziembowski, D Malinowski, Ł Mazurek International Conference on Financial Cryptography and Data Security, 105-121, 2014 | 182 | 2014 |
Neural random-access machines K Kurach, M Andrychowicz, I Sutskever arXiv preprint arXiv:1511.06392, 2015 | 158 | 2015 |
Domain randomization and generative models for robotic grasping J Tobin, L Biewald, R Duan, M Andrychowicz, A Handa, V Kumar, ... 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2018 | 119 | 2018 |
Solving Rubik's Cube with a Robot Hand IA OpenAI, M Andrychowicz, M Chociej, M Litwin, B McGrew, A Petron, ... | 115 | 2019 |
On the malleability of bitcoin transactions M Andrychowicz, S Dziembowski, D Malinowski, Ł Mazurek International Conference on Financial Cryptography and Data Security, 1-18, 2015 | 86 | 2015 |
What matters in on-policy reinforcement learning? a large-scale empirical study M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ... arXiv preprint arXiv:2006.05990, 2020 | 66 | 2020 |
Pow-based distributed cryptography with no trusted setup M Andrychowicz, S Dziembowski Annual Cryptology Conference, 379-399, 2015 | 64 | 2015 |
Modeling bitcoin contracts by timed automata M Andrychowicz, S Dziembowski, D Malinowski, Ł Mazurek International Conference on Formal Modeling and Analysis of Timed Systems, 7-22, 2014 | 50 | 2014 |
Learning efficient algorithms with hierarchical attentive memory M Andrychowicz, K Kurach arXiv preprint arXiv:1602.03218, 2016 | 49 | 2016 |