Markov decision processes with arbitrary reward processes JY Yu, S Mannor, N Shimkin Mathematics of Operations Research 34 (3), 737-757, 2009 | 134 | 2009 |
Online Learning with Sample Path Constraints. S Mannor, JN Tsitsiklis, JY Yu Journal of Machine Learning Research 10 (3), 2009 | 132 | 2009 |
Piecewise-stationary bandit problems with side observations JY Yu, S Mannor Proceedings of the 26th annual international conference on machine learning …, 2009 | 117 | 2009 |
Unimodal Bandits. JY Yu, S Mannor ICML, 41-48, 2011 | 110 | 2011 |
A reinforcement learning technique for optimizing downlink scheduling in an energy-limited vehicular network RF Atallah, CM Assi, JY Yu IEEE Transactions on Vehicular Technology 66 (6), 4592-4601, 2016 | 98 | 2016 |
Lipschitz bandits without the lipschitz constant S Bubeck, G Stoltz, JY Yu Algorithmic Learning Theory: 22nd International Conference, ALT 2011, Espoo …, 2011 | 92 | 2011 |
Online learning in Markov decision processes with arbitrarily changing rewards and transitions JY Yu, S Mannor 2009 international conference on game theory for networks, 314-322, 2009 | 54 | 2009 |
On the design of campus parking systems with QoS guarantees W Griggs, JY Yu, F Wirth, F Häusler, R Shorten IEEE Transactions on Intelligent Transportation Systems 17 (5), 1428-1437, 2015 | 53 | 2015 |
Sample Complexity of Risk-Averse Bandit-Arm Selection. JY Yu, E Nikolova IJCAI, 2576-2582, 2013 | 48 | 2013 |
Arbitrarily modulated Markov decision processes JY Yu, S Mannor Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held …, 2009 | 45 | 2009 |
Distributed parking space detection, characterization, advertisement, and enforcement RL Cogill, O Gallay, C Lee, Z Nabi, M Rufli, R Shorten, T Tchrakian, ... US Patent 9,601,018, 2017 | 37 | 2017 |
Data-driven distributionally robust polynomial optimization M Mevissen, E Ragnoli, JY Yu Advances in Neural Information Processing Systems 26, 2013 | 32 | 2013 |
Reward modeling for mitigating toxicity in transformer-based language models F Faal, K Schmitt, JY Yu Applied Intelligence 53 (7), 8421-8435, 2023 | 28 | 2023 |
Reinforcement mechanism design for electric vehicle demand response in microgrid charging stations L Hou, S Ma, J Yan, C Wang, JY Yu 2020 International Joint Conference on Neural Networks (IJCNN), 1-8, 2020 | 22 | 2020 |
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and network utility maximization F Wirth, S Stuedli, JY Yu, M Corless, R Shorten arXiv preprint arXiv:1404.5064, 2014 | 22 | 2014 |
Mean field equilibria of multi armed bandit games R Gummadi, R Johari, JY Yu 2012 50th Annual Allerton Conference on Communication, Control, and …, 2012 | 21 | 2012 |
Mean field analysis of multi-armed bandit games R Gummadi, R Johari, S Schmit, JY Yu Available at SSRN 2045842, 2013 | 20 | 2013 |
A price-based iterative double auction for charger sharing markets J Gao, T Wong, C Wang, JY Yu IEEE Transactions on Intelligent Transportation Systems 23 (6), 5116-5127, 2021 | 19 | 2021 |
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and optimisation FR Wirth, S Stüdli, JY Yu, M Corless, R Shorten Journal of the ACM (JACM) 66 (4), 1-37, 2019 | 18 | 2019 |
Online Learning with Expert Advice and Finite-Horizon Constraints. B Kveton, JY Yu, G Theocharous, S Mannor AAAI, 331-336, 2008 | 18 | 2008 |