Follow
Jia Yuan Yu
Jia Yuan Yu
Verified email at amazon.com
Title
Cited by
Cited by
Year
Markov decision processes with arbitrary reward processes
JY Yu, S Mannor, N Shimkin
Mathematics of Operations Research 34 (3), 737-757, 2009
1282009
Online Learning with Sample Path Constraints.
S Mannor, JN Tsitsiklis, JY Yu
Journal of Machine Learning Research 10 (3), 2009
1222009
Piecewise-stationary bandit problems with side observations
JY Yu, S Mannor
Proceedings of the 26th annual international conference on machine learning …, 2009
1082009
Unimodal Bandits.
JY Yu, S Mannor
ICML, 41-48, 2011
992011
A reinforcement learning technique for optimizing downlink scheduling in an energy-limited vehicular network
RF Atallah, CM Assi, JY Yu
IEEE Transactions on Vehicular Technology 66 (6), 4592-4601, 2016
892016
Lipschitz bandits without the lipschitz constant
S Bubeck, G Stoltz, JY Yu
Algorithmic Learning Theory: 22nd International Conference, ALT 2011, Espoo …, 2011
862011
Online learning in Markov decision processes with arbitrarily changing rewards and transitions
JY Yu, S Mannor
2009 international conference on game theory for networks, 314-322, 2009
512009
On the design of campus parking systems with QoS guarantees
W Griggs, JY Yu, F Wirth, F Häusler, R Shorten
IEEE Transactions on Intelligent Transportation Systems 17 (5), 1428-1437, 2015
502015
Sample Complexity of Risk-Averse Bandit-Arm Selection.
JY Yu, E Nikolova
IJCAI, 2576-2582, 2013
482013
Arbitrarily modulated Markov decision processes
JY Yu, S Mannor
Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held …, 2009
442009
Distributed parking space detection, characterization, advertisement, and enforcement
RL Cogill, O Gallay, C Lee, Z Nabi, M Rufli, R Shorten, T Tchrakian, ...
US Patent 9,601,018, 2017
352017
Data-driven distributionally robust polynomial optimization
M Mevissen, E Ragnoli, JY Yu
Advances in Neural Information Processing Systems 26, 2013
302013
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and network utility maximization
F Wirth, S Stuedli, JY Yu, M Corless, R Shorten
arXiv preprint arXiv:1404.5064, 2014
212014
Mean field equilibria of multi armed bandit games
R Gummadi, R Johari, JY Yu
2012 50th Annual Allerton Conference on Communication, Control, and …, 2012
212012
Reinforcement mechanism design for electric vehicle demand response in microgrid charging stations
L Hou, S Ma, J Yan, C Wang, JY Yu
2020 International Joint Conference on Neural Networks (IJCNN), 1-8, 2020
182020
Mean field analysis of multi-armed bandit games
R Gummadi, R Johari, S Schmit, JY Yu
Available at SSRN 2045842, 2013
182013
Reward modeling for mitigating toxicity in transformer-based language models
F Faal, K Schmitt, JY Yu
Applied Intelligence 53 (7), 8421-8435, 2023
172023
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and optimisation
FR Wirth, S Stüdli, JY Yu, M Corless, R Shorten
Journal of the ACM (JACM) 66 (4), 1-37, 2019
172019
Communication-efficient distributed multi-resource allocation
SE Alam, R Shorten, F Wirth, JY Yu
2018 IEEE International Smart Cities Conference (ISC2), 1-8, 2018
172018
Online Learning with Expert Advice and Finite-Horizon Constraints.
B Kveton, JY Yu, G Theocharous, S Mannor
AAAI, 331-336, 2008
172008
The system can't perform the operation now. Try again later.
Articles 1–20