Seguir
Jia Yuan Yu
Jia Yuan Yu
E-mail confirmado em amazon.com
Título
Citado por
Citado por
Ano
Markov decision processes with arbitrary reward processes
JY Yu, S Mannor, N Shimkin
Mathematics of Operations Research 34 (3), 737-757, 2009
1342009
Online Learning with Sample Path Constraints.
S Mannor, JN Tsitsiklis, JY Yu
Journal of Machine Learning Research 10 (3), 2009
1322009
Piecewise-stationary bandit problems with side observations
JY Yu, S Mannor
Proceedings of the 26th annual international conference on machine learning …, 2009
1172009
Unimodal Bandits.
JY Yu, S Mannor
ICML, 41-48, 2011
1102011
A reinforcement learning technique for optimizing downlink scheduling in an energy-limited vehicular network
RF Atallah, CM Assi, JY Yu
IEEE Transactions on Vehicular Technology 66 (6), 4592-4601, 2016
982016
Lipschitz bandits without the lipschitz constant
S Bubeck, G Stoltz, JY Yu
Algorithmic Learning Theory: 22nd International Conference, ALT 2011, Espoo …, 2011
922011
Online learning in Markov decision processes with arbitrarily changing rewards and transitions
JY Yu, S Mannor
2009 international conference on game theory for networks, 314-322, 2009
542009
On the design of campus parking systems with QoS guarantees
W Griggs, JY Yu, F Wirth, F Häusler, R Shorten
IEEE Transactions on Intelligent Transportation Systems 17 (5), 1428-1437, 2015
532015
Sample Complexity of Risk-Averse Bandit-Arm Selection.
JY Yu, E Nikolova
IJCAI, 2576-2582, 2013
482013
Arbitrarily modulated Markov decision processes
JY Yu, S Mannor
Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held …, 2009
452009
Distributed parking space detection, characterization, advertisement, and enforcement
RL Cogill, O Gallay, C Lee, Z Nabi, M Rufli, R Shorten, T Tchrakian, ...
US Patent 9,601,018, 2017
372017
Data-driven distributionally robust polynomial optimization
M Mevissen, E Ragnoli, JY Yu
Advances in Neural Information Processing Systems 26, 2013
322013
Reward modeling for mitigating toxicity in transformer-based language models
F Faal, K Schmitt, JY Yu
Applied Intelligence 53 (7), 8421-8435, 2023
282023
Reinforcement mechanism design for electric vehicle demand response in microgrid charging stations
L Hou, S Ma, J Yan, C Wang, JY Yu
2020 International Joint Conference on Neural Networks (IJCNN), 1-8, 2020
222020
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and network utility maximization
F Wirth, S Stuedli, JY Yu, M Corless, R Shorten
arXiv preprint arXiv:1404.5064, 2014
222014
Mean field equilibria of multi armed bandit games
R Gummadi, R Johari, JY Yu
2012 50th Annual Allerton Conference on Communication, Control, and …, 2012
212012
Mean field analysis of multi-armed bandit games
R Gummadi, R Johari, S Schmit, JY Yu
Available at SSRN 2045842, 2013
202013
A price-based iterative double auction for charger sharing markets
J Gao, T Wong, C Wang, JY Yu
IEEE Transactions on Intelligent Transportation Systems 23 (6), 5116-5127, 2021
192021
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and optimisation
FR Wirth, S Stüdli, JY Yu, M Corless, R Shorten
Journal of the ACM (JACM) 66 (4), 1-37, 2019
182019
Online Learning with Expert Advice and Finite-Horizon Constraints.
B Kveton, JY Yu, G Theocharous, S Mannor
AAAI, 331-336, 2008
182008
O sistema não pode executar a operação agora. Tente novamente mais tarde.
Artigos 1–20