Jia Yuan Yu

Cited by

	All	Since 2019
Citations	1333	874
h-index	17	16
i10-index	32	24

200

100

150

200920102011201220132014201520162017201820192020202120222023202415 16 16 25 35 48 53 60 79 104 139 177 179 152 185 41

Public access

View all

17 articles

5 articles

available

not available

Based on funding mandates

Jia Yuan Yu

Amazon

Verified email at amazon.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Markov decision processes with arbitrary reward processes JY Yu, S Mannor, N Shimkin Mathematics of Operations Research 34 (3), 737-757, 2009	128	2009
Online Learning with Sample Path Constraints. S Mannor, JN Tsitsiklis, JY Yu Journal of Machine Learning Research 10 (3), 2009	122	2009
Piecewise-stationary bandit problems with side observations JY Yu, S Mannor Proceedings of the 26th annual international conference on machine learning …, 2009	108	2009
Unimodal Bandits. JY Yu, S Mannor ICML, 41-48, 2011	99	2011
A reinforcement learning technique for optimizing downlink scheduling in an energy-limited vehicular network RF Atallah, CM Assi, JY Yu IEEE Transactions on Vehicular Technology 66 (6), 4592-4601, 2016	89	2016
Lipschitz bandits without the lipschitz constant S Bubeck, G Stoltz, JY Yu Algorithmic Learning Theory: 22nd International Conference, ALT 2011, Espoo …, 2011	86	2011
Online learning in Markov decision processes with arbitrarily changing rewards and transitions JY Yu, S Mannor 2009 international conference on game theory for networks, 314-322, 2009	51	2009
On the design of campus parking systems with QoS guarantees W Griggs, JY Yu, F Wirth, F Häusler, R Shorten IEEE Transactions on Intelligent Transportation Systems 17 (5), 1428-1437, 2015	50	2015
Sample Complexity of Risk-Averse Bandit-Arm Selection. JY Yu, E Nikolova IJCAI, 2576-2582, 2013	48	2013
Arbitrarily modulated Markov decision processes JY Yu, S Mannor Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held …, 2009	44	2009
Distributed parking space detection, characterization, advertisement, and enforcement RL Cogill, O Gallay, C Lee, Z Nabi, M Rufli, R Shorten, T Tchrakian, ... US Patent 9,601,018, 2017	35	2017
Data-driven distributionally robust polynomial optimization M Mevissen, E Ragnoli, JY Yu Advances in Neural Information Processing Systems 26, 2013	30	2013
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and network utility maximization F Wirth, S Stuedli, JY Yu, M Corless, R Shorten arXiv preprint arXiv:1404.5064, 2014	21	2014
Mean field equilibria of multi armed bandit games R Gummadi, R Johari, JY Yu 2012 50th Annual Allerton Conference on Communication, Control, and …, 2012	21	2012
Reinforcement mechanism design for electric vehicle demand response in microgrid charging stations L Hou, S Ma, J Yan, C Wang, JY Yu 2020 International Joint Conference on Neural Networks (IJCNN), 1-8, 2020	18	2020
Mean field analysis of multi-armed bandit games R Gummadi, R Johari, S Schmit, JY Yu Available at SSRN 2045842, 2013	18	2013
Reward modeling for mitigating toxicity in transformer-based language models F Faal, K Schmitt, JY Yu Applied Intelligence 53 (7), 8421-8435, 2023	17	2023
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and optimisation FR Wirth, S Stüdli, JY Yu, M Corless, R Shorten Journal of the ACM (JACM) 66 (4), 1-37, 2019	17	2019
Communication-efficient distributed multi-resource allocation SE Alam, R Shorten, F Wirth, JY Yu 2018 IEEE International Smart Cities Conference (ISC2), 1-8, 2018	17	2018
Online Learning with Expert Advice and Finite-Horizon Constraints. B Kveton, JY Yu, G Theocharous, S Mannor AAAI, 331-336, 2008	17	2008

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by