Follow
Ryan D'Orazio
Ryan D'Orazio
PhD Student at MILA
Verified email at mila.quebec - Homepage
Title
Cited by
Cited by
Year
Hindsight and sequential rationality of correlated play
D Morrill, R D'Orazio, R Sarfati, M Lanctot, JR Wright, AR Greenwald, ...
Proceedings of the AAAI Conference on Artificial Intelligence 35 (6), 5584-5594, 2021
142021
Efficient deviation types and learning for hindsight rationality in extensive-form games
D Morrill, R D’Orazio, M Lanctot, JR Wright, M Bowling, AR Greenwald
International Conference on Machine Learning, 7818-7828, 2021
112021
Solving common-payoff games with approximate policy iteration
S Sokota, E Lockhart, F Timbers, E Davoodi, R D'Orazio, N Burch, ...
Proceedings of the AAAI Conference on Artificial Intelligence 35 (11), 9695-9703, 2021
92021
Stochastic mirror descent: Convergence analysis and adaptive variants via the mirror stochastic Polyak stepsize
R D'Orazio, N Loizou, I Laradji, I Mitliagkas
arXiv preprint arXiv:2110.15412, 2021
72021
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of -Regression Counterfactual Regret Minimization
R D'Orazio, D Morrill, JR Wright, M Bowling
arXiv preprint arXiv:1912.02967, 2019
52019
Regret Minimization with Function Approximation in Extensive-Form Games
R D'Orazio
42020
Simultaneous prediction intervals for patient-specific survival curves
S Sokota, R D'Orazio, K Javed, H Haider, R Greiner
arXiv preprint arXiv:1906.10780, 2019
42019
Optimistic and adaptive lagrangian hedging
R D'Orazio, R Huang
arXiv preprint arXiv:2101.09603, 2021
22021
Bounds for approximate regret-matching algorithms
R D'Orazio, D Morrill, JR Wright
arXiv preprint arXiv:1910.01706, 2019
12019
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games
S Sokota, R D'Orazio, JZ Kolter, N Loizou, M Lanctot, I Mitliagkas, ...
arXiv preprint arXiv:2206.05825, 2022
2022
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections
D Morrill, R D'Orazio, M Lanctot, JR Wright, M Bowling, AR Greenwald
arXiv preprint arXiv:2205.12031, 2022
2022
Simultaneous Prediction Intervals for Patient-Specific Survival Curves Download PDF
S Sokota, R D'Orazio, K Javed, H Haider, R Greiner
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games Supplementary
D Morrill, R D’Orazio, M Lanctot, JR Wright, M Bowling, AR Greenwald
The system can't perform the operation now. Try again later.
Articles 1–13