Follow
Yuu Jinnai
Yuu Jinnai
Other names佑 陣内
CyberAgent, Inc.
Verified email at cyberagent.co.jp - Homepage
Title
Cited by
Cited by
Year
AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
L Wang, Y Zhao, Y Jinnai, Y Tian, R Fonseca
Proceedings of the AAAI Conference on Artificial Intelligence, 2018
144*2018
Policy and value transfer in lifelong reinforcement learning
D Abel*, Y Jinnai*, SY Guo, G Konidaris, M Littman
International Conference on Machine Learning, 20-29, 2018
992018
State abstraction as compression in apprenticeship learning
D Abel, D Arumugam, K Asadi, Y Jinnai, ML Littman, LLS Wong
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 3134-3142, 2019
572019
Discovering options for exploration by minimizing cover time
Y Jinnai, JW Park, D Abel, G Konidaris
Proceedings of the 36th International Conference on Machine Learning, 2019
572019
Exploration in reinforcement learning with deep covering options
Y Jinnai, JW Park, MC Machado, G Konidaris
International Conference on Learning Representations, 2020
522020
Finding Options that Minimize Planning Time
Y Jinnai, D Abel, DE Hershkowitz, M Littman, G Konidaris
Proceedings of the 36th International Conference on Machine Learning, 2018
412018
Lipschitz lifelong reinforcement learning
E Lecarpentier, D Abel, K Asadi, Y Jinnai, E Rachelson, ML Littman
Proceedings of the AAAI Conference on Artificial Intelligence 35 (9), 8270-8278, 2021
362021
Learning to Prune Dominated Action Sequences in Online Black-box Planning
Y Jinnai, A Fukunaga
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
212017
Parallel a* for state-space search
A Fukunaga, A Botea, Y Jinnai, A Kishimoto
Handbook of Parallel Constraint Reasoning, 419-455, 2018
20*2018
Abstract Zobrist Hashing: An Efficient Work Distribution Method for Parallel Best-First Search
Y Jinnai, A Fukunaga
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence …, 2016
172016
Automated Creation of Efficient Work Distribution Functions for Parallel Best-First Search
Y Jinnai, A Fukunaga
International Conference on Automated Planning and Scheduling, 2016
62016
On hash-based work distribution methods for parallel best-first search
Y Jinnai, A Fukunaga
Journal of Artificial Intelligence Research 60, 491-548, 2017
52017
On the depth between beam search and exhaustive search for text generation
Y Jinnai, T Morimura, U Honda
arXiv preprint arXiv:2308.13696, 2023
32023
Filtered Direct Preference Optimization
T Morimura, M Sakamoto, Y Jinnai, K Abe, K Ariu
arXiv preprint arXiv:2404.13846, 2024
22024
Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment
Y Jinnai, T Morimura, K Ariu, K Abe
arXiv preprint arXiv:2404.01054, 2024
22024
Blind signal separation for fast ultrasound computed tomography
T Noda, Y Jinnai, N Tomii, T Azuma
arXiv preprint arXiv:2304.14424, 2023
22023
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Y Jinnai, U Honda, T Morimura, P Zhang
arXiv preprint arXiv:2401.05054, 2024
12024
Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding
Y Jinnai, K Ariu
arXiv preprint arXiv:2401.02749, 2024
12024
Model-Based Minimum Bayes Risk Decoding for Text Generation
Y Jinnai, T Morimura, U Honda, K Ariu, K Abe
Forty-first International Conference on Machine Learning, 2024
1*2024
Skill Discovery with Well-Defined Objectives
Y Jinnai, D Abel, JW Park, DE Hershkowitz, M Littman, G Konidaris
Structure & Priors in Reinforcement Learning (SPiRL) at ICLR 2019, 2019
12019
The system can't perform the operation now. Try again later.
Articles 1–20