Follow
Yuu Jinnai
Yuu Jinnai
Other names佑 陣内
CyberAgent, Inc.
Verified email at cyberagent.co.jp - Homepage
Title
Cited by
Cited by
Year
AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search
L Wang, Y Zhao, Y Jinnai, Y Tian, R Fonseca
Proceedings of the AAAI Conference on Artificial Intelligence, 2018
153*2018
Policy and value transfer in lifelong reinforcement learning
D Abel*, Y Jinnai*, SY Guo, G Konidaris, M Littman
International Conference on Machine Learning, 20-29, 2018
1012018
State abstraction as compression in apprenticeship learning
D Abel, D Arumugam, K Asadi, Y Jinnai, ML Littman, LLS Wong
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 3134-3142, 2019
602019
Discovering options for exploration by minimizing cover time
Y Jinnai, JW Park, D Abel, G Konidaris
Proceedings of the 36th International Conference on Machine Learning, 2019
592019
Exploration in reinforcement learning with deep covering options
Y Jinnai, JW Park, MC Machado, G Konidaris
International Conference on Learning Representations, 2020
532020
Finding Options that Minimize Planning Time
Y Jinnai, D Abel, DE Hershkowitz, M Littman, G Konidaris
Proceedings of the 36th International Conference on Machine Learning, 2019
432019
Lipschitz lifelong reinforcement learning
E Lecarpentier, D Abel, K Asadi, Y Jinnai, E Rachelson, ML Littman
Proceedings of the AAAI Conference on Artificial Intelligence 35 (9), 8270-8278, 2021
392021
Learning to Prune Dominated Action Sequences in Online Black-box Planning
Y Jinnai, A Fukunaga
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
212017
Parallel a* for state-space search
A Fukunaga, A Botea, Y Jinnai, A Kishimoto
Handbook of Parallel Constraint Reasoning, 419-455, 2018
20*2018
Abstract Zobrist Hashing: An Efficient Work Distribution Method for Parallel Best-First Search
Y Jinnai, A Fukunaga
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence …, 2016
182016
Filtered Direct Preference Optimization
T Morimura, M Sakamoto, Y Jinnai, K Abe, K Ariu
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
72024
Automated Creation of Efficient Work Distribution Functions for Parallel Best-First Search
Y Jinnai, A Fukunaga
International Conference on Automated Planning and Scheduling, 2016
62016
On hash-based work distribution methods for parallel best-first search
Y Jinnai, A Fukunaga
Journal of Artificial Intelligence Research 60, 491-548, 2017
52017
On the depth between beam search and exhaustive search for text generation
Y Jinnai, T Morimura, U Honda
arXiv preprint arXiv:2308.13696, 2023
42023
Blind signal separation for fast ultrasound computed tomography
T Noda, Y Jinnai, N Tomii, T Azuma
arXiv preprint arXiv:2304.14424, 2023
42023
Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment
Y Jinnai, T Morimura, K Ariu, K Abe
arXiv preprint arXiv:2404.01054, 2024
32024
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Y Jinnai, U Honda, T Morimura, P Zhang
Findings of the Association for Computational Linguistics ACL 2024, 8494–8525, 2024
22024
Model-Based Minimum Bayes Risk Decoding for Text Generation
Y Jinnai, T Morimura, U Honda, K Ariu, K Abe
Proceedings of the 41st International Conference on Machine Learning 235 …, 2024
2*2024
Does Cross-Cultural Alignment Change the Commonsense Morality of Language Models?
Y Jinnai
Proceedings of the 2nd Workshop on Cross-Cultural Considerations in NLP, 48--64, 2024
12024
Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding
Y Jinnai, K Ariu
Findings of the Association for Computational Linguistics ACL 2024, 8547–8566, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20