Follow
Merwan Barlier
Merwan Barlier
Verified email at huawei.com
Title
Cited by
Cited by
Year
Transfer reinforcement learning with shared dynamics
R Laroche, M Barlier
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
592017
Human-machine dialogue as a stochastic game
M Barlier, J Perolat, R Laroche, O Pietquin
16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015
312015
Training dialogue systems with human advice
M Barlier, R Laroche, O Pietquin
AAMAS 2018-the 17th International Conference on Autonomous Agents and …, 2018
52018
A simple and efficient smoothing method for faster optimization and local exploration
K Scaman, L Dos Santos, M Barlier, I Colin
Advances in Neural Information Processing Systems 33, 6503-6513, 2020
42020
A stochastic model for computer-aided human-human dialogue
M Barlier, R Laroche, O Pietquin
Interspeech 2016 2016, 2051-2055, 2016
42016
Density estimation for conservative q-Learning
P Daoudi, L Dos Santos, M Barlier, A Virmaux
ICLR 2022 Workshop on Generalizable Policy Learning in Physical World, 2022
32022
Human-machine dialogue as a stochastic game
B Merwan, P Julien, L Romain, P Olivier
16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015
32015
Density estimation for conservative q-learning, 2022
P Daoudi, M Barlier, LD Santos, A Virmaux
URL https://openreview. net/forum, 0
3
Multi-agent best arm identification with private communications
A Rio, M Barlier, I Colin, M Soare
International Conference on Machine Learning, 29082-29102, 2023
22023
Learning dialogue dynamics with the method of moments
M Barlier, R Laroche, O Pietquin
2016 IEEE Spoken Language Technology Workshop (SLT), 98-105, 2016
22016
Enhancing reinforcement learning agents with local guides
P Daoudi, B Robu, C Prieur, LD Santos, M Barlier
arXiv preprint arXiv:2402.13930, 2024
12024
Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark
P Daoudi, B Mavkov, B Robu, C Prieur, E Witrant, M Barlier, LD Santos
arXiv preprint arXiv:2402.13654, 2024
2024
Differentially Private Model-Based Offline Reinforcement Learning
A Rio, M Barlier, I Colin, A Thomas
arXiv preprint arXiv:2402.05525, 2024
2024
A Trust Region Approach for Few-Shot Sim-to-Real Reinforcement Learning
P Daoudi, C Prieur, B Robu, M Barlier, LD Santos
arXiv preprint arXiv:2312.15474, 2023
2023
Clustered Multi-Agent Linear Bandits
H Cherkaoui, M Barlier, I Colin
arXiv preprint arXiv:2309.08710, 2023
2023
Price of Safety in Linear Best Arm Identification
X Shang, I Colin, M Barlier, H Cherkaoui
arXiv preprint arXiv:2309.08709, 2023
2023
Method and system for a controller
L Dos Santos, M Barlier, K Balazs, I Colin
US Patent App. 18/065,800, 2023
2023
Sur le rôle de l’être humain dans le dialogue humain/machine
M Barlier
Université de lille, 2018
2018
Wireless Parameter Tuning with Clustering Multi-Agents in Linear Stochastic Bandit
H Cherkaoui, M Barlier, I Colin
The system can't perform the operation now. Try again later.
Articles 1–19