People
Akifumi Wachi Chief Researcher, LY Research
Akifumi Wachi is a research scientist at LY Research. His research interests lie primarily in reinforcement learning, and span the entire theory-to-application spectrum from fundamental advances to deployment in real-world systems. Especially, he is interested in how a policy should (and can) be trained and deployed in safety-critical problems. See https://akifumi-wachi-4.github.io/website/ (external link) for details.
Publications
-
- CONFERENCE (INTERNATIONAL)
- Flipping-based Policy for Chance-Constrained Markov Decision Processes
- Xun Shen (Osaka University), Shuo Jiang (Osaka University), Akifumi Wachi, Kazumune Hashimoto (Osaka University), Sebastien Gros (Norwegian University of Science and Technology)
- The 38th Annual Conference on Neural Information Processing Systems
- December 13, 2024
-
- CONFERENCE (INTERNATIONAL)
- Stepwise Alignment for Constrained Language Model Policy Optimization
- Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto (University of Tsukuba)
- The 38th Annual Conference on Neural Information Processing Systems
- December 11, 2024
-
- CONFERENCE (INTERNATIONAL)
- A Survey of Constraint Formulations in Safe Reinforcement Learning
- Akifumi Wachi, Xun Shen (Osaka University), Yanan Sui (Tsinghua University)
- The 33rd International Joint Conference on Artificial Intelligence
- August 03, 2024
-
- CONFERENCE (INTERNATIONAL)
- Safe Reinforcement Learning Using Model Predictive Control with Probabilistic Control Barrier Function
- Xun Shen (Osaka University), Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University), Shigemasa Takai (Osaka University)
- 2024 American Control Conference
- July 10, 2024
-
- CONFERENCE (INTERNATIONAL)
- Long-term Safe Reinforcement Learning with Binary Feedback
- Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University)
- Thirty-Eighth AAAI Conference on Artificial Intelligence
- March 24, 2024
-
- WORKSHOP (INTERNATIONAL)
- Verbosity Bias in Preference Labeling by Large Language Models
- Keita Saito (University of Tsukuba), Akifumi Wachi, Koki Wataoka, Youhei Akimoto (University of Tsukuba)
- Workshop on Instruction Tuning and Instruction Following at NeurIPS 2023.
- December 16, 2023
-
- CONFERENCE (INTERNATIONAL)
- Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms
- Akifumi Wachi, Wataru Hashimoto (Osaka University), Xun Shen (Osaka University), Kazumune Hashimoto (Osaka University)
- The 37th Conference on Neural Information Processing Systems
- December 13, 2023