LINEヤフーの研究開発

JP
EN

Publications

Trustworthy AI

その他 (国際)

A Relative-Budget Theory for Reinforcement Learning with Verifiable Rewards in Large Language Model Reasoning

Akifumi Wachi, Hirota Kinoshita (Toyota Technological Institute at Chicago), Shokichi Takakura, Rei Higuchi (University of Tokyo/RIKEN AIP), Taiji Suzuki (University of Tokyo/RIKEN AIP)

arXiv.org (arXiv)

2026.2.2
カンファレンス (国際)

A Provable Approach for End-to-End Safe Reinforcement Learning

Akifumi Wachi, Kohei Miyaguchi, Takumi Tanabe, Rei Sato, Youhei Akimoto (University of Tsukuba, RIKEN AIP)

The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)

2025.12.5
カンファレンス (国際)

Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies

Runze Yan (Emory University), Xun Shen (Tokyo University of Agriculture and Technology), Akifumi Wachi, Sebastien Gros (Norwegian University of Science and Technology), Anni Zhao (Emory University), Xiao Hu (Emory University)

The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)

2025.12.3
その他 (国際)

Vulnerability Mitigation for Safety-Aligned Language Models via Debiasing

Thien Q. Tran, Akifumi Wachi, Rei Sato, Takumi Tanabe, Youhei AKimoto (University of Tsukuba, RIKEN AIP)

arXiv.org (arXiv)

2025.2.4
カンファレンス (国際)

Flipping-based Policy for Chance-Constrained Markov Decision Processes

Xun Shen (Osaka University), Shuo Jiang (Osaka University), Akifumi Wachi, Kazumune Hashimoto (Osaka University), Sebastien Gros (Norwegian University of Science and Technology)

The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

2024.12.13
カンファレンス (国際)

Stepwise Alignment for Constrained Language Model Policy Optimization

Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto (University of Tsukuba)

The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

2024.12.11
カンファレンス (国際)

A Survey of Constraint Formulations in Safe Reinforcement Learning

Akifumi Wachi, Xun Shen (Osaka University), Yanan Sui (Tsinghua University)

The 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

2024.8.3
カンファレンス (国際)

Safe Reinforcement Learning Using Model Predictive Control with Probabilistic Control Barrier Function

Xun Shen (Osaka University), Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University), Shigemasa Takai (Osaka University)

2024 American Control Conference (ACC 2024)

2024.7.10
カンファレンス (国際)

Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models

Peifei Zhu, Tsubasa Takahashi, Hirokatsu Kataoka

The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 (CVPR 2024)

2024.6.17
カンファレンス (国際)

Long-term Safe Reinforcement Learning with Binary Feedback

Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University)

Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)

2024.3.24
カンファレンス (国際)

Understanding Likelihood of Normalizing Flow and Image Complexity through the Lens of Out-of-Distribution Detection

Genki Osada, Tsubasa Takahashi, Takashi Nishide (University of Tsukuba)

Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)

2024.3.24
その他 (国内)

Constitutional AI におけるセーフティアラインメントの改善

綿岡晃輝, Thien Q. Tran, 前田若菜, 髙橋翼

言語処理学会第30回年次大会 (NLP2024)

2024.3.4
その他 (国内)

対話モデルに対する敵対的プロンプトの効率的な最適化

矢野一樹 (東北大学), 綿岡晃輝, Thien Q. Tran, 髙橋翼, Seng Pei Liew, 鈴木潤 (東北大学/理化学研究所)

言語処理学会第30回年次大会 (NLP2024)

2024.3.4

1