LINEヤフーの研究開発

JP
EN

Publications

Trustworthy AI

その他 (国際)

A Provable Approach for End-to-End Safe Reinforcement Learning

Akifumi Wachi, Kohei Miyaguchi, Takumi Tanabe, Rei Sato, Youhei Akimoto (University of Tsukuba)

arXiv.org (arXiv)

2025.5.28
その他 (国際)

Vulnerability Mitigation for Safety-Aligned Language Models via Debiasing

Thien Q. Tran, Akifumi Wachi, Rei Sato, Takumi Tanabe, Youhei AKimoto (University of Tsukuba, RIKEN AIP)

arXiv.org (arXiv)

2025.2.4
カンファレンス (国際)

Flipping-based Policy for Chance-Constrained Markov Decision Processes

Xun Shen (Osaka University), Shuo Jiang (Osaka University), Akifumi Wachi, Kazumune Hashimoto (Osaka University), Sebastien Gros (Norwegian University of Science and Technology)

The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

2024.12.13
カンファレンス (国際)

Stepwise Alignment for Constrained Language Model Policy Optimization

Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto (University of Tsukuba)

The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

2024.12.11
カンファレンス (国際)

A Survey of Constraint Formulations in Safe Reinforcement Learning

Akifumi Wachi, Xun Shen (Osaka University), Yanan Sui (Tsinghua University)

The 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

2024.8.3
カンファレンス (国際)

Safe Reinforcement Learning Using Model Predictive Control with Probabilistic Control Barrier Function

Xun Shen (Osaka University), Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University), Shigemasa Takai (Osaka University)

2024 American Control Conference (ACC 2024)

2024.7.10
カンファレンス (国際)

Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models

Peifei Zhu, Tsubasa Takahashi, Hirokatsu Kataoka

The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 (CVPR 2024)

2024.6.17
カンファレンス (国際)

Long-term Safe Reinforcement Learning with Binary Feedback

Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University)

Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)

2024.3.24
カンファレンス (国際)

Understanding Likelihood of Normalizing Flow and Image Complexity through the Lens of Out-of-Distribution Detection

Genki Osada, Tsubasa Takahashi, Takashi Nishide (University of Tsukuba)

Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)

2024.3.24
その他 (国内)

Constitutional AI におけるセーフティアラインメントの改善

綿岡晃輝, Thien Q. Tran, 前田若菜, 髙橋翼

言語処理学会第30回年次大会 (NLP2024)

2024.3.4
その他 (国内)

対話モデルに対する敵対的プロンプトの効率的な最適化

矢野一樹 (東北大学), 綿岡晃輝, Thien Q. Tran, 髙橋翼, Seng Pei Liew, 鈴木潤 (東北大学/理化学研究所)

言語処理学会第30回年次大会 (NLP2024)

2024.3.4

1