Trustworthy AI
-
- カンファレンス (国際)
- A Survey of Constraint Formulations in Safe Reinforcement Learning
- Akifumi Wachi, Xun Shen (Osaka University), Yanan Sui (Tsinghua University)
- The 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)
- 2024.8.3
-
- カンファレンス (国際)
- Safe Reinforcement Learning Using Model Predictive Control with Probabilistic Control Barrier Function
- Xun Shen (Osaka University), Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University), Shigemasa Takai (Osaka University)
- 2024 American Control Conference (ACC 2024)
- 2024.7.10
-
- カンファレンス (国際)
- Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models
- Peifei Zhu, Tsubasa Takahashi, Hirokatsu Kataoka
- The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 (CVPR 2024)
- 2024.6.17
-
- カンファレンス (国際)
- Long-term Safe Reinforcement Learning with Binary Feedback
- Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University)
- Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)
- 2024.3.24
-
- カンファレンス (国際)
- Understanding Likelihood of Normalizing Flow and Image Complexity through the Lens of Out-of-Distribution Detection
- Genki Osada, Tsubasa Takahashi, Takashi Nishide (University of Tsukuba)
- Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)
- 2024.3.24
-
- その他 (国内)
- Constitutional AI におけるセーフティアラインメントの改善
- 綿岡 晃輝, Thien Q. Tran, 前田 若菜, 髙橋 翼
- 言語処理学会第30回年次大会 (NLP2024)
- 2024.3.4
-
- その他 (国内)
- 対話モデルに対する敵対的プロンプトの効率的な最適化
- 矢野 一樹 (東北大学), 綿岡 晃輝, Thien Q. Tran, 髙橋 翼, Seng Pei Liew, 鈴木 潤 (東北大学/理化学研究所)
- 言語処理学会第30回年次大会 (NLP2024)
- 2024.3.4