Trustworthy AI
-
- CONFERENCE (INTERNATIONAL)
- A Survey of Constraint Formulations in Safe Reinforcement Learning
- Akifumi Wachi, Xun Shen (Osaka University), Yanan Sui (Tsinghua University)
- The 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)
- August 03, 2024
-
- CONFERENCE (INTERNATIONAL)
- Safe Reinforcement Learning Using Model Predictive Control with Probabilistic Control Barrier Function
- Xun Shen (Osaka University), Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University), Shigemasa Takai (Osaka University)
- 2024 American Control Conference (ACC 2024)
- July 10, 2024
-
- CONFERENCE (INTERNATIONAL)
- Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models
- Peifei Zhu, Tsubasa Takahashi, Hirokatsu Kataoka
- The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 (CVPR 2024)
- June 17, 2024
-
- CONFERENCE (INTERNATIONAL)
- Long-term Safe Reinforcement Learning with Binary Feedback
- Akifumi Wachi, Wataru Hashimoto (Osaka University), Kazumune Hashimoto (Osaka University)
- Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)
- March 24, 2024
-
- CONFERENCE (INTERNATIONAL)
- Understanding Likelihood of Normalizing Flow and Image Complexity through the Lens of Out-of-Distribution Detection
- Genki Osada, Tsubasa Takahashi, Takashi Nishide (University of Tsukuba)
- Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)
- March 24, 2024
-
- OTHERS (DOMESTIC)
- Constitutional AI におけるセーフティアラインメントの改善
- 綿岡 晃輝, Thien Q. Tran, 前田 若菜, 髙橋 翼
- 言語処理学会第30回年次大会 (NLP2024)
- March 04, 2024
-
- OTHERS (DOMESTIC)
- 対話モデルに対する敵対的プロンプトの効率的な最適化
- 矢野 一樹 (東北大学), 綿岡 晃輝, Thien Q. Tran, 髙橋 翼, Seng Pei Liew, 鈴木 潤 (東北大学/理化学研究所)
- 言語処理学会第30回年次大会 (NLP2024)
- March 04, 2024