News
NeurIPS 2023 に論文が2本採択されました
2023.10.2
以下の論文がNeural Information Processing Systems (NeurIPS) (外部サイト) で採択されました。
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms
Akifumi Wachi, Wataru Hashimoto (Osaka University), Xun Shen (Osaka University), Kazumune Hashimoto (Osaka University)
Direct Preference-based Policy Optimization without Reward Modeling
Gaon An (Seoul National University), Junhyeok Lee (Seoul National University), Xingdong Zuo (NAVER), Norio Kosaka, Kyung-Min Kim (NAVER), Hyun Oh Song (Seoul National University)