News

NeurIPS 2023 に論文が2本採択されました

2023.10.2

以下の論文がNeural Information Processing Systems (NeurIPS) (外部サイト) で採択されました。

Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms
Akifumi Wachi, Wataru Hashimoto (Osaka University), Xun Shen (Osaka University), Kazumune Hashimoto (Osaka University)

Direct Preference-based Policy Optimization without Reward Modeling
Gaon An (Seoul National University), Junhyeok Lee (Seoul National University), Xingdong Zuo (NAVER), Norio Kosaka, Kyung-Min Kim (NAVER), Hyun Oh Song (Seoul National University)