News
NeurIPS 2024に論文が4本採択されました
2024.9.26
以下の論文がThe Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)  (外部サイト) で採択されました。
No-regret Bandit Exploration based on Soft Tree Ensemble Model 
Shogo Iwazaki, Shinya Suzumura 
Local Curvature Smoothing with Stein's Identity for Efficient Score Matching 
Genki Osada, Makoto Shing (Sakana AI), Takashi Nishide (University of Tsukuba) 
Stepwise Alignment for Constrained Language Model Policy Optimization 
Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto (University of Tsukuba) 
Flipping-based Policy for Chance-Constrained Markov Decision Processes 
Xun Shen (Osaka University), Shuo Jiang (Osaka University), Akifumi Wachi, Kazumune Hashimoto (Osaka University), Sebastien Gros (Norwegian University of Science and Technology)