News

NeurIPS 2024に論文が4本採択されました

2024.9.26

以下の論文がThe Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) (外部サイト) で採択されました。

No-regret Bandit Exploration based on Soft Tree Ensemble Model
Shogo Iwazaki, Shinya Suzumura

Local Curvature Smoothing with Stein's Identity for Efficient Score Matching
Genki Osada, Makoto Shing (Sakana AI), Takashi Nishide (University of Tsukuba)

Stepwise Alignment for Constrained Language Model Policy Optimization
Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto (University of Tsukuba)

Flipping-based Policy for Chance-Constrained Markov Decision Processes
Xun Shen (Osaka University), Shuo Jiang (Osaka University), Akifumi Wachi, Kazumune Hashimoto (Osaka University), Sebastien Gros (Norwegian University of Science and Technology)