News
4 papers accepted at NeurIPS 2024
September 26, 2024
Our papers have been accepted at The Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) (external link).
No-regret Bandit Exploration based on Soft Tree Ensemble Model
Shogo Iwazaki, Shinya Suzumura
Local Curvature Smoothing with Stein's Identity for Efficient Score Matching
Genki Osada, Makoto Shing (Sakana AI), Takashi Nishide (University of Tsukuba)
Stepwise Alignment for Constrained Language Model Policy Optimization
Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto (University of Tsukuba)
Flipping-based Policy for Chance-Constrained Markov Decision Processes
Xun Shen (Osaka University), Shuo Jiang (Osaka University), Akifumi Wachi, Kazumune Hashimoto (Osaka University), Sebastien Gros (Norwegian University of Science and Technology)