firstbacksecondback
90 Results
Poster
|
Wed 11:00 |
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL Qi Lv · Xiang Deng · Gongwei Chen · MICHAEL YU WANG · Liqiang Nie |
|
Workshop
|
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling Loris Gaven · Thomas Carta · Clément ROMAC · Olivier Sigaud · Sylvain Lamprier · Pierre-Yves Oudeyer |
||
Poster
|
Wed 11:00 |
Diffusion-Reward Adversarial Imitation Learning Chun-Mao Lai · Hsiang-Chun Wang · Ping-Chun Hsieh · Frank Wang · Min-Hung Chen · Shao-Hua Sun |
|
Poster
|
Wed 11:00 |
A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data Adrian Remonda · Nicklas Hansen · Ayoub Raji · Nicola Musiu · Marko Bertogna · Eduardo Veas · Xiaolong Wang |
|
Poster
|
Wed 16:30 |
Multi-turn Reinforcement Learning with Preference Human Feedback Lior Shani · Aviv Rosenberg · Asaf Cassel · Oran Lang · Daniele Calandriello · Avital Zipori · Hila Noga · Orgad Keller · Bilal Piot · Idan Szpektor · Avinatan Hassidim · Yossi Matias · Remi Munos |
|
Poster
|
Wed 11:00 |
Distributional Preference Alignment of LLMs via Optimal Transport Igor Melnyk · Youssef Mroueh · Brian Belgodere · Mattia Rigotti · Apoorva Nitsure · Mikhail Yurochkin · Kristjan Greenewald · Jiri Navratil · Jarret Ross |