Poster
|
Thu 16:30
|
Distributional Successor Features Enable Zero-Shot Policy Optimization
Chuning Zhu · Xinqi Wang · Tyler Han · Simon Du · Abhishek Gupta
|
|
Workshop
|
|
Policy optimization to align the validity, coherence and efficiency of reasoning agents in multi-turn dialogues
Jeremy Curuksu
|
|
Poster
|
Wed 11:00
|
Policy Optimization for Robust Average Reward MDPs
Zhongchang Sun · Sihong He · Fei Miao · Shaofeng Zou
|
|
Poster
|
Thu 11:00
|
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes
Asaf Cassel · Aviv Rosenberg
|
|
Poster
|
Wed 16:30
|
Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality
Audrey Huang · Nan Jiang
|
|
Poster
|
Thu 11:00
|
Graph Diffusion Policy Optimization
Yijing Liu · Chao Du · Tianyu Pang · Chongxuan LI · Min Lin · Wei Chen
|
|
Poster
|
|
Learning the Optimal Policy for Balancing Short-Term and Long-Term Rewards
Qinwei Yang · Xueqing Liu · Yan Zeng · Ruocheng Guo · Yang Liu · Peng Wu
|
|
Workshop
|
|
P3O: Pessimistic Preference-based Policy Optimization for Robust Alignment from Preferences
Dhawal Gupta · Christoph Dann · Alekh Agarwal
|
|
Workshop
|
|
Recursive Nested Filtering for Efficient Amortized Bayesian Experimental Design
Sahel Mohammad Iqbal · Hany Abdulsamad · Sara Perez-Vieites · Simo Sarkka · Adrien Corenflos
|
|