Poster
|
Wed 16:30
|
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro · Marco Mussi · Matteo Papini · Alberto Maria Metelli
|
|
Poster
|
Thu 16:30
|
Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning
Junyan Liu · Yunfan Li · Ruosong Wang · Lin Yang
|
|
Poster
|
Fri 11:00
|
Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms
Yang Cai · Gabriele Farina · Julien Grand-Clément · Christian Kroer · Chung-Wei Lee · Haipeng Luo · Weiqiang Zheng
|
|
Poster
|
Fri 11:00
|
Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration
Hongming Zhang · Chenjun Xiao · Chao Gao · Han Wang · bo xu · Martin Müller
|
|
Workshop
|
Sun 9:50
|
Lightning Talk: Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning
|
|
Poster
|
|
DeltaDEQ: Exploiting Heterogeneous Convergence for Accelerating Deep Equilibrium Iterations
Zuowen Wang · Longbiao Cheng · Pehuen Moure · Niklas Hahn · Shih-Chii Liu
|
|
Poster
|
Wed 16:30
|
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Liang-Hsuan Tseng · En-Pei Hu · Cheng-Han Chiang · Yuan Tseng · Hung-yi Lee · Lin-shan Lee · Shao-Hua Sun
|
|
Poster
|
Fri 11:00
|
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
Chenlu Ye · Wei Xiong · Yuheng Zhang · Hanze Dong · Nan Jiang · Tong Zhang
|
|
Poster
|
Thu 16:30
|
Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios
Shantanu Jaiswal · Debaditya Roy · Basura Fernando · Cheston Tan
|
|
Poster
|
Fri 16:30
|
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning
Yi Ma · Jianye Hao · Xiaohan Hu · YAN ZHENG · Chenjun Xiao
|
|
Poster
|
Thu 16:30
|
OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations
Yao Shu · Jiongfeng Fang · Ying He · Fei Yu
|
|
Poster
|
Fri 11:00
|
UQ-Guided Hyperparameter Optimization for Iterative Learners
Jiesong Liu · Feng Zhang · Jiawei Guan · Xipeng Shen
|
|