Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

17 Results

<<   <   Page 1 of 2   >   >>
Poster
Wed 16:30 Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning
Alessandro Montenegro · Marco Mussi · Matteo Papini · Alberto Maria Metelli
Poster
Thu 16:30 Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning
Junyan Liu · Yunfan Li · Ruosong Wang · Lin Yang
Poster
Fri 11:00 Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms
Yang Cai · Gabriele Farina · Julien Grand-Clément · Christian Kroer · Chung-Wei Lee · Haipeng Luo · Weiqiang Zheng
Poster
Fri 11:00 Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration
Hongming Zhang · Chenjun Xiao · Chao Gao · Han Wang · bo xu · Martin Müller
Workshop
Sun 9:50 Lightning Talk: Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning
Poster
DeltaDEQ: Exploiting Heterogeneous Convergence for Accelerating Deep Equilibrium Iterations
Zuowen Wang · Longbiao Cheng · Pehuen Moure · Niklas Hahn · Shih-Chii Liu
Poster
Wed 16:30 REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Liang-Hsuan Tseng · En-Pei Hu · Cheng-Han Chiang · Yuan Tseng · Hung-yi Lee · Lin-shan Lee · Shao-Hua Sun
Poster
Fri 11:00 Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
Chenlu Ye · Wei Xiong · Yuheng Zhang · Hanze Dong · Nan Jiang · Tong Zhang
Poster
Thu 16:30 Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios
Shantanu Jaiswal · Debaditya Roy · Basura Fernando · Cheston Tan
Poster
Fri 16:30 Iteratively Refined Behavior Regularization for Offline Reinforcement Learning
Yi Ma · Jianye Hao · Xiaohan Hu · YAN ZHENG · Chenjun Xiao
Poster
Thu 16:30 OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations
Yao Shu · Jiongfeng Fang · Ying He · Fei Yu
Poster
Fri 11:00 UQ-Guided Hyperparameter Optimization for Iterative Learners
Jiesong Liu · Feng Zhang · Jiawei Guan · Xipeng Shen