firstbacksecondback
312 Results
Poster
|
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL Qin-Wen Luo · Ming-Kun Xie · Yewen Wang · Sheng-Jun Huang |
||
Workshop
|
Improving Fine-Tuning with Latent Cluster Correction Cédric Thanh |
||
Poster
|
Fri 16:30 |
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning Hao Ma · Tianyi Hu · Zhiqiang Pu · Liu Boyin · Xiaolin Ai · Yanyan Liang · Min Chen |
|
Workshop
|
Teaching LLMs How To Learn with Contextual Fine-Tuning Younwoo Choi · Muhammad Adil Asif · Ziwen Han · John Willes · Rahul Krishnan |
||
Poster
|
Wed 11:00 |
RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models Maya Varma · Jean-Benoit Delbrouck · Zhihong Chen · Akshay Chaudhari · Curtis Langlotz |
|
Workshop
|
Sat 12:00 |
CPP-UT-Bench: Can LLMs Write Complex Unit Tests in C++? Vaishnavi Bhargava · Rajat Ghosh · Debojyoti Dutta |
|
Workshop
|
Fine-tuning LLM Agents with Retrospective In-Context Online Learning Wen-Tse Chen · Jiayu Chen · Fahim Tajwar · Hao Zhu · Xintong Duan · Ruslan Salakhutdinov · Jeff Schneider |
||
Workshop
|
Sat 14:20 |
Fine-tuning LLM Agents with Retrospective In-Context Online Learning Wen-Tse Chen · Jiayu Chen · Fahim Tajwar · Hao Zhu · Xintong Duan · Ruslan Salakhutdinov · Jeff Schneider |
|
Poster
|
Thu 11:00 |
BackdoorAlign: Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment Jiongxiao Wang · Jiazhao LI · Yiquan Li · Xiangyu Qi · Junjie Hu · Sharon Li · Patrick McDaniel · Muhao Chen · Bo Li · Chaowei Xiao |
|
Workshop
|
Fine-Tuning a Time Series Foundation Model with Wasserstein Loss Andrei Chernov |
||
Workshop
|
Sat 10:30 |
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs Jonas Hübotter · Sascha Bongni · Ido Hakimi · Andreas Krause |
|
Workshop
|
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs Jonas Hübotter · Sascha Bongni · Ido Hakimi · Andreas Krause |