Skip to yearly menu bar Skip to main content


Who Gets the Reward & Who Gets the Blame? Evaluation-Aligned Post-Training for Multi-LLM Agents

Chih-Hsuan Yang ⋅ Tanwi Mallick ⋅ Ian Foster ⋅ Amal Gueroudji ⋅ Rajeev Thakur

Abstract

Chat is not available.