Skip to yearly menu bar Skip to main content


Where did the Reasoning Go Wrong? A Benchmark of Puzzle-Based Visual Tasks with Error Detection

Yusu Qian ⋅ Cheng Wan ⋅ Chao Jia ⋅ Yinfei Yang ⋅ Qingyu Zhao ⋅ Zhe Gan

Abstract

Chat is not available.