Skip to yearly menu bar Skip to main content


Trust but Verify: Reliable VLM evaluation in-the-wild with program synthesis

Viraj Uday Prabhu ⋅ Senthil Purushwalkam ⋅ Jieyu Zhang ⋅ An Yan ⋅ Caiming Xiong ⋅ Ran Xu

Abstract

Chat is not available.