Skip to yearly menu bar Skip to main content


SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Zixiang Xu ⋅ Yanbo Wang ⋅ Yue Huang ⋅ Haomin Zhuang ⋅ Yujun Zhou ⋅ Jiayi Ye ⋅ Zirui Song ⋅ Lang Gao ⋅ Chenxi Wang ⋅ Zhaorun Chen ⋅ Sixian Li ⋅ Wang Pan ⋅ Yue Zhao ⋅ Xiangliang Zhang ⋅ Xiuying Chen

Abstract

Chat is not available.