Skip to yearly menu bar Skip to main content


SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Zixiang Xu · Yanbo Wang · Yue Huang · Haomin Zhuang · Yujun Zhou · Jiayi Ye · Zirui Song · Lang Gao · Chenxi Wang · Zhaorun Chen · Sixian Li · Wang Pan · Yue Zhao · Xiangliang Zhang · Xiuying Chen

Abstract

Chat is not available.