Skip to yearly menu bar Skip to main content


Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation

Siyuan Wang ⋅ Zhuohan Long ⋅ Zhihao Fan ⋅ Xuanjing Huang ⋅ zhongyu wei

Abstract

Chat is not available.