firstbacksecondback
2 Results
Poster
|
Tue 8:45 |
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena Lianmin Zheng · Wei-Lin Chiang · Ying Sheng · Siyuan Zhuang · Zhanghao Wu · Yonghao Zhuang · Zi Lin · Zhuohan Li · Dacheng Li · Eric Xing · Hao Zhang · Joseph Gonzalez · Ion Stoica |
|
Workshop
|
Prometheus: Inducing Evaluation Capability in Language Models Seungone Kim · Jamin Shin · Yejin Cho · Joel Jang · Shayne Longpre · Hwaran Lee · Sangdoo Yun · Seongjin Shin · Sungdong Kim · James Thorne · Minjoon Seo |