firstbacksecondback
88 Results
Poster
|
Tue 8:45 |
Holistic Evaluation of Text-to-Image Models Tony Lee · Michihiro Yasunaga · Chenlin Meng · Yifan Mai · Joon Sung Park · Agrim Gupta · Yunzhi Zhang · Deepak Narayanan · Hannah Teufel · Marco Bellagente · Minguk Kang · Taesung Park · Jure Leskovec · Jun-Yan Zhu · Fei-Fei Li · Jiajun Wu · Stefano Ermon · Percy Liang |
|
Workshop
|
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts Pan Lu · Hritik Bansal · Tanglin Xia · Jiacheng Liu · Chunyuan Li · Hannaneh Hajishirzi · Hao Cheng · Kai-Wei Chang · Michel Galley · Jianfeng Gao |
||
Workshop
|
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation Sewon Min · Kalpesh Krishna · Xinxi Lyu · Mike Lewis · Scott Yih · Pang Wei Koh · Mohit Iyyer · Luke Zettlemoyer · Hannaneh Hajishirzi |
||
Workshop
|
Prometheus: Inducing Evaluation Capability in Language Models Seungone Kim · Jamin Shin · Yejin Cho · Joel Jang · Shayne Longpre · Hwaran Lee · Sangdoo Yun · Seongjin Shin · Sungdong Kim · James Thorne · Minjoon Seo |