Skip to yearly menu bar Skip to main content


Poster

Establishing Best Practices in Building Rigorous Agentic Benchmarks

Yuxuan Zhu ⋅ Tengjun Jin ⋅ Yada Pruksachatkun ⋅ Andy Zhang ⋅ Shu Liu ⋅ Sasha Cui ⋅ Sayash Kapoor ⋅ Shayne Longpre ⋅ Kevin Meng ⋅ Rebecca Weiss ⋅ Fazl Barez ⋅ Rahul Gupta ⋅ Jwala Dhamala ⋅ Jacob Merizian ⋅ Mario Giulianelli ⋅ Harry Coppock ⋅ Cozmin Ududec ⋅ Antony Kellermann ⋅ Jasjeet Sekhon ⋅ Jacob Steinhardt ⋅ Sarah Schwettmann ⋅ Arvind Narayanan ⋅ Matei A Zaharia ⋅ Ion Stoica ⋅ Percy Liang ⋅ Daniel Kang
2025 Poster

Abstract

Video

Chat is not available.