Skip to yearly menu bar Skip to main content


Quantifying Variance in Evaluation Benchmarks

Lovish Madaan ⋅ Aaditya Singh ⋅ Rylan Schaeffer ⋅ Andrew Poulton ⋅ Sanmi Koyejo ⋅ Pontus Lars Erik Saito Stenetorp ⋅ Sharan Narang ⋅ Dieuwke Hupkes

Abstract

Chat is not available.