firstbacksecondback
91 Results
Workshop
|
Sat 7:30 |
Human Evaluation of Generative Models Divyansh Kaushik · Jennifer Hsia · Jessica Huynh · Yonadav Shavit · Samuel Bowman · Ting-Hao Huang · Douwe Kiela · Zachary Lipton · Eric Michael Smith |
|
Workshop
|
Sat 11:50 |
Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark Vitali Petsiuk · Alexander E. Siemenn · Saisamrit Surbehera · Qi Qi Chin · Keith Tyser · Gregory Hunter · Arvind Raghavan · Yann Hicke · Bryan Plummer · Ori Kerret · Tonio Buonassisi · Kate Saenko · Armando Solar-Lezama · Iddo Drori |
|
Workshop
|
Beyond Decision Recommendations: Stop Putting Machine Learning First and Design Human-Centered AI for Decision Support Zana Bucinca · Alexandra Chouldechova · Jennifer Wortman Vaughan · Krzysztof Z Gajos |
||
Workshop
|
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning Anton Bakhtin · David Wu · Adam Lerer · Jonathan Gray · Athul Jacob · Gabriele Farina · Alexander Miller · Noam Brown |
||
Poster
|
Thu 9:00 |
Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world Eugene Vinitsky · Nathan Lichtlé · Xiaomeng Yang · Brandon Amos · Jakob Foerster |
|
Workshop
|
Sat 8:35 |
Towards Credible Human Evaluation of Open-Domain Dialog Systems Using Interactive Setup Sijia Liu · Patrick Lange · Behnam Hedayatnia · Alexandros Papangelis · Di Jin · Andrew Wirth · Yang Liu · Dilek Hakkani-Tur |
|
Poster
|
Tue 9:00 |
Fine-tuning language models to find agreement among humans with diverse preferences Michiel Bakker · Martin Chadwick · Hannah Sheahan · Michael Tessler · Lucy Campbell-Gillingham · Jan Balaguer · Nat McAleese · Amelia Glaese · John Aslanides · Matt Botvinick · Christopher Summerfield |