firstbacksecondback
188 Results
Workshop
|
Sat 9:45 |
Evaluating Peripheral Vision as an Input Transformation to Understand Object Detection Model Behavior Anne Harrington · Vasha DuTell · Mark Hamilton · Ayush Tewari · Simon Stent · Bill Freeman · Ruth Rosenholtz |
|
Workshop
|
From Text to Tactic: Evaluating LLMs Playing the Game of Avalon Jonathan Light · Min Cai · Sheng Shen · Ziniu Hu |
||
Workshop
|
NLPBench: Evaluating Large Language Models on Solving NLP Problems Linxin Song · Jieyu Zhang · Lechao Cheng · Pengyuan Zhou · Tianyi Zhou · Zihui Li |
||
Workshop
|
Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models Yujin Kim · Jaehong Yoon · Seonghyeon Ye · Sung Ju Hwang · Se-Young Yun |
||
Workshop
|
Prometheus: Inducing Evaluation Capability in Language Models Seungone Kim · Jamin Shin · Yejin Cho · Joel Jang · Shayne Longpre · Hwaran Lee · Sangdoo Yun · Seongjin Shin · Sungdong Kim · James Thorne · Minjoon Seo |
||
Workshop
|
Evaluating the Utility of Model Explanations for Model Development Shawn Im · Jacob Andreas · Yilun Zhou |
||
Workshop
|
Trick or treat? Evaluating stability strategies in graph network-based simulators Omer Rochman Sharabi · Gilles Louppe |
||
Workshop
|
Zero-shot Conversational Summarization Evaluations with small Large Language Models Ramesh Manuvinakurike · Saurav Sahay · Sangeeta Manepalli · Lama Nachman |
||
Workshop
|
Evaluating Zero-Shot Scoring for In Vitro Antibody Binding Prediction with Experimental Validation Divya Nori · Simon Mathis · Amir Shanehsazzadeh |
||
Workshop
|
Evaluation of Representational Similarity Scores Across Human Visual Cortex Francisco Acosta · Colin Conwell · David Klindt · Nina Miolane |
||
Workshop
|
Evaluating Physically Motivated Loss Functions for Photometric Redshift Estimation Andrew Engel · Jan Strube |
||
Workshop
|
Evaluating AI-guided Design for Scientific Discovery Michael Pekala · Elizabeth Pogue · Alexander New · Gregory Bassen · Janna Domenico · Tyrel McQueen · Christopher Stiles |