firstbacksecondback
188 Results
Affinity Workshop
|
Unraveling the Effects of Age-Based Distribution Shifts on Medical Image Classifiers Kumail Alhamoud · Yasir Ghunaim · Motasem Alfarra · Philip Torr · Tom Hartvigsen · Bernard Ghanem · Adel Bibi · Marzyeh Ghassemi |
||
Workshop
|
Evaluating Zero-Shot Scoring for In Vitro Antibody Binding Prediction with Experimental Validation Divya Nori · Simon Mathis · Amir Shanehsazzadeh |
||
Workshop
|
Sat 12:01 |
FRUNI and FTREE synthetic knowledge graphs for evaluating explainability Pablo Sanchez-Martin · Tarek R. Besold · Priyadarshini Kumari |
|
Workshop
|
Fri 12:50 |
#35: Cross-cultural differences in evaluating offensive language and the role of moral foundations Aida Mostafazadeh Davani · Mark Díaz · Vinodkumar Prabhakaran |
|
Workshop
|
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts Pan Lu · Hritik Bansal · Tanglin Xia · Jiacheng Liu · Chunyuan Li · Hannaneh Hajishirzi · Hao Cheng · Kai-Wei Chang · Michel Galley · Jianfeng Gao |
||
Workshop
|
A collection of principles for guiding and evaluating large language models Konstantin Hebenstreit · Robert Praas · Matthias Samwald |
||
Workshop
|
Evaluating Zero-Shot Scoring for In Vitro Antibody Binding Prediction with Experimental Validation Divya Nori · Simon Mathis · Amir Shanehsazzadeh |
||
Workshop
|
Evaluating AI-guided Design for Scientific Discovery Michael Pekala · Elizabeth Pogue · Alexander New · Gregory Bassen · Janna Domenico · Tyrel McQueen · Christopher Stiles |
||
Workshop
|
Fri 12:50 |
#38: Off The Rails: Procedural Dilemma Generation for Moral Reasoning Jan-Philipp Fraenken · Ayesha Khawaja · Kanishk Gandhi · Jared Moore · Noah Goodman · Tobias Gerstenberg |
|
Workshop
|
Fri 12:50 |
#39: Western, Religious or Spiritual: An Evaluation of Moral Justification in Large Language Models Eyup E. Kucuk · Muhammed Koçyiğit |
|
Workshop
|
ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models Maitreya Patel · Tejas Gokhale · Chitta Baral · 'YZ' Yezhou Yang |
||
Workshop
|
Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment Yang Liu · Yuanshun (Kevin) Yao · Jean-Francois Ton · Xiaoying Zhang · Ruocheng Guo · Hao Cheng · Yegor Klochkov · Muhammad Faaiz Taufiq · Hang Li |