firstbacksecondback
188 Results
Poster
|
Wed 8:45 |
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach Riccardo Poiani · Nicole Nobili · Alberto Maria Metelli · Marcello Restelli |
|
Poster
|
Thu 15:00 |
Can You Rely on Your Model Evaluation? Improving Model Evaluation with Synthetic Test Data Boris van Breugel · Nabeel Seedat · Fergus Imrie · Mihaela van der Schaar |
|
Poster
|
Wed 8:45 |
Future-Dependent Value-Based Off-Policy Evaluation in POMDPs Masatoshi Uehara · Haruka Kiyohara · Andrew Bennett · Victor Chernozhukov · Nan Jiang · Nathan Kallus · Chengchun Shi · Wen Sun |
|
Poster
|
Thu 8:45 |
Feature Likelihood Divergence: Evaluating the Generalization of Generative Models Using Samples Marco Jiralerspong · Joey Bose · Ian Gemp · Chongli Qin · Yoram Bachrach · Gauthier Gidel |
|
Poster
|
Thu 8:45 |
Distributional Policy Evaluation: a Maximum Entropy approach to Representation Learning Riccardo Zamboni · Alberto Maria Metelli · Marcello Restelli |
|
Poster
|
Wed 15:00 |
Optimal and Fair Encouragement Policy Evaluation and Learning Angela Zhou |
|
Poster
|
Thu 8:45 |
Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits Muhammad Faaiz Taufiq · Arnaud Doucet · Rob Cornish · Jean-Francois Ton |
|
Poster
|
Thu 15:00 |
DICES Dataset: Diversity in Conversational AI Evaluation for Safety Lora Aroyo · Alex Taylor · Mark Díaz · Christopher Homan · Alicia Parrish · Gregory Serapio-García · Vinodkumar Prabhakaran · Ding Wang |
|
Poster
|
Thu 15:00 |
Quantus: An Explainable AI Toolkit for Responsible Evaluation of Neural Network Explanations and Beyond Anna Hedström · Leander Weber · Daniel Krakowczyk · Dilyara Bareeva · Franz Motzkus · Wojciech Samek · Sebastian Lapuschkin · Marina Höhne |
|
Poster
|
Thu 15:00 |
SPACE: Single-round Participant Amalgamation for Contribution Evaluation in Federated Learning Yi-Chung Chen · Hsi-Wen Chen · Shun-Gui Wang · Ming-syan Chen |
|
Poster
|
Tue 15:15 |
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation Jiawei Liu · Chunqiu Steven Xia · Yuyao Wang · LINGMING ZHANG |
|
Poster
|
Tue 8:45 |
Stable Bias: Evaluating Societal Representations in Diffusion Models Sasha Luccioni · Christopher Akiki · Margaret Mitchell · Yacine Jernite |