Workshop
|
MoAT: Meta-Evaluation of Anti-Malware Trustworthiness Sharon Lin · Marc Fyrbiak · Christof Paar |
||
Workshop
|
Re-Evaluating Chemical Synthesis Planning Algorithms Austin Tripp · Krzysztof Maziarz · Sarah Lewis · Guoqing Liu · Marwin Segler |
||
Poster
|
Tue 9:00 |
Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation Sérgio Jesus · José Pombal · Duarte Alves · André Cruz · Pedro Saleiro · Rita Ribeiro · João Gama · Pedro Bizarro |
|
Poster
|
Wed 9:00 |
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models Maribeth Rauh · John Mellor · Jonathan Uesato · Po-Sen Huang · Johannes Welbl · Laura Weidinger · Sumanth Dathathri · Amelia Glaese · Geoffrey Irving · Iason Gabriel · William Isaac · Lisa Anne Hendricks |
|
Poster
|
Wed 9:00 |
Towards Better Evaluation for Dynamic Link Prediction Farimah Poursafaei · Shenyang Huang · Kellin Pelrine · Reihaneh Rabbany |
|
Poster
|
Tue 9:00 |
MTNeuro: A Benchmark for Evaluating Representations of Brain Structure Across Multiple Levels of Abstraction Jorge Quesada · Lakshmi Sathidevi · Ran Liu · Nauman Ahad · Joy Jackson · Mehdi Azabou · Jingyun Xiao · Christopher Liding · Matthew Jin · Carolina Urzay · William Gray-Roncal · Erik Johnson · Eva Dyer |
|
Poster
|
Thu 14:00 |
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models Chunyuan Li · Haotian Liu · Liunian Li · Pengchuan Zhang · Jyoti Aneja · Jianwei Yang · Ping Jin · Houdong Hu · Zicheng Liu · Yong Jae Lee · Jianfeng Gao |
|
Poster
|
Tue 14:00 |
SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles Chejian Xu · Wenhao Ding · Weijie Lyu · ZUXIN LIU · Shuai Wang · Yihan He · Hanjiang Hu · DING ZHAO · Bo Li |
|
Poster
|
Wed 14:00 |
OpenXAI: Towards a Transparent Evaluation of Model Explanations Chirag Agarwal · Satyapriya Krishna · Eshika Saxena · Martin Pawelczyk · Nari Johnson · Isha Puri · Marinka Zitnik · Himabindu Lakkaraju |
|
Poster
|
A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks Ganqu Cui · Lifan Yuan · Bingxiang He · Yangyi Chen · Zhiyuan Liu · Maosong Sun |
||
Poster
|
Tue 9:00 |
Evaluating Latent Space Robustness and Uncertainty of EEG-ML Models under Realistic Distribution Shifts Neeraj Wagh · Jionghao Wei · Samarth Rawal · Brent M Berry · Yogatheesan Varatharajah |
|
Poster
|
Thu 9:00 |
FACT: Learning Governing Abstractions Behind Integer Sequences Peter Belcak · Ard Kastrati · Flavio Schenker · Roger Wattenhofer |