Workshop
|
|
Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits
Zhuokai Zhao · Takumi Matsuzawa · William Irvine · Michael Maire · Gordon Kindlmann
|
|
Poster
|
Wed 16:30
|
Evaluating Copyright Takedown Methods for Language Models
Boyi Wei · Weijia Shi · Yangsibo Huang · Noah Smith · Chiyuan Zhang · Luke Zettlemoyer · Kai Li · Peter Henderson
|
|
Poster
|
Wed 11:00
|
A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured Datasets
Kyungeun Lee · Wonjong Rhee
|
|
Workshop
|
|
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation
To Eun Kim · Fernando Diaz
|
|
Workshop
|
|
Declare and Justify: Explicit assumptions in AI evaluations are necessary for effective regulation
Peter Barnett · Lisa Thiergart
|
|
Poster
|
Fri 11:00
|
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Haitao Li · You Chen · Qingyao Ai · Yueyue WU · Ruizhe Zhang · Yiqun LIU
|
|
Workshop
|
|
Critical human-AI use scenarios and interaction modes for societal impact evaluations
Lujain Ibrahim · Saffron Huang · Lama Ahmad · Markus Anderljung
|
|
Poster
|
Thu 16:30
|
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality
Tianle Zhang · Langtian Ma · Yuchen Yan · yuchen zhang · yue yang · Ziyao Guo · Wenqi Shao · Kai Wang · Yang You · Yu Qiao · Ping Luo · Kaipeng Zhang
|
|
Workshop
|
|
Making Climate AI Systems Past and Future Aware to Better Evaluate Climate Change Policies
Riya . · Sudhakar Singh
|
|
Poster
|
Wed 11:00
|
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Jiaxin Cheng · ZIXU ZHAO · Tong He · Tianjun Xiao · Zheng Zhang · Yicong Zhou
|
|
Workshop
|
Sat 13:00
|
Composers’ Evaluations of an AI Music Tool: Insights for Human-Centered Design
Eleanor Row · George Fazekas
|
|
Poster
|
Fri 11:00
|
InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques
Rohan Gupta · Iván Arcuschin Moreno · Thomas Kwa · Adrià Garriga-Alonso
|
|