Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

113 Results

<<   <   Page 3 of 10   >   >>
Workshop
Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits
Zhuokai Zhao · Takumi Matsuzawa · William Irvine · Michael Maire · Gordon Kindlmann
Poster
Wed 16:30 Evaluating Copyright Takedown Methods for Language Models
Boyi Wei · Weijia Shi · Yangsibo Huang · Noah Smith · Chiyuan Zhang · Luke Zettlemoyer · Kai Li · Peter Henderson
Poster
Wed 11:00 A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured Datasets
Kyungeun Lee · Wonjong Rhee
Workshop
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation
To Eun Kim · Fernando Diaz
Workshop
Declare and Justify: Explicit assumptions in AI evaluations are necessary for effective regulation
Peter Barnett · Lisa Thiergart
Poster
Fri 11:00 LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
Haitao Li · You Chen · Qingyao Ai · Yueyue WU · Ruizhe Zhang · Yiqun LIU
Workshop
Critical human-AI use scenarios and interaction modes for societal impact evaluations
Lujain Ibrahim · Saffron Huang · Lama Ahmad · Markus Anderljung
Poster
Thu 16:30 Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality
Tianle Zhang · Langtian Ma · Yuchen Yan · yuchen zhang · yue yang · Ziyao Guo · Wenqi Shao · Kai Wang · Yang You · Yu Qiao · Ping Luo · Kaipeng Zhang
Workshop
Making Climate AI Systems Past and Future Aware to Better Evaluate Climate Change Policies
Riya . · Sudhakar Singh
Poster
Wed 11:00 Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Jiaxin Cheng · ZIXU ZHAO · Tong He · Tianjun Xiao · Zheng Zhang · Yicong Zhou
Workshop
Sat 13:00 Composers’ Evaluations of an AI Music Tool: Insights for Human-Centered Design
Eleanor Row · George Fazekas
Poster
Fri 11:00 InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques
Rohan Gupta · Iván Arcuschin Moreno · Thomas Kwa · Adrià Garriga-Alonso