Poster
|
Thu 16:30
|
Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF Datasets
Ike Obi · Rohan Pant · Srishti Shekhar Agrawal · Maham Ghazanfar · Aaron Basiletti
|
|
Workshop
|
Sun 11:23
|
Spotlight 10
Andrew Dumit
|
|
Poster
|
Fri 11:00
|
OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking
Haiji Liang · Ruize Han
|
|
Poster
|
Fri 11:00
|
Fit for our purpose, not yours: Benchmark for a low-resource, Indigenous language
Suzanne Duncan · Gianna Leoni · Lee Steven · Keoni K Mahelona · Peter Lucas K Jones
|
|
Poster
|
Fri 16:30
|
A New Multi-Source Light Detection Benchmark and Semi-Supervised Focal Light Detection
Jae-Yong Baek · Yong-Sang Yoo · Seung-Hwan Bae
|
|
Poster
|
Fri 16:30
|
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates
Hexuan Deng · Wenxiang Jiao · Xuebo Liu · Min Zhang · Zhaopeng Tu
|
|
Poster
|
Thu 16:30
|
VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding
Xiang Li · Jian Ding · Mohamed Elhoseiny
|
|
Poster
|
Wed 11:00
|
A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured Datasets
Kyungeun Lee · Wonjong Rhee
|
|
Poster
|
Wed 16:30
|
SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset
Yubin Hu · Kairui Wen · Heng Zhou · Xiaoyang Guo · Yong-jin Liu
|
|
Poster
|
Thu 11:00
|
APEBench: A Benchmark for Autoregressive Neural Emulators of PDEs
Felix Koehler · Simon Niedermayr · rüdiger westermann · Nils Thuerey
|
|
Poster
|
|
UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-World Document Analysis
Yulong Hui · YAO LU · Huanchen Zhang
|
|
Poster
|
Fri 11:00
|
ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models
Jio Oh · Soyeon Kim · Junseok Seo · Jindong Wang · Ruochen Xu · Xing Xie · Steven Whang
|
|