firstbacksecondback
332 Results
Poster
|
Fri 11:00 |
Expecting The Unexpected: Towards Broad Out-Of-Distribution Detection Charles Guille-Escuret · Pierre-André Noël · Ioannis Mitliagkas · David Vazquez · Joao Monteiro |
|
Poster
|
Wed 11:00 |
Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors Anisha Pal · Julia Kruk · Mansi Phute · Manognya Bhattaram · Diyi Yang · Duen Horng Chau · Judy Hoffman |
|
Poster
|
Thu 11:00 |
Benchmarking Estimators for Natural Experiments: A Novel Dataset and a Doubly Robust Algorithm R. Teal Witter · Christopher Musco |
|
Poster
|
Wed 16:30 |
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark Yubo Wang · Xueguang Ma · Ge Zhang · Yuansheng Ni · Abhranil Chandra · Shiguang Guo · Weiming Ren · Aaran Arulraj · Xuan He · Ziyan Jiang · Tianle Li · Max KU · Kai Wang · Alex Zhuang · Rongqi Fan · Xiang Yue · Wenhu Chen |
|
Poster
|
Thu 16:30 |
A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics Puze Liu · Jonas Günster · Niklas Funk · Simon Gröger · Dong Chen · Haitham Bou Ammar · Julius Jankowski · Ante Marić · Sylvain Calinon · Andrej Orsula · Miguel Olivares · Hongyi Zhou · Rudolf Lioutikov · Gerhard Neumann · Amarildo Likmeta · Amirhossein Zhalehmehrabi · Thomas Bonenfant · Marcello Restelli · Davide Tateo · Ziyuan Liu · Jan Peters |
|
Poster
|
AudioMarkBench: Benchmarking Robustness of Audio Watermarking Hongbin Liu · Moyang Guo · Zhengyuan Jiang · Lun Wang · Neil Gong |
||
Poster
|
Wed 16:30 |
Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods Jiamian Hu · Hong Yuanyuan · Yihua Chen · He Wang · Moriaki Yasuhara |
|
Poster
|
Fri 11:00 |
SETLEXSEM CHALLENGE: Using Set Operations to Evaluate the Lexical and Semantic Robustness of Language Models Nicholas Dronen · Bardiya Akhbari · Manish Digambar Gawali |
|
Poster
|
Thu 16:30 |
TabularBench: Benchmarking Adversarial Robustness for Tabular Deep Learning in Real-world Use-cases Thibault Simonetto · Salah GHAMIZI · Maxime Cordy |
|
Poster
|
Thu 16:30 |
The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations Tyler LaBonte · John Hill · Xinchen Zhang · Vidya Muthukumar · Abhishek Kumar |
|
Poster
|
Fri 11:00 |
RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Easton Huch · Jieru Shi · Madeline R Abbott · Jessica Golbus · Alexander Moreno · Walter Dempsey |
|
Affinity Event
|
The Queer Algorithm Guillaume Chevillon |