Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

332 Results

<<   <   Page 4 of 28   >   >>
Poster
Fri 11:00 Expecting The Unexpected: Towards Broad Out-Of-Distribution Detection
Charles Guille-Escuret · Pierre-André Noël · Ioannis Mitliagkas · David Vazquez · Joao Monteiro
Poster
Wed 11:00 Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors
Anisha Pal · Julia Kruk · Mansi Phute · Manognya Bhattaram · Diyi Yang · Duen Horng Chau · Judy Hoffman
Poster
Thu 11:00 Benchmarking Estimators for Natural Experiments: A Novel Dataset and a Doubly Robust Algorithm
R. Teal Witter · Christopher Musco
Poster
Wed 16:30 MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Yubo Wang · Xueguang Ma · Ge Zhang · Yuansheng Ni · Abhranil Chandra · Shiguang Guo · Weiming Ren · Aaran Arulraj · Xuan He · Ziyan Jiang · Tianle Li · Max KU · Kai Wang · Alex Zhuang · Rongqi Fan · Xiang Yue · Wenhu Chen
Poster
Thu 16:30 A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics
Puze Liu · Jonas Günster · Niklas Funk · Simon Gröger · Dong Chen · Haitham Bou Ammar · Julius Jankowski · Ante Marić · Sylvain Calinon · Andrej Orsula · Miguel Olivares · Hongyi Zhou · Rudolf Lioutikov · Gerhard Neumann · Amarildo Likmeta · Amirhossein Zhalehmehrabi · Thomas Bonenfant · Marcello Restelli · Davide Tateo · Ziyuan Liu · Jan Peters
Poster
AudioMarkBench: Benchmarking Robustness of Audio Watermarking
Hongbin Liu · Moyang Guo · Zhengyuan Jiang · Lun Wang · Neil Gong
Poster
Wed 16:30 Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods
Jiamian Hu · Hong Yuanyuan · Yihua Chen · He Wang · Moriaki Yasuhara
Poster
Fri 11:00 SETLEXSEM CHALLENGE: Using Set Operations to Evaluate the Lexical and Semantic Robustness of Language Models
Nicholas Dronen · Bardiya Akhbari · Manish Digambar Gawali
Poster
Thu 16:30 TabularBench: Benchmarking Adversarial Robustness for Tabular Deep Learning in Real-world Use-cases
Thibault Simonetto · Salah GHAMIZI · Maxime Cordy
Poster
Thu 16:30 The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations
Tyler LaBonte · John Hill · Xinchen Zhang · Vidya Muthukumar · Abhishek Kumar
Poster
Fri 11:00 RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions
Easton Huch · Jieru Shi · Madeline R Abbott · Jessica Golbus · Alexander Moreno · Walter Dempsey
Affinity Event
The Queer Algorithm
Guillaume Chevillon