Poster
|
Wed 16:30
|
Is Cross-validation the Gold Standard to Estimate Out-of-sample Model Performance?
Garud Iyengar · Henry Lam · Tianyu Wang
|
|
Poster
|
Thu 16:30
|
Active, anytime-valid risk controlling prediction sets
Ziyu Xu · Nikos Karampatziakis · Paul Mineiro
|
|
Poster
|
Wed 11:00
|
Marginal Causal Flows for Validation and Inference
Daniel de Vassimon Manela · Laura Battaglia · Robin Evans
|
|
Poster
|
Wed 11:00
|
SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents
Niels Mündler · Mark Müller · Jingxuan He · Martin Vechev
|
|
Poster
|
Wed 11:00
|
Validating Climate Models with Spherical Convolutional Wasserstein Distance
Robert Garrett · Trevor Harris · Zhuo Wang · Bo Li
|
|
Poster
|
Wed 16:30
|
Externally Valid Policy Evaluation from Randomized Trials Using Additional Observational Data
Sofia Ek · Dave Zachariah
|
|
Poster
|
Wed 11:00
|
Large language model validity via enhanced conformal prediction methods
John Cherian · Isaac Gibbs · Emmanuel Candes
|
|
Affinity Event
|
|
Armadillo: Robust Secure Aggregation for Federated Learning with Input Validation
Yiping Ma · Yue Guo · Harish Karthikeyan · Antigoni Polychroniadou
|
|
Poster
|
Fri 11:00
|
STL: Still Tricky Logic (for System Validation, Even When Showing Your Work)
Isabelle Hurley · Rohan Paleja · Ashley Suh · Jaime D Pena · Ho Chit Siu
|
|
Expo Talk Panel
|
Tue 8:30
|
AI Verification & Validation: Trends, Applications, and Challenges
Lucas Garcia · Darren Cofer
|
|
Poster
|
Wed 16:30
|
Distribution Learning with Valid Outputs Beyond the Worst-Case
Nicholas Rittler · Kamalika Chaudhuri
|
|
Affinity Event
|
|
Evaluating Generative AI for Scenario Variation in Automated Driving Validation
Manasa Mariam Mammen · Zafer Kayatas · Eva Zimmermann · Pavel Nedvědický
|
|