firstbacksecondback
5 Results
Workshop
|
Agentic Anomaly Detection for Shipping Alexander Timms · Abigail Langbridge · Fearghal O'Donncha |
||
Workshop
|
FEABench: Evaluating Language Models on Real World Physics Reasoning Ability Nayantara Mudur · Hao Cui · Subhashini Venugopalan · Paul Raccuglia · Michael Brenner · Peter Norgaard |
||
Workshop
|
Sat 12:00 |
Monty Hall and Score Optimization in Conformal Prediction to Improve LLMs for MCQs Harit Vishwakarma · Alan Mishler · Thomas Cook · Niccolo Dalmasso · Natraj Raman · Sumitra Ganesh |
|
Workshop
|
Improving Decision-Making in Open-World Agents with Conformal Prediction and Monty Hall Harit Vishwakarma · Alan Mishler · Thomas Cook · Niccolo Dalmasso · Natraj Raman · Sumitra Ganesh |
||
Poster
|
Thu 11:00 |
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning Shirley Wu · Shiyu Zhao · Qian Huang · Kexin Huang · Michihiro Yasunaga · Kaidi Cao · Vassilis Ioannidis · Karthik Subbian · Jure Leskovec · James Zou |