firstbacksecondback
5 Results
Workshop
|
SEAL: Suite for Evaluating API-use of LLMs Woojeong Kim · Ashish Jagmohan · Aditya Vempaty |
||
Workshop
|
HAMMR : HierArchical MultiModal React agents for generic VQA Lluis Castrejon · Thomas Mensink · Howard Zhou · Vittorio Ferrari · Andre Araujo · Jasper Uijlings |
||
Poster
|
SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection Liangxin Liu · Xuebo Liu · Derek Wong · Dongfang Li · Ziyi Wang · Baotian Hu · Min Zhang |
||
Poster
|
Thu 16:30 |
WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs Seungju Han · Kavel Rao · Allyson Ettinger · Liwei Jiang · Bill Yuchen Lin · Nathan Lambert · Yejin Choi · Nouha Dziri |
|
Workshop
|
Sat 12:00 |
Monty Hall and Score Optimization in Conformal Prediction to Improve LLMs for MCQs Harit Vishwakarma · Alan Mishler · Thomas Cook · Niccolo Dalmasso · Natraj Raman · Sumitra Ganesh |