Workshop
|
Sat 15:45
|
ReFeR: A Hierarchical Framework of Models as Evaluative and Reasoning Agents
Yaswanth Narsupalli · Abhranil Chandra · Sreevatsa Muppirala · Manish Gupta · Pawan Goyal
|
|
Workshop
|
Sat 12:00
|
A STEP TOWARDS MIXTURE OF GRADER: STATISTICAL ANALYSIS OF EXISTING AUTOMATIC EVALUATION METRICS
Yun Joon Soh · Jishen Zhao
|
|
Workshop
|
|
Multimodal Auto Validation For Self-Refinement in Web Agents
Ruhana Azam · Tamer Abuelsaad · Aditya Vempaty · Ashish Jagmohan
|
|
Workshop
|
Sat 15:45
|
Towards LLM-guided Efficient and Interpretable Multi-linear Tensor Network Rank Selection
Giorgos Iacovides · Wuyang Zhou · Danilo Mandic
|
|
Poster
|
Thu 11:00
|
IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering
Ruosen Li · Ruochen Li · Barry Wang · Xinya Du
|
|
Poster
|
Fri 11:00
|
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Allen Nie · Yash Chandak · Christina Yuan · Anirudhan Badrinath · Yannis Flet-Berliac · Emma Brunskill
|
|
Workshop
|
Sat 12:00
|
Advancing Conversational Psychotherapy: Integrating Privacy, Dual-Memory, and Domain Expertise with Large Language Models
XiuYu Zhang · Zening Luo
|
|
Workshop
|
Sat 12:00
|
CLUE: Concept-Level Uncertainty Estimation for Large Language Models
Yu-Hsiang Wang · Andrew Bai · Che-Ping Tsai · Cho-Jui Hsieh
|
|