NeurIPS 2024

Skip to yearly menu bar Skip to main content

8 Results

Workshop	Sat 15:45	ReFeR: A Hierarchical Framework of Models as Evaluative and Reasoning Agents Yaswanth Narsupalli · Abhranil Chandra · Sreevatsa Muppirala · Manish Gupta · Pawan Goyal
Workshop	Sat 12:00	A STEP TOWARDS MIXTURE OF GRADER: STATISTICAL ANALYSIS OF EXISTING AUTOMATIC EVALUATION METRICS Yun Joon Soh · Jishen Zhao
Workshop		Multimodal Auto Validation For Self-Refinement in Web Agents Ruhana Azam · Tamer Abuelsaad · Aditya Vempaty · Ashish Jagmohan
Workshop	Sat 15:45	Towards LLM-guided Efficient and Interpretable Multi-linear Tensor Network Rank Selection Giorgos Iacovides · Wuyang Zhou · Danilo Mandic
Poster	Thu 11:00	IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering Ruosen Li · Ruochen Li · Barry Wang · Xinya Du
Poster	Fri 11:00	OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators Allen Nie · Yash Chandak · Christina Yuan · Anirudhan Badrinath · Yannis Flet-Berliac · Emma Brunskill
Workshop	Sat 12:00	Advancing Conversational Psychotherapy: Integrating Privacy, Dual-Memory, and Domain Expertise with Large Language Models XiuYu Zhang · Zening Luo
Workshop	Sat 12:00	CLUE: Concept-Level Uncertainty Estimation for Large Language Models Yu-Hsiang Wang · Andrew Bai · Che-Ping Tsai · Cho-Jui Hsieh