firstbacksecondback
340 Results
Poster
|
Thu 11:00 |
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia Yufang Hou · Alessandra Pascale · Javier Carnerero-Cano · Tigran Tchrakian · Radu Marinescu · Elizabeth Daly · Inkit Padhi · Prasanna Sattigeri |
|
Poster
|
Thu 16:30 |
CTIBench: A Benchmark for Evaluating LLMs in Cyber Threat Intelligence Md Tanvirul Alam · Dipkamal Bhusal · Le Nguyen · Nidhi Rastogi |
|
Poster
|
Fri 16:30 |
Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMs Ching-An Cheng · Allen Nie · Adith Swaminathan |
|
Expo Demonstration
|
Tue 15:00 |
Moving beyond chat: Enabling LLMs with intrinsic functions that give fine-grained control in application development Luis Lastras · Kristjan Greenewald · Nathalie Baracaldo |
|
Poster
|
Fri 11:00 |
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization Wanhua Li · Zibin Meng · Jiawei Zhou · Donglai Wei · Chuang Gan · Hanspeter Pfister |
|
Poster
|
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory Chaojun Xiao · Pengle Zhang · Xu Han · Guangxuan Xiao · Yankai Lin · Zhengyan Zhang · Zhiyuan Liu · Maosong Sun |
||
Poster
|
Thu 16:30 |
QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs Saleh Ashkboos · Amirkeivan Mohtashami · Maximilian Croci · Bo Li · Pashmina Cameron · Martin Jaggi · Dan Alistarh · Torsten Hoefler · James Hensman |
|
Poster
|
Wed 16:30 |
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs Zhongshen Zeng · Yinhong Liu · Yingjia Wan · Jingyao Li · Pengguang Chen · Jianbo Dai · Yuxuan Yao · Rongwu Xu · Zehan Qi · Wanru Zhao · Linling Shen · Jianqiao Lu · Haochen Tan · Yukang Chen · Hao Zhang · Zhan Shi · Bailin Wang · Zhijiang Guo · Jiaya Jia |
|
Poster
|
Wed 16:30 |
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making Manling Li · Shiyu Zhao · Qineng Wang · Kangrui Wang · Yu Zhou · Sanjana Srivastava · Cem Gokmen · Tony Lee · Erran Li Li · Ruohan Zhang · Weiyu Liu · Percy Liang · Fei-Fei Li · Jiayuan Mao · Jiajun Wu |
|
Oral
|
Wed 15:50 |
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making Manling Li · Shiyu Zhao · Qineng Wang · Kangrui Wang · Yu Zhou · Sanjana Srivastava · Cem Gokmen · Tony Lee · Erran Li Li · Ruohan Zhang · Weiyu Liu · Percy Liang · Fei-Fei Li · Jiayuan Mao · Jiajun Wu |
|
Poster
|
Thu 11:00 |
Time-Reversal Provides Unsupervised Feedback to LLMs Yerram Varun · Rahul Madhavan · Sravanti Addepalli · Arun Suggala · Karthikeyan Shanmugam · Prateek Jain |
|
Poster
|
Fri 16:30 |
QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor Adaptation Zhuo Chen · Rumen Dangovski · Charlotte Loh · Owen Dugan · Di Luo · Marin Soljacic |