firstbacksecondback
378 Results
Workshop
|
GRE Score: Generative Risk Evaluation for Large Language Models ZAITANG LI · Mohamed Mouhajir · Pin-Yu Chen · Tsung-Yi Ho |
||
Workshop
|
A Systematic Evaluation of Decoding-Free Generative Candidate Selection Methods Mingyu Derek Ma · Yanna Ding · Zijie Huang · Jianxi Gao · Yizhou Sun · Wei Wang |
||
Workshop
|
SharedContextBench: How Lossy are Long-context Methods in KV Cache Reuse Yucheng LI · Huiqiang Jiang · Qianhui Wu · Xufang Luo · Surin Ahn · Chengruidong Zhang · Amir Abdi · Dongsheng Li · Jianfeng Gao · Yuqing Yang · Lili Qiu |
||
Workshop
|
Benchmark to Audit LLM Generated Clinical Notes for Disparities Arising from Biases and Stereotypes Hongyu Cai · Swetasudha Panda · Naveen Jafer Nizar · Qinlan Shen · Daeja Oxendine · Sumana Srivatsa · Krishnaram Kenthapadi |
||
Poster
|
Wed 11:00 |
MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning Yifan Jiang · jiarui zhang · Kexuan Sun · Zhivar Sourati · Kian Ahrabian · Kaixin Ma · Filip Ilievski · Jay Pujara |
|
Workshop
|
LLMs Infer Protected Attributes Beyond Proxy Features Dimitri Staufer |