Workshop
|
|
Safe and Sound: Evaluating Language Models for Bias Mitigation and Understanding
Shaina Raza · Deval Pandya · Shardul ghuge · Nifemi
|
|
Workshop
|
|
MMLU-Pro+: Evaluating Higher-Order Reasoning and Shortcut Learning in LLMs
Saeid Asgari · Aliasghar Khani · Amir Khasahmadi
|
|
Workshop
|
Sat 14:45
|
Contributed talk: Evaluating Gender Bias Transfer between Pre-trained and Prompt Adapted Language Models
Natalie Mackraz
|
|
Poster
|
Thu 11:00
|
Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency
Yiran Liu · Ke Yang · Zehan Qi · Xiao Liu · Yang Yu · Cheng Xiang Zhai
|
|
Workshop
|
Sat 17:27
|
Better Bias Benchmarking of Language Models via Multi-factor Analysis
Hannah Powers · Ioana Baldini · Dennis Wei · Kristin P Bennett
|
|
Workshop
|
|
Better Bias Benchmarking of Language Models via Multi-factor Analysis
Hannah Powers · Ioana Baldini · Dennis Wei · Kristin P Bennett
|
|
Workshop
|
|
LLMs Infer Protected Attributes Beyond Proxy Features
Dimitri Staufer
|
|
Workshop
|
|
Evaluating Gender Bias Transfer between Pre-trained and Prompt Adapted Language Models
Nivedha Sivakumar · Natalie Mackraz · Samira Khorshidi · Krishna Patel · Barry-John Theobald · Luca Zappella · Nicholas Apostoloff
|
|
Workshop
|
Sat 15:45
|
Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation
Siyuan Wang · Zhuohan Long · Zhihao Fan · Xuanjing Huang · zhongyu wei
|
|
Workshop
|
|
Q-Morality: Quantum-Enhanced ActAdd-Guided Bias Reduction in LLMs
Shardul Kulkarni
|
|
Workshop
|
|
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models
Song Wang · Peng Wang · Tong Zhou · Yushun Dong · Zhen Tan · Jundong Li
|
|
Workshop
|
|
SocialStigmaQA Spanish and Japanese - Towards Multicultural Adaptation of Social Bias Benchmarks
Clara Higuera-Cabañes · Ryo Iwaki · Beñat San Sebastian · ROSARIO UCEDA-SOSA · Manish Nagireddy · Hiroshi Kanayama · Mikio Takeuchi · Gakuto Kurata · Karthikeyan Natesan Ramamurthy
|
|