Skip to yearly menu bar Skip to main content


Poster

WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia

Yufang Hou ⋅ Alessandra Pascale ⋅ Javier Carnerero-Cano ⋅ Tigran Tchrakian ⋅ Radu Marinescu ⋅ Elizabeth Daly ⋅ Inkit Padhi ⋅ Prasanna Sattigeri
2024 Poster

Abstract

Video

Chat is not available.