Skip to yearly menu bar Skip to main content


Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks

Xiaodong Yu · Hao Cheng · Xiaodong Liu · Dan Roth · Jianfeng Gao

Abstract

Video

Chat is not available.