Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Reliable ML from Unreliable Data
Sat, Dec 6, 2025 • 11:00 AM – 12:00 PM PST

Disarming Strategic Text: Span-Aware Counterfactuals for Robust Content Moderation

Hardik Meisheri · Zaid Hassan · Karthik Sankaranarayanan

Abstract

Chat is not available.