Skip to yearly menu bar Skip to main content


Poster

SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks

Hwiwon Lee ⋅ Ziqi Zhang ⋅ Hanxiao Lu ⋅ LINGMING ZHANG
2025 Poster

Abstract

Video

Chat is not available.