Skip to yearly menu bar Skip to main content


Poster

NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security

Minghao Shao ⋅ Sofija Jancheska ⋅ Meet Udeshi ⋅ Brendan Dolan-Gavitt ⋅ haoran xi ⋅ Kimberly Milner ⋅ Boyuan Chen ⋅ Max Yin ⋅ Siddharth Garg ⋅ Prashanth Krishnamurthy ⋅ Farshad Khorrami ⋅ Ramesh Karri ⋅ Muhammad Shafique
2024 Poster

Abstract

Video

Chat is not available.