firstbacksecondback
2 Results
Workshop
|
VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree Search David Brandfonbrener · Simon Henniger · Sibi Raja · Tarun Prasad · Chloe Loughridge · Federico Cassano · Sabrina Hu · Jianang Yang · William Byrd · Robert Zinkov · Nada Amin |
||
Workshop
|
DafnyBench: A Benchmark for Formal Software Verification Chloe Loughridge · Qinyi Sun · Seth Ahrenbach · Federico Cassano · Chuyue (Livia) Sun · Ying Sheng · Anish Mudide · Md Rakib Hossain Misu · Nada Amin · Max Tegmark |