CVE-Bench: testing LLM agents on real-world vulnerability patches

by logickkk1 | View on Hacker News