OpenAI and Paradigm Launch EVMbench to Test AI Agents Against Smart Contract Vulnerabilities
TLDR: EVMbench draws from 120 high-severity vulnerabilities curated across 40 real-world smart contract audits. GPT-5.3-Codex scored 72.2% in exploit mode, far outperforming GPT-5, which only...












