JUNE 2026
Announcement_14
Our benchmark BenchCAD was used by Anthropic in their official Claude Fable 5 & Claude Mythos 5 System Card: a dedicated section (§8.16.4, pp. 282–283) evaluates their frontier models on BenchCAD’s Vision2Code task — with two figures and a Python-tools ablation.
From Anthropic’s Claude Fable 5 & Claude Mythos 5 System Card (§8.16.4).