Haozhe Zhang
Toggle navigation
about
blog
projects
publications
research
an archive of posts in this category
May 11, 2026
BenchCAD — evaluating LLMs on the part of code where output is physical