In the paper, Xue et al. propose the ZK-Eval evaluation framework and the ZK-Coder enhancement framework to assess and improve LLM capabilities in ZK code generation, significantly boosting the correctness of Circom and Noir programs. Xue等人在论文中提出了ZK-Eval评估框架和ZK-Coder增强框架,用于评估和改进LLM在ZK代码生成中的能力,显著提升Circom和Noir程序的正确率。
Notes
ZKP programming is hard and error-prone; LLMs lack systematic evaluation.
ZK-Eval benchmark covers language, gadget, and end-to-end generation.
LLMs handle syntax well but fail on gadgets and semantics; Circom is harder.
ZK-Coder uses sketching, retrieval, and repair to improve reliability.
Success rates on Circom/Noir rise from <30% to 80–90%.
Repair loop is crucial; challenges remain in efficiency and data scarcity.