
v1v2 (latest)
ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Author Contacts:
Papers citing "ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation"
24 / 24 papers shown
Title |
---|
![]() DS-1000: A Natural and Reliable Benchmark for Data Science Code
Generation Yuhang Lai Chengxi Li Yiming Wang Tianyi Zhang Ruiqi Zhong Luke Zettlemoyer Scott Yih Daniel Fried Si-yi Wang Tao Yu |