v1v2 (latest)
GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents
Papers citing "GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents"
30 / 30 papers shown
Title |
---|
![]() DS-1000: A Natural and Reliable Benchmark for Data Science Code
Generation Yuhang Lai Chengxi Li Yiming Wang Tianyi Zhang Ruiqi Zhong Luke Zettlemoyer Scott Yih Daniel Fried Si-yi Wang Tao Yu |