Title |
---|
![]() DebugBench: Evaluating Debugging Capability of Large Language Models Runchu Tian Yining Ye Yujia Qin Xin Cong Yankai Lin ...Yesai Wu Haotian Hui Weichuan Liu Zhiyuan Liu Maosong Sun |
![]() Lemur: Harmonizing Natural Language and Code for Language Agents Yiheng Xu Hongjin Su Chen Xing Boyu Mi Qian Liu ...Siheng Zhao Lingpeng Kong Bailin Wang Caiming Xiong Tao Yu |