Title |
---|
![]() DebugBench: Evaluating Debugging Capability of Large Language Models Runchu Tian Yining Ye Yujia Qin Xin Cong Yankai Lin ...Yesai Wu Haotian Hui Weichuan Liu Zhiyuan Liu Maosong Sun |
![]() Has Your Pretrained Model Improved? A Multi-head Posterior Based
Approach Prince Osei Aboagye Yan Zheng Junpeng Wang Uday Singh Saini Xin Dai ...Yujie Fan Zhongfang Zhuang Shubham Jain Liang Wang Wei Zhang |
![]() G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model Jiahui Gao Renjie Pi Jipeng Zhang Jiacheng Ye Wanjun Zhong ...Lanqing Hong Jianhua Han Hang Xu Zhenguo Li Lingpeng Kong |