Title |
---|
![]() AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? Han Bao Yue Huang Yanbo Wang Jiayi Ye Xiangqi Wang Preslav Nakov Mohamed Elhoseiny Wei Wei Mohamed Elhoseiny Xiangliang Zhang |
![]() Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge Jiayi Ye Yanbo Wang Yue Huang Dongping Chen Qihui Zhang ...Werner Geyer Chao Huang Pin-Yu Chen Nitesh Chawla Xiangliang Zhang |
![]() Qwen2.5-Coder Technical Report Binyuan Hui Jian Yang Zeyu Cui Jiaxi Yang Dayiheng Liu ...Fei Huang Xingzhang Ren Xuancheng Ren Jingren Zhou Junyang Lin |
![]() AugGPT: Leveraging ChatGPT for Text Data Augmentation Haixing Dai Zheng Liu Wenxiong Liao Xiaoke Huang Yihan Cao ...Lichao Sun Quanzheng Li Dinggang Shen Tianming Liu Xiang Li |
![]() InCoder: A Generative Model for Code Infilling and Synthesis Daniel Fried Armen Aghajanyan Jessy Lin Sida I. Wang Eric Wallace Freda Shi Ruiqi Zhong Wen-tau Yih Luke Zettlemoyer M. Lewis |