Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.07331
Cited By
DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
9 October 2024
Yiming Huang
Jianwen Luo
Yan Yu
Yitong Zhang
Fangyu Lei
Yifan Wei
Shizhu He
Lifu Huang
Xiao Liu
Jun Zhao
Kang Liu
ELM
ALM
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models"
3 / 3 papers shown
Title
MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?
Yunxiang Zhang
Muhammad Khalifa
Shitanshu Bhushan
Grant D Murphy
Lajanugen Logeswaran
Jaekyeom Kim
Moontae Lee
Honglak Lee
Lu Wang
LLMAG
ELM
64
0
0
13 Apr 2025
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational Biology
Ludovico Mitchener
Jon M. Laurent
Benjamin Tenmann
Siddharth Narayanan
Geemi P Wellawatte
A. White
Lorenzo Sani
Samuel G. Rodriques
LLMAG
LM&MA
ELM
64
4
0
28 Feb 2025
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Fangyu Lei
Jixuan Chen
Yuxiao Ye
Ruisheng Cao
Dongchan Shin
...
Caiming Xiong
Ruoxi Sun
Qian Liu
Sida I. Wang
Tao Yu
LMTD
85
21
0
12 Nov 2024
1