HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

Papers citing "HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation"