Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.15927
Cited By
E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models
29 January 2024
Jinchang Hou
Chang Ao
Haihong Wu
Xiangtao Kong
Zhigang Zheng
Daijia Tang
Chengming Li
Xiping Hu
Ruifeng Xu
Shiwen Ni
Min Yang
AI4Ed
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models"
9 / 9 papers shown
Title
CPG-EVAL: A Multi-Tiered Benchmark for Evaluating the Chinese Pedagogical Grammar Competence of Large Language Models
Dong Wang
ELM
33
0
0
17 Apr 2025
UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models
Xin Xu
Qiyun Xu
Tong Xiao
Tianhao Chen
Yuchen Yan
Jiaxin Zhang
Shizhe Diao
Can Yang
Yang Wang
ELM
LRM
AI4CE
113
4
0
01 Feb 2025
IPEval: A Bilingual Intellectual Property Agency Consultation Evaluation Benchmark for Large Language Models
Qiyao Wang
Jianguo Huang
Shule Lu
Yuan Lin
Kan Xu
Liang Yang
Hongfei Lin
ELM
32
0
0
18 Jun 2024
How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?
Siye Wu
Jian Xie
Jiangjie Chen
Tinghui Zhu
Kai Zhang
Yanghua Xiao
KELM
48
20
0
04 Apr 2024
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
253
1,073
0
05 Oct 2022
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
328
4,139
0
24 May 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
416
8,650
0
28 Jan 2022
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
211
3,513
0
10 Jun 2015
1