Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.12193
Cited By
v1
v2 (latest)
A Chinese Dataset for Evaluating the Safeguards in Large Language Models
19 February 2024
Yuxia Wang
Zenan Zhai
Haonan Li
Xudong Han
Lizhi Lin
Zhenxuan Zhang
Jingru Zhao
Preslav Nakov
Timothy Baldwin
Re-assign community
ArXiv (abs)
PDF
HTML
Github (248★)
Papers citing
"A Chinese Dataset for Evaluating the Safeguards in Large Language Models"
5 / 5 papers shown
Title
AnswerCarefully: A Dataset for Improving the Safety of Japanese LLM Output
Hisami Suzuki
Satoru Katsumata
Takashi Kodama
Tetsuro Takahashi
Kouta Nakayama
Satoshi Sekine
52
0
0
03 Jun 2025
NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context
Ben Yao
Qiuchi Li
Yazhou Zhang
Siyu Yang
Bohan Zhang
Prayag Tiwari
Jing Qin
111
0
0
13 May 2025
JailBench: A Comprehensive Chinese Security Assessment Benchmark for Large Language Models
Shuyi Liu
Simiao Cui
Haoran Bu
Yuming Shang
Xi Zhang
ELM
83
1
0
26 Feb 2025
ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
Han Zhang
Hongfu Gao
Qiang Hu
Guanhua Chen
L. Yang
Bingyi Jing
Hongxin Wei
Bing Wang
Haifeng Bai
Lei Yang
AILaw
ELM
136
4
0
24 Oct 2024
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
Han Jiang
Xiaoyuan Yi
Zhihua Wei
Ziang Xiao
Shu Wang
Xing Xie
ELM
ALM
160
8
0
20 Jun 2024
1