Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.16529
Cited By
v1
v2 (latest)
Safety Evaluation and Enhancement of DeepSeek Models in Chinese Contexts
18 March 2025
Wenjing Zhang
Xuejiao Lei
Zhaoxiang Liu
Limin Han
Jiaojiao Zhao
Junting Guo
Zhenhong Long
Shu Yang
Meijuan An
Beibei Huang
Rongjia Du
Ning Wang
Kai Wang
Shiguo Lian
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Safety Evaluation and Enhancement of DeepSeek Models in Chinese Contexts"
10 / 10 papers shown
Title
The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Linghan Huang
Haolin Jin
Zhaoge Bi
Pengyue Yang
Peizhou Zhao
Taozhao Chen
Xiongfei Wu
Lei Ma
Huaming Chen
AAML
59
0
0
18 May 2025
Safety Evaluation of DeepSeek Models in Chinese Contexts
Wenjing Zhang
Xuejiao Lei
Zhaoxiang Liu
Rongjia Du
Zhenhong Long
...
Jiaojiao Zhao
Minjie Hua
Chaoyang Ma
Kai Wang
Kai Wang
ELM
209
9
0
16 Feb 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
384
2,022
0
22 Jan 2025
CHiSafetyBench: A Chinese Hierarchical Safety Benchmark for Large Language Models
Wenjing Zhang
Xuejiao Lei
Zhaoxiang Liu
Meijuan An
Bikun Yang
Kaikai Zhao
Kai Wang
Shiguo Lian
ELM
99
8
0
14 Jun 2024
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Naman Jain
King Han
Alex Gu
Wen-Ding Li
Fanjia Yan
Tianjun Zhang
Sida I. Wang
Armando Solar-Lezama
Koushik Sen
Ion Stoica
ELM
148
448
0
12 Mar 2024
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Mantas Mazeika
Long Phan
Xuwang Yin
Andy Zou
Zifan Wang
...
Nathaniel Li
Steven Basart
Bo Li
David A. Forsyth
Dan Hendrycks
AAML
112
419
0
06 Feb 2024
Large Language Models for Generative Information Extraction: A Survey
Derong Xu
Wei-neng Chen
Wenjun Peng
Chao Zhang
Tong Xu
Xiangyu Zhao
Xian Wu
Yefeng Zheng
Yang Wang
Enhong Chen
156
170
0
29 Dec 2023
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
David Rein
Betty Li Hou
Asa Cooper Stickland
Jackson Petty
Richard Yuanzhe Pang
Julien Dirani
Julian Michael
Samuel R. Bowman
AI4MH
ELM
148
737
0
20 Nov 2023
Let's Verify Step by Step
Hunter Lightman
V. Kosaraju
Yura Burda
Harrison Edwards
Bowen Baker
Teddy Lee
Jan Leike
John Schulman
Ilya Sutskever
K. Cobbe
ALM
OffRL
LRM
213
1,241
0
31 May 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
1.6K
14,828
0
15 Mar 2023
1