Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.03814
Cited By
MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
8 January 2025
Fengxiang Wang
Ranjie Duan
Peng Xiao
Xiaojun Jia
Shiji Zhao
Chongwen Wang
YueFeng Chen
Hang Su
Jialing Tao
Hui Xue
Jun Zhu
Hui Xue
LLMAG
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue"
3 / 3 papers shown
Title
Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs
Haoming Yang
Ke Ma
Xiaojun Jia
Yingfei Sun
Qianqian Xu
Q. Huang
AAML
159
0
0
03 May 2025
Steering Dialogue Dynamics for Robustness against Multi-turn Jailbreaking Attacks
Hanjiang Hu
Alexander Robey
Changliu Liu
AAML
LLMSV
47
1
0
28 Feb 2025
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways
Zehang Deng
Yongjian Guo
Changzhou Han
Wanlun Ma
Junwu Xiong
Sheng Wen
Yang Xiang
44
23
0
04 Jun 2024
1