Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.01586
Cited By
v1
v2 (latest)
TrustAgent: Towards Safe and Trustworthy LLM-based Agents through Agent Constitution
2 February 2024
Wenyue Hua
Xianjun Yang
Zelong Li
Cheng Wei
Yongfeng Zhang
LLMAG
Re-assign community
ArXiv (abs)
PDF
HTML
Github (41★)
Papers citing
"TrustAgent: Towards Safe and Trustworthy LLM-based Agents through Agent Constitution"
8 / 8 papers shown
Title
SoK: The Privacy Paradox of Large Language Models: Advancements, Privacy Risks, and Mitigation
Yashothara Shanmugarasa
Ming Ding
M. Chamikara
Thierry Rakotoarivelo
PILM
AILaw
76
0
0
15 Jun 2025
Application-Driven Value Alignment in Agentic AI Systems: Survey and Perspectives
Wei Zeng
Hengshu Zhu
Chuan Qin
Han Wu
Yihang Cheng
...
Xiaowei Jin
Yinuo Shen
Zhenxing Wang
Feimin Zhong
Hui Xiong
AI4TS
65
0
0
11 Jun 2025
Comprehensive Vulnerability Analysis is Necessary for Trustworthy LLM-MAS
Pengfei He
Yue Xing
Shen Dong
Juanhui Li
Zhenwei Dai
...
Hui Liu
Han Xu
Zhen Xiang
Charu C. Aggarwal
Hui Liu
LLMAG
70
0
0
02 Jun 2025
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Bang Zhang
Ruotian Ma
Qingxuan Jiang
Peisong Wang
Jiaqi Chen
...
Fanghua Ye
Jian Li
Yifan Yang
Zhaopeng Tu
Xiaolong Li
LLMAG
ELM
ALM
254
0
1
01 May 2025
RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models
Bang An
Shiyue Zhang
Mark Dredze
156
5
0
25 Apr 2025
Stay Focused: Problem Drift in Multi-Agent Debate
Jonas Becker
Lars Benedikt Kaesberg
Andreas Stephan
Jan Philip Wahle
Terry Ruas
Bela Gipp
143
2
0
26 Feb 2025
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
H. Zhang
Jingyuan Huang
Kai Mei
Yifei Yao
Zhenting Wang
Chenlu Zhan
Hongwei Wang
Yongfeng Zhang
AAML
LLMAG
ELM
203
40
0
03 Oct 2024
A Survey of Language-Based Communication in Robotics
William Hunt
Sarvapali D. Ramchurn
Mohammad D. Soorati
LM&Ro
248
13
0
06 Jun 2024
1