Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.04325
Cited By
Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation
7 May 2024
Atharvan Dogra
Ameet Deshpande
John Nay
Tanmay Rajpurohit
Ashwin Kalyan
Balaraman Ravindran
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation"
5 / 5 papers shown
Title
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
79
394
0
19 May 2023
The Internal State of an LLM Knows When It's Lying
A. Azaria
Tom Michael Mitchell
HILM
275
344
0
26 Apr 2023
Toxicity in ChatGPT: Analyzing Persona-assigned Language Models
Ameet Deshpande
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
LM&MA
LLMAG
75
369
0
11 Apr 2023
Large Language Models Can Be Used to Estimate the Latent Positions of Politicians
Patrick Y. Wu
Jonathan Nagler
Joshua A. Tucker
Solomon Messing
163
28
0
21 Mar 2023
The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities
Joel Lehman
Jeff Clune
D. Misevic
C. Adami
L. Altenberg
...
Danesh Tarapore
S. Thibault
Westley Weimer
R. Watson
Jason Yosinksi
119
282
0
09 Mar 2018
1