Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation

7 May 2024

Papers citing "Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation"

5 / 5 papers shown

Title
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing Zhibin Gou Zhihong Shao Yeyun Gong Yelong Shen Yujiu Yang Nan Duan Weizhu Chen KELM LRM 79 394 0 19 May 2023
The Internal State of an LLM Knows When It's Lying A. Azaria Tom Michael Mitchell HILM 275 344 0 26 Apr 2023
Toxicity in ChatGPT: Analyzing Persona-assigned Language Models Ameet Deshpande Vishvak Murahari Tanmay Rajpurohit Ashwin Kalyan Karthik Narasimhan LM&MA LLMAG 75 369 0 11 Apr 2023
Large Language Models Can Be Used to Estimate the Latent Positions of Politicians Patrick Y. Wu Jonathan Nagler Joshua A. Tucker Solomon Messing 163 28 0 21 Mar 2023
The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities Joel Lehman Jeff Clune D. Misevic C. Adami L. Altenberg ... Danesh Tarapore S. Thibault Westley Weimer R. Watson Jason Yosinksi 119 282 0 09 Mar 2018