ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.13136
  4. Cited By
What Would Jiminy Cricket Do? Towards Agents That Behave Morally

What Would Jiminy Cricket Do? Towards Agents That Behave Morally

25 October 2021
Dan Hendrycks
Mantas Mazeika
Andy Zou
Sahil Patel
Christine Zhu
Jesus Navarro
D. Song
Yue Liu
Jacob Steinhardt
ArXivPDFHTML

Papers citing "What Would Jiminy Cricket Do? Towards Agents That Behave Morally"

25 / 25 papers shown
Title
The Odyssey of the Fittest: Can Agents Survive and Still Be Good?
The Odyssey of the Fittest: Can Agents Survive and Still Be Good?
Dylan Waldner
Risto Miikkulainen
69
0
0
08 Feb 2025
SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
Paul Röttger
Fabio Pernisi
Bertie Vidgen
Dirk Hovy
ELM
KELM
82
33
0
08 Apr 2024
Unsolved Problems in ML Safety
Unsolved Problems in ML Safety
Dan Hendrycks
Nicholas Carlini
John Schulman
Jacob Steinhardt
211
282
0
28 Sep 2021
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Leads to Generally Capable Agents
Open-Ended Learning Team
Adam Stooke
Anuj Mahajan
Catarina Barros
Charlie Deck
...
Nicolas Porcel
Roberta Raileanu
Steph Hughes-Fitt
Valentin Dalibard
Wojciech M. Czarnecki
62
184
0
27 Jul 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
90
665
0
03 Jun 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
Decision Transformer: Reinforcement Learning via Sequence Modeling
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
71
1,608
0
02 Jun 2021
Training Value-Aligned Reinforcement Learning Agents Using a Normative
  Prior
Training Value-Aligned Reinforcement Learning Agents Using a Normative Prior
Md Sultan al Nahian
Spencer Frazier
Brent Harrison
Mark O. Riedl
49
18
0
19 Apr 2021
Keep CALM and Explore: Language Models for Action Generation in
  Text-based Games
Keep CALM and Explore: Language Models for Action Generation in Text-based Games
Shunyu Yao
Rohan Rao
Matthew J. Hausknecht
Karthik Narasimhan
LLMAG
LM&Ro
34
128
0
06 Oct 2020
Aligning AI With Shared Human Values
Aligning AI With Shared Human Values
Dan Hendrycks
Collin Burns
Steven Basart
Andrew Critch
Jingkai Li
D. Song
Jacob Steinhardt
85
540
0
05 Aug 2020
How to Avoid Being Eaten by a Grue: Structured Exploration Strategies
  for Textual Worlds
How to Avoid Being Eaten by a Grue: Structured Exploration Strategies for Textual Worlds
Prithviraj Ammanabrolu
Ethan Tien
Matthew J. Hausknecht
Mark O. Riedl
LLMAG
48
50
0
12 Jun 2020
Avoiding Side Effects in Complex Environments
Avoiding Side Effects in Complex Environments
Alexander Matt Turner
Neale Ratzlaff
Prasad Tadepalli
35
34
0
11 Jun 2020
Learning Dynamic Belief Graphs to Generalize on Text-Based Games
Learning Dynamic Belief Graphs to Generalize on Text-Based Games
Ashutosh Adhikari
Xingdi Yuan
Marc-Alexandre Côté
M. Zelinka
Marc-Antoine Rondeau
Romain Laroche
Pascal Poupart
Jian Tang
Adam Trischler
William L. Hamilton
AI4CE
41
81
0
21 Feb 2020
Graph Constrained Reinforcement Learning for Natural Language Action
  Spaces
Graph Constrained Reinforcement Learning for Natural Language Action Spaces
Prithviraj Ammanabrolu
Matthew J. Hausknecht
AI4CE
LLMAG
34
129
0
23 Jan 2020
Learning Human Objectives by Evaluating Hypothetical Behavior
Learning Human Objectives by Evaluating Hypothetical Behavior
S. Reddy
Anca Dragan
Sergey Levine
Shane Legg
Jan Leike
25
75
0
05 Dec 2019
SafeLife 1.0: Exploring Side Effects in Complex Environments
SafeLife 1.0: Exploring Side Effects in Complex Environments
Carroll L. Wainwright
P. Eckersley
29
12
0
03 Dec 2019
Interactive Fiction Games: A Colossal Adventure
Interactive Fiction Games: A Colossal Adventure
Matthew J. Hausknecht
Prithviraj Ammanabrolu
Marc-Alexandre Côté
Xingdi Yuan
LLMAG
LM&Ro
AI4CE
28
196
0
11 Sep 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
346
24,160
0
26 Jul 2019
NAIL: A General Interactive Fiction Agent
NAIL: A General Interactive Fiction Agent
Matthew J. Hausknecht
Ricky Loynd
Greg Yang
Adith Swaminathan
Jason D. Williams
23
39
0
12 Feb 2019
Playing Text-Adventure Games with Graph-Based Deep Reinforcement
  Learning
Playing Text-Adventure Games with Graph-Based Deep Reinforcement Learning
Prithviraj Ammanabrolu
Mark O. Riedl
GNN
40
122
0
04 Dec 2018
The Text-Based Adventure AI Competition
The Text-Based Adventure AI Competition
Timothy Atkinson
Hendrik Baier
Tara Copplestone
Sam Devlin
J. Swan
9
23
0
03 Aug 2018
TextWorld: A Learning Environment for Text-based Games
TextWorld: A Learning Environment for Text-based Games
Marc-Alexandre Côté
Ákos Kádár
Xingdi Yuan
Ben A. Kybartas
Tavian Barnes
...
Matthew J. Hausknecht
Layla El Asri
Mahmoud Adada
Wendy Tay
Adam Trischler
LLMAG
20
365
0
29 Jun 2018
Datasheets for Datasets
Datasheets for Datasets
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
205
2,158
0
23 Mar 2018
Constrained Policy Optimization
Constrained Policy Optimization
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
78
1,313
0
30 May 2017
Cooperative Inverse Reinforcement Learning
Cooperative Inverse Reinforcement Learning
Dylan Hadfield-Menell
Anca Dragan
Pieter Abbeel
Stuart J. Russell
43
643
0
09 Jun 2016
Deep Reinforcement Learning with a Natural Language Action Space
Deep Reinforcement Learning with a Natural Language Action Space
Ji He
Jianshu Chen
Xiaodong He
Jianfeng Gao
Lihong Li
Li Deng
Mari Ostendorf
61
245
0
14 Nov 2015
1