Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.05829
Cited By
Limitations of Agents Simulated by Predictive Models
8 February 2024
Raymond Douglas
Jacek Karwowski
Chan Bae
Andis Draguns
Victoria Krakovna
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Limitations of Agents Simulated by Predictive Models"
12 / 12 papers shown
Title
Reinforced Self-Training (ReST) for Language Modeling
Çağlar Gülçehre
T. Paine
S. Srinivasan
Ksenia Konyushkova
L. Weerts
...
Chenjie Gu
Wolfgang Macherey
Arnaud Doucet
Orhan Firat
Nando de Freitas
OffRL
123
308
0
17 Aug 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
389
4,163
0
29 May 2023
Measuring Progress on Scalable Oversight for Large Language Models
Sam Bowman
Jeeyoon Hyun
Ethan Perez
Edwin Chen
Craig Pettit
...
Tristan Hume
Yuntao Bai
Zac Hatfield-Dodds
Benjamin Mann
Jared Kaplan
ALM
ELM
79
132
0
04 Nov 2022
The Alignment Problem from a Deep Learning Perspective
Richard Ngo
Lawrence Chan
Sören Mindermann
119
192
0
30 Aug 2022
Causal Imitation Learning with Unobserved Confounders
Junzhe Zhang
D. Kumor
Elias Bareinboim
CML
75
76
0
12 Aug 2022
Is Power-Seeking AI an Existential Risk?
Joseph Carlsmith
ELM
67
87
0
16 Jun 2022
Shaking the foundations: delusions in sequence models for interaction and control
Pedro A. Ortega
M. Kunesch
Grégoire Delétang
Tim Genewein
Jordi Grau-Moya
...
Yutian Chen
Scott E. Reed
Marcus Hutter
Nando de Freitas
Shane Legg
80
64
0
20 Oct 2021
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
169
1,838
0
13 Dec 2019
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CE
BDL
91
674
0
02 May 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,420
0
04 Jan 2018
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
218
3,377
0
12 Jun 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
115
1,348
0
27 Feb 2017
1