Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.02877
Cited By
Training Agents using Upside-Down Reinforcement Learning
5 December 2019
R. Srivastava
Pranav Shyam
Filipe Wall Mutz
Wojciech Ja'skowski
Jürgen Schmidhuber
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training Agents using Upside-Down Reinforcement Learning"
41 / 41 papers shown
Title
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
Minh Hoang Nguyen
Linh Le Pham Van
Thommen George Karimpanal
Sunil Gupta
Hung Le
OffRL
LRM
37
0
0
14 May 2025
Predictive Coding for Decision Transformer
Tung M. Luu
Donghoon Lee
Chang D. Yoo
OffRL
66
2
0
04 Oct 2024
Gaussian Splatting to Real World Flight Navigation Transfer with Liquid Networks
Alex Quach
Makram Chahine
Alexander Amini
Ramin Hasani
Daniela Rus
3DGS
39
7
0
21 Jun 2024
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
21
0
0
16 Jun 2024
CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning
Luke Rowe
Roger Girgis
Anthony Gosselin
Bruno Carrez
Florian Golemo
Felix Heide
Liam Paull
Christopher Pal
46
4
0
29 Mar 2024
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces
Tianyu Zheng
Ge Zhang
Xingwei Qu
Ming Kuang
Stephen W. Huang
Zhaofeng He
OffRL
58
1
0
20 Feb 2024
Return-Aligned Decision Transformer
Tsunehiko Tanaka
Kenshi Abe
Kaito Ariu
Tetsuro Morimura
Edgar Simo-Serra
OffRL
69
1
0
06 Feb 2024
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Guy Van den Broeck
Mathias Niepert
Yitao Liang
OffRL
36
1
0
31 Oct 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
30
8
0
04 Sep 2023
Passive learning of active causal strategies in agents and language models
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Ishita Dasgupta
A. Nam
Jane X. Wang
31
15
0
25 May 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
38
13
0
26 Apr 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
T. Kanazawa
Chetan Gupta
29
0
0
15 Mar 2023
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
36
15
0
07 Mar 2023
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
38
24
0
31 Jan 2023
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem
Chenyi Yu
Weinan Zhang
H. Lai
Zheng Tian
L. Kneip
Jun Wang
33
15
0
18 Dec 2022
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
68
365
0
28 Nov 2022
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
David Meger
OffRL
17
14
0
28 Nov 2022
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson
A. H. Qureshi
24
8
0
11 Nov 2022
Dichotomy of Control: Separating What You Can Control from What You Cannot
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
30
42
0
24 Oct 2022
Implicit Offline Reinforcement Learning via Supervised Learning
Alexandre Piché
Rafael Pardiñas
David Vazquez
Igor Mordatch
C. Pal
SSL
OffRL
29
4
0
21 Oct 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
26
62
0
15 Oct 2022
Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning
Tung Nguyen
Qinqing Zheng
Aditya Grover
OffRL
34
6
0
11 Oct 2022
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
Taku Yamagata
Ahmed Khalil
Raúl Santos-Rodríguez
OffRL
160
72
0
08 Sep 2022
Hindsight Learning for MDPs with Exogenous Inputs
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
18
23
0
13 Jul 2022
Goal-Conditioned Generators of Deep Policies
Francesco Faccio
Vincent Herrmann
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
40
8
0
04 Jul 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
39
141
0
15 Jun 2022
Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets
M. Strupl
Francesco Faccio
Dylan R. Ashley
Jürgen Schmidhuber
R. Srivastava
17
9
0
13 May 2022
Unsupervised Learning of Temporal Abstractions with Slot-based Transformers
Anand Gopalakrishnan
Kazuki Irie
Jürgen Schmidhuber
Sjoerd van Steenkiste
OffRL
26
16
0
25 Mar 2022
Learning Relative Return Policies With Upside-Down Reinforcement Learning
Dylan R. Ashley
Kai Arulkumaran
Jürgen Schmidhuber
R. Srivastava
OffRL
24
1
0
23 Feb 2022
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
35
133
0
20 Jan 2022
RvS: What is Essential for Offline RL via Supervised Learning?
Scott Emmons
Benjamin Eysenbach
Ilya Kostrikov
Sergey Levine
OffRL
31
170
0
20 Dec 2021
Teaching Autoregressive Language Models Complex Tasks By Demonstration
Gabriel Recchia
28
22
0
05 Sep 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
33
57
0
11 Jun 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
49
15
0
10 Jun 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
68
651
0
03 Jun 2021
Rapid Exploration for Open-World Navigation with Latent Goal Models
Dhruv Shah
Benjamin Eysenbach
G. Kahn
Nicholas Rhinehart
Sergey Levine
32
70
0
12 Apr 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Chong Chen
Yaodong Yang
Lu Zhang
Wulong Liu
Zhaopeng Meng
OffRL
35
4
0
03 Mar 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
343
1,968
0
04 May 2020
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
30
212
0
20 Jun 2018
1