Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.13465
Cited By
Reward-Conditioned Policies
31 December 2019
Aviral Kumar
Xue Bin Peng
Sergey Levine
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reward-Conditioned Policies"
30 / 30 papers shown
Title
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
105
2
0
28 Jan 2025
Predictive Coding for Decision Transformer
Tung M. Luu
Donghoon Lee
Chang D. Yoo
OffRL
66
2
0
04 Oct 2024
Towards Aligning Language Models with Textual Feedback
Sauc Abadal Lloret
Shehzaad Dhuliawala
K. Murugesan
Mrinmaya Sachan
VLM
53
1
0
24 Jul 2024
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
26
0
0
16 Jun 2024
UCB-driven Utility Function Search for Multi-objective Reinforcement Learning
Yucheng Shi
Alexandros Agapitos
David Lynch
Giorgio Cruciata
Cengis Hasan
Hao Wang
Yayu Yao
Aleksandar Milenovic
44
0
0
01 May 2024
CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning
Luke Rowe
Roger Girgis
Anthony Gosselin
Bruno Carrez
Florian Golemo
Felix Heide
Liam Paull
Christopher Pal
53
4
0
29 Mar 2024
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View
Raj Ghugare
Matthieu Geist
Glen Berseth
Benjamin Eysenbach
OffRL
40
14
0
20 Jan 2024
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Hoang Trung-Dung
Guy Van den Broeck
Yitao Liang
OffRL
36
1
0
31 Oct 2023
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
Chenxiao Gao
Chenyang Wu
Mingjun Cao
Rui Kong
Zongzhang Zhang
Yang Yu
OffRL
41
13
0
12 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
35
8
0
04 Sep 2023
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Tongzhou Wang
Antonio Torralba
Phillip Isola
Amy Zhang
OffRL
39
34
0
03 Apr 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
T. Kanazawa
Chetan Gupta
34
0
0
15 Mar 2023
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
41
15
0
07 Mar 2023
Language Decision Transformers with Exponential Tilt for Interactive Text Environments
Nicolas Angelard-Gontier
Pau Rodríguez López
I. Laradji
David Vazquez
C. Pal
OffRL
39
1
0
10 Feb 2023
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
71
367
0
28 Nov 2022
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson
A. H. Qureshi
27
8
0
11 Nov 2022
Dichotomy of Control: Separating What You Can Control from What You Cannot
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
30
42
0
24 Oct 2022
Implicit Offline Reinforcement Learning via Supervised Learning
Alexandre Piché
Rafael Pardiñas
David Vazquez
Igor Mordatch
C. Pal
SSL
OffRL
29
4
0
21 Oct 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
31
62
0
15 Oct 2022
Human-AI Coordination via Human-Regularized Search and Learning
Hengyuan Hu
David J. Wu
Adam Lerer
Jakob N. Foerster
Noam Brown
24
7
0
11 Oct 2022
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
Taku Yamagata
Ahmed Khalil
Raúl Santos-Rodríguez
OffRL
160
72
0
08 Sep 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
44
22
0
24 Jun 2022
Learning Relative Return Policies With Upside-Down Reinforcement Learning
Dylan R. Ashley
Kai Arulkumaran
Jürgen Schmidhuber
R. Srivastava
OffRL
24
1
0
23 Feb 2022
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
40
133
0
20 Jan 2022
RvS: What is Essential for Offline RL via Supervised Learning?
Scott Emmons
Benjamin Eysenbach
Ilya Kostrikov
Sergey Levine
OffRL
31
170
0
20 Dec 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
71
651
0
03 Jun 2021
Critic Regularized Regression
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
...
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
OffRL
36
319
0
26 Jun 2020
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
143
928
0
07 Jul 2017
1