Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.01345
Cited By
Decision Transformer: Reinforcement Learning via Sequence Modeling
2 June 2021
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Decision Transformer: Reinforcement Learning via Sequence Modeling"
46 / 396 papers shown
Title
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory
Lian Siyao
Weijiang Yu
Tianpei Gu
Chunze Lin
Quan Wang
Chao Qian
Chen Change Loy
Ziwei Liu
SLR
42
184
0
24 Mar 2022
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
Ming Yin
Yaqi Duan
Mengdi Wang
Yu-Xiang Wang
OffRL
34
66
0
11 Mar 2022
Policy Architectures for Compositional Generalization in Control
Allan Zhou
Vikash Kumar
Chelsea Finn
Aravind Rajeswaran
26
22
0
10 Mar 2022
LISA: Learning Interpretable Skill Abstractions from Language
Divyansh Garg
Skanda Vaidyanath
Kuno Kim
Jiaming Song
Stefano Ermon
LM&Ro
OffRL
156
29
0
28 Feb 2022
Consistent Dropout for Policy Gradient Reinforcement Learning
Matthew J. Hausknecht
Nolan Wagener
OffRL
27
10
0
23 Feb 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
40
132
0
23 Feb 2022
Learning Relative Return Policies With Upside-Down Reinforcement Learning
Dylan R. Ashley
Kai Arulkumaran
Jürgen Schmidhuber
R. Srivastava
OffRL
24
1
0
23 Feb 2022
TransDreamer: Reinforcement Learning with Transformer World Models
Changgu Chen
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
OffRL
32
91
0
19 Feb 2022
Flowformer: Linearizing Transformers with Conservation Flows
Haixu Wu
Jialong Wu
Jiehui Xu
Jianmin Wang
Mingsheng Long
14
90
0
13 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
35
65
0
13 Feb 2022
Pre-Trained Language Models for Interactive Decision-Making
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
50
249
0
03 Feb 2022
Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers
Amir Ardalan Kalantari
Mohammad Amini
Sarath Chandar
Doina Precup
52
4
0
01 Feb 2022
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
DNNFuser: Generative Pre-Trained Transformer as a Generalized Mapper for Layer Fusion in DNN Accelerators
Sheng-Chun Kao
Xiaoyu Huang
T. Krishna
AI4CE
35
9
0
26 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
35
133
0
20 Jan 2022
Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation
Rishabh Jangir
Nicklas Hansen
Sambaran Ghosal
Mohit Jain
Xiaolong Wang
32
66
0
19 Jan 2022
RvS: What is Essential for Offline RL via Supervised Learning?
Scott Emmons
Benjamin Eysenbach
Ilya Kostrikov
Sergey Levine
OffRL
31
170
0
20 Dec 2021
Diaformer: Automatic Diagnosis via Symptoms Sequence Generation
Junying Chen
Dongfang Li
Qingcai Chen
Wenxiu Zhou
Xin Liu
MedIm
30
30
0
20 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
28
38
0
06 Dec 2021
Quantile Filtered Imitation Learning
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
33
6
0
02 Dec 2021
SWAT: Spatial Structure Within and Among Tokens
Kumara Kahatapitiya
Michael S. Ryoo
25
6
0
26 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
37
1
0
11 Nov 2021
Transfer learning with causal counterfactual reasoning in Decision Transformers
Ayman Boustati
Hana Chockler
Daniel C. McNamee
CML
OffRL
LRM
21
9
0
27 Oct 2021
What Would Jiminy Cricket Do? Towards Agents That Behave Morally
Dan Hendrycks
Mantas Mazeika
Andy Zou
Sahil Patel
Christine Zhu
Jesus Navarro
D. Song
Bo-wen Li
Jacob Steinhardt
16
58
0
25 Oct 2021
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Benjamin L. Edelman
Surbhi Goel
Sham Kakade
Cyril Zhang
27
117
0
19 Oct 2021
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
848
0
12 Oct 2021
Relative Molecule Self-Attention Transformer
Lukasz Maziarka
Dawid Majchrowski
Tomasz Danel
Piotr Gaiñski
Jacek Tabor
Igor T. Podolak
Pawel M. Morkisz
Stanislaw Jastrzebski
MedIm
45
34
0
12 Oct 2021
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets
J. E. Grigsby
Yanjun Qi
OffRL
29
5
0
10 Oct 2021
Pathologies in priors and inference for Bayesian transformers
Tristan Cinquin
Alexander Immer
Max Horn
Vincent Fortuin
UQCV
BDL
MedIm
34
9
0
08 Oct 2021
Offline RL With Resource Constrained Online Deployment
Jayanth Reddy Regatti
A. Deshmukh
Frank Cheng
Young Hun Jung
Abhishek Gupta
Ürün Dogan
OffRL
13
2
0
07 Oct 2021
An Offline Deep Reinforcement Learning for Maintenance Decision-Making
H. Khorasgani
Haiyan Wang
Chetan Gupta
Ahmed K. Farahat
KELM
OffRL
21
5
0
28 Sep 2021
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
244
344
0
22 Sep 2021
Evolving Decomposed Plasticity Rules for Information-Bottlenecked Meta-Learning
Fan Wang
Hao Tian
Haoyi Xiong
Hua Wu
Jie Fu
Yang Cao
Yu Kang
Haifeng Wang
AI4CE
15
3
0
08 Sep 2021
DeepAltTrip: Top-k Alternative Itineraries for Trip Recommendation
Syed Md. Mukit Rashid
Mohammed Eunus Ali
Muhammad Aamir Cheema
16
10
0
08 Sep 2021
Teaching Autoregressive Language Models Complex Tasks By Demonstration
Gabriel Recchia
26
22
0
05 Sep 2021
Boosting Search Engines with Interactive Agents
Leonard Adolphs
Benjamin Boerschinger
Christian Buck
Michelle Chen Huebscher
Massimiliano Ciaramita
...
Thomas Hofmann
Yannic Kilcher
Sascha Rothe
Pier Giuseppe Sessa
Lierni Sestorain Saralegui
LLMAG
26
24
0
01 Sep 2021
Pre-trained Language Models as Prior Knowledge for Playing Text-based Games
Ishika Singh
Gargi Singh
Ashutosh Modi
OffRL
AI4CE
27
29
0
18 Jul 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
32
29
0
16 Jun 2021
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
58
785
0
12 Jun 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
33
57
0
11 Jun 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
66
649
0
03 Jun 2021
Which transformer architecture fits my data? A vocabulary bottleneck in self-attention
Noam Wies
Yoav Levine
Daniel Jannai
Amnon Shashua
40
20
0
09 May 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,805
0
24 Feb 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
343
1,963
0
04 May 2020
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
301
1,616
0
18 Sep 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
306
10,378
0
12 Dec 2018
Previous
1
2
3
4
5
6
7
8