ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.01345
  4. Cited By
Decision Transformer: Reinforcement Learning via Sequence Modeling

Decision Transformer: Reinforcement Learning via Sequence Modeling

2 June 2021
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
    OffRL
ArXivPDFHTML

Papers citing "Decision Transformer: Reinforcement Learning via Sequence Modeling"

50 / 390 papers shown
Title
Pathway to Future Symbiotic Creativity
Pathway to Future Symbiotic Creativity
Yi-Ting Guo
Qi-fei Liu
Jie Chen
Wei Xue
Jie Fu
...
Fernando Rosas
Jeffrey Shaw
Xing Wu
Jiji Zhang
Jianliang Xu
34
0
0
18 Aug 2022
Private, Efficient, and Accurate: Protecting Models Trained by
  Multi-party Learning with Differential Privacy
Private, Efficient, and Accurate: Protecting Models Trained by Multi-party Learning with Differential Privacy
Wenqiang Ruan
Ming Xu
Wenjing Fang
Li Wang
Lei Wang
Wei Han
40
12
0
18 Aug 2022
Sampling Through the Lens of Sequential Decision Making
Sampling Through the Lens of Sequential Decision Making
J. Dou
Alvin Pan
Runxue Bao
Haiyi Mao
Lei Luo
Zhi-Hong Mao
26
19
0
17 Aug 2022
AutoCAT: Reinforcement Learning for Automated Exploration of
  Cache-Timing Attacks
AutoCAT: Reinforcement Learning for Automated Exploration of Cache-Timing Attacks
Mulong Luo
Wenjie Xiong
G. G. Lee
Yueying Li
Xiaomeng Yang
Amy Zhang
Yuandong Tian
Hsien-Hsin S. Lee
G. E. Suh
AAML
40
10
0
17 Aug 2022
MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control
MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control
Nolan Wagener
Andrey Kolobov
Felipe Vieira Frujeri
Ricky Loynd
Ching-An Cheng
Matthew J. Hausknecht
27
21
0
15 Aug 2022
LATTE: LAnguage Trajectory TransformEr
LATTE: LAnguage Trajectory TransformEr
A. Bucker
Luis F. C. Figueredo
Sami Haddadin
Ashish Kapoor
Shuang Ma
Sai H. Vemprala
Rogerio Bonatti
LM&Ro
39
59
0
04 Aug 2022
Language Model Cascades
Language Model Cascades
David Dohan
Winnie Xu
Aitor Lewkowycz
Jacob Austin
David Bieber
...
Henryk Michalewski
Rif A. Saurous
Jascha Narain Sohl-Dickstein
Kevin Patrick Murphy
Charles Sutton
ReLM
LRM
38
101
0
21 Jul 2022
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Adam R. Villaflor
Zheng Huang
Swapnil Pande
John M. Dolan
J. Schneider
OffRL
25
24
0
21 Jul 2022
Discriminator-Weighted Offline Imitation Learning from Suboptimal
  Demonstrations
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
26
66
0
20 Jul 2022
Hindsight Learning for MDPs with Exogenous Inputs
Hindsight Learning for MDPs with Exogenous Inputs
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
18
23
0
13 Jul 2022
Transformer Neural Processes: Uncertainty-Aware Meta Learning Via
  Sequence Modeling
Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling
Tung Nguyen
Aditya Grover
BDL
UQCV
19
99
0
09 Jul 2022
Finding Fallen Objects Via Asynchronous Audio-Visual Integration
Finding Fallen Objects Via Asynchronous Audio-Visual Integration
Chuang Gan
Yi Gu
Siyuan Zhou
Jeremy Schwartz
S. Alter
James Traer
Dan Gutfreund
J. Tenenbaum
Josh H. McDermott
Antonio Torralba
57
19
0
07 Jul 2022
Pure Transformers are Powerful Graph Learners
Pure Transformers are Powerful Graph Learners
Jinwoo Kim
Tien Dat Nguyen
Seonwoo Min
Sungjun Cho
Moontae Lee
Honglak Lee
Seunghoon Hong
43
191
0
06 Jul 2022
Transformers are Adaptable Task Planners
Transformers are Adaptable Task Planners
Vidhi Jain
Yixin Lin
Eric Undersander
Yonatan Bisk
Akshara Rai
25
24
0
06 Jul 2022
Goal-Conditioned Generators of Deep Policies
Goal-Conditioned Generators of Deep Policies
Francesco Faccio
Vincent Herrmann
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
40
8
0
04 Jul 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
31
22
0
24 Jun 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale
  Knowledge
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan
Guanzhi Wang
Yunfan Jiang
Ajay Mandlekar
Yuncong Yang
Haoyi Zhu
Andrew Tang
De-An Huang
Yuke Zhu
Anima Anandkumar
LM&Ro
57
352
0
17 Jun 2022
Bootstrapped Transformer for Offline Reinforcement Learning
Bootstrapped Transformer for Offline Reinforcement Learning
Kerong Wang
Hanye Zhao
Xufang Luo
Kan Ren
Weinan Zhang
Dongsheng Li
OffRL
16
37
0
17 Jun 2022
Towards Understanding How Machines Can Learn Causal Overhypotheses
Towards Understanding How Machines Can Learn Causal Overhypotheses
Eliza Kosoy
David M. Chan
Adrian Liu
Jasmine Collins
Bryanna Kaufmann
Sandy Han Huang
Jessica B. Hamrick
John F. Canny
Nan Rosemary Ke
Alison Gopnik
CML
AI4CE
28
18
0
16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
37
141
0
15 Jun 2022
Transformers are Meta-Reinforcement Learners
Transformers are Meta-Reinforcement Learners
Luckeciano C. Melo
OffRL
41
50
0
14 Jun 2022
Challenges and Opportunities in Offline Reinforcement Learning from
  Visual Observations
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Cong Lu
Philip J. Ball
Tim G. J. Rudner
Jack Parker-Holder
Michael A. Osborne
Yee Whye Teh
OffRL
32
52
0
09 Jun 2022
Unveiling Transformers with LEGO: a synthetic reasoning task
Unveiling Transformers with LEGO: a synthetic reasoning task
Yi Zhang
A. Backurs
Sébastien Bubeck
Ronen Eldan
Suriya Gunasekar
Tal Wagner
LRM
36
85
0
09 Jun 2022
Offline RL for Natural Language Generation with Implicit Language Q
  Learning
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
144
103
0
05 Jun 2022
Incorporating Explicit Uncertainty Estimates into Deep Offline
  Reinforcement Learning
Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
David Brandfonbrener
Rémi Tachet des Combes
Romain Laroche
OffRL
37
5
0
02 Jun 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement
  Learning
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert W. Platt
Chris Amato
OffRL
35
35
0
02 Jun 2022
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Muning Wen
J. Kuba
Runji Lin
Weinan Zhang
Ying Wen
Jun Wang
Yaodong Yang
26
179
0
30 May 2022
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing
  Mechanisms in Sequence Learning
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning
Aniket Didolkar
Kshitij Gupta
Anirudh Goyal
Nitesh B. Gundavarapu
Alex Lamb
Nan Rosemary Ke
Yoshua Bengio
AI4CE
118
17
0
30 May 2022
Non-Markovian policies occupancy measures
Non-Markovian policies occupancy measures
Romain Laroche
Rémi Tachet des Combes
Jacob Buckman
OffRL
37
1
0
27 May 2022
Quark: Controllable Text Generation with Reinforced Unlearning
Quark: Controllable Text Generation with Reinforced Unlearning
Ximing Lu
Sean Welleck
Jack Hessel
Liwei Jiang
Lianhui Qin
Peter West
Prithviraj Ammanabrolu
Yejin Choi
MU
66
206
0
26 May 2022
Towards Learning Universal Hyperparameter Optimizers with Transformers
Towards Learning Universal Hyperparameter Optimizers with Transformers
Yutian Chen
Xingyou Song
Chansoo Lee
Zehao Wang
Qiuyi Zhang
...
Greg Kochanski
Arnaud Doucet
MarcÁurelio Ranzato
Sagi Perel
Nando de Freitas
32
63
0
26 May 2022
TALM: Tool Augmented Language Models
TALM: Tool Augmented Language Models
Aaron T Parisi
Yao-Min Zhao
Noah Fiedel
KELM
RALM
LLMAG
41
144
0
24 May 2022
Chain of Thought Imitation with Procedure Cloning
Chain of Thought Imitation with Procedure Cloning
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
35
30
0
22 May 2022
User-Interactive Offline Reinforcement Learning
User-Interactive Offline Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
25
11
0
21 May 2022
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
633
0
20 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
95
793
0
12 May 2022
Minimal Neural Network Models for Permutation Invariant Agents
Minimal Neural Network Models for Permutation Invariant Agents
J. Pedersen
S. Risi
51
3
0
12 May 2022
Symphony: Learning Realistic and Diverse Agents for Autonomous Driving
  Simulation
Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation
Maximilian Igl
Daewoo Kim
Alex Kuefler
Paul Mougin
Punit Shah
K. Shiarlis
Drago Anguelov
Mark Palatucci
Brandyn White
Shimon Whiteson
35
64
0
06 May 2022
HDGT: Heterogeneous Driving Graph Transformer for Multi-Agent Trajectory
  Prediction via Scene Encoding
HDGT: Heterogeneous Driving Graph Transformer for Multi-Agent Trajectory Prediction via Scene Encoding
Xiaosong Jia
Peng Wu
Li Chen
Yunxing Liu
Hongyang Li
Junchi Yan
32
122
0
30 Apr 2022
Towards Flexible Inference in Sequential Decision Problems via
  Bidirectional Transformers
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Micah Carroll
Jessy Lin
Orr Paradise
Raluca Georgescu
Mingfei Sun
...
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
40
10
0
28 Apr 2022
Can Foundation Models Perform Zero-Shot Task Specification For Robot
  Manipulation?
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?
Yuchen Cui
S. Niekum
Abhi Gupta
Vikash Kumar
Aravind Rajeswaran
LM&Ro
30
74
0
23 Apr 2022
Jump-Start Reinforcement Learning
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
44
109
0
05 Apr 2022
Unsupervised Learning of Temporal Abstractions with Slot-based
  Transformers
Unsupervised Learning of Temporal Abstractions with Slot-based Transformers
Anand Gopalakrishnan
Kazuki Irie
Jürgen Schmidhuber
Sjoerd van Steenkiste
OffRL
26
16
0
25 Mar 2022
Reshaping Robot Trajectories Using Natural Language Commands: A Study of
  Multi-Modal Data Alignment Using Transformers
Reshaping Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers
A. Bucker
Luis F. C. Figueredo
Sami Haddadin
Ashish Kapoor
Shuang Ma
Rogerio Bonatti
LM&Ro
40
49
0
25 Mar 2022
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic
  Memory
Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory
Lian Siyao
Weijiang Yu
Tianpei Gu
Chunze Lin
Quan Wang
Chao Qian
Chen Change Loy
Ziwei Liu
SLR
42
184
0
24 Mar 2022
Near-optimal Offline Reinforcement Learning with Linear Representation:
  Leveraging Variance Information with Pessimism
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
Ming Yin
Yaqi Duan
Mengdi Wang
Yu-Xiang Wang
OffRL
34
66
0
11 Mar 2022
Policy Architectures for Compositional Generalization in Control
Policy Architectures for Compositional Generalization in Control
Allan Zhou
Vikash Kumar
Chelsea Finn
Aravind Rajeswaran
26
22
0
10 Mar 2022
LISA: Learning Interpretable Skill Abstractions from Language
LISA: Learning Interpretable Skill Abstractions from Language
Divyansh Garg
Skanda Vaidyanath
Kuno Kim
Jiaming Song
Stefano Ermon
LM&Ro
OffRL
156
29
0
28 Feb 2022
Consistent Dropout for Policy Gradient Reinforcement Learning
Consistent Dropout for Policy Gradient Reinforcement Learning
Matthew J. Hausknecht
Nolan Wagener
OffRL
27
10
0
23 Feb 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement
  Learning
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
40
132
0
23 Feb 2022
Previous
12345678
Next