Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.01345
Cited By
Decision Transformer: Reinforcement Learning via Sequence Modeling
2 June 2021
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Decision Transformer: Reinforcement Learning via Sequence Modeling"
50 / 396 papers shown
Title
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
33
7
0
02 Feb 2024
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand
Amy Zhang
Ufuk Topcu
OffRL
43
3
0
30 Jan 2024
Multi-Object Navigation in real environments using hybrid policies
Assem Sadek
G. Bono
Boris Chidlovskii
A. Baskurt
Christian Wolf
47
5
0
24 Jan 2024
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View
Raj Ghugare
Matthieu Geist
Glen Berseth
Benjamin Eysenbach
OffRL
35
14
0
20 Jan 2024
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning
Sabariswaran Mani
Sreyas Venkataraman
Abhranil Chandra
Adyan Rizvi
Yash Sirvi
Soumojit Bhattacharya
Aritra Hazra
OffRL
34
1
0
17 Jan 2024
DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement
Jiaqi Liu
Peng Hang
Xiaocong Zhao
Jianqiang Wang
Jian Sun
54
10
0
08 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
22
10
0
06 Jan 2024
The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Pratyusha Sharma
Jordan T. Ash
Dipendra Kumar Misra
LRM
19
79
0
21 Dec 2023
In-Context Reinforcement Learning for Variable Action Spaces
Viacheslav Sinii
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Sergey Kolesnikov
26
14
0
20 Dec 2023
SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution
Zhixuan Liang
Yao Mu
Hengbo Ma
Masayoshi Tomizuka
Mingyu Ding
Ping Luo
38
38
0
18 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach
Xingshuai Huang
Di Wu
Benoit Boulet
OffRL
27
2
0
12 Dec 2023
Saturn Platform: Foundation Model Operations and Generative AI for Financial Services
Antonio Busson
Rennan Gaio
Rafael H. Rocha
Francisco Evangelista
Bruno Rizzi
Luan Carvalho
Rafael Miceli
Marcos Rabaioli
David Favaro
28
1
0
12 Dec 2023
Workflow-Guided Response Generation for Task-Oriented Dialogue
Do June Min
Paloma Sodhi
Ramya Ramakrishnan
45
0
0
14 Nov 2023
Unified machine learning tasks and datasets for enhancing renewable energy
Arsam Aryandoust
Thomas Rigoni
Francesco di Stefano
Anthony Patt
40
0
0
12 Nov 2023
Uncovering Intermediate Variables in Transformers using Circuit Probing
Michael A. Lepori
Thomas Serre
Ellie Pavlick
78
7
0
07 Nov 2023
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Guy Van den Broeck
Mathias Niepert
Yitao Liang
OffRL
36
1
0
31 Oct 2023
Hybrid Search for Efficient Planning with Completeness Guarantees
Kalle Kujanpää
Joni Pajarinen
Alexander Ilin
29
3
0
19 Oct 2023
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Bodhisattwa Prasad Majumder
Bhavana Dalvi
Peter Alexander Jansen
Oyvind Tafjord
Niket Tandon
Li Zhang
Chris Callison-Burch
Peter Clark
LRM
LLMAG
CLL
21
39
0
16 Oct 2023
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Zichen Zhang
Yunshuang Li
Osbert Bastani
Abhishek Gupta
Dinesh Jayaraman
Yecheng Jason Ma
Luca Weihs
37
17
0
12 Oct 2023
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
Licong Lin
Yu Bai
Song Mei
OffRL
37
45
0
12 Oct 2023
Predicting Player Engagement in Tom Clancy's The Division 2: A Multimodal Approach via Pixels and Gamepad Actions
Kosmas Pinitas
David Renaudie
Mike Thomsen
M. Barthet
Konstantinos Makantasis
Antonios Liapis
Georgios N. Yannakakis
23
13
0
09 Oct 2023
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Hanjing Wang
Man-Kit Sit
Cong He
Ying Wen
Weinan Zhang
Jun Wang
Yaodong Yang
Luo Mai
OffRL
VLM
40
1
0
08 Oct 2023
Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) Challenge
J. Tan
Mehul Motani
LLMAG
44
8
0
08 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
38
19
0
06 Oct 2023
Amortized Network Intervention to Steer the Excitatory Point Processes
Zitao Song
Wendi Ren
Sourav Garg
29
1
0
06 Oct 2023
PrototypeFormer: Learning to Explore Prototype Relationships for Few-shot Image Classification
Feihong He
Gang Li
Jiangmeng Li
VLM
ViT
54
2
0
05 Oct 2023
PCGPT: Procedural Content Generation via Transformers
Sajad Mohaghegh
Mohammad Amin Ramezan Dehnavi
Golnoosh Abdollahinejad
Matin Hashemi
ViT
24
2
0
03 Oct 2023
How Physics and Background Attributes Impact Video Transformers in Robotic Manipulation: A Case Study on Planar Pushing
Shutong Jin
Ruiyu Wang
Muhammad Zahid
Florian T. Pokorny
38
1
0
03 Oct 2023
Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Zecheng Wang
Che Wang
Zixuan Dong
Keith Ross
OffRL
38
5
0
01 Oct 2023
Language models in molecular discovery
Chaoqi Wang
Yibo Jiang
Chenghao Yang
Han Liu
Yuxin Chen
30
7
0
28 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
38
1
0
26 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
24
17
0
22 Sep 2023
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Guan-Bo Wang
Sijie Cheng
Xianyuan Zhan
Xiangang Li
Sen Song
Yang Liu
ALM
27
233
0
20 Sep 2023
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
Chenxiao Gao
Chenyang Wu
Mingjun Cao
Rui Kong
Zongzhang Zhang
Yang Yu
OffRL
34
13
0
12 Sep 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
47
0
0
08 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
30
8
0
04 Sep 2023
Rule-Based Error Detection and Correction to Operationalize Movement Trajectory Classification
B. Xi
Kevin Scaria
Paulo Shakarian
Paulo Shakarian
34
2
0
28 Aug 2023
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Xiaoyu Huang
Dhruv Batra
Akshara Rai
Andrew Szot
LM&Ro
38
21
0
19 Aug 2023
MTD-GPT: A Multi-Task Decision-Making GPT Model for Autonomous Driving at Unsignalized Intersections
Jiaqi Liu
Peng Hang
Xiao Qi
Jianqiang Wang
Jian Sun
26
42
0
30 Jul 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
27
3
0
29 Jul 2023
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Nico Gürtler
Sebastian Blaes
Pavel Kolev
Felix Widmaier
Manuel Wüthrich
Stefan Bauer
Bernhard Schölkopf
Georg Martius
OffRL
33
28
0
28 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
40
5
0
20 Jul 2023
Towards Self-Assembling Artificial Neural Networks through Neural Developmental Programs
Elias Najarro
Shyam Sudhakaran
S. Risi
28
16
0
17 Jul 2023
Image Transformation Sequence Retrieval with General Reinforcement Learning
Enrique Mas-Candela
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
27
0
0
13 Jul 2023
Transformers in Reinforcement Learning: A Survey
Pranav Agarwal
A. Rahman
P. St-Charles
Simon J. D. Prince
Samira Ebrahimi Kahou
OffRL
32
19
0
12 Jul 2023
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
S. E. Ada
Erhan Öztop
Emre Ugur
OffRL
49
15
0
10 Jul 2023
Large Language Models as General Pattern Machines
Suvir Mirchandani
F. Xia
Peter R. Florence
Brian Ichter
Danny Driess
Montse Gonzalez Arenas
Kanishka Rao
Dorsa Sadigh
Andy Zeng
LLMAG
61
186
0
10 Jul 2023
Deep Attention Q-Network for Personalized Treatment Recommendation
Simin Ma
Junghwan Lee
N. Serban
Shihao Yang
OffRL
38
5
0
04 Jul 2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Jinyi Liu
Yi Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
OffRL
47
2
0
27 Jun 2023
Previous
1
2
3
4
5
6
7
8
Next