ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.06860
  4. Cited By
A Minimalist Approach to Offline Reinforcement Learning

A Minimalist Approach to Offline Reinforcement Learning

12 June 2021
Scott Fujimoto
S. Gu
    OffRL
ArXivPDFHTML

Papers citing "A Minimalist Approach to Offline Reinforcement Learning"

50 / 522 papers shown
Title
GTA: Generative Trajectory Augmentation with Guidance for Offline
  Reinforcement Learning
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
Jaewoo Lee
Sujin Yun
Taeyoung Yun
Jinkyoo Park
46
6
0
27 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
40
3
0
25 May 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Chenjia Bai
Jingwen Yang
Zongqing Lu
Xiu Li
30
8
0
24 May 2024
DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation
DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation
Jinxin Liu
Xinghong Guo
Zifeng Zhuang
Donglin Wang
DiffM
OffRL
50
2
0
23 May 2024
State-Constrained Offline Reinforcement Learning
State-Constrained Offline Reinforcement Learning
Charles A. Hepburn
Yue Jin
Giovanni Montana
OffRL
35
0
0
23 May 2024
Offline Reinforcement Learning from Datasets with Structured
  Non-Stationarity
Offline Reinforcement Learning from Datasets with Structured Non-Stationarity
Johannes Ackermann
Takayuki Osa
Masashi Sugiyama
OffRL
42
2
0
23 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
40
2
0
23 May 2024
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning
Hai Zhang
Boyuan Zheng
Anqi Guo
Tianying Ji
Anqi Guo
Junqiao Zhao
Lanqing Li
OffRL
39
0
0
20 May 2024
Towards Robust Policy: Enhancing Offline Reinforcement Learning with
  Adversarial Attacks and Defenses
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses
Thanh Nguyen
Tung M. Luu
Tri Ton
Chang D. Yoo
OffRL
AAML
34
0
0
18 May 2024
Reinformer: Max-Return Sequence Modeling for Offline RL
Reinformer: Max-Return Sequence Modeling for Offline RL
Zifeng Zhuang
Dengyun Peng
Jinxin Liu
Ziqi Zhang
Donglin Wang
OffRL
AI4TS
48
13
0
14 May 2024
Ensemble Successor Representations for Task Generalization in
  Offline-to-Online Reinforcement Learning
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
Changhong Wang
Xudong Yu
Chenjia Bai
Qiaosheng Zhang
Zhen Wang
40
1
0
12 May 2024
Improving Offline Reinforcement Learning with Inaccurate Simulators
Improving Offline Reinforcement Learning with Inaccurate Simulators
Yiwen Hou
Haoyuan Sun
Jinming Ma
Feng Wu
OffRL
35
4
0
07 May 2024
Robot Air Hockey: A Manipulation Testbed for Robot Learning with
  Reinforcement Learning
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Caleb Chuck
Carl Qi
M. Munje
Shuozhe Li
Max Rudolph
...
Kavan Mehta
Anthony Wang
Peter Stone
Amy Zhang
S. Niekum
40
4
0
06 May 2024
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline
  Reinforcement Learning
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
OffRL
34
9
0
30 Apr 2024
Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic
  Furniture Assembly
Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly
Hao-ming Lin
Radu Corcodel
Ding Zhao
40
7
0
26 Apr 2024
Offline Reinforcement Learning with Behavioral Supervisor Tuning
Offline Reinforcement Learning with Behavioral Supervisor Tuning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
27
1
0
25 Apr 2024
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement
  Learning Agents
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents
Chen Gong
Kecen Li
Jin Yao
Tianhao Wang
OnRL
36
0
0
18 Apr 2024
Continual Offline Reinforcement Learning via Diffusion-based Dual
  Generative Replay
Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay
Jinmei Liu
Wenbin Li
Xiangyu Yue
Shilin Zhang
Chunlin Chen
Zhi Wang
OffRL
DiffM
36
5
0
16 Apr 2024
Offline Trajectory Generalization for Offline Reinforcement Learning
Offline Trajectory Generalization for Offline Reinforcement Learning
Ziqi Zhao
Zhaochun Ren
Liu Yang
Fajie Yuan
Pengjie Ren
Zhumin Chen
Jun Ma
Xin Xin
OffRL
29
1
0
16 Apr 2024
Policy-Guided Diffusion
Policy-Guided Diffusion
Matthew Jackson
Michael T. Matthews
Cong Lu
Benjamin Ellis
Shimon Whiteson
Jakob N. Foerster
OffRL
52
17
0
09 Apr 2024
Diverse Randomized Value Functions: A Provably Pessimistic Approach for
  Offline Reinforcement Learning
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
Xudong Yu
Chenjia Bai
Hongyi Guo
Changhong Wang
Zhen Wang
OffRL
39
0
0
09 Apr 2024
AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning
  with Value-based Dataset
AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset
Dongsu Lee
Chanin Eom
Minhae Kwon
GP
OffRL
24
5
0
03 Apr 2024
GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped
  Robot
GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot
Wenxuan Song
Han Zhao
Pengxiang Ding
Can Cui
Shangke Lyu
Yaning Fan
Donglin Wang
OffRL
27
11
0
20 Mar 2024
Simple Ingredients for Offline Reinforcement Learning
Simple Ingredients for Offline Reinforcement Learning
Edoardo Cetin
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
Yann Ollivier
Ahmed Touati
OffRL
37
2
0
19 Mar 2024
A Simple Mixture Policy Parameterization for Improving Sample Efficiency
  of CVaR Optimization
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Yudong Luo
Yangchen Pan
Han Wang
Philip H. S. Torr
Pascal Poupart
42
3
0
17 Mar 2024
Offline Goal-Conditioned Reinforcement Learning for Shape Control of
  Deformable Linear Objects
Offline Goal-Conditioned Reinforcement Learning for Shape Control of Deformable Linear Objects
Rita Laezza
Mohammadreza Shetab-Bushehri
Gabriel Arslan Waltersson
Erol Özgür
Y. Mezouar
Y. Karayiannidis
OffRL
43
0
0
15 Mar 2024
A2PO: Towards Effective Offline Reinforcement Learning from an
  Advantage-aware Perspective
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
Yunpeng Qing
Shunyu Liu
Jingyuan Cong
Kaixuan Chen
Yihe Zhou
Mingli Song
OffRL
34
1
0
12 Mar 2024
Disentangling Policy from Offline Task Representation Learning via
  Adversarial Data Augmentation
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
Chengxing Jia
Fuxiang Zhang
Yi-Chen Li
Chenxiao Gao
Xu-Hui Liu
Lei Yuan
Zongzhang Zhang
Yang Yu
AAML
39
4
0
12 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
34
3
0
09 Mar 2024
Canonical Form of Datatic Description in Control Systems
Canonical Form of Datatic Description in Control Systems
Guojian Zhan
Ziang Zheng
Shengbo Eben Li
18
1
0
04 Mar 2024
Offline Goal-Conditioned Reinforcement Learning for Safety-Critical
  Tasks with Recovery Policy
Offline Goal-Conditioned Reinforcement Learning for Safety-Critical Tasks with Recovery Policy
Chenyang Cao
Zichen Yan
Renhao Lu
Junbo Tan
Xueqian Wang
OffRL
39
2
0
04 Mar 2024
SELFI: Autonomous Self-Improvement with Reinforcement Learning for
  Social Navigation
SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation
Noriaki Hirose
Dhruv Shah
Kyle Stachowicz
A. Sridhar
Sergey Levine
69
5
0
01 Mar 2024
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
Yifei Zhou
Andrea Zanette
Jiayi Pan
Sergey Levine
Aviral Kumar
65
50
0
29 Feb 2024
Trajectory-wise Iterative Reinforcement Learning Framework for
  Auto-bidding
Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding
Haoming Li
Yusen Huo
Shuai Dou
Zhenzhe Zheng
Zhilin Zhang
Chuan Yu
Jian Xu
Fan Wu
OffRL
24
3
0
23 Feb 2024
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared
  Semantic Spaces
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces
Tianyu Zheng
Ge Zhang
Xingwei Qu
Ming Kuang
Stephen W. Huang
Zhaofeng He
OffRL
50
1
0
20 Feb 2024
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning
Anya Sims
Cong Lu
Yee Whye Teh
OffRL
35
3
0
19 Feb 2024
Feudal Networks for Visual Navigation
Feudal Networks for Visual Navigation
Faith Johnson
Bryan Bo Cao
Kristin J. Dana
Shubham Jain
Ashwin Ashok
34
2
0
19 Feb 2024
Policy Learning for Off-Dynamics RL with Deficient Support
Policy Learning for Off-Dynamics RL with Deficient Support
Linh Le Pham Van
Hung The Tran
Sunil R. Gupta
40
1
0
16 Feb 2024
Dataset Clustering for Improved Offline Policy Learning
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
29
2
0
14 Feb 2024
Single-Reset Divide & Conquer Imitation Learning
Single-Reset Divide & Conquer Imitation Learning
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
40
0
0
14 Feb 2024
Hybrid Inverse Reinforcement Learning
Hybrid Inverse Reinforcement Learning
Juntao Ren
Gokul Swamy
Zhiwei Steven Wu
J. Andrew Bagnell
Sanjiban Choudhury
36
18
0
13 Feb 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask
  Representation via Temporal Action-Driven Contrastive Loss
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Shuang Ma
Hal Daumé
Huazhe Xu
John Langford
Praveen Palanisamy
Kalyan Shankar Basu
Furong Huang
40
5
0
09 Feb 2024
Federated Offline Reinforcement Learning: Collaborative Single-Policy
  Coverage Suffices
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices
Jiin Woo
Laixi Shi
Gauri Joshi
Yuejie Chi
OffRL
26
3
0
08 Feb 2024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg
A. Abdolmaleki
Jingwei Zhang
Oliver Groth
Michael Bloesch
...
Sarah Bechtle
Steven Kapturowski
Roland Hafner
N. Heess
Martin Riedmiller
OffRL
LRM
27
12
0
08 Feb 2024
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Ruoqing Zhang
Ziwei Luo
Jens Sjölund
Thomas B. Schon
Per Mattsson
15
7
0
06 Feb 2024
Return-Aligned Decision Transformer
Return-Aligned Decision Transformer
Tsunehiko Tanaka
Kenshi Abe
Kaito Ariu
Tetsuro Morimura
Edgar Simo-Serra
OffRL
67
1
0
06 Feb 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
SEABO: A Simple Search-Based Method for Offline Imitation Learning
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
19
9
0
06 Feb 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for
  Offline Reinforcement Learning
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
47
17
0
05 Feb 2024
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based
  Trajectory Stitching
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Guanghe Li
Yixiang Shan
Zhengbang Zhu
Ting Long
Weinan Zhang
OffRL
26
9
0
04 Feb 2024
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement
  Learning with Diverse Human Feedback
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
Yifu Yuan
Jianye Hao
Yi Ma
Zibin Dong
Hebin Liang
Jinyi Liu
Zhixin Feng
Kai-Wen Zhao
Yan Zheng
OffRL
ALM
24
14
0
04 Feb 2024
Previous
12345...91011
Next