ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.06860
  4. Cited By
A Minimalist Approach to Offline Reinforcement Learning

A Minimalist Approach to Offline Reinforcement Learning

12 June 2021
Scott Fujimoto
S. Gu
    OffRL
ArXivPDFHTML

Papers citing "A Minimalist Approach to Offline Reinforcement Learning"

50 / 522 papers shown
Title
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative
  Imitation Learning
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning
Martin Moder
Stephen Adhisaputra
Josef Pauli
18
0
0
07 Aug 2024
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
Shirong Liu
Chenjia Bai
Zixian Guo
Hao Zhang
Gaurav Sharma
Yang Liu
OffRL
32
2
0
04 Aug 2024
Language-Conditioned Offline RL for Multi-Robot Navigation
Language-Conditioned Offline RL for Multi-Robot Navigation
Steven D. Morad
Ajay Shankar
J. Blumenkamp
Amanda Prorok
LM&Ro
OffRL
48
6
0
29 Jul 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement
  Learning
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
41
5
0
29 Jul 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
40
1
0
26 Jul 2024
Offline Imitation Learning Through Graph Search and Retrieval
Offline Imitation Learning Through Graph Search and Retrieval
Zhao-Heng Yin
Pieter Abbeel
OffRL
48
3
0
22 Jul 2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement
  Learning
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Yi-Fan Yao
Zhepeng Cen
Wenhao Ding
Hao-ming Lin
Shiqi Liu
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRL
OnRL
51
1
0
19 Jul 2024
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement
  Learning
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
Xu-Hui Liu
Tian-Shuo Liu
Shengyi Jiang
Ruifeng Chen
Zhilong Zhang
Xinwei Chen
Yang Yu
OffRL
OnRL
34
2
0
17 Jul 2024
Offline Reinforcement Learning with Imputed Rewards
Offline Reinforcement Learning with Imputed Rewards
Carlo Romeo
Andrew D. Bagdanov
OffRL
31
0
0
15 Jul 2024
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Hao-ming Lin
Wenhao Ding
Jian Chen
Laixi Shi
Jiacheng Zhu
Bo-wen Li
Ding Zhao
OffRL
CML
52
0
0
15 Jul 2024
A Benchmark Environment for Offline Reinforcement Learning in Racing
  Games
A Benchmark Environment for Offline Reinforcement Learning in Racing Games
Girolamo Macaluso
Alessandro Sestini
Andrew D. Bagdanov
OffRL
24
0
0
12 Jul 2024
MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces
MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces
Wayne Wu
Honglin He
Yiran Wang
Chenda Duan
Jack He
Zhizheng Liu
Quanyi Li
Bolei Zhou
50
1
0
11 Jul 2024
Pretraining-finetuning Framework for Efficient Co-design: A Case Study
  on Quadruped Robot Parkour
Pretraining-finetuning Framework for Efficient Co-design: A Case Study on Quadruped Robot Parkour
Ci Chen
Jiyu Yu
Haojian Lu
Hongbo Gao
R. Xiong
Yue Wang
51
0
0
09 Jul 2024
FOSP: Fine-tuning Offline Safe Policy through World Models
FOSP: Fine-tuning Offline Safe Policy through World Models
Chenyang Cao
Yucheng Xin
Silang Wu
Longxiang He
Zichen Yan
Junbo Tan
Xueqian Wang
OffRL
58
0
0
06 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and
  Imperfect Simulators
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
40
1
0
30 Jun 2024
Multimodal foundation world models for generalist embodied agents
Multimodal foundation world models for generalist embodied agents
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Aaron C. Courville
Sai Rajeswar
OffRL
LM&Ro
50
5
0
26 Jun 2024
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers
Chongyi Zheng
Anca Dragan
Sergey Levine
Benjamin Eysenbach
OffRL
45
7
0
24 Jun 2024
Equivariant Offline Reinforcement Learning
Equivariant Offline Reinforcement Learning
Arsh Tangri
Ondrej Biza
Dian Wang
David M. Klee
Owen Howell
Robert Platt
OffRL
39
3
0
20 Jun 2024
Efficient Offline Reinforcement Learning: The Critic is Critical
Efficient Offline Reinforcement Learning: The Critic is Critical
Adam Jelley
Trevor A. McInroe
Sam Devlin
Amos Storkey
OffRL
39
1
0
19 Jun 2024
Offline Imitation Learning with Model-based Reverse Augmentation
Offline Imitation Learning with Model-based Reverse Augmentation
Jie-Jing Shao
Hao-Sen Shi
Lan-Zhe Guo
Yu-Feng Li
OffRL
35
5
0
18 Jun 2024
Dialogue Action Tokens: Steering Language Models in Goal-Directed
  Dialogue with a Multi-Turn Planner
Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner
Kenneth Li
Yiming Wang
Fernanda Viégas
Martin Wattenberg
38
6
0
17 Jun 2024
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
Siyuan Li
Rongchang Zuo
Peng Liu
Yingnan Zhao
Yingnan Zhao
40
1
0
17 Jun 2024
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models
  in Decision Making
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yi Ma
Pengyi Li
Yan Zheng
DiffM
58
9
0
13 Jun 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
51
8
0
13 Jun 2024
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for
  Offline Reinforcement Learning
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning
Xuemin Hu
Shen Li
Yingfen Xu
Bo Tang
Long Chen
36
0
0
13 Jun 2024
Dispelling the Mirage of Progress in Offline MARL through Standardised
  Baselines and Evaluation
Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation
Claude Formanek
C. Tilbury
Louise Beyers
Jonathan P. Shock
Arnu Pretorius
OffRL
39
1
0
13 Jun 2024
A Dual Approach to Imitation Learning from Observations with Offline
  Datasets
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
33
4
0
13 Jun 2024
Residual Learning and Context Encoding for Adaptive Offline-to-Online
  Reinforcement Learning
Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning
Mohammadreza Nakhaei
Aidan Scannell
Joni Pajarinen
OffRL
49
1
0
12 Jun 2024
CDSA: Conservative Denoising Score-based Algorithm for Offline
  Reinforcement Learning
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning
Zeyuan Liu
Kai Yang
Xiu Li
OffRL
44
0
0
11 Jun 2024
Augmenting Offline RL with Unlabeled Data
Augmenting Offline RL with Unlabeled Data
Zhao Wang
Briti Gangopadhyay
Jia-Fong Yeh
Shingo Takamatsu
OffRL
28
0
0
11 Jun 2024
Integrating Domain Knowledge for handling Limited Data in Offline RL
Integrating Domain Knowledge for handling Limited Data in Offline RL
Briti Gangopadhyay
Zhao Wang
Jia-Fong Yeh
Shingo Takamatsu
OffRL
32
0
0
11 Jun 2024
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
Chang Chen
Junyeob Baek
Fei Deng
Kenji Kawaguchi
Çağlar Gülçehre
Sungjin Ahn
OffRL
33
1
0
10 Jun 2024
Is Value Functions Estimation with Classification Plug-and-play for
  Offline Reinforcement Learning?
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?
Denis Tarasov
Kirill Brilliantov
Dmitrii Kharlapenko
OffRL
32
2
0
10 Jun 2024
Discovering Multiple Solutions from a Single Task in Offline
  Reinforcement Learning
Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning
Takayuki Osa
Tatsuya Harada
OffRL
36
2
0
10 Jun 2024
Strategically Conservative Q-Learning
Strategically Conservative Q-Learning
Yutaka Shimizu
Joey Hong
Sergey Levine
M. Tomizuka
OffRL
OnRL
42
0
0
06 Jun 2024
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in
  Offline Reinforcement Learning
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning
Jiahang Cao
Qiang Zhang
Ziqing Wang
Jiaxu Wang
Hao Cheng
Yecheng Shao
Wen Zhao
Gang Han
Yijie Guo
Renjing Xu
Mamba
59
2
0
04 Jun 2024
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Haotian Hu
Yiqin Yang
Jianing Ye
Chengjie Wu
Ziqing Mai
Yujing Hu
Tangjie Lv
Changjie Fan
Qianchuan Zhao
Chongjie Zhang
OffRL
OnRL
39
3
0
31 May 2024
Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence
  Modeling
Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling
Sili Huang
Jifeng Hu
Zhe Yang
Liwei Yang
Tao Luo
Hechang Chen
Lichao Sun
Bo Yang
Mamba
29
3
0
31 May 2024
In-Context Decision Transformer: Reinforcement Learning via Hierarchical
  Chain-of-Thought
In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
Sili Huang
Jifeng Hu
Hechang Chen
Lichao Sun
Bo Yang
OffRL
LRM
29
7
0
31 May 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
56
1
0
31 May 2024
Adaptive Advantage-Guided Policy Regularization for Offline
  Reinforcement Learning
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu
Yang Li
Yixing Lan
Hao Gao
Wei Pan
Xin Xu
OffRL
36
5
0
30 May 2024
Learning from Random Demonstrations: Offline Reinforcement Learning with
  Importance-Sampled Diffusion Models
Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models
Zeyu Fang
Tian Lan
OffRL
36
2
0
30 May 2024
Diffusion Policies creating a Trust Region for Offline Reinforcement
  Learning
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
Tianyu Chen
Zhendong Wang
Mingyuan Zhou
OffRL
29
5
0
30 May 2024
Long-Horizon Rollout via Dynamics Diffusion for Offline Reinforcement
  Learning
Long-Horizon Rollout via Dynamics Diffusion for Offline Reinforcement Learning
Hanye Zhao
Xiaoshen Han
Zhengbang Zhu
Minghuan Liu
Yong Yu
Weinan Zhang
OffRL
42
0
0
29 May 2024
Causal Action Influence Aware Counterfactual Data Augmentation
Causal Action Influence Aware Counterfactual Data Augmentation
Núria Armengol Urpí
Marco Bagatella
Marin Vlastelica
Georg Martius
CML
33
5
0
29 May 2024
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement
  Learning
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
Tianle Zhang
Jiayi Guan
Lin Zhao
Yihang Li
Dongjiang Li
...
Lei Sun
Yue Chen
Xuelong Wei
Lusong Li
Xiaodong He
43
1
0
29 May 2024
Efficient Preference-based Reinforcement Learning via Aligned Experience
  Estimation
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation
Fengshuo Bai
Rui Zhao
Hongming Zhang
Sijia Cui
Ying Wen
Yaodong Yang
Bo Xu
Lei Han
OffRL
24
6
0
29 May 2024
Reinforcement Learning in Dynamic Treatment Regimes Needs Critical
  Reexamination
Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination
Zhiyao Luo
Yangchen Pan
Peter Watkinson
Tingting Zhu
OffRL
33
0
0
28 May 2024
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical
  Behaviors in Deep Off-Policy RL
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
OnRL
36
2
0
28 May 2024
HarmoDT: Harmony Multi-Task Decision Transformer for Offline
  Reinforcement Learning
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
Shengchao Hu
Ziqing Fan
Li Shen
Ya-Qin Zhang
Yanfeng Wang
Dacheng Tao
OffRL
45
9
0
28 May 2024
Previous
123456...91011
Next