ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.06860
  4. Cited By
A Minimalist Approach to Offline Reinforcement Learning

A Minimalist Approach to Offline Reinforcement Learning

12 June 2021
Scott Fujimoto
S. Gu
    OffRL
ArXivPDFHTML

Papers citing "A Minimalist Approach to Offline Reinforcement Learning"

50 / 522 papers shown
Title
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline
  Reinforcement Learning
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
26
7
0
09 Oct 2023
DiffCPS: Diffusion Model based Constrained Policy Search for Offline
  Reinforcement Learning
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
Longxiang He
Li Shen
Linrui Zhang
Junbo Tan
Xueqian Wang
OffRL
28
8
0
09 Oct 2023
Improving Offline-to-Online Reinforcement Learning with Q Conditioned
  State Entropy Exploration
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
Ziqi Zhang
Xiao Xiong
Zifeng Zhuang
Jinxin Liu
Donglin Wang
OffRL
OnRL
45
0
0
07 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced
  Datasets
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
38
19
0
06 Oct 2023
Understanding, Predicting and Better Resolving Q-Value Divergence in
  Offline-RL
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
OffRL
35
16
0
06 Oct 2023
Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning
Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning
Tao Li
Juan Guevara
Xinghong Xie
Quanyan Zhu
OffRL
34
1
0
06 Oct 2023
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for
  Decision Making
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making
Jeonghye Kim
Suyoung Lee
Woojun Kim
Young-Jin Sung
OffRL
33
17
0
04 Oct 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable
  Diffusion Model
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
45
29
0
03 Oct 2023
Efficient Planning with Latent Diffusion
Efficient Planning with Latent Diffusion
Wenhao Li
DiffM
40
4
0
30 Sep 2023
Consistency Models as a Rich and Efficient Policy Class for
  Reinforcement Learning
Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning
Daoce Wang
Chi Jin
OffRL
DiffM
27
25
0
29 Sep 2023
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty
  and Smoothness
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
Xiaoyu Wen
Xudong Yu
Rui Yang
Chenjia Bai
Zhen Wang
OffRL
OnRL
21
10
0
29 Sep 2023
Stackelberg Batch Policy Learning
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
32
0
0
28 Sep 2023
Boosting Offline Reinforcement Learning for Autonomous Driving with
  Hierarchical Latent Skills
Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills
Zenan Li
Fan Nie
Q. Sun
Fang Da
Hang Zhao
OffRL
36
6
0
24 Sep 2023
Counterfactual Conservative Q Learning for Offline Multi-agent
  Reinforcement Learning
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning
Jianzhun Shao
Yun Qu
Chen Chen
Hongchang Zhang
Xiangyang Ji
OffRL
20
19
0
22 Sep 2023
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
Haoyi Niu
Tianying Ji
Bingqi Liu
Haocheng Zhao
Xiangyu Zhu
Jianying Zheng
Pengfei Huang
Guyue Zhou
Jianming Hu
Xianyuan Zhan
OffRL
OnRL
AI4CE
27
6
0
22 Sep 2023
Q-Transformer: Scalable Offline Reinforcement Learning via
  Autoregressive Q-Functions
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Yevgen Chebotar
Q. Vuong
A. Irpan
Karol Hausman
F. Xia
...
Brianna Zitkovich
Tomas Jackson
Kanishka Rao
Chelsea Finn
Sergey Levine
OffRL
129
81
0
18 Sep 2023
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
Xiao-Yin Liu
Xiao-Hu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
OOD
21
5
0
16 Sep 2023
A Real-World Quadrupedal Locomotion Benchmark for Offline Reinforcement
  Learning
A Real-World Quadrupedal Locomotion Benchmark for Offline Reinforcement Learning
Sidney Besnard
Shuyu Yang
M. Fadili
OffRL
24
2
0
13 Sep 2023
ACT: Empowering Decision Transformer with Dynamic Programming via
  Advantage Conditioning
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
Chenxiao Gao
Chenyang Wu
Mingjun Cao
Rui Kong
Zongzhang Zhang
Yang Yu
OffRL
29
13
0
12 Sep 2023
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning
L. Du
Min Chen
Mingyang Sun
Shouling Ji
Peng Cheng
Jiming Chen
Zhikun Zhang
OffRL
40
8
0
06 Sep 2023
Model-based Offline Policy Optimization with Adversarial Network
Model-based Offline Policy Optimization with Adversarial Network
Junming Yang
Xingguo Chen
Shengyuan Wang
Bolei Zhang
OffRL
14
2
0
05 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with
  Expert Guidance
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
26
8
0
04 Sep 2023
Multi-Objective Decision Transformers for Offline Reinforcement Learning
Multi-Objective Decision Transformers for Offline Reinforcement Learning
Abdelghani Ghanem
P. Ciblat
Mounir Ghogho
OffRL
27
1
0
31 Aug 2023
Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline
  Data in the Real World
Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World
Nicolas Gurtler
Felix Widmaier
Cansu Sancaktar
Sebastian Blaes
Pavel Kolev
...
Arman Raayatsanati
Hehui Zheng
Barnabas Gavin Cangan
Bernhard Schölkopf
Georg Martius
OffRL
35
2
0
15 Aug 2023
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Nico Gürtler
Sebastian Blaes
Pavel Kolev
Felix Widmaier
Manuel Wüthrich
Stefan Bauer
Bernhard Schölkopf
Georg Martius
OffRL
33
28
0
28 Jul 2023
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Laixi Shi
Robert Dadashi
Yuejie Chi
Pablo Samuel Castro
M. Geist
OffRL
27
5
0
25 Jul 2023
Contrastive Example-Based Control
Contrastive Example-Based Control
Kyle Hatch
Benjamin Eysenbach
Rafael Rafailov
Tianhe Yu
Ruslan Salakhutdinov
Sergey Levine
Chelsea Finn
OffRL
31
4
0
24 Jul 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
Model-based Offline Reinforcement Learning with Count-based Conservatism
Byeongchang Kim
Min Hwan Oh
OffRL
17
12
0
21 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
37
5
0
20 Jul 2023
Layered controller synthesis for dynamic multi-agent systems
Layered controller synthesis for dynamic multi-agent systems
Emily Clement
Nicolas Perrin-Gilbert
Philipp Schlehuber-Caissier
21
1
0
13 Jul 2023
Budgeting Counterfactual for Offline RL
Budgeting Counterfactual for Offline RL
Yao Liu
Pratik Chaudhari
Rasool Fakoor
OffRL
25
2
0
12 Jul 2023
Diffusion Policies for Out-of-Distribution Generalization in Offline
  Reinforcement Learning
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
S. E. Ada
Erhan Öztop
Emre Ugur
OffRL
40
15
0
10 Jul 2023
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Zilai Zeng
Ce Zhang
Shijie Wang
Chen Sun
OffRL
29
5
0
07 Jul 2023
Offline Reinforcement Learning with Imbalanced Datasets
Offline Reinforcement Learning with Imbalanced Datasets
Li Jiang
Sijie Cheng
Jielin Qiu
Haoran Xu
Wai Kin Victor Chan
Zhao Ding
OffRL
34
3
0
06 Jul 2023
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via
  Self-supervised Learning
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
Xiang Li
Varun Belagali
Jinghuan Shang
Michael S. Ryoo
37
28
0
04 Jul 2023
Beyond Conservatism: Diffusion Policies in Offline Multi-agent
  Reinforcement Learning
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning
Zhuoran Li
Ling Pan
Longbo Huang
DiffM
OffRL
23
7
0
04 Jul 2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Jinyi Liu
Y. Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
OffRL
44
2
0
27 Jun 2023
CEIL: Generalized Contextual Imitation Learning
CEIL: Generalized Contextual Imitation Learning
Jinxin Liu
Li He
Yachen Kang
Zifeng Zhuang
Donglin Wang
Huazhe Xu
36
18
0
26 Jun 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
42
8
0
26 Jun 2023
Waypoint Transformer: Reinforcement Learning via Supervised Learning
  with Intermediate Targets
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Anirudhan Badrinath
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
25
16
0
24 Jun 2023
Offline Skill Graph (OSG): A Framework for Learning and Planning using
  Offline Reinforcement Learning Skills
Offline Skill Graph (OSG): A Framework for Learning and Planning using Offline Reinforcement Learning Skills
Ben-ya Halevy
Y. Aperstein
Dotan Di Castro
GP
OffRL
33
1
0
23 Jun 2023
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
Jinxin Liu
Lipeng Zu
Li He
Donglin Wang
OffRL
45
8
0
23 Jun 2023
TACO: Temporal Latent Action-Driven Contrastive Loss for Visual
  Reinforcement Learning
TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Ruijie Zheng
Xiyao Wang
Yanchao Sun
Shuang Ma
Jieyu Zhao
Huazhe Xu
Hal Daumé
Furong Huang
43
35
0
22 Jun 2023
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory
  Weighting
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting
Zhang-Wei Hong
Pulkit Agrawal
Rémi Tachet des Combes
Romain Laroche
OffRL
31
17
0
22 Jun 2023
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement
  Learning
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning
Jinxin Liu
Ziqi Zhang
Zhenyu Wei
Zifeng Zhuang
Yachen Kang
Sibo Gai
Donglin Wang
OffRL
20
16
0
22 Jun 2023
Mimicking Better by Matching the Approximate Action Distribution
Mimicking Better by Matching the Approximate Action Distribution
Joao A. Candido Ramos
Lionel Blondé
Naoya Takeishi
Alexandros Kalousis
35
2
0
16 Jun 2023
Residual Q-Learning: Offline and Online Policy Customization without
  Value
Residual Q-Learning: Offline and Online Policy Customization without Value
Chenran Li
Chen Tang
Haruki Nishimura
Jean-Pierre Mercat
M. Tomizuka
Wei Zhan
OffRL
30
6
0
15 Jun 2023
Katakomba: Tools and Benchmarks for Data-Driven NetHack
Katakomba: Tools and Benchmarks for Data-Driven NetHack
Vladislav Kurenkov
Alexander Nikulin
Denis Tarasov
Sergey Kolesnikov
OffRL
30
5
0
14 Jun 2023
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online
  Reinforcement Learning
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning
Siyuan Guo
Yanchao Sun
Jifeng Hu
Sili Huang
Hechang Chen
Haiyin Piao
Lichao Sun
Yi-Ju Chang
OffRL
OnRL
31
7
0
13 Jun 2023
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Kai-Wen Zhao
Yi Ma
Jianye Hao
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
OnRL
20
12
0
12 Jun 2023
Previous
123...567...91011
Next