ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.01387
  4. Cited By
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open
  Problems

A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems

2 March 2022
Rafael Figueiredo Prudencio
Marcos R. O. A. Máximo
Esther Luna Colombini
    OffRL
ArXivPDFHTML

Papers citing "A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems"

48 / 48 papers shown
Title
Enhancing Trust Management System for Connected Autonomous Vehicles Using Machine Learning Methods: A Survey
Enhancing Trust Management System for Connected Autonomous Vehicles Using Machine Learning Methods: A Survey
Qian Xu
Lei Zhang
Yong-Jin Liu
29
0
0
10 May 2025
Do We Need Transformers to Play FPS Video Games?
Do We Need Transformers to Play FPS Video Games?
Karmanbir Batth
Krish Sethi
Aly Shariff
Leo Shi
Hetul Patel
OffRL
AI4CE
31
0
0
24 Apr 2025
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
Younghwan Lee
Tung M. Luu
Donghoon Lee
Chang D. Yoo
3DV
VLM
OffRL
41
0
0
03 Apr 2025
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Teng Pang
Bingzheng Wang
Guoqiang Wu
Yilong Yin
OffRL
70
0
0
03 Mar 2025
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Wesley A. Suttle
A. Suresh
Carlos Nieto-Granda
OffRL
95
0
0
06 Feb 2025
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
102
2
0
28 Jan 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
100
1
0
22 Dec 2024
Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning
Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning
Anthony Kobanda
Rémy Portelas
Odalric-Ambrym Maillard
Ludovic Denoyer
OffRL
CLL
77
0
0
19 Dec 2024
CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning
CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning
Duo Wu
Yufei Guo
Yuan Meng
Yanning Zhang
Le Sun
Zhi Wang
189
0
0
25 Nov 2024
Latent Feature Mining for Predictive Model Enhancement with Large
  Language Models
Latent Feature Mining for Predictive Model Enhancement with Large Language Models
Bingxuan Li
Pengyi Shi
Amy Ward
57
9
0
06 Oct 2024
Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown
Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown
Xingzhou Lou
Dong Yan
Wei Shen
Yuzi Yan
Jian Xie
Junge Zhang
47
22
0
01 Oct 2024
Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling
Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling
Jesse van Remmerden
Z. Bukhsh
Yingqian Zhang
OffRL
OnRL
45
1
0
16 Sep 2024
KAN v.s. MLP for Offline Reinforcement Learning
KAN v.s. MLP for Offline Reinforcement Learning
Haihong Guo
Fengxin Li
Jiao Li
Hongyan Liu
OffRL
33
0
0
15 Sep 2024
MA-CDMR: An Intelligent Cross-domain Multicast Routing Method based on
  Multiagent Deep Reinforcement Learning in Multi-domain SDWN
MA-CDMR: An Intelligent Cross-domain Multicast Routing Method based on Multiagent Deep Reinforcement Learning in Multi-domain SDWN
Miao Ye
Hongwen Hu
Xiaoli Wang
Yuping Wang
Yong Wang
Wen Peng
Jihao Zheng
23
1
0
27 Aug 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
40
1
0
26 Jul 2024
Coordination Failure in Cooperative Offline MARL
Coordination Failure in Cooperative Offline MARL
C. Tilbury
Claude Formanek
Louise Beyers
Jonathan P. Shock
Arnu Pretorius
OffRL
38
1
0
01 Jul 2024
Dispelling the Mirage of Progress in Offline MARL through Standardised
  Baselines and Evaluation
Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation
Claude Formanek
C. Tilbury
Louise Beyers
Jonathan P. Shock
Arnu Pretorius
OffRL
39
1
0
13 Jun 2024
Residual Learning and Context Encoding for Adaptive Offline-to-Online
  Reinforcement Learning
Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning
Mohammadreza Nakhaei
Aidan Scannell
Joni Pajarinen
OffRL
49
1
0
12 Jun 2024
Integrating Domain Knowledge for handling Limited Data in Offline RL
Integrating Domain Knowledge for handling Limited Data in Offline RL
Briti Gangopadhyay
Zhao Wang
Jia-Fong Yeh
Shingo Takamatsu
OffRL
32
0
0
11 Jun 2024
SwiftRL: Towards Efficient Reinforcement Learning on Real
  Processing-In-Memory Systems
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems
Kailash Gogineni
Sai Santosh Dayapule
Juan Gómez Luna
Karthikeya Gogineni
Peng Wei
Tian-Shing Lan
Mohammad Sadrosadati
Onur Mutlu
Guru Venkataramani
50
10
0
07 May 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
Yi Xu
Weiran Shen
Xiao Zhang
Jun Xu
OffRL
41
0
0
24 Mar 2024
A Model-Based Approach for Improving Reinforcement Learning Efficiency
  Leveraging Expert Observations
A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations
E. C. Ozcan
Vittorio Giammarino
James Queeney
I. Paschalidis
OffRL
39
0
0
29 Feb 2024
Transductive Reward Inference on Graph
Transductive Reward Inference on Graph
B. Qu
Xiaofeng Cao
Qing-Wu Guo
Yi Chang
Ivor W. Tsang
Chengqi Zhang
OffRL
35
0
0
06 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
24
7
0
02 Feb 2024
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
78
5
0
13 Dec 2023
The Safety Challenges of Deep Learning in Real-World Type 1 Diabetes
  Management
The Safety Challenges of Deep Learning in Real-World Type 1 Diabetes Management
Harry Emerson
Ryan McConville
Matt Guy
33
0
0
23 Oct 2023
Raijū: Reinforcement Learning-Guided Post-Exploitation for Automating
  Security Assessment of Network Systems
Raijū: Reinforcement Learning-Guided Post-Exploitation for Automating Security Assessment of Network Systems
V. Pham
Hien Do Hoang
Phan Thanh Trung
Van Dinh Quoc
T. To
Phan The Duy
14
0
0
27 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with
  Expert Guidance
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
26
8
0
04 Sep 2023
Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline
  Data in the Real World
Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World
Nicolas Gurtler
Felix Widmaier
Cansu Sancaktar
Sebastian Blaes
Pavel Kolev
...
Arman Raayatsanati
Hehui Zheng
Barnabas Gavin Cangan
Bernhard Schölkopf
Georg Martius
OffRL
35
2
0
15 Aug 2023
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Nico Gürtler
Sebastian Blaes
Pavel Kolev
Felix Widmaier
Manuel Wüthrich
Stefan Bauer
Bernhard Schölkopf
Georg Martius
OffRL
33
28
0
28 Jul 2023
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive
  Control
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Rohan Chitnis
Yingchen Xu
B. Hashemi
Lucas Lehnert
Ürün Dogan
Zheqing Zhu
Olivier Delalleau
OffRL
26
9
0
01 Jun 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
33
36
0
16 May 2023
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular
  Procedures: A Systematic Review
Autonomous Navigation for Robot-assisted Intraluminal and Endovascular Procedures: A Systematic Review
Ameya Pore
Zhen Li
Diego DallÁlba
A. Hernansanz
Elena De Momi
A. Menciassi
Alicia Casals Gelpí
J. Dankelman
Paolo Fiorini
E. V. Poorten
24
29
0
06 May 2023
Learning to Control Autonomous Fleets from Observation via Offline
  Reinforcement Learning
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement Learning
Carolin Schmidt
Daniele Gammelli
Francisco Câmara Pereira
Filipe Rodrigues
OffRL
14
4
0
28 Feb 2023
Swapped goal-conditioned offline reinforcement learning
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
Joni Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
33
1
0
17 Feb 2023
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
Zhixuan Liang
Yao Mu
Mingyu Ding
Fei Ni
M. Tomizuka
Ping Luo
80
99
0
03 Feb 2023
Domain Generalization for Robust Model-Based Offline Reinforcement
  Learning
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OOD
OffRL
27
0
0
27 Nov 2022
Active Example Selection for In-Context Learning
Active Example Selection for In-Context Learning
Yiming Zhang
Shi Feng
Chenhao Tan
SILM
LRM
32
186
0
08 Nov 2022
Multi-Step Prediction in Linearized Latent State Spaces for
  Representation Learning
Multi-Step Prediction in Linearized Latent State Spaces for Representation Learning
A. Tytarenko
BDL
30
1
0
02 Sep 2022
Offline Equilibrium Finding
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
43
2
0
12 Jul 2022
Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer
  Credit
Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit
Raad Khraishi
Ramin Okhrati
OffRL
21
5
0
06 Mar 2022
A Survey on Safety-Critical Driving Scenario Generation -- A
  Methodological Perspective
A Survey on Safety-Critical Driving Scenario Generation -- A Methodological Perspective
Wenhao Ding
Chejian Xu
Mansur Arief
Hao-ming Lin
Bo-wen Li
Ding Zhao
30
144
0
04 Feb 2022
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
60
100
0
06 Nov 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
843
0
12 Oct 2021
A Workflow for Offline Model-Free Robotic Reinforcement Learning
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
143
85
0
22 Sep 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
219
413
0
16 Feb 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
165
1,632
0
02 Feb 2020
1