ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.01708
  4. Cited By
Benchmarking Batch Deep Reinforcement Learning Algorithms

Benchmarking Batch Deep Reinforcement Learning Algorithms

3 October 2019
Shih-Han Chou
Wen-Yen Chang
W. Hsu
Jianlong Fu
    OffRL
ArXivPDFHTML

Papers citing "Benchmarking Batch Deep Reinforcement Learning Algorithms"

50 / 105 papers shown
Title
An Investigation of Offline Reinforcement Learning in Factorisable Action Spaces
Alex Beeson
David Ireland
Giovanni Montana
OffRL
43
2
0
17 Nov 2024
Teaching Embodied Reinforcement Learning Agents: Informativeness and
  Diversity of Language Use
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use
Jiajun Xi
Yinong He
Jianing Yang
Yinpei Dai
Joyce Chai
LM&Ro
24
2
0
31 Oct 2024
Random Policy Enables In-Context Reinforcement Learning within Trust Horizons
Random Policy Enables In-Context Reinforcement Learning within Trust Horizons
Weiqin Chen
Santiago Paternain
OffRL
44
0
0
25 Oct 2024
Development and Validation of Heparin Dosing Policies Using an Offline
  Reinforcement Learning Algorithm
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm
Yooseok Lim
Inbeom Park
Sujee Lee
OffRL
28
0
0
24 Sep 2024
Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Weiqin Chen
Sandipan Mishra
Santiago Paternain
OffRL
48
2
0
22 Aug 2024
Integrating Domain Knowledge for handling Limited Data in Offline RL
Integrating Domain Knowledge for handling Limited Data in Offline RL
Briti Gangopadhyay
Zhao Wang
Jia-Fong Yeh
Shingo Takamatsu
OffRL
32
0
0
11 Jun 2024
Offline Model-Based Optimization via Policy-Guided Gradient Search
Offline Model-Based Optimization via Policy-Guided Gradient Search
Yassine Chemingui
Aryan Deshwal
Trong Nghia Hoang
J. Doppa
OffRL
53
9
0
08 May 2024
Scaling Vision-and-Language Navigation With Offline RL
Scaling Vision-and-Language Navigation With Offline RL
Valay Bundele
Mahesh Bhupati
Biplab Banerjee
Aditya Grover
OffRL
29
1
0
27 Mar 2024
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based
  Recommender Systems
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems
Yuanqing Yu
Chongming Gao
Jiawei Chen
Heng Tang
Yuefeng Sun
Qian Chen
Weizhi Ma
Min Zhang
OffRL
45
2
0
23 Feb 2024
Dataset Clustering for Improved Offline Policy Learning
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
31
2
0
14 Feb 2024
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation
  Allocation Approach for Recommender Systems
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems
Jiahong Zhou
Shunhui Mao
Guoliang Yang
Bo Tang
Qianlong Xie
Lebin Lin
Xingxing Wang
Dong Wang
37
7
0
27 Dec 2023
The Generalization Gap in Offline Reinforcement Learning
The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
92
10
0
10 Dec 2023
Learning RL-Policies for Joint Beamforming Without Exploration: A Batch
  Constrained Off-Policy Approach
Learning RL-Policies for Joint Beamforming Without Exploration: A Batch Constrained Off-Policy Approach
Heasung Kim
S. Ankireddy
OffRL
37
0
0
12 Oct 2023
Robust Offline Reinforcement Learning -- Certify the Confidence Interval
Aayush Mishra
Simon S. Du
OffRL
36
0
0
28 Sep 2023
A Real-World Quadrupedal Locomotion Benchmark for Offline Reinforcement
  Learning
A Real-World Quadrupedal Locomotion Benchmark for Offline Reinforcement Learning
Sidney Besnard
Shuyu Yang
M. Fadili
OffRL
34
2
0
13 Sep 2023
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning
L. Du
Min Chen
Mingyang Sun
Shouling Ji
Peng Cheng
Jiming Chen
Zhikun Zhang
OffRL
50
8
0
06 Sep 2023
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning
  Approach to Critical Care
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care
Ali Shirali
Alexander Schubert
Ahmed Alaa
OffRL
32
3
0
13 Jun 2023
Explaining RL Decisions with Trajectories
Explaining RL Decisions with Trajectories
Shripad Deshmukh
Arpan Dasgupta
Balaji Krishnamurthy
Nan Jiang
Chirag Agarwal
Georgios Theocharous
J. Subramanian
OffRL
31
3
0
06 May 2023
Knowledge Transfer from Teachers to Learners in Growing-Batch
  Reinforcement Learning
Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning
P. Emedom-Nnamdi
A. Friesen
Bobak Shahriari
Nando de Freitas
Matthew W. Hoffman
OffRL
31
0
0
05 May 2023
Leveraging Factored Action Spaces for Efficient Offline Reinforcement
  Learning in Healthcare
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
Shengpu Tang
Maggie Makar
Michael Sjoding
Finale Doshi-Velez
Jenna Wiens
OffRL
65
40
0
02 May 2023
BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading
BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading
Maniraman Periyasamy
Marc Hölle
Marco Wiedmann
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
OffRL
49
6
0
27 Apr 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A
  Survey
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
35
1
0
23 Mar 2023
Efficient Communication via Self-supervised Information Aggregation for
  Online and Offline Multi-agent Reinforcement Learning
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning
Cong Guan
F. Chen
Lei Yuan
Zongzhang Zhang
Yang Yu
OffRL
37
4
0
19 Feb 2023
Swapped goal-conditioned offline reinforcement learning
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
Joni Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
36
1
0
17 Feb 2023
Identifying Expert Behavior in Offline Training Datasets Improves
  Behavioral Cloning of Robotic Manipulation Policies
Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Francisco Roldan Sanchez
Kevin McGuinness
Noel E. O'Connor
S. Redmond
OffRL
30
3
0
30 Jan 2023
Improving Behavioural Cloning with Positive Unlabeled Learning
Improving Behavioural Cloning with Positive Unlabeled Learning
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
Nico Gürtler
Felix Widmaier
Francisco Roldan Sanchez
S. Redmond
OffRL
OnRL
34
8
0
27 Jan 2023
Safe Evaluation For Offline Learning: Are We Ready To Deploy?
Safe Evaluation For Offline Learning: Are We Ready To Deploy?
Hager Radi
Josiah P. Hanna
Peter Stone
Matthew E. Taylor
OffRL
ELM
36
0
0
16 Dec 2022
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce
  Order Fraud Evaluation
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation
Soysal Degirmenci
Chris Jones
OffRL
27
1
0
05 Dec 2022
Offline Q-Learning on Diverse Multi-Task Data Both Scales And
  Generalizes
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Aviral Kumar
Rishabh Agarwal
Xinyang Geng
George Tucker
Sergey Levine
OffRL
44
48
0
28 Nov 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
18
1
0
20 Oct 2022
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot
GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot
Tianli Ding
L. Graesser
Saminda Abeyruwan
David B. DÁmbrosio
Anish Shankar
P. Sermanet
Pannag R. Sanketi
Corey Lynch
59
20
0
07 Oct 2022
BCRLSP: An Offline Reinforcement Learning Framework for Sequential
  Targeted Promotion
BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion
Fanglin Chen
Xiao Liu
Bo Tang
Zhiyu Li
Serim Hwang
Guomian Zhuang
OffRL
25
1
0
16 Jul 2022
An Empirical Study of Implicit Regularization in Deep Offline RL
An Empirical Study of Implicit Regularization in Deep Offline RL
Çağlar Gülçehre
Srivatsan Srinivasan
Jakub Sygnowski
Georg Ostrovski
Mehrdad Farajtabar
Matt Hoffman
Razvan Pascanu
Arnaud Doucet
OffRL
14
16
0
05 Jul 2022
Modular Lifelong Reinforcement Learning via Neural Composition
Modular Lifelong Reinforcement Learning via Neural Composition
Jorge Armando Mendez Mendez
H. V. Seijen
Eric Eaton
OffRL
KELM
CLL
86
38
0
01 Jul 2022
Predicting the Need for Blood Transfusion in Intensive Care Units with
  Reinforcement Learning
Predicting the Need for Blood Transfusion in Intensive Care Units with Reinforcement Learning
Yuqing Wang
Yun Zhao
Linda R. Petzold
OffRL
37
5
0
26 Jun 2022
Robust Task Representations for Offline Meta-Reinforcement Learning via
  Contrastive Learning
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
Haoqi Yuan
Zongqing Lu
SSL
OffRL
38
36
0
21 Jun 2022
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise
  Reward
Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward
Tengyu Xu
Yue Wang
Shaofeng Zou
Yingbin Liang
OffRL
35
13
0
13 Jun 2022
Incorporating Explicit Uncertainty Estimates into Deep Offline
  Reinforcement Learning
Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
David Brandfonbrener
Rémi Tachet des Combes
Romain Laroche
OffRL
37
5
0
02 Jun 2022
Non-Markovian policies occupancy measures
Non-Markovian policies occupancy measures
Romain Laroche
Rémi Tachet des Combes
Jacob Buckman
OffRL
37
1
0
27 May 2022
Data Valuation for Offline Reinforcement Learning
Data Valuation for Offline Reinforcement Learning
Amir Abolfazli
Gregory Palmer
D. Kudenko
OffRL
28
0
0
19 May 2022
Semi-Markov Offline Reinforcement Learning for Healthcare
Semi-Markov Offline Reinforcement Learning for Healthcare
Mehdi Fatemi
Mary Wu
J. Petch
Walter Nelson
S. Connolly
Alexander Benz
A. Carnicelli
Marzyeh Ghassemi
OffRL
33
13
0
17 Mar 2022
Safe Reinforcement Learning for Legged Locomotion
Safe Reinforcement Learning for Legged Locomotion
Tsung-Yen Yang
Tingnan Zhang
Linda Luu
Sehoon Ha
Jie Tan
Wenhao Yu
29
40
0
05 Mar 2022
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open
  Problems
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems
Rafael Figueiredo Prudencio
Marcos R. O. A. Máximo
Esther Luna Colombini
OffRL
26
222
0
02 Mar 2022
Learning to Liquidate Forex: Optimal Stopping via Adaptive Top-K
  Regression
Learning to Liquidate Forex: Optimal Stopping via Adaptive Top-K Regression
Diksha Garg
Pankaj Malhotra
Anil Bhatia
Sanjay Bhat
L. Vig
Gautam M. Shroff
29
0
0
25 Feb 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement
  for Value Error
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
David Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
Offline Reinforcement Learning for Road Traffic Control
Offline Reinforcement Learning for Road Traffic Control
Mayuresh Kunjir
Sanjay Chawla
OffRL
32
4
0
07 Jan 2022
Compressive Features in Offline Reinforcement Learning for Recommender
  Systems
Compressive Features in Offline Reinforcement Learning for Recommender Systems
Hung Nguyen
Minh Nguyen
Long Pham
Jennifer Adorno Nieves
OffRL
18
2
0
16 Nov 2021
A Dataset Perspective on Offline Reinforcement Learning
A Dataset Perspective on Offline Reinforcement Learning
Kajetan Schweighofer
Andreas Radler
Marius-Constantin Dinu
M. Hofmarcher
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
OffRL
30
17
0
08 Nov 2021
Batch Reinforcement Learning from Crowds
Batch Reinforcement Learning from Crowds
Guoxi Zhang
H. Kashima
OffRL
40
5
0
08 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
65
100
0
06 Nov 2021
123
Next