ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.04133
  4. Cited By
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for
  Continuous Control

Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control

10 August 2017
Riashat Islam
Peter Henderson
Maziar Gomrokchi
Doina Precup
    BDL
    OffRL
ArXivPDFHTML

Papers citing "Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control"

50 / 55 papers shown
Title
Multi-parameter Control for the (1+($λ$,$λ$))-GA on OneMax via Deep Reinforcement Learning
Multi-parameter Control for the (1+(λλλ,λλλ))-GA on OneMax via Deep Reinforcement Learning
Tai Nguyen
Phong Le
Carola Doerr
Nguyen Dang
27
0
0
19 May 2025
CaRL: Learning Scalable Planning Policies with Simple Rewards
CaRL: Learning Scalable Planning Policies with Simple Rewards
Bernhard Jaeger
D. Dauner
Jens Beißwenger
Simon Gerstenecker
Kashyap Chitta
Andreas Geiger
60
1
0
24 Apr 2025
AlgOS: Algorithm Operating System
AlgOS: Algorithm Operating System
Llewyn Salt
Marcus Gallagher
VLM
37
0
0
07 Apr 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
109
1
0
22 Dec 2024
What is Reproducibility in Artificial Intelligence and Machine Learning Research?
What is Reproducibility in Artificial Intelligence and Machine Learning Research?
Abhyuday Desai
Mohamed Abdelhamid
N. R. Padalkar
AI4CE
32
2
0
29 Apr 2024
Task-optimal data-driven surrogate models for eNMPC via differentiable simulation and optimization
Task-optimal data-driven surrogate models for eNMPC via differentiable simulation and optimization
Daniel Mayfrank
Na Young Ahn
Alexander Mitsos
Manuel Dahmen
34
2
0
21 Mar 2024
Signal Temporal Logic-Guided Apprenticeship Learning
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
Stefanos Nikolaidis
46
2
0
09 Nov 2023
Quantifying Language Models' Sensitivity to Spurious Features in Prompt
  Design or: How I learned to start worrying about prompt formatting
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting
Melanie Sclar
Yejin Choi
Yulia Tsvetkov
Alane Suhr
53
308
0
17 Oct 2023
PASTA: Pretrained Action-State Transformer Agents
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
45
5
0
20 Jul 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
36
9
0
29 May 2023
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards
  global optimality
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso
Elisa Alboni
G. P. R. Papini
Patrick M. Wensing
Andrea Del Prete
30
15
0
12 Nov 2022
Hyperbolic Deep Reinforcement Learning
Hyperbolic Deep Reinforcement Learning
Edoardo Cetin
B. Chamberlain
Michael M. Bronstein
Jonathan J. Hunt
50
21
0
04 Oct 2022
Towards a Standardised Performance Evaluation Protocol for Cooperative
  MARL
Towards a Standardised Performance Evaluation Protocol for Cooperative MARL
R. Gorsane
Omayma Mahjoub
Ruan de Kock
Roland Dubb
Siddarth S. Singh
Arnu Pretorius
OffRL
44
50
0
21 Sep 2022
Measuring Interventional Robustness in Reinforcement Learning
Measuring Interventional Robustness in Reinforcement Learning
Katherine Avery
Jack Kenney
Pracheta Amaranath
Erica Cai
David D. Jensen
21
0
0
19 Sep 2022
Towards Augmented Microscopy with Reinforcement Learning-Enhanced
  Workflows
Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows
Michael Xu
Abinash Kumar
J. Lebeau
20
7
0
04 Aug 2022
Leakage and the Reproducibility Crisis in ML-based Science
Leakage and the Reproducibility Crisis in ML-based Science
Sayash Kapoor
Arvind Narayanan
25
177
0
14 Jul 2022
ARLO: A Framework for Automated Reinforcement Learning
ARLO: A Framework for Automated Reinforcement Learning
Marco Mussi
Davide Lombarda
Alberto Maria Metelli
F. Trovò
Marcello Restelli
OffRL
41
4
0
20 May 2022
Deep Learning Reproducibility and Explainable AI (XAI)
Deep Learning Reproducibility and Explainable AI (XAI)
Anastasia-Maria Leventi-Peetz
T. Östreich
19
9
0
23 Feb 2022
Hyperparameter Tuning for Deep Reinforcement Learning Applications
Hyperparameter Tuning for Deep Reinforcement Learning Applications
M. Kiran
Melis Ozyildirim
40
22
0
26 Jan 2022
Reproducibility in Learning
Reproducibility in Learning
R. Impagliazzo
Rex Lei
T. Pitassi
Jessica Sorrell
32
43
0
20 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Parallelized and Randomized Adversarial Imitation Learning for
  Safety-Critical Self-Driving Vehicles
Parallelized and Randomized Adversarial Imitation Learning for Safety-Critical Self-Driving Vehicles
Won Joon Yun
Myungjae Shin
Soyi Jung
S. Kwon
Joongheon Kim
24
5
0
26 Dec 2021
Aggressive Q-Learning with Ensembles: Achieving Both High Sample
  Efficiency and High Asymptotic Performance
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Yanqiu Wu
Xinyue Chen
Che Wang
Yiming Zhang
Keith Ross
OffRL
17
9
0
17 Nov 2021
Kernel-based diffusion approximated Markov decision processes for
  autonomous navigation and control on unstructured terrains
Kernel-based diffusion approximated Markov decision processes for autonomous navigation and control on unstructured terrains
Junhong Xu
Kai-Li Yin
Zheng Chen
Jason M. Gregory
Ethan Stump
Lantao Liu
37
2
0
16 Nov 2021
Which Model to Trust: Assessing the Influence of Models on the
  Performance of Reinforcement Learning Algorithms for Continuous Control Tasks
Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks
Giacomo Arcieri
David Wölfle
Eleni Chatzi
OffRL
27
5
0
25 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement
  Learning
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
47
17
0
07 Oct 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
61
639
0
30 Aug 2021
Variational Actor-Critic Algorithms
Variational Actor-Critic Algorithms
Yuhua Zhu
Lexing Ying
OffRL
15
0
0
03 Aug 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
49
15
0
10 Jun 2021
What Matters for Adversarial Imitation Learning?
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
M. Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
55
77
0
01 Jun 2021
On the Importance of Hyperparameter Optimization for Model-based
  Reinforcement Learning
On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning
Bangnig Zhang
Raghunandan Rajan
Luis Pineda
Nathan Lambert
André Biedenkapp
Kurtland Chua
Frank Hutter
Roberto Calandra
29
100
0
26 Feb 2021
A Methodology for the Development of RL-Based Adaptive Traffic Signal
  Controllers
A Methodology for the Development of RL-Based Adaptive Traffic Signal Controllers
G. Varela
Pedro P. Santos
Alberto Sardinha
Francisco S. Melo
26
2
0
24 Jan 2021
A Study of Checkpointing in Large Scale Training of Deep Neural Networks
A Study of Checkpointing in Large Scale Training of Deep Neural Networks
Elvis Rojas
A. Kahira
Esteban Meneses
L. Bautista-Gomez
Rosa M. Badia
29
22
0
01 Dec 2020
Dirichlet policies for reinforced factor portfolios
Dirichlet policies for reinforced factor portfolios
Eric André
Guillaume Coqueret
25
7
0
10 Nov 2020
How to Make Deep RL Work in Practice
How to Make Deep RL Work in Practice
Nirnai Rao
Elie Aljalbout
Axel Sauer
Sami Haddadin
OffRL
29
11
0
25 Oct 2020
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control
Ian Fox
Joyce M. Lee
R. Pop-Busui
Jenna Wiens
BDL
OffRL
30
50
0
18 Sep 2020
Quantity vs. Quality: On Hyperparameter Optimization for Deep
  Reinforcement Learning
Quantity vs. Quality: On Hyperparameter Optimization for Deep Reinforcement Learning
L. Hertel
Pierre Baldi
D. Gillen
BDL
31
12
0
29 Jul 2020
One Policy to Control Them All: Shared Modular Policies for
  Agent-Agnostic Control
One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control
Wenlong Huang
Igor Mordatch
Deepak Pathak
51
167
0
09 Jul 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale
  Empirical Study
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
M. Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
31
214
0
10 Jun 2020
Randomized Policy Learning for Continuous State and Action MDPs
Randomized Policy Learning for Continuous State and Action MDPs
Hiteshi Sharma
Rahul Jain
21
1
0
08 Jun 2020
Robotic Arm Control and Task Training through Deep Reinforcement
  Learning
Robotic Arm Control and Task Training through Deep Reinforcement Learning
Andrea Franceschetti
E. Tosello
Nicola Castaman
Stefano Ghidoni
12
32
0
06 May 2020
Explore and Exploit with Heterotic Line Bundle Models
Explore and Exploit with Heterotic Line Bundle Models
Magdalena Larfors
Robin Schneider
41
38
0
10 Mar 2020
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for
  Reproducible Deep Reinforcement Learning
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning
Keng Wah Loon
L. Graesser
Milan Cvitkovic
OffRL
26
13
0
28 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
29
26
0
13 Dec 2019
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic
  Regulator with Ergodic Cost
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Zhuoran Yang
Yongxin Chen
Mingyi Hong
Zhaoran Wang
32
39
0
14 Jul 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
44
28
0
25 Mar 2019
Dopamine: A Research Framework for Deep Reinforcement Learning
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
28
276
0
14 Dec 2018
Deterministic Implementations for Reproducibility in Deep Reinforcement
  Learning
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning
P. Nagarajan
Garrett A. Warnell
Peter Stone
22
51
0
15 Sep 2018
Policy Search in Continuous Action Domains: an Overview
Policy Search in Continuous Action Domains: an Overview
Olivier Sigaud
F. Stulp
16
72
0
13 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning
  Approaches
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
29
875
0
03 Mar 2018
12
Next