ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.12901
  4. Cited By
Challenges of Real-World Reinforcement Learning

Challenges of Real-World Reinforcement Learning

29 April 2019
Gabriel Dulac-Arnold
D. Mankowitz
Todd Hester
    OffRL
ArXivPDFHTML

Papers citing "Challenges of Real-World Reinforcement Learning"

50 / 108 papers shown
Title
ABIDES-Gym: Gym Environments for Multi-Agent Discrete Event Simulation
  and Application to Financial Markets
ABIDES-Gym: Gym Environments for Multi-Agent Discrete Event Simulation and Application to Financial Markets
Selim Amrouni
Aymeric Moulin
Jared Vann
Svitlana Vyetrenko
T. Balch
Manuela Veloso
AI4CE
26
42
0
27 Oct 2021
Learning Robust Controllers Via Probabilistic Model-Based Policy Search
Learning Robust Controllers Via Probabilistic Model-Based Policy Search
V. Charvet
B. S. Jensen
R. Murray-Smith
19
2
0
26 Oct 2021
GrowSpace: Learning How to Shape Plants
GrowSpace: Learning How to Shape Plants
Yasmeen Hitti
Ionelia Buzatu
Manuel Del Verme
M. Lefsrud
Florian Golemo
A. Durand
19
2
0
15 Oct 2021
Correct Me if I am Wrong: Interactive Learning for Robotic Manipulation
Correct Me if I am Wrong: Interactive Learning for Robotic Manipulation
Eugenio Chisari
Tim Welschehold
Joschka Boedecker
Wolfram Burgard
Abhinav Valada
19
37
0
07 Oct 2021
Adaptive control of a mechatronic system using constrained residual
  reinforcement learning
Adaptive control of a mechatronic system using constrained residual reinforcement learning
Tom Staessens
Tom Lefebvre
Guillaume Crevecoeur
19
16
0
06 Oct 2021
Deep reinforcement learning for guidewire navigation in coronary artery
  phantom
Deep reinforcement learning for guidewire navigation in coronary artery phantom
Jihoon Kweon
Kyunghwan Kim
Chaehyuk Lee
Hwi Kwon
Jinwoo Park
...
Inwook Back
J. Roh
Y. Moon
Jaesoon Choi
Young-Hak Kim
OnRL
18
33
0
05 Oct 2021
Deep Reinforcement Learning with Adjustments
Deep Reinforcement Learning with Adjustments
H. Khorasgani
Haiyan Wang
Chetan Gupta
Susumu Serita
18
2
0
28 Sep 2021
Semi-Supervised Imitation Learning with Mixed Qualities of
  Demonstrations for Autonomous Driving
Semi-Supervised Imitation Learning with Mixed Qualities of Demonstrations for Autonomous Driving
Gunmin Lee
Wooseok Oh
Seungyoung Shin
Dohyeong Kim
Jeongwoo Oh
Jaeyeon Jeong
Sungjoon Choi
Songhwai Oh
SSL
33
2
0
23 Sep 2021
A Survey of Text Games for Reinforcement Learning informed by Natural
  Language
A Survey of Text Games for Reinforcement Learning informed by Natural Language
P. Osborne
Heido Nomm
André Freitas
AI4CE
32
24
0
20 Sep 2021
Learning Robot Swarm Tactics over Complex Adversarial Environments
Learning Robot Swarm Tactics over Complex Adversarial Environments
A. Behjat
Hemanth Manjunatha
Prajit KrisshnaKumar
Apurv Jani
Leighton Collins
...
Joseph P. Distefano
David Doermann
Karthik Dantu
Ehsan Esfahani
Souma Chowdhury
6
11
0
13 Sep 2021
Reinforcement Learning based Condition-oriented Maintenance Scheduling
  for Flow Line Systems
Reinforcement Learning based Condition-oriented Maintenance Scheduling for Flow Line Systems
Raphael Lamprecht
Ferdinand Wurst
Marco F. Huber
14
3
0
27 Aug 2021
Accelerating the Learning of TAMER with Counterfactual Explanations
Accelerating the Learning of TAMER with Counterfactual Explanations
Jakob Karalus
F. Lindner
OffRL
29
4
0
03 Aug 2021
The Benchmark Lottery
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
42
89
0
14 Jul 2021
RRL: Resnet as representation for Reinforcement Learning
RRL: Resnet as representation for Reinforcement Learning
Rutav Shah
Vikash Kumar
OffRL
30
111
0
07 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Unsupervised Skill Discovery with Bottleneck Option Learning
Unsupervised Skill Discovery with Bottleneck Option Learning
Jaekyeom Kim
Seohong Park
Gunhee Kim
32
32
0
27 Jun 2021
Learning Policies with Zero or Bounded Constraint Violation for
  Constrained MDPs
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs
Tao-Wen Liu
Ruida Zhou
D. Kalathil
P. R. Kumar
Chao Tian
29
78
0
04 Jun 2021
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
32
52
0
26 Apr 2021
Model-aided Deep Reinforcement Learning for Sample-efficient UAV
  Trajectory Design in IoT Networks
Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks
Omid Esrafilian
Harald Bayerlein
David Gesbert
16
6
0
21 Apr 2021
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow
John Mcleod
Hrvoje Stojić
Vincent Adam
Dongho Kim
Jordi Grau-Moya
Peter Vrancx
Felix Leibfried
OffRL
21
2
0
26 Mar 2021
Robust Multi-Modal Policies for Industrial Assembly via Reinforcement
  Learning and Demonstrations: A Large-Scale Study
Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study
Jianlan Luo
Oleg O. Sushkov
Rugile Pevceviciute
Wenzhao Lian
Chang Su
Mel Vecerík
Ning Ye
S. Schaal
Jonathan Scholz
OffRL
27
60
0
21 Mar 2021
Combining Pessimism with Optimism for Robust and Efficient Model-Based
  Deep Reinforcement Learning
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning
Sebastian Curi
Ilija Bogunovic
Andreas Krause
39
17
0
18 Mar 2021
RecSim NG: Toward Principled Uncertainty Modeling for Recommender
  Ecosystems
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Martin Mladenov
Chih-Wei Hsu
Vihan Jain
Eugene Ie
Christopher Colby
Nicolas Mayoraz
H. Pham
Dustin Tran
Ivan Vendrov
Craig Boutilier
BDL
15
31
0
14 Mar 2021
Gym-ANM: Reinforcement Learning Environments for Active Network
  Management Tasks in Electricity Distribution Systems
Gym-ANM: Reinforcement Learning Environments for Active Network Management Tasks in Electricity Distribution Systems
Robin Henry
D. Ernst
21
34
0
14 Mar 2021
Offline Reinforcement Learning with Pseudometric Learning
Offline Reinforcement Learning with Pseudometric Learning
Robert Dadashi
Shideh Rezaeifar
Nino Vieillard
Léonard Hussenot
Olivier Pietquin
M. Geist
OffRL
33
40
0
02 Mar 2021
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
Juhyoung Lee
Sangyeob Kim
Sangjin Kim
Wooyoung Jo
H. Yoo
OffRL
21
9
0
24 Jan 2021
Social NCE: Contrastive Learning of Socially-aware Motion
  Representations
Social NCE: Contrastive Learning of Socially-aware Motion Representations
Yuejiang Liu
Qi Yan
Alexandre Alahi
29
101
0
21 Dec 2020
Resonance: Replacing Software Constants with Context-Aware Models in
  Real-time Communication
Resonance: Replacing Software Constants with Context-Aware Models in Real-time Communication
J. Gupchup
A. Aazami
Yaran Fan
Senja Filipi
Tom Finley
...
D. Perednya
Sriram Srinivasan
John Langford
Ross Cutler
J. Gehrke
OffRL
19
1
0
23 Nov 2020
Language-guided Navigation via Cross-Modal Grounding and Alternate
  Adversarial Learning
Language-guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning
Weixia Zhang
Chao Ma
Qi Wu
Xiaokang Yang
39
44
0
22 Nov 2020
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement
  Learning
OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Anurag Ajay
Aviral Kumar
Pulkit Agrawal
Sergey Levine
Ofir Nachum
OffRL
OnRL
34
155
0
26 Oct 2020
Multi-UAV Path Planning for Wireless Data Harvesting with Deep
  Reinforcement Learning
Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning
Harald Bayerlein
Mirco Theile
Marco Caccamo
David Gesbert
29
120
0
23 Oct 2020
Reinforcement Learning with Combinatorial Actions: An Application to
  Vehicle Routing
Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing
A. Delarue
Ross Anderson
Christian Tjandraatmadja
35
93
0
22 Oct 2020
Robust Constrained Reinforcement Learning for Continuous Control with
  Model Misspecification
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
D. A. Calian
Rae Jeong
Cosmin Paduraru
N. Heess
Sumanth Dathathri
Martin Riedmiller
Timothy A. Mann
24
11
0
20 Oct 2020
Artificial Intelligence for UAV-enabled Wireless Networks: A Survey
Artificial Intelligence for UAV-enabled Wireless Networks: A Survey
Mohamed-Amine Lahmeri
Mustafa A. Kishk
Mohamed-Slim Alouini
26
102
0
24 Sep 2020
Human Engagement Providing Evaluative and Informative Advice for
  Interactive Reinforcement Learning
Human Engagement Providing Evaluative and Informative Advice for Interactive Reinforcement Learning
Adam Bignold
Francisco Cruz
Richard Dazeley
Peter Vamplew
Cameron Foale
22
18
0
21 Sep 2020
QPLEX: Duplex Dueling Multi-Agent Q-Learning
QPLEX: Duplex Dueling Multi-Agent Q-Learning
Jianhao Wang
Zhizhou Ren
Terry Liu
Yang Yu
Chongjie Zhang
OffRL
51
437
0
03 Aug 2020
Probabilistic Active Meta-Learning
Probabilistic Active Meta-Learning
Jean Kaddour
Steindór Sæmundsson
M. Deisenroth
22
34
0
17 Jul 2020
UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement
  Learning Approach
UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach
Harald Bayerlein
Mirco Theile
Marco Caccamo
David Gesbert
18
54
0
01 Jul 2020
Critic Regularized Regression
Critic Regularized Regression
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
...
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
OffRL
36
317
0
26 Jun 2020
Learning to Play Table Tennis From Scratch using Muscular Robots
Learning to Play Table Tennis From Scratch using Muscular Robots
Le Chen
Simon Guist
Roberto Calandra
V. Berenz
Bernhard Schölkopf
Jan Peters
11
88
0
10 Jun 2020
Off-policy Learning for Remote Electrical Tilt Optimization
Off-policy Learning for Remote Electrical Tilt Optimization
Filippo Vannella
Jaeseong Jeong
Alexandre Proutière
OffRL
14
14
0
21 May 2020
A Survey of Reinforcement Learning Algorithms for Dynamically Varying
  Environments
A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments
Sindhu Padakandla
25
144
0
19 May 2020
Optimizing for the Future in Non-Stationary MDPs
Optimizing for the Future in Non-Stationary MDPs
Yash Chandak
Georgios Theocharous
Shiv Shankar
Martha White
Sridhar Mahadevan
Philip S. Thomas
OffRL
13
65
0
17 May 2020
DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC)
  Resource Scheduling
DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling
Tegg Taekyong Sung
J. Ha
Jeewoo Kim
Alex Yahja
Chae-Bong Sohn
Bo Ryu
21
9
0
15 May 2020
How Do You Act? An Empirical Study to Understand Behavior of Deep
  Reinforcement Learning Agents
How Do You Act? An Empirical Study to Understand Behavior of Deep Reinforcement Learning Agents
Richard Meyes
Moritz Schneider
Tobias Meisen
28
2
0
07 Apr 2020
ACNMP: Skill Transfer and Task Extrapolation through Learning from
  Demonstration and Reinforcement Learning via Representation Sharing
ACNMP: Skill Transfer and Task Extrapolation through Learning from Demonstration and Reinforcement Learning via Representation Sharing
M. Akbulut
Erhan Öztop
M. Yunus Seker
Y. Nagai
Ahmet E. Tekden
Emre Ugur
14
2
0
25 Mar 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
120
0
24 Mar 2020
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
M. Jovanović
20
159
0
01 Mar 2020
Learning in Markov Decision Processes under Constraints
Learning in Markov Decision Processes under Constraints
Rahul Singh
Abhishek Gupta
Ness B. Shroff
41
27
0
27 Feb 2020
Scalable Multi-Task Imitation Learning with Autonomous Improvement
Scalable Multi-Task Imitation Learning with Autonomous Improvement
Avi Singh
Eric Jang
A. Irpan
Daniel Kappler
Murtaza Dalal
Sergey Levine
Mohi Khansari
Chelsea Finn
48
35
0
25 Feb 2020
Previous
123
Next