Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
88
183
0
09 Jan 2020
Sample-based Distributional Policy Gradient
Rahul Singh
Keuntaek Lee
Yongxin Chen
64
19
0
08 Jan 2020
Blue River Controls: A toolkit for Reinforcement Learning Control Systems on Hardware
Kirill Polzounov
R. Sundar
L. Redden
38
10
0
07 Jan 2020
Reanalysis of Variance Reduced Temporal Difference Learning
Tengyu Xu
Zhe Wang
Yi Zhou
Yingbin Liang
OffRL
104
39
0
07 Jan 2020
Learning Reusable Options for Multi-Task Reinforcement Learning
Francisco M. Garcia
Chris Nota
Philip S. Thomas
26
4
0
06 Jan 2020
Universal Successor Features for Transfer Reinforcement Learning
Chen Ma
Dylan R. Ashley
Junfeng Wen
Yoshua Bengio
OffRL
58
26
0
05 Jan 2020
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation
Letian Chen
Rohan R. Paleja
Muyleng Ghuy
Matthew C. Gombolay
116
39
0
02 Jan 2020
Model Inversion Networks for Model-Based Optimization
Aviral Kumar
Sergey Levine
OffRL
97
100
0
31 Dec 2019
Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning
Andreas Sedlmeier
Thomas Gabor
Thomy Phan
Lenz Belzner
Claudia Linnhoff-Popien
46
25
0
31 Dec 2019
Information Theoretic Model Predictive Q-Learning
M. Bhardwaj
Ankur Handa
Dieter Fox
Byron Boots
80
23
0
31 Dec 2019
Augmented Replay Memory in Reinforcement Learning With Continuous Control
Mirza Ramicic
Andrea Bonarini
KELM
CLL
OffRL
28
1
0
29 Dec 2019
Quasi-Newton Trust Region Policy Optimization
Devesh K. Jha
A. Raghunathan
Diego Romeres
57
9
0
26 Dec 2019
Learning an Interpretable Traffic Signal Control Policy
James Ault
Josiah P. Hanna
Guni Sharon
45
50
0
23 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
131
193
0
23 Dec 2019
Monte-Carlo Tree Search for Policy Optimization
Xiaobai Ma
Katherine Driggs-Campbell
Zongzhang Zhang
Mykel J. Kochenderfer
114
6
0
23 Dec 2019
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning
Eivind Meyer
Haakon Robinson
Adil Rasheed
Omer San
65
66
0
18 Dec 2019
Relational Mimic for Visual Adversarial Imitation Learning
Lionel Blondé
Yichuan Tang
Jian Zhang
Russ Webb
40
0
0
18 Dec 2019
Learning to grow: control of material self-assembly using evolutionary reinforcement learning
S. Whitelam
Isaac Tamblyn
53
34
0
18 Dec 2019
Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation
Tianhong Dai
Kai Arulkumaran
Tamara Gerbert
Samyakh Tukra
Feryal M. P. Behbahani
Anil Anthony Bharath
87
28
0
18 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
181
1,839
0
13 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
72
26
0
13 Dec 2019
Zero-shot generalization using cascaded system-representations
A. Malik
OffRL
24
2
0
11 Dec 2019
Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control Optimization
Paolo Pagliuca
Nicola Milano
S. Nolfi
80
31
0
11 Dec 2019
Efficient Robotic Task Generalization Using Deep Model Fusion Reinforcement Learning
Tianying Wang
Hao Zhang
Wei Qi Toh
Erik Cambria
Cheston Tan
Yan Wu
Yong Liu
Wei Jing
33
6
0
11 Dec 2019
Marginalized State Distribution Entropy Regularization in Policy Optimization
Riashat Islam
Zafarali Ahmed
Doina Precup
59
17
0
11 Dec 2019
Adversarial recovery of agent rewards from latent spaces of the limit order book
Jacobo Roa-Vicens
Yuanbo Wang
Virgile Mison
Y. Gal
Ricardo M. A. Silva
51
3
0
09 Dec 2019
Learning Latent State Spaces for Planning through Reward Prediction
Aaron J. Havens
Ouyang Yi
P. Nagarajan
Yasuhiro Fujita
36
7
0
09 Dec 2019
Transformer Based Reinforcement Learning For Games
Uddeshya Upadhyay
Nikunj Shah
Sucheta Ravikanti
Mayanka Medhe
OffRL
52
11
0
09 Dec 2019
Value-of-Information based Arbitration between Model-based and Model-free Control
Krishn Bera
Yash Mandilwar
R. Bapi
11
3
0
08 Dec 2019
VALAN: Vision and Language Agent Navigation
L. Lansing
Vihan Jain
Harsh Mehta
Haoshuo Huang
Eugene Ie
LM&Ro
AI4TS
58
8
0
06 Dec 2019
Training Agents using Upside-Down Reinforcement Learning
R. Srivastava
Pranav Shyam
Filipe Wall Mutz
Wojciech Ja'skowski
Jürgen Schmidhuber
OffRL
93
126
0
05 Dec 2019
Reinforcement Learning with Convolutional Reservoir Computing
Hanten Chang
K. Futagami
51
22
0
05 Dec 2019
Learning Human Objectives by Evaluating Hypothetical Behavior
S. Reddy
Anca Dragan
Sergey Levine
Shane Legg
Jan Leike
87
77
0
05 Dec 2019
Optimizing Norm-Bounded Weighted Ambiguity Sets for Robust MDPs
R. Russel
Bahram Behzadian
Marek Petrik
43
3
0
04 Dec 2019
Adaptive Online Planning for Continual Lifelong Learning
Kevin Lu
Igor Mordatch
Pieter Abbeel
OffRL
OnRL
CLL
73
15
0
03 Dec 2019
Flow Rate Control in Smart District Heating Systems Using Deep Reinforcement Learning
Tinghao Zhang
Jing Luo
Ping Chen
Jie Liu
AI4CE
47
5
0
01 Dec 2019
Playing Games in the Dark: An approach for cross-modality transfer in reinforcement learning
Rui Silva
Miguel Vasco
Francisco S. Melo
Ana Paiva
Manuela Veloso
OffRL
41
14
0
28 Nov 2019
LeRoP: A Learning-Based Modular Robot Photography Framework
Hao Kang
Jianming Zhang
Haoxiang Li
Zhe Lin
TJ Rhodes
Bedrich Benes
42
4
0
28 Nov 2019
Adversarial Deep Reinforcement Learning based Adaptive Moving Target Defense
Taha Eghtesad
Yevgeniy Vorobeychik
Aron Laszka
AAML
49
8
0
27 Nov 2019
Biologically inspired architectures for sample-efficient deep reinforcement learning
Pierre Harvey Richemond
Arinbjorn Kolbeinsson
Yike Guo
59
2
0
25 Nov 2019
ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems
Bharathan Balaji
Jordan Bell-Masterson
Enes Bilgin
Andreas C. Damianou
Pablo Moreno Garcia
Arpit Jain
Runfei Luo
Alvaro Maggiar
Balakrishnan Narayanaswamy
Chun Jimmie Ye
OffRL
63
32
0
24 Nov 2019
DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning
Mohammadhosein Hasanbeig
N. Jeppu
Alessandro Abate
T. Melham
Daniel Kroening
95
20
0
22 Nov 2019
Actively Learning Gaussian Process Dynamics
Mona Buisson-Fenet
Friedrich Solowjow
Sebastian Trimpe
GP
103
64
0
22 Nov 2019
Sample-Efficient Reinforcement Learning with Maximum Entropy Mellowmax Episodic Control
Marta Sarrico
Kai Arulkumaran
A. Agostinelli
Pierre Harvey Richemond
Anil Anthony Bharath
24
2
0
21 Nov 2019
Memory-Efficient Episodic Control Reinforcement Learning with Dynamic Online k-means
A. Agostinelli
Kai Arulkumaran
Marta Sarrico
Pierre Harvey Richemond
Anil Anthony Bharath
OffRL
22
4
0
21 Nov 2019
Agent Probing Interaction Policies
Siddharth Ghiya
Oluwafemi Azeez
Brendan Miller
20
0
0
21 Nov 2019
Unsupervised Object Segmentation with Explicit Localization Module
Weitang Liu
Lifeng Wei
James Sharpnack
John Douglas Owens
SSeg
23
4
0
21 Nov 2019
Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Vibhavari Dasagi
Robert Lee
Jake Bruce
Jurgen Leitner
OffRL
56
2
0
20 Nov 2019
MANGA: Method Agnostic Neural-policy Generalization and Adaptation
Homanga Bharadhwaj
Shoichiro Yamaguchi
S. Maeda
43
4
0
19 Nov 2019
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online
Yangchen Pan
Kirby Banman
Martha White
22
0
0
19 Nov 2019
Previous
1
2
3
...
39
40
41
...
50
51
52
Next