Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Learning Compositional Neural Programs for Continuous Control
Thomas Pierrot
Nicolas Perrin
Feryal M. P. Behbahani
Alexandre Laterre
Olivier Sigaud
Karim Beguir
Nando de Freitas
CLL
95
4
0
27 Jul 2020
Weak Human Preference Supervision For Deep Reinforcement Learning
Zehong Cao
Kaichiu Wong
Chin-Teng Lin
60
5
0
25 Jul 2020
Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies
Shengpu Tang
Aditya Modi
Michael Sjoding
Jenna Wiens
OffRL
68
26
0
24 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
82
4
0
24 Jul 2020
Distributional Reinforcement Learning via Moment Matching
Thanh Tang Nguyen
Sunil R. Gupta
Svetha Venkatesh
OOD
22
22
0
24 Jul 2020
Adaptive Traffic Control with Deep Reinforcement Learning: Towards State-of-the-art and Beyond
Siavash Alemzadeh
Ramin Moslemi
Ratnesh K. Sharma
M. Mesbahi
OffRL
36
5
0
21 Jul 2020
Soft Expert Reward Learning for Vision-and-Language Navigation
Hu Wang
Qi Wu
Chunhua Shen
55
51
0
21 Jul 2020
Integrating Deep Reinforcement Learning Networks with Health System Simulations
Michael Allen
T. Monks
AI4CE
41
4
0
21 Jul 2020
Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground with Human-in-the-loop
Jonathan Chung
Anna Luo
Xavier Raffin
Scott Perry
OffRL
52
3
0
20 Jul 2020
Multi-agent Reinforcement Learning in Bayesian Stackelberg Markov Games for Adaptive Moving Target Defense
Sailik Sengupta
S. Kambhampati
AAML
63
46
0
20 Jul 2020
A Hierarchical Approach to Scaling Batch Active Search Over Structured Data
Vivek Myers
Peyton Greenside
74
1
0
20 Jul 2020
Multi-robot Cooperative Object Transportation using Decentralized Deep Reinforcement Learning
Lin Zhang
Hao Xiong
Ou Ma
Zhaokui Wang
109
6
0
17 Jul 2020
CoNES: Convex Natural Evolutionary Strategies
Sushant Veer
Anirudha Majumdar
74
3
0
16 Jul 2020
PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Alekh Agarwal
Mikael Henaff
Sham Kakade
Wen Sun
OffRL
94
110
0
16 Jul 2020
DRIFT: Deep Reinforcement Learning for Functional Software Testing
Luke R. Harries
Rebekah Storan Clarke
Timothy Chapman
Swamy V. P. L. N. Nallamalli
Levent Özgür
...
Aaron Dietrich
José Miguel Hernández-Lobato
Tom Ellis
Cheng Zhang
K. Ciosek
VLM
OffRL
24
14
0
16 Jul 2020
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
51
25
0
15 Jul 2020
Learning to Sample with Local and Global Contexts in Experience Replay Buffer
Youngmin Oh
Kimin Lee
Jinwoo Shin
Eunho Yang
Sung Ju Hwang
OffRL
66
16
0
14 Jul 2020
Efficient Empowerment Estimation for Unsupervised Stabilization
Ruihan Zhao
Kevin Lu
Pieter Abbeel
Stas Tiomkin
58
8
0
14 Jul 2020
Explore and Explain: Self-supervised Navigation and Recounting
Roberto Bigazzi
Federico Landi
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
EgoV
LM&Ro
78
17
0
14 Jul 2020
Robustifying Reinforcement Learning Agents via Action Space Adversarial Training
Kai Liang Tan
Yasaman Esfandiari
Xian Yeow Lee
Aakanksha
Soumik Sarkar
AAML
135
57
0
14 Jul 2020
DinerDash Gym: A Benchmark for Policy Learning in High-Dimensional Action Space
Siwei Chen
Xiao Ma
David Hsu
45
3
0
13 Jul 2020
OtoWorld: Towards Learning to Separate by Learning to Move
Omkar Ranadive
Grant Gasser
David Terpay
Prem Seetharaman
47
1
0
12 Jul 2020
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
Scott Fujimoto
David Meger
Doina Precup
96
58
0
12 Jul 2020
Learning Retrospective Knowledge with Reverse Reinforcement Learning
Shangtong Zhang
Vivek Veeriah
Shimon Whiteson
OffRL
AI4TS
76
13
0
09 Jul 2020
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
Chuang Gan
Jeremy Schwartz
S. Alter
Damian Mrowca
Martin Schrimpf
...
Antonio Torralba
J. DiCarlo
J. Tenenbaum
Josh H. McDermott
Daniel L. K. Yamins
VGen
177
317
0
09 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
119
205
0
09 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
73
19
0
09 Jul 2020
Adaptive Regret for Control of Time-Varying Dynamics
Paula Gradu
Elad Hazan
Edgar Minasyan
97
47
0
08 Jul 2020
Double Prioritized State Recycled Experience Replay
Fanchen Bu
D. Chang
OffRL
40
11
0
08 Jul 2020
Guided Exploration with Proximal Policy Optimization using a Single Demonstration
Gabriele Libardi
Gianni De Fabritiis
58
24
0
07 Jul 2020
Natural Emergence of Heterogeneous Strategies in Artificially Intelligent Competitive Teams
A. Deka
Katia Sycara
AAML
124
32
0
06 Jul 2020
robo-gym -- An Open Source Toolkit for Distributed Deep Reinforcement Learning on Real and Simulated Robots
M. Lucchi
Friedemann Zindler
Stephan Mühlbacher-Karrer
Horst Pichler
OffRL
86
30
0
06 Jul 2020
Explaining Fast Improvement in Online Imitation Learning
Xinyan Yan
Byron Boots
Ching-An Cheng
OnRL
72
1
0
06 Jul 2020
Bidirectional Model-based Policy Optimization
Hang Lai
Jian Shen
Weinan Zhang
Yong Yu
76
58
0
04 Jul 2020
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Yufei Wang
Tianwei Ni
70
21
0
03 Jul 2020
Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control
Yaofeng Desmond Zhong
Naomi Ehrich Leonard
DRL
AI4CE
95
43
0
03 Jul 2020
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks
Yuqian Jiang
Suda Bharadwaj
Bo Wu
Rishi Shah
Ufuk Topcu
Peter Stone
CLL
OffRL
LRM
58
43
0
03 Jul 2020
Policy Improvement via Imitation of Multiple Oracles
Ching-An Cheng
Andrey Kolobov
Alekh Agarwal
44
5
0
01 Jul 2020
Falsification-Based Robust Adversarial Reinforcement Learning
Xiao Wang
Saasha Nair
Matthias Althoff
AAML
64
19
0
01 Jul 2020
Convex Regularization in Monte-Carlo Tree Search
Tuan Dam
Carlo DÉramo
Jan Peters
Joni Pajarinen
OffRL
77
11
0
01 Jul 2020
Group Equivariant Deep Reinforcement Learning
Arnab Kumar Mondal
Pratheeksha Nair
Kaleem Siddiqi
61
34
0
01 Jul 2020
Enforcing Almost-Sure Reachability in POMDPs
Sebastian Junges
N. Jansen
Sanjit A. Seshia
61
26
0
30 Jun 2020
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
Elise van der Pol
Daniel E. Worrall
H. V. Hoof
F. Oliehoek
Max Welling
BDL
AI4CE
111
164
0
30 Jun 2020
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
132
49
0
30 Jun 2020
Extracting Latent State Representations with Linear Dynamics from Rich Observations
Abraham Frandsen
Rong Ge
30
2
0
29 Jun 2020
Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper
Kallil M. C. Zielinski
Marcelo Teixeira
Richardson Ribeiro
Dalcimar Casanova
OffRL
33
1
0
29 Jun 2020
Using Reinforcement Learning to Herd a Robotic Swarm to a Target Distribution
Z. Kakish
Karthik Elamvazhuthi
Spring Berman
32
8
0
29 Jun 2020
Deep Bayesian Quadrature Policy Optimization
Akella Ravi Tej
Kamyar Azizzadenesheli
Mohammad Ghavamzadeh
Anima Anandkumar
Yisong Yue
62
5
0
28 Jun 2020
DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
39
4
0
26 Jun 2020
Intrinsic Reward Driven Imitation Learning via Generative Model
Xingrui Yu
Yueming Lyu
Ivor W. Tsang
45
55
0
26 Jun 2020
Previous
1
2
3
...
34
35
36
...
50
51
52
Next