Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Visual Adversarial Imitation Learning using Variational Models
Rafael Rafailov
Tianhe Yu
Aravind Rajeswaran
Chelsea Finn
SSL
116
50
0
16 Jul 2021
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration
Yuda Song
Wen Sun
112
21
0
15 Jul 2021
MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning
Kevin Wenliang Li
Abhishek Gupta
Ashwin Reddy
Vitchyr H. Pong
Aurick Zhou
Justin Yu
Sergey Levine
UQCV
71
31
0
15 Jul 2021
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Joel Z Leibo
Edgar A. Duénez-Guzmán
A. Vezhnevets
J. Agapiou
P. Sunehag
Raphael Köster
Jayd Matyas
Charlie Beattie
Igor Mordatch
T. Graepel
OffRL
93
111
0
14 Jul 2021
A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization
Zhenning Li
Chengzhong Xu
Guohui Zhang
28
6
0
13 Jul 2021
Carle's Game: An Open-Ended Challenge in Exploratory Machine Creativity
Q. Davis
AI4CE
47
2
0
13 Jul 2021
Out-of-Distribution Dynamics Detection: RL-Relevant Benchmarks and Results
Mohamad H. Danesh
Alan Fern
152
14
0
11 Jul 2021
NVCell: Standard Cell Layout in Advanced Technology Nodes with Reinforcement Learning
Haoxing Ren
Matthew R. Fojtik
Brucek Khailany
OffRL
71
13
0
09 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
152
67
0
08 Jul 2021
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
Yuexiang Zhai
Christina Baek
Zhengyuan Zhou
Jiantao Jiao
Yi-An Ma
85
23
0
08 Jul 2021
Imitation by Predicting Observations
Andrew Jaegle
Yury Sulsky
Arun Ahuja
Jake Bruce
Rob Fergus
Greg Wayne
41
12
0
08 Jul 2021
Towards Autonomous Pipeline Inspection with Hierarchical Reinforcement Learning
N. Botteghi
L.J.L. Grefte
M. Poel
B. Sirmaçek
C. Brune
Edwin Dertien
Stefano Stramigioli
57
4
0
08 Jul 2021
Quadruped Locomotion on Non-Rigid Terrain using Reinforcement Learning
Taehei Kim
Sung-Hee Lee
62
4
0
07 Jul 2021
Reward-Adaptive Reinforcement Learning: Dynamic Policy Gradient Optimization for Bipedal Locomotion
Changxin Huang
Guangrun Wang
Zhibo Zhou
Ronghui Zhang
Liang Lin
76
20
0
05 Jul 2021
Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation
Yaowen Yao
Li Xiao
Zhicheng An
Wanpeng Zhang
Dijun Luo
101
21
0
05 Jul 2021
Examining average and discounted reward optimality criteria in reinforcement learning
Vektor Dewanto
M. Gallagher
OffRL
57
17
0
03 Jul 2021
Experience-Driven PCG via Reinforcement Learning: A Super Mario Bros Study
Tianye Shu
Jialin Liu
Georgios N. Yannakakis
94
41
0
30 Jun 2021
Limited depth bandit-based strategy for Monte Carlo planning in continuous action spaces
R. Quinteiro
Francisco S. Melo
P. A. Santos
26
1
0
29 Jun 2021
Curious Explorer: a provable exploration strategy in Policy Learning
M. Miani
Maurizio Parton
M. Romito
126
0
0
29 Jun 2021
Action Set Based Policy Optimization for Safe Power Grid Management
Bo Zhou
Hongsheng Zeng
Yuecheng Liu
Kejiao Li
Fan Wang
Hao Tian
56
21
0
29 Jun 2021
Compositional Reinforcement Learning from Logical Specifications
Kishor Jothimurugan
Suguman Bansal
Osbert Bastani
Rajeev Alur
CoGe
135
81
0
25 Jun 2021
Closed-form Continuous-time Neural Models
Ramin Hasani
Mathias Lechner
Alexander Amini
Lucas Liebenwein
Aaron Ray
Max Tschaikowski
G. Teschl
Daniela Rus
PINN
AI4TS
109
92
0
25 Jun 2021
Branch Prediction as a Reinforcement Learning Problem: Why, How and Case Studies
Anastasios Zouzias
Kleovoulos Kalaitzidis
Boris Grot
OffRL
23
9
0
25 Jun 2021
Bayesian Optimization with High-Dimensional Outputs
Wesley J. Maddox
Maximilian Balandat
A. Wilson
E. Bakshy
UQCV
84
51
0
24 Jun 2021
Towards Exploiting Geometry and Time for Fast Off-Distribution Adaptation in Multi-Task Robot Learning
K.R. Zentner
Ryan Julian
Ujjwal Puri
Yulun Zhang
Gaurav Sukhatme
27
0
0
24 Jun 2021
Evolving Hierarchical Memory-Prediction Machines in Multi-Task Reinforcement Learning
Stephen Kelly
Tatiana Voegerl
W. Banzhaf
C. Gondro
76
13
0
23 Jun 2021
Bregman Gradient Policy Optimization
Feihu Huang
Shangqian Gao
Heng-Chiao Huang
164
16
0
23 Jun 2021
Policy Smoothing for Provably Robust Reinforcement Learning
Aounon Kumar
Alexander Levine
Soheil Feizi
AAML
127
59
0
21 Jun 2021
Analytically Tractable Bayesian Deep Q-Learning
Luong Ha
L. Nguyen
J. Goulet
BDL
OffRL
35
2
0
21 Jun 2021
Scalable Safety-Critical Policy Evaluation with Accelerated Rare Event Sampling
Mengdi Xu
Peide Huang
Fengpei Li
Jiacheng Zhu
Xuewei Qi
K. Oguchi
Zhiyuan Huang
Henry Lam
Ding Zhao
51
4
0
19 Jun 2021
Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments
Abdus Salam Azad
Edward Kim
M. Wu
Kimin Lee
Ion Stoica
Pieter Abbeel
Sanjit A. Seshia
54
7
0
18 Jun 2021
Learning from Demonstration without Demonstrations
Tom Blau
Gilad Francis
Philippe Morere
OffRL
44
1
0
17 Jun 2021
Mungojerrie: Reinforcement Learning of Linear-Time Objectives
E. M. Hahn
Mateo Perez
S. Schewe
Fabio Somenzi
Ashutosh Trivedi
D. Wojtczak
64
10
0
16 Jun 2021
Deep Reinforcement Learning for Conservation Decisions
Marcus Lapeyrolerie
Melissa S. Chapman
Kari E. A. Norman
C. Boettiger
OffRL
124
18
0
15 Jun 2021
Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning
Duzhen Zhang
Tielin Zhang
Shuncheng Jia
Xiang Cheng
Bo Xu
AI4CE
73
1
0
15 Jun 2021
Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Haque Ishfaq
Qiwen Cui
V. Nguyen
Alex Ayoub
Zhuoran Yang
Zhaoran Wang
Doina Precup
Lin F. Yang
96
48
0
15 Jun 2021
rSoccer: A Framework for Studying Reinforcement Learning in Small and Very Small Size Robot Soccer
Felipe Bezerra Martins
Mateus G. Machado
H. Bassani
Pedro H. M. Braga
Edna N. S. Barros
32
16
0
15 Jun 2021
Poisoning Deep Reinforcement Learning Agents with In-Distribution Triggers
C. Ashcraft
Kiran Karra
57
28
0
14 Jun 2021
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
145
834
0
12 Jun 2021
Recomposing the Reinforcement Learning Building Blocks with Hypernetworks
Shai Keynan
Elad Sarafian
Sarit Kraus
OffRL
97
30
0
12 Jun 2021
Model-free Reinforcement Learning for Branching Markov Decision Processes
E. M. Hahn
Mateo Perez
S. Schewe
Fabio Somenzi
Ashutosh Trivedi
D. Wojtczak
AI4CE
8
1
0
12 Jun 2021
Preferential Temporal Difference Learning
N. Anand
Doina Precup
OOD
48
9
0
11 Jun 2021
Safe Reinforcement Learning with Linear Function Approximation
Sanae Amani
Christos Thrampoulidis
Lin F. Yang
73
36
0
11 Jun 2021
Adversarial Option-Aware Hierarchical Imitation Learning
Mingxuan Jing
Wenbing Huang
F. Sun
Xiaojian Ma
Tao Kong
Chuang Gan
Lei Li
GAN
63
24
0
10 Jun 2021
Reinforcement Learning for Industrial Control Network Cyber Security Orchestration
John Mern
Kyle Hatch
Ryan Silva
J. Brush
Mykel J. Kochenderfer
65
4
0
09 Jun 2021
Taxonomy of Machine Learning Safety: A Survey and Primer
Sina Mohseni
Haotao Wang
Zhiding Yu
Chaowei Xiao
Zhangyang Wang
J. Yadawa
95
32
0
09 Jun 2021
Safe Deep Q-Network for Autonomous Vehicles at Unsignalized Intersection
Kasra Mokhtari
Alan R. Wagner
62
9
0
08 Jun 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
80
11
0
08 Jun 2021
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Nathan Grinsztajn
Johan Ferret
Olivier Pietquin
Philippe Preux
Matthieu Geist
SSL
123
14
0
08 Jun 2021
Exploration and preference satisfaction trade-off in reward-free learning
Noor Sajid
P. Tigas
Alexey Zakharov
Zafeirios Fountas
Karl J. Friston
91
20
0
08 Jun 2021
Previous
1
2
3
...
25
26
27
...
50
51
52
Next