Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training
Peide Huang
Mengdi Xu
Fei Fang
Ding Zhao
155
38
0
19 Feb 2022
Transformation Coding: Simple Objectives for Equivariant Representations
Mehran Shakerinava
A. Mondal
Siamak Ravanbakhsh
OffRL
72
0
0
19 Feb 2022
Shaping Advice in Deep Reinforcement Learning
Baicen Xiao
Bhaskar Ramasubramanian
Radha Poovendran
46
0
0
19 Feb 2022
Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization
Brandon Trabucco
Xinyang Geng
Aviral Kumar
Sergey Levine
OffRL
99
102
0
17 Feb 2022
Robust Reinforcement Learning via Genetic Curriculum
Yeeho Song
J. Schneider
71
9
0
17 Feb 2022
Deep Koopman Operator with Control for Nonlinear Systems
Hao-bin Shi
Max Meng
73
79
0
16 Feb 2022
Policy Learning and Evaluation with Randomized Quasi-Monte Carlo
Sébastien M. R. Arnold
P. LÉcuyer
Liyu Chen
Yi-fan Chen
Fei Sha
OffRL
84
4
0
16 Feb 2022
CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning
Long Yang
Jiaming Ji
Juntao Dai
Yu Zhang
Pengfei Li
Gang Pan
69
17
0
15 Feb 2022
L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth Reinforcement Learning
Taisuke Kobayashi
94
16
0
15 Feb 2022
QuadSim: A Quadcopter Rotational Dynamics Simulation Framework For Reinforcement Learning Algorithms
Burak Han Demirbilek
24
0
0
14 Feb 2022
Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration
Sravan Jayanthi
Letian Chen
Matthew C. Gombolay
48
0
0
14 Feb 2022
Autonomous Vehicles on the Edge: A Survey on Autonomous Vehicle Racing
Johannes Betz
Hongrui Zheng
Alexander Liniger
Ugo Rosolia
Phillip Karle
Madhur Behl
Venkat Krovi
Rahul Mangharam
82
232
0
14 Feb 2022
Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Aivar Sootla
Alexander I. Cowen-Rivers
Taher Jafferjee
Ziyan Wang
D. Mguni
Jun Wang
Haitham Bou-Ammar
130
54
0
14 Feb 2022
Evolving Neural Networks with Optimal Balance between Information Flow and Connections Cost
A. Khalili
A. Bouchachia
82
0
0
12 Feb 2022
End-to-end Reinforcement Learning of Robotic Manipulation with Robust Keypoints Representation
Tianying Wang
En Yen Puang
Marcus Lee
Yongpeng Wu
Wei Jing
SSL
67
5
0
12 Feb 2022
Uncertainty Aware System Identification with Universal Policies
B. L. Semage
Thommen George Karimpanal
Santu Rana
Svetha Venkatesh
105
3
0
11 Feb 2022
Fast Model-based Policy Search for Universal Policy Networks
B. L. Semage
Thommen George Karimpanal
Santu Rana
Svetha Venkatesh
77
1
0
11 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
131
30
0
10 Feb 2022
Bayesian Nonparametrics for Offline Skill Discovery
Valentin Villecroze
H. Braviner
Panteha Naderian
Chris J. Maddison
Gabriel Loaiza-Ganem
BDL
OffRL
85
8
0
09 Feb 2022
Contextualize Me -- The Case for Context in Reinforcement Learning
C. Benjamins
Theresa Eimer
Frederik Schubert
Aditya Mohan
Sebastian Dohler
André Biedenkapp
Bodo Rosenhahn
Frank Hutter
Marius Lindauer
OffRL
104
32
0
09 Feb 2022
Scenario-Assisted Deep Reinforcement Learning
Raz Yerushalmi
Guy Amir
Achiya Elyasaf
D. Harel
Guy Katz
Assaf Marron
OffRL
55
13
0
09 Feb 2022
skrl: Modular and Flexible Library for Reinforcement Learning
Antonio Serrano-Muñoz
D. Chrysostomou
Simon Boegh
N. Arana-Arexolaleiba
89
32
0
08 Feb 2022
Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning
Bryon Tjanaka
Matthew C. Fontaine
Julian Togelius
Stefanos Nikolaidis
86
54
0
08 Feb 2022
Local Explanations for Reinforcement Learning
Ronny Luss
Amit Dhurandhar
Miao Liu
FAtt
OffRL
81
3
0
08 Feb 2022
Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations
Nathan Beck
Abhiramon Rajasekharan
H. Tran
24
3
0
05 Feb 2022
Offline Reinforcement Learning for Mobile Notifications
Yiping Yuan
A. Muralidharan
Preetam Nandy
Miao Cheng
Prakruthi Prabhakar
OffRL
61
10
0
04 Feb 2022
SAFE-OCC: A Novelty Detection Framework for Convolutional Neural Network Sensors and its Application in Process Control
J. Pulsipher
Luke D. J. Coutinho
Tyler A. Soderstrom
Victor M. Zavala
HAI
48
7
0
03 Feb 2022
ExPoSe: Combining State-Based Exploration with Gradient-Based Online Search
Dixant Mittal
Siddharth Aravindan
W. Lee
OnRL
48
3
0
03 Feb 2022
A General, Evolution-Inspired Reward Function for Social Robotics
Thomas Kingsford
135
0
0
01 Feb 2022
Finding the optimal human strategy for Wordle using maximum correct letter probabilities and reinforcement learning
B. Anderson
Jesse G. Meyer
87
24
0
01 Feb 2022
Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies
Carlos Güemes-Palau
Paul Almasan
Shihan Xiao
Xiangle Cheng
Xiang Shi
Pere Barlet-Ros
A. Cabellos-Aparicio
58
9
0
01 Feb 2022
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation
Matilde Gargiani
Andrea Zanelli
Andrea Martinelli
Tyler H. Summers
John Lygeros
78
14
0
01 Feb 2022
Adversarial Imitation Learning from Video using a State Observer
Haresh Karnan
Garrett A. Warnell
F. Torabi
Peter Stone
GAN
108
13
0
01 Feb 2022
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Michael Laskin
Hao Liu
Xue Bin Peng
Denis Yarats
Aravind Rajeswaran
Pieter Abbeel
SSL
164
69
0
01 Feb 2022
You May Not Need Ratio Clipping in PPO
Mingfei Sun
Vitaly Kurin
Guoqing Liu
Sam Devlin
Tao Qin
Katja Hofmann
Shimon Whiteson
62
16
0
31 Jan 2022
ApolloRL: a Reinforcement Learning Platform for Autonomous Driving
Fei Gao
Peng Geng
Jiaqi Guo
YuanQiang Liu
Dingfeng Guo
Yabo Su
Jie Zhou
Xiao Wei
Jin Li
Xu Liu
41
9
0
29 Jan 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Jue Chen
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
74
18
0
29 Jan 2022
Zeroth-Order Actor-Critic: An Evolutionary Framework for Sequential Decision Problems
Yuheng Lei
Jianyu Chen
Guojian Zhan
Tao Zhang
Jiangtao Li
Jianyu Chen
Shengbo Eben Li
Sifa Zheng
OffRL
82
3
0
29 Jan 2022
Towards Safe Reinforcement Learning with a Safety Editor Policy
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
149
31
0
28 Jan 2022
Using Deep Reinforcement Learning for Zero Defect Smart Forging
Yunpeng Ma
A. Kassler
Bestoun S. Ahmed
P. Krakhmalev
A. Thore
Arash Toyser
Hans Lindback
OffRL
AI4CE
21
4
0
25 Jan 2022
Evolution Gym: A Large-Scale Benchmark for Evolving Soft Robots
Jagdeep Bhatia
Holly Jackson
Yunsheng Tian
Jie Xu
Wojciech Matusik
88
82
0
24 Jan 2022
STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence
Liang Xu
Daoming Lyu
Yangchen Pan
Aiwen Jiang
Bo Liu
95
0
0
24 Jan 2022
Deep Q-learning: a robust control approach
B. Varga
Balázs Kulcsár
M. Chehreghani
OOD
55
11
0
21 Jan 2022
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning
Sergio Rozada
Santiago Paternain
A. Marques
101
15
0
21 Jan 2022
DROPO: Sim-to-Real Transfer with Offline Domain Randomization
Gabriele Tiboni
Karol Arndt
Ville Kyrki
63
28
0
20 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
104
144
0
20 Jan 2022
A Deep Learning Approach To Estimation Using Measurements Received Over a Network
S. Agarwal
S. Kaul
Saket Anand
P. B. Sujit
25
0
0
20 Jan 2022
AdaTerm: Adaptive T-Distribution Estimated Robust Moments for Noise-Robust Stochastic Gradient Optimization
Wendyam Eric Lionel Ilboudo
Taisuke Kobayashi
Takamitsu Matsubara
89
13
0
18 Jan 2022
Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Tom Bewley
J. Lawry
Arthur G. Richards
46
3
0
17 Jan 2022
Parameterized Convex Universal Approximators for Decision-Making Problems
Jinrae Kim
Youdan Kim
54
5
0
17 Jan 2022
Previous
1
2
3
...
20
21
22
...
50
51
52
Next