Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation
Alex J. Chan
Ioana Bica
Alihan Huyuk
Daniel Jarrett
M. Schaar
88
14
0
08 Jun 2021
Amortized Generation of Sequential Algorithmic Recourses for Black-box Models
Sahil Verma
Keegan E. Hines
John P. Dickerson
94
24
0
07 Jun 2021
3DB: A Framework for Debugging Computer Vision Models
Guillaume Leclerc
Hadi Salman
Andrew Ilyas
Sai H. Vemprala
Logan Engstrom
...
Pengchuan Zhang
Shibani Santurkar
Greg Yang
Ashish Kapoor
Aleksander Madry
120
42
0
07 Jun 2021
Average-Reward Reinforcement Learning with Trust Region Methods
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
61
18
0
07 Jun 2021
Efficient Continuous Control with Double Actors and Regularized Critics
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Xiu Li
OffRL
47
50
0
06 Jun 2021
Same State, Different Task: Continual Reinforcement Learning without Interference
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLL
OffRL
91
47
0
05 Jun 2021
Lifetime policy reuse and the importance of task capacity
David M. Bossens
Adam Sobey
CLL
OffRL
71
3
0
03 Jun 2021
Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning
Jongwook Choi
Archit Sharma
Honglak Lee
Sergey Levine
S. Gu
DRL
67
21
0
02 Jun 2021
Deep Reinforcement Learning-based UAV Navigation and Control: A Soft Actor-Critic with Hindsight Experience Replay Approach
Myoung-Hoon Lee
Jun Moon
40
8
0
02 Jun 2021
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
Matthieu Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
126
78
0
01 Jun 2021
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
60
3
0
01 Jun 2021
AppBuddy: Learning to Accomplish Tasks in Mobile Apps via Reinforcement Learning
Maayan Shvo
Zhiming Hu
Rodrigo Toro Icarte
Iqbal Mohomed
A. Jepson
Sheila A. McIlraith
92
14
0
31 May 2021
Q-attention: Enabling Efficient Learning for Vision-based Robotic Manipulation
Stephen James
Andrew J. Davison
87
129
0
31 May 2021
Procedural Content Generation: Better Benchmarks for Transfer Reinforcement Learning
Matthias Muller-Brockhausen
Mike Preuss
Aske Plaat
66
9
0
31 May 2021
Shaped Policy Search for Evolutionary Strategies using Waypoints
Kiran Lekkala
Laurent Itti
32
1
0
30 May 2021
Reducing the Deployment-Time Inference Control Costs of Deep Reinforcement Learning Agents via an Asymmetric Architecture
Chin-Jui Chang
Yu-Wei Chu
Chao-Hsien Ting
Hao-Kang Liu
Zhang-Wei Hong
Chun-Yi Lee
AI4CE
36
1
0
30 May 2021
Gradient-Free Neural Network Training via Synaptic-Level Reinforcement Learning
Aman Bhargava
Mohammadreza Rezaei
M. Lankarany
35
5
0
29 May 2021
Towards a Very Large Scale Traffic Simulator for Multi-Agent Reinforcement Learning Testbeds
Zijian Hu
Chengxiang Zhuge
Wei-Ying Ma
AI4CE
20
6
0
28 May 2021
Hyperparameter Selection for Imitation Learning
Léonard Hussenot
Marcin Andrychowicz
Damien Vincent
Robert Dadashi
Anton Raichuk
...
Sabela Ramos
Manu Orsini
Olivier Bachem
Matthieu Geist
Olivier Pietquin
115
18
0
25 May 2021
Affine Transport for Sim-to-Real Domain Adaptation
Anton Mallasto
Karol Arndt
Markus Heinonen
Samuel Kaski
Ville Kyrki
51
4
0
25 May 2021
Ankle Joints Are Beneficial When Optimizing Supported Real-world Bipedal Robot Gaits
Hilmar Elverhoy
Steinar Boe
V. Søyseth
T. Nygaard
30
1
0
22 May 2021
Cross-domain Imitation from Observations
Dripta S. Raychaudhuri
S. Paul
J. Baar
Amit K. Roy-Chowdhury
OOD
82
45
0
20 May 2021
A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning
Yongfeng Li
Mingming Zhao
Weijie Chen
Zaiwen Wen
50
5
0
20 May 2021
Improved Exploring Starts by Kernel Density Estimation-Based State-Space Coverage Acceleration in Reinforcement Learning
Maximilian Schenke
Oliver Wallscheid
OffRL
34
5
0
19 May 2021
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization
Jing-Cheng Pang
Tian Xu
Shengyi Jiang
Yu-Ren Liu
Yang Yu
74
1
0
18 May 2021
DACBench: A Benchmark Library for Dynamic Algorithm Configuration
Theresa Eimer
André Biedenkapp
Maximilian V Reimer
Steven Adriaensen
Frank Hutter
Marius Lindauer
86
29
0
18 May 2021
Reinforcement Learning for Adaptive Video Compressive Sensing
Sidi Lu
Xin Yuan
Aggelos K. Katsaggelos
Weisong Shi
34
3
0
18 May 2021
Make Bipedal Robots Learn How to Imitate
Vishal Kumar
Sinnu Susan Thomas
72
0
0
15 May 2021
Feature-Based Interpretable Reinforcement Learning based on State-Transition Models
Omid Davoodi
Majid Komeili
FAtt
OffRL
61
6
0
14 May 2021
An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with Pybullet
Xintong Yang
Ze Ji
Jing Wu
Yu-kun Lai
58
15
0
12 May 2021
Value Iteration in Continuous Actions, States and Time
M. Lutter
Shie Mannor
Jan Peters
Dieter Fox
Animesh Garg
52
37
0
10 May 2021
Deeply-Debiased Off-Policy Interval Estimation
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
61
38
0
10 May 2021
Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model
Lingwei Peng
Hui Qian
Zhebang Shen
Chao Zhang
Fei Li
68
2
0
08 May 2021
Context-Based Soft Actor Critic for Environments with Non-stationary Dynamics
Yuan Pu
Shaochen Wang
Xin Yao
Bin Li
40
1
0
07 May 2021
UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms
Denis Belomestny
I. Levin
Eric Moulines
A. Naumov
S. Samsonov
V. Zorina
OffRL
46
0
0
05 May 2021
On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning
Marc Aurel Vischer
R. T. Lange
Henning Sprekeler
OOD
UQCV
OffRL
91
25
0
04 May 2021
Robotic Surgery With Lean Reinforcement Learning
Yotam Barnoy
Molly O'Brien
Wenjie Wang
Gregory D. Hager
OffRL
74
21
0
03 May 2021
Ensemble Feature Extraction for Multi-Container Quality-Diversity Algorithms
L. Cazenille
73
9
0
03 May 2021
Pedestrian Collision Avoidance for Autonomous Vehicles at Unsignalized Intersection Using Deep Q-Network
Kasra Mokhtari
Alan R. Wagner
38
6
0
01 May 2021
On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery Learning
Diego Ferigo
Raffaello Camoriano
Paolo Maria Viceconte
Daniele Calandriello
Silvio Traversaro
Lorenzo Rosasco
Daniele Pucci
60
15
0
29 Apr 2021
A Reinforcement Learning Environment for Polyhedral Optimizations
Alexander Brauckmann
Andrés Goens
J. Castrillón
47
7
0
28 Apr 2021
Continual Learning Approach for Improving the Data and Computation Mapping in Near-Memory Processing System
Pritam Majumder
Jiayi Huang
Sungkeun Kim
A. Muzahid
Dylan Siegers
Chia-Che Tsai
Eun Jung Kim
117
1
0
28 Apr 2021
Implementing Reinforcement Learning Algorithms in Retail Supply Chains with OpenAI Gym Toolkit
Shaun C. D'Souza
OffRL
14
2
0
27 Apr 2021
Reinforcement Learning using Guided Observability
Stephan Weigand
Pascal Klink
Jan Peters
Joni Pajarinen
OffRL
31
4
0
22 Apr 2021
XAI-N: Sensor-based Robot Navigation using Expert Policies and Decision Trees
Aaron M. Roth
Jing Liang
Tianyi Zhou
87
8
0
22 Apr 2021
Network Defense is Not a Game
Andres Molina-Markham
Ransom K. Winder
Ahmad Ridley
AAML
40
14
0
20 Apr 2021
GDDR: GNN-based Data-Driven Routing
Oliver Hope
Eiko Yoneki
GNN
42
24
0
20 Apr 2021
Model-predictive control and reinforcement learning in multi-energy system case studies
Glenn Ceusters
Román Cantú Rodríguez
A. García
R. Franke
Geert Deconinck
L. Helsen
Ann Nowé
M. Messagie
L. R. Camargo
55
90
0
20 Apr 2021
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
Jie Ren
Yewen Li
Zihan Ding
Wei Pan
Hao Dong
BDL
MoE
52
26
0
19 Apr 2021
Low-rank State-action Value-function Approximation
Sergio Rozada
Victor M. Tenorio
A. Marques
OffRL
79
9
0
18 Apr 2021
Previous
1
2
3
...
26
27
28
...
50
51
52
Next