ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
The Medkit-Learn(ing) Environment: Medical Decision Modelling through
  Simulation
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation
Alex J. Chan
Ioana Bica
Alihan Huyuk
Daniel Jarrett
M. Schaar
88
14
0
08 Jun 2021
Amortized Generation of Sequential Algorithmic Recourses for Black-box
  Models
Amortized Generation of Sequential Algorithmic Recourses for Black-box Models
Sahil Verma
Keegan E. Hines
John P. Dickerson
94
24
0
07 Jun 2021
3DB: A Framework for Debugging Computer Vision Models
3DB: A Framework for Debugging Computer Vision Models
Guillaume Leclerc
Hadi Salman
Andrew Ilyas
Sai H. Vemprala
Logan Engstrom
...
Pengchuan Zhang
Shibani Santurkar
Greg Yang
Ashish Kapoor
Aleksander Madry
120
42
0
07 Jun 2021
Average-Reward Reinforcement Learning with Trust Region Methods
Average-Reward Reinforcement Learning with Trust Region Methods
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
61
18
0
07 Jun 2021
Efficient Continuous Control with Double Actors and Regularized Critics
Efficient Continuous Control with Double Actors and Regularized Critics
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Xiu Li
OffRL
47
50
0
06 Jun 2021
Same State, Different Task: Continual Reinforcement Learning without
  Interference
Same State, Different Task: Continual Reinforcement Learning without Interference
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLLOffRL
91
47
0
05 Jun 2021
Lifetime policy reuse and the importance of task capacity
Lifetime policy reuse and the importance of task capacity
David M. Bossens
Adam Sobey
CLLOffRL
71
3
0
03 Jun 2021
Variational Empowerment as Representation Learning for Goal-Based
  Reinforcement Learning
Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning
Jongwook Choi
Archit Sharma
Honglak Lee
Sergey Levine
S. Gu
DRL
67
21
0
02 Jun 2021
Deep Reinforcement Learning-based UAV Navigation and Control: A Soft
  Actor-Critic with Hindsight Experience Replay Approach
Deep Reinforcement Learning-based UAV Navigation and Control: A Soft Actor-Critic with Hindsight Experience Replay Approach
Myoung-Hoon Lee
Jun Moon
40
8
0
02 Jun 2021
What Matters for Adversarial Imitation Learning?
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
Matthieu Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
126
78
0
01 Jun 2021
Did I do that? Blame as a means to identify controlled effects in
  reinforcement learning
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
60
3
0
01 Jun 2021
AppBuddy: Learning to Accomplish Tasks in Mobile Apps via Reinforcement
  Learning
AppBuddy: Learning to Accomplish Tasks in Mobile Apps via Reinforcement Learning
Maayan Shvo
Zhiming Hu
Rodrigo Toro Icarte
Iqbal Mohomed
A. Jepson
Sheila A. McIlraith
92
14
0
31 May 2021
Q-attention: Enabling Efficient Learning for Vision-based Robotic
  Manipulation
Q-attention: Enabling Efficient Learning for Vision-based Robotic Manipulation
Stephen James
Andrew J. Davison
87
129
0
31 May 2021
Procedural Content Generation: Better Benchmarks for Transfer
  Reinforcement Learning
Procedural Content Generation: Better Benchmarks for Transfer Reinforcement Learning
Matthias Muller-Brockhausen
Mike Preuss
Aske Plaat
66
9
0
31 May 2021
Shaped Policy Search for Evolutionary Strategies using Waypoints
Shaped Policy Search for Evolutionary Strategies using Waypoints
Kiran Lekkala
Laurent Itti
32
1
0
30 May 2021
Reducing the Deployment-Time Inference Control Costs of Deep
  Reinforcement Learning Agents via an Asymmetric Architecture
Reducing the Deployment-Time Inference Control Costs of Deep Reinforcement Learning Agents via an Asymmetric Architecture
Chin-Jui Chang
Yu-Wei Chu
Chao-Hsien Ting
Hao-Kang Liu
Zhang-Wei Hong
Chun-Yi Lee
AI4CE
36
1
0
30 May 2021
Gradient-Free Neural Network Training via Synaptic-Level Reinforcement
  Learning
Gradient-Free Neural Network Training via Synaptic-Level Reinforcement Learning
Aman Bhargava
Mohammadreza Rezaei
M. Lankarany
35
5
0
29 May 2021
Towards a Very Large Scale Traffic Simulator for Multi-Agent
  Reinforcement Learning Testbeds
Towards a Very Large Scale Traffic Simulator for Multi-Agent Reinforcement Learning Testbeds
Zijian Hu
Chengxiang Zhuge
Wei-Ying Ma
AI4CE
20
6
0
28 May 2021
Hyperparameter Selection for Imitation Learning
Hyperparameter Selection for Imitation Learning
Léonard Hussenot
Marcin Andrychowicz
Damien Vincent
Robert Dadashi
Anton Raichuk
...
Sabela Ramos
Manu Orsini
Olivier Bachem
Matthieu Geist
Olivier Pietquin
115
18
0
25 May 2021
Affine Transport for Sim-to-Real Domain Adaptation
Affine Transport for Sim-to-Real Domain Adaptation
Anton Mallasto
Karol Arndt
Markus Heinonen
Samuel Kaski
Ville Kyrki
51
4
0
25 May 2021
Ankle Joints Are Beneficial When Optimizing Supported Real-world Bipedal
  Robot Gaits
Ankle Joints Are Beneficial When Optimizing Supported Real-world Bipedal Robot Gaits
Hilmar Elverhoy
Steinar Boe
V. Søyseth
T. Nygaard
30
1
0
22 May 2021
Cross-domain Imitation from Observations
Cross-domain Imitation from Observations
Dripta S. Raychaudhuri
S. Paul
J. Baar
Amit K. Roy-Chowdhury
OOD
82
45
0
20 May 2021
A Stochastic Composite Augmented Lagrangian Method For Reinforcement
  Learning
A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning
Yongfeng Li
Mingming Zhao
Weijie Chen
Zaiwen Wen
50
5
0
20 May 2021
Improved Exploring Starts by Kernel Density Estimation-Based State-Space
  Coverage Acceleration in Reinforcement Learning
Improved Exploring Starts by Kernel Density Estimation-Based State-Space Coverage Acceleration in Reinforcement Learning
Maximilian Schenke
Oliver Wallscheid
OffRL
34
5
0
19 May 2021
Reinforcement Learning With Sparse-Executing Actions via Sparsity
  Regularization
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization
Jing-Cheng Pang
Tian Xu
Shengyi Jiang
Yu-Ren Liu
Yang Yu
74
1
0
18 May 2021
DACBench: A Benchmark Library for Dynamic Algorithm Configuration
DACBench: A Benchmark Library for Dynamic Algorithm Configuration
Theresa Eimer
André Biedenkapp
Maximilian V Reimer
Steven Adriaensen
Frank Hutter
Marius Lindauer
86
29
0
18 May 2021
Reinforcement Learning for Adaptive Video Compressive Sensing
Reinforcement Learning for Adaptive Video Compressive Sensing
Sidi Lu
Xin Yuan
Aggelos K. Katsaggelos
Weisong Shi
34
3
0
18 May 2021
Make Bipedal Robots Learn How to Imitate
Make Bipedal Robots Learn How to Imitate
Vishal Kumar
Sinnu Susan Thomas
72
0
0
15 May 2021
Feature-Based Interpretable Reinforcement Learning based on
  State-Transition Models
Feature-Based Interpretable Reinforcement Learning based on State-Transition Models
Omid Davoodi
Majid Komeili
FAttOffRL
61
6
0
14 May 2021
An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic
  Manipulation with Pybullet
An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with Pybullet
Xintong Yang
Ze Ji
Jing Wu
Yu-kun Lai
58
15
0
12 May 2021
Value Iteration in Continuous Actions, States and Time
Value Iteration in Continuous Actions, States and Time
M. Lutter
Shie Mannor
Jan Peters
Dieter Fox
Animesh Garg
52
37
0
10 May 2021
Deeply-Debiased Off-Policy Interval Estimation
Deeply-Debiased Off-Policy Interval Estimation
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
61
38
0
10 May 2021
Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model
Lingwei Peng
Hui Qian
Zhebang Shen
Chao Zhang
Fei Li
68
2
0
08 May 2021
Context-Based Soft Actor Critic for Environments with Non-stationary
  Dynamics
Context-Based Soft Actor Critic for Environments with Non-stationary Dynamics
Yuan Pu
Shaochen Wang
Xin Yao
Bin Li
40
1
0
07 May 2021
UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms
UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms
Denis Belomestny
I. Levin
Eric Moulines
A. Naumov
S. Samsonov
V. Zorina
OffRL
46
0
0
05 May 2021
On Lottery Tickets and Minimal Task Representations in Deep
  Reinforcement Learning
On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning
Marc Aurel Vischer
R. T. Lange
Henning Sprekeler
OODUQCVOffRL
91
25
0
04 May 2021
Robotic Surgery With Lean Reinforcement Learning
Robotic Surgery With Lean Reinforcement Learning
Yotam Barnoy
Molly O'Brien
Wenjie Wang
Gregory D. Hager
OffRL
74
21
0
03 May 2021
Ensemble Feature Extraction for Multi-Container Quality-Diversity
  Algorithms
Ensemble Feature Extraction for Multi-Container Quality-Diversity Algorithms
L. Cazenille
73
9
0
03 May 2021
Pedestrian Collision Avoidance for Autonomous Vehicles at Unsignalized
  Intersection Using Deep Q-Network
Pedestrian Collision Avoidance for Autonomous Vehicles at Unsignalized Intersection Using Deep Q-Network
Kasra Mokhtari
Alan R. Wagner
38
6
0
01 May 2021
On the Emergence of Whole-body Strategies from Humanoid Robot
  Push-recovery Learning
On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery Learning
Diego Ferigo
Raffaello Camoriano
Paolo Maria Viceconte
Daniele Calandriello
Silvio Traversaro
Lorenzo Rosasco
Daniele Pucci
60
15
0
29 Apr 2021
A Reinforcement Learning Environment for Polyhedral Optimizations
A Reinforcement Learning Environment for Polyhedral Optimizations
Alexander Brauckmann
Andrés Goens
J. Castrillón
47
7
0
28 Apr 2021
Continual Learning Approach for Improving the Data and Computation
  Mapping in Near-Memory Processing System
Continual Learning Approach for Improving the Data and Computation Mapping in Near-Memory Processing System
Pritam Majumder
Jiayi Huang
Sungkeun Kim
A. Muzahid
Dylan Siegers
Chia-Che Tsai
Eun Jung Kim
117
1
0
28 Apr 2021
Implementing Reinforcement Learning Algorithms in Retail Supply Chains
  with OpenAI Gym Toolkit
Implementing Reinforcement Learning Algorithms in Retail Supply Chains with OpenAI Gym Toolkit
Shaun C. D'Souza
OffRL
14
2
0
27 Apr 2021
Reinforcement Learning using Guided Observability
Reinforcement Learning using Guided Observability
Stephan Weigand
Pascal Klink
Jan Peters
Joni Pajarinen
OffRL
31
4
0
22 Apr 2021
XAI-N: Sensor-based Robot Navigation using Expert Policies and Decision
  Trees
XAI-N: Sensor-based Robot Navigation using Expert Policies and Decision Trees
Aaron M. Roth
Jing Liang
Tianyi Zhou
87
8
0
22 Apr 2021
Network Defense is Not a Game
Network Defense is Not a Game
Andres Molina-Markham
Ransom K. Winder
Ahmad Ridley
AAML
40
14
0
20 Apr 2021
GDDR: GNN-based Data-Driven Routing
GDDR: GNN-based Data-Driven Routing
Oliver Hope
Eiko Yoneki
GNN
42
24
0
20 Apr 2021
Model-predictive control and reinforcement learning in multi-energy
  system case studies
Model-predictive control and reinforcement learning in multi-energy system case studies
Glenn Ceusters
Román Cantú Rodríguez
A. García
R. Franke
Geert Deconinck
L. Helsen
Ann Nowé
M. Messagie
L. R. Camargo
55
90
0
20 Apr 2021
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement
  Learning
Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
Jie Ren
Yewen Li
Zihan Ding
Wei Pan
Hao Dong
BDLMoE
52
26
0
19 Apr 2021
Low-rank State-action Value-function Approximation
Low-rank State-action Value-function Approximation
Sergio Rozada
Victor M. Tenorio
A. Marques
OffRL
79
9
0
18 Apr 2021
Previous
123...262728...505152
Next