ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
A reinforcement learning approach to hybrid control design
A reinforcement learning approach to hybrid control design
Meet Gandhi
A. Kundu
S. Bhatnagar
15
0
0
02 Sep 2020
Ranking Policy Decisions
Ranking Policy Decisions
Hadrien Pouget
Hana Chockler
Youcheng Sun
Daniel Kroening
OffRL
49
6
0
31 Aug 2020
Reinforcement Learning with Feedback-modulated TD-STDP
Reinforcement Learning with Feedback-modulated TD-STDP
Stephen Chung
R. Kozma
30
3
0
29 Aug 2020
AllenAct: A Framework for Embodied AI Research
AllenAct: A Framework for Embodied AI Research
Luca Weihs
Jordi Salvador
Klemen Kotar
Unnat Jain
Kuo-Hao Zeng
Roozbeh Mottaghi
Aniruddha Kembhavi
LM&RoAI4CE
80
75
0
28 Aug 2020
learn2learn: A Library for Meta-Learning Research
learn2learn: A Library for Meta-Learning Research
Sébastien M. R. Arnold
Praateek Mahajan
Debajyoti Datta
Ian Bunner
Konstantinos Saitas Zarkias
134
96
0
27 Aug 2020
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity
  Edge Devices
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity Edge Devices
Parth Mannan
A. Samajdar
T. Krishna
58
2
0
27 Aug 2020
Constrained Markov Decision Processes via Backward Value Functions
Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija
Philip Amortila
Joelle Pineau
104
52
0
26 Aug 2020
Identifying Critical States by the Action-Based Variance of Expected
  Return
Identifying Critical States by the Action-Based Variance of Expected Return
Izumi Karino
Yoshiyuki Ohmura
Yasuo Kuniyoshi
OffRL
26
2
0
26 Aug 2020
Inverse Policy Evaluation for Value-based Sequential Decision-making
Inverse Policy Evaluation for Value-based Sequential Decision-making
Alan Chan
Kristopher De Asis
R. Sutton
OffRL
87
1
0
26 Aug 2020
t-Soft Update of Target Network for Deep Reinforcement Learning
t-Soft Update of Target Network for Deep Reinforcement Learning
Taisuke Kobayashi
Wendyam Eric Lionel Ilboudo
133
52
0
25 Aug 2020
Neural Bridge Sampling for Evaluating Safety-Critical Autonomous Systems
Neural Bridge Sampling for Evaluating Safety-Critical Autonomous Systems
Aman Sinha
Matthew O'Kelly
Russ Tedrake
John C. Duchi
102
49
0
24 Aug 2020
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep
  Reinforcement Learning
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep Reinforcement Learning
Taisuke Kobayashi
OffRL
44
8
0
23 Aug 2020
Adversarial Imitation Learning via Random Search
Adversarial Imitation Learning via Random Search
Myungjae Shin
Joongheon Kim
40
11
0
21 Aug 2020
Biomechanic Posture Stabilisation via Iterative Training of Multi-policy
  Deep Reinforcement Learning Agents
Biomechanic Posture Stabilisation via Iterative Training of Multi-policy Deep Reinforcement Learning Agents
M. Hossny
Julie Iskander
51
0
0
21 Aug 2020
Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data
  Generation
Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation
Jeevan Devaranjan
Amlan Kar
Sanja Fidler
76
89
0
20 Aug 2020
Optimal control towards sustainable wastewater treatment plants based on
  multi-agent reinforcement learning
Optimal control towards sustainable wastewater treatment plants based on multi-agent reinforcement learning
Kehua Chen
Hongcheng Wang
Borja Valverde Perez
Siyuan Zhai
L. Vezzaro
Aijie Wang
24
0
0
19 Aug 2020
A Framework for Studying Reinforcement Learning and Sim-to-Real in Robot
  Soccer
A Framework for Studying Reinforcement Learning and Sim-to-Real in Robot Soccer
H. Bassani
R. A. Delgado
J. N. D. O. L. Junior
H. R. Medeiros
Pedro H. M. Braga
Mateus G. Machado
L. H. C. Santos
Alain Tapp
22
11
0
18 Aug 2020
Heteroscedastic Uncertainty for Robust Generative Latent Dynamics
Heteroscedastic Uncertainty for Robust Generative Latent Dynamics
Oliver Limoyo
Bryan Chan
Filip Marić
Brandon Wagstaff
Rupam Mahmood
Jonathan Kelly
74
8
0
18 Aug 2020
SuperSuit: Simple Microwrappers for Reinforcement Learning Environments
SuperSuit: Simple Microwrappers for Reinforcement Learning Environments
J. K. Terry
Benjamin Black
Ananth Hari
44
22
0
17 Aug 2020
An adaptive synchronization approach for weights of deep reinforcement
  learning
An adaptive synchronization approach for weights of deep reinforcement learning
S. Badran
M. Rezghi
30
0
0
16 Aug 2020
Reducing Sampling Error in Batch Temporal Difference Learning
Reducing Sampling Error in Batch Temporal Difference Learning
Brahma S. Pavse
Ishan Durugkar
Josiah P. Hanna
Peter Stone
OffRL
71
12
0
15 Aug 2020
Accountable Off-Policy Evaluation With Kernel Bellman Statistics
Accountable Off-Policy Evaluation With Kernel Bellman Statistics
Yihao Feng
Zhaolin Ren
Ziyang Tang
Qiang Liu
OffRL
148
44
0
15 Aug 2020
Reinforcement Learning with Quantum Variational Circuits
Reinforcement Learning with Quantum Variational Circuits
Owen Lockwood
Mei Si
75
140
0
15 Aug 2020
Interactive Visualization for Debugging RL
Interactive Visualization for Debugging RL
Shuby Deshpande
Benjamin Eysenbach
J. Schneider
77
7
0
14 Aug 2020
Sample-efficient Cross-Entropy Method for Real-time Planning
Sample-efficient Cross-Entropy Method for Real-time Planning
Cristina Pinneri
Shambhuraj Sawant
Sebastian Blaes
Jan Achterhold
Joerg Stueckler
Michal Rolínek
Georg Martius
86
103
0
14 Aug 2020
OR-Gym: A Reinforcement Learning Library for Operations Research
  Problems
OR-Gym: A Reinforcement Learning Library for Operations Research Problems
Christian D. Hubbs
Hector D. Perez
Owais Sarwar
N. Sahinidis
I. Grossmann
J. Wassick
OffRLAI4CE
67
74
0
14 Aug 2020
Imitating Unknown Policies via Exploration
Imitating Unknown Policies via Exploration
Nathan Gavenski
Juarez Monteiro
R. Granada
Felipe Meneguzzi
Rodrigo C. Barros
OffRL
49
7
0
13 Aug 2020
Model-Based Offline Planning
Model-Based Offline Planning
Arthur Argenson
Gabriel Dulac-Arnold
OffRL
111
155
0
12 Aug 2020
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
84
84
0
12 Aug 2020
An ocular biomechanics environment for reinforcement learning
An ocular biomechanics environment for reinforcement learning
Julie Iskander
M. Hossny
39
4
0
12 Aug 2020
Learning Event-triggered Control from Data through Joint Optimization
Learning Event-triggered Control from Data through Joint Optimization
Niklas Funk
Dominik Baumann
V. Berenz
Sebastian Trimpe
74
18
0
11 Aug 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a
  Survey
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDLOffRL
127
17
0
11 Aug 2020
TriFinger: An Open-Source Robot for Learning Dexterity
TriFinger: An Open-Source Robot for Learning Dexterity
Manuel Wüthrich
Felix Widmaier
F. Grimminger
J. Akpo
S. Joshi
...
Julian Viereck
M. Naveau
Ludovic Righetti
Bernhard Schölkopf
Stefan Bauer
82
72
0
08 Aug 2020
Error Autocorrelation Objective Function for Improved System Modeling
Error Autocorrelation Objective Function for Improved System Modeling
Anand Ramakrishnan
Warren B.Jackson
Kent Evans
DRL
60
0
0
08 Aug 2020
Assisted Perception: Optimizing Observations to Communicate State
Assisted Perception: Optimizing Observations to Communicate State
S. Reddy
Sergey Levine
Anca Dragan
92
15
0
06 Aug 2020
Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a
  Braille Keyboard
Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard
Alex Church
John Lloyd
R. Hadsell
Nathan Lepora
80
31
0
06 Aug 2020
ClipUp: A Simple and Powerful Optimizer for Distribution-based Policy
  Evolution
ClipUp: A Simple and Powerful Optimizer for Distribution-based Policy Evolution
N. E. Toklu
Paweł Liskowski
R. Srivastava
52
11
0
05 Aug 2020
An Imitation from Observation Approach to Transfer Learning with
  Dynamics Mismatch
An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch
Siddarth Desai
Ishan Durugkar
Haresh Karnan
Garrett A. Warnell
Josiah P. Hanna
Peter Stone
48
5
0
04 Aug 2020
Reinforced Grounded Action Transformation for Sim-to-Real Transfer
Reinforced Grounded Action Transformation for Sim-to-Real Transfer
Haresh Karnan
Siddharth Desai
Josiah P. Hanna
Garrett A. Warnell
Peter Stone
59
24
0
04 Aug 2020
Concurrent Training Improves the Performance of Behavioral Cloning from
  Observation
Concurrent Training Improves the Performance of Behavioral Cloning from Observation
Zachary Robertson
Matthew R. Walter
OffRL
57
3
0
03 Aug 2020
Proximal Deterministic Policy Gradient
Proximal Deterministic Policy Gradient
Marco Maggipinto
Gian Antonio Susto
Pratik Chaudhari
OffRL
38
5
0
03 Aug 2020
Learning to Drive (L2D) as a Low-Cost Benchmark for Real-World
  Reinforcement Learning
Learning to Drive (L2D) as a Low-Cost Benchmark for Real-World Reinforcement Learning
A. Viitala
Rinu Boney
Yi Zhao
Alexander Ilin
Arno Solin
OffRL
57
7
0
03 Aug 2020
BenchBot: Evaluating Robotics Research in Photorealistic 3D Simulation
  and on Real Robots
BenchBot: Evaluating Robotics Research in Photorealistic 3D Simulation and on Real Robots
Ben Talbot
David Hall
Haoyang Zhang
S. Bista
Rohan Smith
Feras Dayoub
Niko Sünderhauf
62
15
0
03 Aug 2020
Interactive Imitation Learning in State-Space
Interactive Imitation Learning in State-Space
Snehal Jauhri
C. Celemin
Jens Kober
143
14
0
02 Aug 2020
Deep Reinforcement Learning using Cyclical Learning Rates
Deep Reinforcement Learning using Cyclical Learning Rates
Ralf Gulde
Marc Tuscher
A. Csiszar
O. Riedel
A. Verl
28
9
0
31 Jul 2020
Towards Deep Robot Learning with Optimizer applicable to Non-stationary
  Problems
Towards Deep Robot Learning with Optimizer applicable to Non-stationary Problems
Taisuke Kobayashi
ODL
56
9
0
31 Jul 2020
Data-efficient Hindsight Off-policy Option Learning
Data-efficient Hindsight Off-policy Option Learning
Markus Wulfmeier
Dushyant Rao
Roland Hafner
Thomas Lampe
A. Abdolmaleki
...
Michael Neunert
Dhruva Tirumala
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
91
47
0
30 Jul 2020
Quantity vs. Quality: On Hyperparameter Optimization for Deep
  Reinforcement Learning
Quantity vs. Quality: On Hyperparameter Optimization for Deep Reinforcement Learning
L. Hertel
Pierre Baldi
D. Gillen
BDL
75
13
0
29 Jul 2020
Statistical Bootstrapping for Uncertainty Estimation in Off-Policy
  Evaluation
Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation
Ilya Kostrikov
Ofir Nachum
OffRL
70
31
0
27 Jul 2020
Self-Adapting Recurrent Models for Object Pushing from Learning in
  Simulation
Self-Adapting Recurrent Models for Object Pushing from Learning in Simulation
Lin Cong
Michael Görner
Philipp Ruppel
Hongzhuo Liang
Norman Hendrich
Jianwei Zhang
99
13
0
27 Jul 2020
Previous
123...333435...505152
Next