ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Pylot: A Modular Platform for Exploring Latency-Accuracy Tradeoffs in
  Autonomous Vehicles
Pylot: A Modular Platform for Exploring Latency-Accuracy Tradeoffs in Autonomous Vehicles
Ionel Gog
Sukrit Kalra
Peter Schafhalter
Matthew A. Wright
Joseph E. Gonzalez
Ion Stoica
114
70
0
16 Apr 2021
Quantum Architecture Search via Deep Reinforcement Learning
Quantum Architecture Search via Deep Reinforcement Learning
En-Jui Kuo
Yao-Lung L. Fang
Samuel Yen-Chi Chen
AI4CE
90
90
0
15 Apr 2021
GAN-Based Interactive Reinforcement Learning from Demonstration and
  Human Evaluative Feedback
GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback
Jie Huang
Rongshun Juan
R. Gomez
Keisuke Nakamura
Q. Sha
Bo He
Guangliang Li
73
10
0
14 Apr 2021
TAAC: Temporally Abstract Actor-Critic for Continuous Control
TAAC: Temporally Abstract Actor-Critic for Continuous Control
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
56
21
0
13 Apr 2021
Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement
  Learning
Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement Learning
Takato Okudo
Seiji Yamada
OffRL
44
21
0
13 Apr 2021
Reward Shaping with Dynamic Trajectory Aggregation
Reward Shaping with Dynamic Trajectory Aggregation
Takato Okudo
Seiji Yamada
28
2
0
13 Apr 2021
Muesli: Combining Improvements in Policy Optimization
Muesli: Combining Improvements in Policy Optimization
Matteo Hessel
Ivo Danihelka
Fabio Viola
A. Guez
Simon Schmitt
Laurent Sifre
T. Weber
David Silver
H. V. Hasselt
111
66
0
13 Apr 2021
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From
  a Single Offline Environment
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment
Philip J. Ball
Cong Lu
Jack Parker-Holder
Stephen J. Roberts
OffRL
112
45
0
12 Apr 2021
Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep
  Reinforcement Learning
Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep Reinforcement Learning
Jinxin Liu
Donglin Wang
Qiangxing Tian
Zhengyu Chen
92
23
0
11 Apr 2021
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy
  Behavior Representation for Deep Reinforcement Learning
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy Behavior Representation for Deep Reinforcement Learning
Ammar Fayad
M. Ibrahim
BDL
55
3
0
09 Apr 2021
CropGym: a Reinforcement Learning Environment for Crop Management
CropGym: a Reinforcement Learning Environment for Crop Management
H. Overweg
H. Berghuijs
Ioannis Athanasiadis
OffRL
57
34
0
09 Apr 2021
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement
  Learning
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning
Wenzhen Huang
Qiyue Yin
Junge Zhang
Kaiqi Huang
56
3
0
09 Apr 2021
ACERAC: Efficient reinforcement learning in fine time discretization
ACERAC: Efficient reinforcement learning in fine time discretization
Jakub Łyskawa
Pawel Wawrzyñski
34
2
0
08 Apr 2021
A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular
  Control
A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control
Zahra Gharaee
Karl Holmquist
Linbo He
Michael Felsberg
BDL
45
4
0
08 Apr 2021
Py-Feat: Python Facial Expression Analysis Toolbox
Py-Feat: Python Facial Expression Analysis Toolbox
J. H. Cheong
Eshin Jolly
Tiankang Xie
Sophie Byrne
Matthew Kenney
Luke J. Chang
CVBM
69
93
0
08 Apr 2021
Bootstrapping of memetic from genetic evolution via inter-agent
  selection pressures
Bootstrapping of memetic from genetic evolution via inter-agent selection pressures
N. Guttenberg
Marek Rosa
6
0
0
07 Apr 2021
GEM: Group Enhanced Model for Learning Dynamical Control Systems
GEM: Group Enhanced Model for Learning Dynamical Control Systems
Philippe Hansen-Estruch
Wenling Shang
Lerrel Pinto
Pieter Abbeel
Stas Tiomkin
AI4CE
63
3
0
07 Apr 2021
Ecole: A Library for Learning Inside MILP Solvers
Ecole: A Library for Learning Inside MILP Solvers
Antoine Prouvost
Justin Dumouchelle
Maxime Gasse
Didier Chételat
Andrea Lodi
33
4
0
06 Apr 2021
Zeus: Efficiently Localizing Actions in Videos using Reinforcement
  Learning
Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning
Pramod Chunduri
J. Bang
Yao Lu
Joy Arulraj
54
12
0
06 Apr 2021
Fast Design Space Exploration of Nonlinear Systems: Part II
Fast Design Space Exploration of Nonlinear Systems: Part II
Prerit Terway
Kenza Hamidouche
N. Jha
48
4
0
05 Apr 2021
No Need for Interactions: Robust Model-Based Imitation Learning using
  Neural ODE
No Need for Interactions: Robust Model-Based Imitation Learning using Neural ODE
HaoChih Lin
Baopu Li
Xin Zhou
Jiankun Wang
Max Meng
41
6
0
03 Apr 2021
Optimization Algorithm for Feedback and Feedforward Policies towards
  Robot Control Robust to Sensing Failures
Optimization Algorithm for Feedback and Feedforward Policies towards Robot Control Robust to Sensing Failures
Taisuke Kobayashi
Kenta Yoshizawa
29
3
0
01 Apr 2021
Towards Real-World Deployment of Reinforcement Learning for Traffic
  Signal Control
Towards Real-World Deployment of Reinforcement Learning for Traffic Signal Control
Arthur Muller
Vishal S. Rangras
Georg Schnittker
Michael Waldmann
Maxim Friesen
Tobias Ferfers
Lukas Schreckenberg
Florian Hufen
J. Jasperneite
M. Wiering
OffRL
59
15
0
30 Mar 2021
pH-RL: A personalization architecture to bring reinforcement learning to
  health practice
pH-RL: A personalization architecture to bring reinforcement learning to health practice
Ali el Hassouni
Mark Hoogendoorn
Marketa Ciharova
A. Kleiboer
K. Amarti
Vesa Muhonen
H. Riper
A. E. Eiben
OffRL
30
2
0
29 Mar 2021
Fundamental Challenges in Deep Learning for Stiff Contact Dynamics
Fundamental Challenges in Deep Learning for Stiff Contact Dynamics
Mihir Parmar
Mathew Halm
Michael Posa
74
38
0
29 Mar 2021
Robust Reinforcement Learning under model misspecification
Robust Reinforcement Learning under model misspecification
Lebin Yu
Jian Wang
Xudong Zhang
OOD
64
2
0
29 Mar 2021
Co-Imitation Learning without Expert Demonstration
Co-Imitation Learning without Expert Demonstration
Kun-Peng Ning
Hu Xu
Kun Zhu
Sheng-Jun Huang
OffRL
36
3
0
27 Mar 2021
Character Controllers Using Motion VAEs
Character Controllers Using Motion VAEs
Hung Yu Ling
F. Zinno
George Cheng
M. van de Panne
DRL
87
254
0
26 Mar 2021
Adversarial Imitation Learning with Trajectorial Augmentation and
  Correction
Adversarial Imitation Learning with Trajectorial Augmentation and Correction
Dafni Antotsiou
C. Ciliberto
Tae-Kyun Kim
63
10
0
25 Mar 2021
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with
  Deep Reinforcement Learning
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning
A. S. Morgan
Daljeet Nandha
Georgia Chalvatzaki
Carlo DÉramo
A. Dollar
Jan Peters
96
44
0
25 Mar 2021
Policy Information Capacity: Information-Theoretic Measure for Task
  Complexity in Deep Reinforcement Learning
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning
Hiroki Furuta
T. Matsushima
Tadashi Kozuno
Y. Matsuo
Sergey Levine
Ofir Nachum
S. Gu
OffRL
58
14
0
23 Mar 2021
Online Baum-Welch algorithm for Hierarchical Imitation Learning
Online Baum-Welch algorithm for Hierarchical Imitation Learning
Vittorio Giammarino
I. Paschalidis
OffRL
42
2
0
22 Mar 2021
Reward-Reinforced Reinforcement Learning for Multi-agent Systems
Reward-Reinforced Reinforcement Learning for Multi-agent Systems
Changgang Zheng
Shufan Yang
Juan Marcelo Parra Ullauri
A. García-Domínguez
Nelly Bencomo
53
11
0
22 Mar 2021
Introspective Visuomotor Control: Exploiting Uncertainty in Deep
  Visuomotor Control for Failure Recovery
Introspective Visuomotor Control: Exploiting Uncertainty in Deep Visuomotor Control for Failure Recovery
Chia-Man Hung
Li Sun
Yizhe Wu
Ioannis Havoutis
Ingmar Posner
36
5
0
22 Mar 2021
MaAST: Map Attention with Semantic Transformersfor Efficient Visual
  Navigation
MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation
Zachary Seymour
Kowshik Thopalli
Niluthpol Chowdhury Mithun
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
3DPC
69
18
0
21 Mar 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
91
23
0
17 Mar 2021
Building Safer Autonomous Agents by Leveraging Risky Driving Behavior
  Knowledge
Building Safer Autonomous Agents by Leveraging Risky Driving Behavior Knowledge
Ashish Rana
A. Malhi
91
9
0
16 Mar 2021
Inclined Quadrotor Landing using Deep Reinforcement Learning
Inclined Quadrotor Landing using Deep Reinforcement Learning
Jacob E. Kooi
Robert Babuška
64
30
0
16 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with
  Curiosity Contrastive Forward Dynamics Model
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
47
17
0
15 Mar 2021
RL-Controller: a reinforcement learning framework for active structural
  control
RL-Controller: a reinforcement learning framework for active structural control
S. S. Eshkevari
Soheil Sadeghi Eshkevari
Debarshi Sen
S. Pakzad
AI4CE
30
2
0
13 Mar 2021
Network Environment Design for Autonomous Cyberdefense
Network Environment Design for Autonomous Cyberdefense
Andres Molina-Markham
Cory Miniter
Becky Powell
Ahmad Ridley
72
43
0
13 Mar 2021
Domain Curiosity: Learning Efficient Data Collection Strategies for
  Domain Adaptation
Domain Curiosity: Learning Efficient Data Collection Strategies for Domain Adaptation
Karol Arndt
Oliver Struckmeier
Ville Kyrki
46
1
0
12 Mar 2021
Discovering Diverse Solutions in Deep Reinforcement Learning by
  Maximizing State-Action-Based Mutual Information
Discovering Diverse Solutions in Deep Reinforcement Learning by Maximizing State-Action-Based Mutual Information
Takayuki Osa
Voot Tangkaratt
Masashi Sugiyama
73
33
0
12 Mar 2021
A Quadratic Actor Network for Model-Free Reinforcement Learning
A Quadratic Actor Network for Model-Free Reinforcement Learning
Matthias Weissenbacher
Yoshinobu Kawahara
25
0
0
11 Mar 2021
Generalizable Episodic Memory for Deep Reinforcement Learning
Generalizable Episodic Memory for Deep Reinforcement Learning
Haotian Hu
Jianing Ye
Guangxiang Zhu
Zhizhou Ren
Chongjie Zhang
OffRL
84
39
0
11 Mar 2021
Full Gradient DQN Reinforcement Learning: A Provably Convergent Scheme
Full Gradient DQN Reinforcement Learning: A Provably Convergent Scheme
Konstantin Avrachenkov
Vivek Borkar
H. Dolhare
K. Patil
60
9
0
10 Mar 2021
Learning from Imperfect Demonstrations from Agents with Varying Dynamics
Learning from Imperfect Demonstrations from Agents with Varying Dynamics
Zhangjie Cao
Dorsa Sadigh
88
29
0
10 Mar 2021
Model-free Policy Learning with Reward Gradients
Model-free Policy Learning with Reward Gradients
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
49
6
0
09 Mar 2021
A Survey of Embodied AI: From Simulators to Research Tasks
A Survey of Embodied AI: From Simulators to Research Tasks
Jiafei Duan
Samson Yu
Tangyao Li
Huaiyu Zhu
Cheston Tan
LM&Ro
134
296
0
08 Mar 2021
A Crash Course on Reinforcement Learning
A Crash Course on Reinforcement Learning
F. Yaghmaie
L. Ljung
90
2
0
08 Mar 2021
Previous
123...272829...505152
Next