ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,654 papers shown
Title
Curiosity-Driven Experience Prioritization via Density Estimation
Curiosity-Driven Experience Prioritization via Density Estimation
Rui Zhao
Volker Tresp
27
54
0
20 Feb 2019
Emergent Coordination Through Competition
Emergent Coordination Through Competition
Siqi Liu
Guy Lever
J. Merel
S. Tunyasuvunakool
N. Heess
T. Graepel
47
149
0
19 Feb 2019
Fast Efficient Hyperparameter Tuning for Policy Gradients
Fast Efficient Hyperparameter Tuning for Policy Gradients
Supratik Paul
Vitaly Kurin
Shimon Whiteson
22
32
0
18 Feb 2019
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy
  Observations
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations
Yuhui Wang
Hao He
Xiaoyang Tan
30
9
0
15 Feb 2019
Network Offloading Policies for Cloud Robotics: a Learning-based
  Approach
Network Offloading Policies for Cloud Robotics: a Learning-based Approach
Sandeep P. Chinchali
Apoorva Sharma
James Harrison
Amine Elhafsi
Daniel Kang
Evgenya Pergament
Eyal Cidon
Sachin Katti
Marco Pavone
OffRL
11
105
0
15 Feb 2019
VERIFAI: A Toolkit for the Design and Analysis of Artificial
  Intelligence-Based Systems
VERIFAI: A Toolkit for the Design and Analysis of Artificial Intelligence-Based Systems
T. Dreossi
Daniel J. Fremont
Shromona Ghosh
Edward J. Kim
H. Ravanbakhsh
Marcell Vazquez-Chanlatte
S. Seshia
18
29
0
12 Feb 2019
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
Francisco M. Garcia
Philip S. Thomas
24
38
0
03 Feb 2019
Certified Reinforcement Learning with Logic Guidance
Certified Reinforcement Learning with Logic Guidance
Mohammadhosein Hasanbeig
Daniel Kroening
Alessandro Abate
24
53
0
02 Feb 2019
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order
  Optimization Perspective
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective
Anirudh Vemula
Wen Sun
J. Andrew Bagnell
19
40
0
31 Jan 2019
Improving Evolutionary Strategies with Generative Neural Networks
Improving Evolutionary Strategies with Generative Neural Networks
Louis Faury
Clément Calauzènes
Olivier Fercoq
Syrine Krichene
27
12
0
31 Jan 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
24
363
0
30 Jan 2019
Discretizing Continuous Action Space for On-Policy Optimization
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
26
118
0
29 Jan 2019
Modularization of End-to-End Learning: Case Study in Arcade Games
Modularization of End-to-End Learning: Case Study in Arcade Games
Andrew Melnik
Sascha Fleer
M. Schilling
Helge J. Ritter
OffRL
39
12
0
27 Jan 2019
Trust Region Value Optimization using Kalman Filtering
Trust Region Value Optimization using Kalman Filtering
Shirli Di-Castro Shashua
Shie Mannor
19
7
0
23 Jan 2019
Neuroflight: Next Generation Flight Control Firmware
Neuroflight: Next Generation Flight Control Firmware
W. Koch
R. Mancuso
Azer Bestavros
35
29
0
19 Jan 2019
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep
  Reinforcement Learning
AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning
Ameer Haj-Ali
Qijing Huang
William S. Moses
J. Xiang
Ion Stoica
Krste Asanović
J. Wawrzynek
29
36
0
15 Jan 2019
Model-Predictive Policy Learning with Uncertainty Regularization for
  Driving in Dense Traffic
Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic
Mikael Henaff
A. Canziani
Yann LeCun
OOD
28
122
0
08 Jan 2019
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Adversarial Learning of a Sampler Based on an Unnormalized Distribution
Chunyuan Li
Ke Bai
Jianqiao Li
Guoyin Wang
Changyou Chen
Lawrence Carin
16
10
0
03 Jan 2019
Complementary reinforcement learning towards explainable agents
Complementary reinforcement learning towards explainable agents
J. H. Lee
27
12
0
01 Jan 2019
Deconfounding Reinforcement Learning in Observational Settings
Deconfounding Reinforcement Learning in Observational Settings
Chaochao Lu
Bernhard Schölkopf
José Miguel Hernández-Lobato
CML
OOD
39
73
0
26 Dec 2018
Learning to Walk via Deep Reinforcement Learning
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
54
433
0
26 Dec 2018
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for
  Model-based Control
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control
Xingxing Liang
Qi Wang
Yanghe Feng
Zhong Liu
Jincai Huang
29
5
0
24 Dec 2018
TD-Regularized Actor-Critic Methods
TD-Regularized Actor-Critic Methods
Simone Parisi
Voot Tangkaratt
Jan Peters
Mohammad Emtiyaz Khan
OffRL
30
32
0
19 Dec 2018
Communication-Efficient Policy Gradient Methods for Distributed
  Reinforcement Learning
Communication-Efficient Policy Gradient Methods for Distributed Reinforcement Learning
Tianyi Chen
Kaipeng Zhang
G. Giannakis
Tamer Basar
OffRL
29
41
0
07 Dec 2018
Active Deep Q-learning with Demonstration
Active Deep Q-learning with Demonstration
Si-An Chen
Voot Tangkaratt
Hsuan-Tien Lin
Masashi Sugiyama
18
32
0
06 Dec 2018
Relative Entropy Regularized Policy Iteration
Relative Entropy Regularized Policy Iteration
A. Abdolmaleki
Jost Tobias Springenberg
Jonas Degrave
Steven Bohez
Yuval Tassa
Dan Belov
N. Heess
Martin Riedmiller
27
72
0
05 Dec 2018
PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard
  Exploration Environments
PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard Exploration Environments
Qihao Liu
Yujia Wang
Xiao-Fei Liu
25
8
0
26 Nov 2018
Coordinating Disaster Emergency Response with Heuristic Reinforcement
  Learning
Coordinating Disaster Emergency Response with Heuristic Reinforcement Learning
L. Nguyen
Zhou Yang
Jiazhen Zhu
Jia Ming Li
Fang Jin
24
21
0
12 Nov 2018
Towards Governing Agent's Efficacy: Action-Conditional $β$-VAE for
  Deep Transparent Reinforcement Learning
Towards Governing Agent's Efficacy: Action-Conditional βββ-VAE for Deep Transparent Reinforcement Learning
John Yang
Gyujeong Lee
Minsung Hyun
Simyung Chang
Nojun Kwak
29
3
0
11 Nov 2018
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Qiming Zou
Ling Wang
K. Lu
Yu Li
OffRL
25
0
0
09 Nov 2018
Deep Reinforcement Learning via L-BFGS Optimization
Deep Reinforcement Learning via L-BFGS Optimization
Chris Paxton
Roummel F. Marcia
OffRL
21
0
0
06 Nov 2018
Learning to Defend by Learning to Attack
Learning to Defend by Learning to Attack
Haoming Jiang
Zhehui Chen
Yuyang Shi
Bo Dai
T. Zhao
21
22
0
03 Nov 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
Towards a Simple Approach to Multi-step Model-based Reinforcement
  Learning
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning
Kavosh Asadi
Evan Cater
Dipendra Kumar Misra
Michael L. Littman
OffRL
29
13
0
31 Oct 2018
Assessing Generalization in Deep Reinforcement Learning
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
D. Song
OffRL
18
233
0
29 Oct 2018
Sample-Efficient Learning of Nonprehensile Manipulation Policies via
  Physics-Based Informed State Distributions
Sample-Efficient Learning of Nonprehensile Manipulation Policies via Physics-Based Informed State Distributions
Lerrel Pinto
Aditya Mandalika
Brian Hou
S. Srinivasa
33
13
0
24 Oct 2018
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
Michael Schaarschmidt
Sven Mika
Kai Fricke
Eiko Yoneki
OffRL
23
5
0
21 Oct 2018
Autonomous Self-Explanation of Behavior for Interactive Reinforcement
  Learning Agents
Autonomous Self-Explanation of Behavior for Interactive Reinforcement Learning Agents
Yosuke Fukuchi
Masahiko Osawa
Hiroshi Yamakawa
M. Imai
19
31
0
20 Oct 2018
O2A: One-shot Observational learning with Action vectors
O2A: One-shot Observational learning with Action vectors
Leo Pauly
Wisdom C. Agboh
David C. Hogg
R. Fuentes
57
9
0
17 Oct 2018
Multiple Interactions Made Easy (MIME): Large Scale Demonstrations Data
  for Imitation
Multiple Interactions Made Easy (MIME): Large Scale Demonstrations Data for Imitation
Pratyusha Sharma
Lekha Mohan
Lerrel Pinto
Abhinav Gupta
23
119
0
16 Oct 2018
Batch Active Preference-Based Learning of Reward Functions
Batch Active Preference-Based Learning of Reward Functions
Erdem Biyik
Dorsa Sadigh
22
108
0
10 Oct 2018
Reinforcement Learning for Improving Agent Design
Reinforcement Learning for Improving Agent Design
David R Ha
35
124
0
09 Oct 2018
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen
Amin Babadi
Xiaoxiao Ma
J. Lehtinen
32
62
0
05 Oct 2018
Energy-Based Hindsight Experience Prioritization
Energy-Based Hindsight Experience Prioritization
Rui Zhao
Volker Tresp
19
74
0
02 Oct 2018
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented
  Demonstrations using Directed Information
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information
Arjun Sharma
Mohit Sharma
Nicholas Rhinehart
Kris Kitani
27
68
0
29 Sep 2018
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Yunhao Tang
Shipra Agrawal
TPM
39
29
0
27 Sep 2018
Omega-Regular Objectives in Model-Free Reinforcement Learning
Omega-Regular Objectives in Model-Free Reinforcement Learning
E. M. Hahn
Mateo Perez
S. Schewe
Fabio Somenzi
Ashutosh Trivedi
D. Wojtczak
15
145
0
26 Sep 2018
Switching Isotropic and Directional Exploration with Parameter Space
  Noise in Deep Reinforcement Learning
Switching Isotropic and Directional Exploration with Parameter Space Noise in Deep Reinforcement Learning
Izumi Karino
Kazutoshi Tanaka
Ryuma Niiyama
Yasuo Kuniyoshi
19
3
0
18 Sep 2018
Deep Learning with Experience Ranking Convolutional Neural Network for
  Robot Manipulator
Deep Learning with Experience Ranking Convolutional Neural Network for Robot Manipulator
Hai V. Nguyen
Hung M. La
M. Deans
SSL
OffRL
12
8
0
16 Sep 2018
Deterministic Implementations for Reproducibility in Deep Reinforcement
  Learning
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning
P. Nagarajan
Garrett A. Warnell
Peter Stone
22
51
0
15 Sep 2018
Previous
123...3031323334
Next