ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
TD-Regularized Actor-Critic Methods
TD-Regularized Actor-Critic Methods
Simone Parisi
Voot Tangkaratt
Jan Peters
Mohammad Emtiyaz Khan
OffRL
61
31
0
19 Dec 2018
Domain Adaptation for Reinforcement Learning on the Atari
Domain Adaptation for Reinforcement Learning on the Atari
Thomas Carr
Maria Chli
George Vogiatzis
38
22
0
18 Dec 2018
Soft Actor-Critic Algorithms and Applications
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
159
2,456
0
13 Dec 2018
Communication-Efficient Policy Gradient Methods for Distributed
  Reinforcement Learning
Communication-Efficient Policy Gradient Methods for Distributed Reinforcement Learning
Tianyi Chen
Kai Zhang
G. Giannakis
Tamer Basar
OffRL
100
41
0
07 Dec 2018
Off-Policy Deep Reinforcement Learning without Exploration
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRLBDL
296
1,626
0
07 Dec 2018
Active Deep Q-learning with Demonstration
Active Deep Q-learning with Demonstration
Si-An Chen
Voot Tangkaratt
Hsuan-Tien Lin
Masashi Sugiyama
48
33
0
06 Dec 2018
Relative Entropy Regularized Policy Iteration
Relative Entropy Regularized Policy Iteration
A. Abdolmaleki
Jost Tobias Springenberg
Jonas Degrave
Steven Bohez
Yuval Tassa
Dan Belov
N. Heess
Martin Riedmiller
68
72
0
05 Dec 2018
JANUS: Fast and Flexible Deep Learning via Symbolic Graph Execution of
  Imperative Programs
JANUS: Fast and Flexible Deep Learning via Symbolic Graph Execution of Imperative Programs
Eunji Jeong
Sungwoo Cho
Gyeong-In Yu
Joo Seong Jeong
Dongjin Shin
Byung-Gon Chun
59
25
0
04 Dec 2018
Adversarial Domain Randomization
Adversarial Domain Randomization
Rawal Khirodkar
Kris Kitani
34
5
0
03 Dec 2018
Hierarchical Policy Design for Sample-Efficient Learning of Robot Table
  Tennis Through Self-Play
Hierarchical Policy Design for Sample-Efficient Learning of Robot Table Tennis Through Self-Play
R. Mahjourian
Navdeep Jaitly
N. Lazić
Sergey Levine
Risto Miikkulainen
73
16
0
30 Nov 2018
An Introduction to Deep Reinforcement Learning
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRLAI4CE
173
1,279
0
30 Nov 2018
Exploring Restart Distributions
Exploring Restart Distributions
Arash Tavakoli
Vitaly Levdik
Riashat Islam
Christopher M. Smith
Petar Kormushev
OffRL
35
5
0
27 Nov 2018
Distributed traffic light control at uncoupled intersections with
  real-world topology by deep reinforcement learning
Distributed traffic light control at uncoupled intersections with real-world topology by deep reinforcement learning
Mark Schutera
Niklas Goby
S. Smolarek
Markus Reischl
33
7
0
27 Nov 2018
Understanding the impact of entropy on policy optimization
Understanding the impact of entropy on policy optimization
Zafarali Ahmed
Nicolas Le Roux
Mohammad Norouzi
Dale Schuurmans
83
238
0
27 Nov 2018
Genetic-Gated Networks for Deep Reinforcement
Genetic-Gated Networks for Deep Reinforcement
Simyung Chang
John Yang
Jaeseok Choi
Nojun Kwak
AI4CE
51
17
0
26 Nov 2018
PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard
  Exploration Environments
PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard Exploration Environments
Qihao Liu
Yujia Wang
Xiao-Fei Liu
84
8
0
26 Nov 2018
Coordinating Disaster Emergency Response with Heuristic Reinforcement
  Learning
Coordinating Disaster Emergency Response with Heuristic Reinforcement Learning
L. Nguyen
Zhou Yang
Jiazhen Zhu
Jia Ming Li
Fang Jin
44
23
0
12 Nov 2018
Learning data augmentation policies using augmented random search
Learning data augmentation policies using augmented random search
Mingyang Geng
Kele Xu
Bo Ding
Huaimin Wang
Lei Zhang
56
9
0
12 Nov 2018
Towards Governing Agent's Efficacy: Action-Conditional $β$-VAE for
  Deep Transparent Reinforcement Learning
Towards Governing Agent's Efficacy: Action-Conditional βββ-VAE for Deep Transparent Reinforcement Learning
John Yang
Gyujeong Lee
Minsung Hyun
Simyung Chang
Nojun Kwak
65
3
0
11 Nov 2018
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Qiming Zou
Ling Wang
K. Lu
Yu Li
OffRL
52
0
0
09 Nov 2018
Baselines for Reinforcement Learning in Text Games
Baselines for Reinforcement Learning in Text Games
Mikuláš Zelinka
31
6
0
07 Nov 2018
Deep Reinforcement Learning via L-BFGS Optimization
Deep Reinforcement Learning via L-BFGS Optimization
Chris Paxton
Roummel F. Marcia
OffRL
45
0
0
06 Nov 2018
Learning to Defend by Learning to Attack
Learning to Defend by Learning to Attack
Haoming Jiang
Zhehui Chen
Yuyang Shi
Bo Dai
T. Zhao
98
22
0
03 Nov 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
106
56
0
03 Nov 2018
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
J. Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
Zhengxing Chen
Scott Fujimoto
85
139
0
01 Nov 2018
Towards a Simple Approach to Multi-step Model-based Reinforcement
  Learning
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning
Kavosh Asadi
Evan Cater
Dipendra Kumar Misra
Michael L. Littman
OffRL
86
13
0
31 Oct 2018
Relative Importance Sampling For Off-Policy Actor-Critic in Deep
  Reinforcement Learning
Relative Importance Sampling For Off-Policy Actor-Critic in Deep Reinforcement Learning
Mahammad Humayoo
Xueqi Cheng
BDLOffRL
26
5
0
30 Oct 2018
Assessing Generalization in Deep Reinforcement Learning
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
Basel Alomair
OffRL
124
238
0
29 Oct 2018
Sample-Efficient Learning of Nonprehensile Manipulation Policies via
  Physics-Based Informed State Distributions
Sample-Efficient Learning of Nonprehensile Manipulation Policies via Physics-Based Informed State Distributions
Lerrel Pinto
Aditya Mandalika
Brian Hou
S. Srinivasa
60
13
0
24 Oct 2018
Reconciling $λ$-Returns with Experience Replay
Reconciling λλλ-Returns with Experience Replay
Brett Daley
Chris Amato
59
4
0
23 Oct 2018
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep
  Reinforcement Learning
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep Reinforcement Learning
Vahid Behzadan
Arslan Munir
80
27
0
23 Oct 2018
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
RLgraph: Modular Computation Graphs for Deep Reinforcement Learning
Michael Schaarschmidt
Sven Mika
Kai Fricke
Eiko Yoneki
OffRL
51
5
0
21 Oct 2018
Autonomous Self-Explanation of Behavior for Interactive Reinforcement
  Learning Agents
Autonomous Self-Explanation of Behavior for Interactive Reinforcement Learning Agents
Yosuke Fukuchi
Masahiko Osawa
Hiroshi Yamakawa
M. Imai
61
31
0
20 Oct 2018
Safe Reinforcement Learning with Model Uncertainty Estimates
Safe Reinforcement Learning with Model Uncertainty Estimates
Björn Lütjens
Michael Everett
Jonathan P. How
81
169
0
19 Oct 2018
O2A: One-shot Observational learning with Action vectors
O2A: One-shot Observational learning with Action vectors
Leo Pauly
Wisdom C. Agboh
David C. Hogg
R. Fuentes
92
9
0
17 Oct 2018
Data Association with Gaussian Processes
Data Association with Gaussian Processes
Markus Kaiser
Clemens Otte
Thomas Runkler
Carl Henrik Ek
28
0
0
16 Oct 2018
Multiple Interactions Made Easy (MIME): Large Scale Demonstrations Data
  for Imitation
Multiple Interactions Made Easy (MIME): Large Scale Demonstrations Data for Imitation
Pratyusha Sharma
Lekha Mohan
Lerrel Pinto
Abhinav Gupta
75
121
0
16 Oct 2018
Batch Active Preference-Based Learning of Reward Functions
Batch Active Preference-Based Learning of Reward Functions
Erdem Biyik
Dorsa Sadigh
120
113
0
10 Oct 2018
Reinforcement Learning for Improving Agent Design
Reinforcement Learning for Improving Agent Design
David R Ha
106
127
0
09 Oct 2018
SFV: Reinforcement Learning of Physical Skills from Videos
SFV: Reinforcement Learning of Physical Skills from Videos
Xue Bin Peng
Angjoo Kanazawa
Jitendra Malik
Pieter Abbeel
Sergey Levine
99
65
0
08 Oct 2018
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation
Perttu Hämäläinen
Amin Babadi
Xiaoxiao Ma
J. Lehtinen
116
63
0
05 Oct 2018
Reinforcement Learning Meets Hybrid Zero Dynamics: A Case Study for
  RABBIT
Reinforcement Learning Meets Hybrid Zero Dynamics: A Case Study for RABBIT
Guillermo A. Castillo
Bowen Weng
Ayonga Hereid
Wei Zhang
51
23
0
03 Oct 2018
Energy-Based Hindsight Experience Prioritization
Energy-Based Hindsight Experience Prioritization
Rui Zhao
Volker Tresp
173
74
0
02 Oct 2018
Injective State-Image Mapping facilitates Visual Adversarial Imitation
  Learning
Injective State-Image Mapping facilitates Visual Adversarial Imitation Learning
Subhajit Chaudhury
Daiki Kimura
Asim Munawar
Ryuki Tachibana
GANVGen
42
3
0
02 Oct 2018
Reinforcement Learning with Perturbed Rewards
Reinforcement Learning with Perturbed Rewards
Jingkang Wang
Yang Liu
Yue Liu
NoLa
95
131
0
02 Oct 2018
Deep Quality-Value (DQV) Learning
Deep Quality-Value (DQV) Learning
M. Sabatelli
Gilles Louppe
Pierre Geurts
M. Wiering
OffRL
50
16
0
30 Sep 2018
Using State Predictions for Value Regularization in Curiosity Driven
  Deep Reinforcement Learning
Using State Predictions for Value Regularization in Curiosity Driven Deep Reinforcement Learning
Gino Brunner
Manuel Fritsche
Oliver Richter
Roger Wattenhofer
48
7
0
30 Sep 2018
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented
  Demonstrations using Directed Information
Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information
Arjun Sharma
Mohit Sharma
Nicholas Rhinehart
Kris Kitani
84
68
0
29 Sep 2018
Learning to Coordinate Multiple Reinforcement Learning Agents for
  Diverse Query Reformulation
Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation
Rodrigo Nogueira
Jannis Bulian
Massimiliano Ciaramita
54
11
0
27 Sep 2018
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Boosting Trust Region Policy Optimization by Normalizing Flows Policy
Yunhao Tang
Shipra Agrawal
TPM
110
30
0
27 Sep 2018
Previous
123...464748...505152
Next