ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
The Temporal Singularity: time-accelerated simulated civilizations and
  their implications
The Temporal Singularity: time-accelerated simulated civilizations and their implications
G. Spigler
3DGSAI4CE
19
1
0
22 Jun 2018
How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement
  Learning Experiments
How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments
Cédric Colas
Olivier Sigaud
Pierre-Yves Oudeyer
63
93
0
21 Jun 2018
A Dissection of Overfitting and Generalization in Continuous
  Reinforcement Learning
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLLOffRL
110
180
0
20 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
130
222
0
20 Jun 2018
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
J. Matas
Stephen James
Andrew J. Davison
AI4CE
77
361
0
20 Jun 2018
VirtualHome: Simulating Household Activities via Programs
VirtualHome: Simulating Household Activities via Programs
Xavier Puig
K. Ra
Marko Boben
Jiaman Li
Tingwu Wang
Sanja Fidler
Antonio Torralba
LM&Ro
118
503
0
19 Jun 2018
Laplacian Smoothing Gradient Descent
Laplacian Smoothing Gradient Descent
Stanley Osher
Bao Wang
Penghang Yin
Xiyang Luo
Farzin Barekat
Minh Pham
A. Lin
ODL
113
43
0
17 Jun 2018
Surprising Negative Results for Generative Adversarial Tree Search
Surprising Negative Results for Generative Adversarial Tree Search
Kamyar Azizzadenesheli
Brandon Yang
Weitang Liu
Zachary Chase Lipton
Anima Anandkumar
91
13
0
15 Jun 2018
Self-Imitation Learning
Self-Imitation Learning
Junhyuk Oh
Yijie Guo
Satinder Singh
Honglak Lee
SSL
85
251
0
14 Jun 2018
Qualitative Measurements of Policy Discrepancy for Return-Based Deep
  Q-Network
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network
Wenjia Meng
Qian Zheng
L. Yang
Pengfei Li
Gang Pan
45
21
0
14 Jun 2018
Accelerating Imitation Learning with Predictive Models
Accelerating Imitation Learning with Predictive Models
Ching-An Cheng
Xinyan Yan
Evangelos A. Theodorou
Byron Boots
90
21
0
12 Jun 2018
Continuous-time Value Function Approximation in Reproducing Kernel
  Hilbert Spaces
Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces
Motoya Ohnishi
M. Yukawa
M. Johansson
Masashi Sugiyama
45
3
0
08 Jun 2018
Re-evaluating Evaluation
Re-evaluating Evaluation
David Balduzzi
K. Tuyls
Julien Perolat
T. Graepel
MoMe
89
101
0
07 Jun 2018
Graph Convolutional Policy Network for Goal-Directed Molecular Graph
  Generation
Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation
Jiaxuan You
Bowen Liu
Rex Ying
Vijay S. Pande
J. Leskovec
GNN
347
905
0
07 Jun 2018
Deep Reinforcement Learning for General Video Game AI
Deep Reinforcement Learning for General Video Game AI
R. Torrado
Philip Bontrager
Julian Togelius
Jialin Liu
Diego Perez-Liebana
82
131
0
06 Jun 2018
Randomized Value Functions via Multiplicative Normalizing Flows
Randomized Value Functions via Multiplicative Normalizing Flows
Ahmed Touati
Harsh Satija
Joshua Romoff
Joelle Pineau
Pascal Vincent
55
36
0
06 Jun 2018
Human-like generalization in a machine through predicate learning
Human-like generalization in a machine through predicate learning
L. Doumas
Guillermo Puebla
Andrea E. Martin
NAI
49
9
0
05 Jun 2018
Boredom-driven curious learning by Homeo-Heterostatic Value Gradients
Boredom-driven curious learning by Homeo-Heterostatic Value Gradients
Yen Yu
A. Chang
Ryota Kanai
30
9
0
05 Jun 2018
BindsNET: A machine learning-oriented spiking neural networks library in
  Python
BindsNET: A machine learning-oriented spiking neural networks library in Python
Hananel Hazan
D. J. Saunders
Hassaan Khan
Darpan T. Sanghavi
H. Siegelmann
R. Kozma
AI4CE
81
234
0
04 Jun 2018
Mitigation of Policy Manipulation Attacks on Deep Q-Networks with
  Parameter-Space Noise
Mitigation of Policy Manipulation Attacks on Deep Q-Networks with Parameter-Space Noise
Vahid Behzadan
Arslan Munir
AAML
74
21
0
04 Jun 2018
Playing Atari with Six Neurons
Playing Atari with Six Neurons
Giuseppe Cuccu
Julian Togelius
Philippe Cudré-Mauroux
165
43
0
04 Jun 2018
Challenges in High-dimensional Reinforcement Learning with Evolution
  Strategies
Challenges in High-dimensional Reinforcement Learning with Evolution Strategies
Nils Müller
Tobias Glasmachers
61
28
0
04 Jun 2018
DAQN: Deep Auto-encoder and Q-Network
DAQN: Deep Auto-encoder and Q-Network
Daiki Kimura
50
18
0
02 Jun 2018
Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient
  Method
Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method
Yang Lyu
Quan Pan
Jin-wen Hu
Chunhui Zhao
Shuai Liu
108
33
0
01 Jun 2018
Sequential Attacks on Agents for Long-Term Adversarial Goals
Sequential Attacks on Agents for Long-Term Adversarial Goals
E. Tretschk
Seong Joon Oh
Mario Fritz
OnRL
399
48
1
31 May 2018
Supervised Policy Update for Deep Reinforcement Learning
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
George Andriopoulos
77
20
0
29 May 2018
Truncated Horizon Policy Search: Combining Reinforcement Learning &
  Imitation Learning
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Wen Sun
J. Andrew Bagnell
Byron Boots
128
94
0
29 May 2018
Reward Constrained Policy Optimization
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
97
546
0
28 May 2018
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul
Michael A. Osborne
Shimon Whiteson
102
18
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
77
84
0
26 May 2018
A0C: Alpha Zero in Continuous Action Space
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
95
49
0
24 May 2018
Intelligent Trainer for Model-Based Reinforcement Learning
Intelligent Trainer for Model-Based Reinforcement Learning
Yuanlong Li
Linsen Dong
Xin Zhou
Yonggang Wen
K. Guan
OffRL
43
0
0
24 May 2018
Representation Balancing MDPs for Off-Policy Policy Evaluation
Representation Balancing MDPs for Off-Policy Policy Evaluation
Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
70
75
0
23 May 2018
A General Family of Robust Stochastic Operators for Reinforcement
  Learning
A General Family of Robust Stochastic Operators for Reinforcement Learning
Yingdong Lu
M. Squillante
C. Wu
44
3
0
21 May 2018
Where Do You Think You're Going?: Inferring Beliefs about Dynamics from
  Behavior
Where Do You Think You're Going?: Inferring Beliefs about Dynamics from Behavior
S. Reddy
Anca Dragan
Sergey Levine
89
105
0
21 May 2018
Evolution-Guided Policy Gradient in Reinforcement Learning
Evolution-Guided Policy Gradient in Reinforcement Learning
Shauharda Khadka
Kagan Tumer
132
232
0
21 May 2018
Two geometric input transformation methods for fast online reinforcement
  learning with neural nets
Two geometric input transformation methods for fast online reinforcement learning with neural nets
Sina Ghiassian
Huizhen Yu
Banafsheh Rafiee
R. Sutton
OffRL
62
10
0
18 May 2018
Leveraging human knowledge in tabular reinforcement learning: A study of
  human subjects
Leveraging human knowledge in tabular reinforcement learning: A study of human subjects
Ariel Rosenfeld
Moshe Cohen
Matthew E. Taylor
Sarit Kraus
OffRL
38
34
0
15 May 2018
GAN Q-learning
GAN Q-learning
T. Doan
Bogdan Mazoure
Clare Lyle
OODOffRL
64
19
0
13 May 2018
Task Transfer by Preference-Based Cost Learning
Task Transfer by Preference-Based Cost Learning
Chi-Hung Hsu
Shu-Huan Chang
Da-Cheng Juan
Yu-Ting Chen
Shih-Chieh Chang
79
54
0
12 May 2018
Interactive Reinforcement Learning with Dynamic Reuse of Prior Knowledge
  from Human/Agent's Demonstration
Interactive Reinforcement Learning with Dynamic Reuse of Prior Knowledge from Human/Agent's Demonstration
Zhaodong Wang
Matthew E. Taylor
OnRL
14
5
0
11 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially
  Observable Markov Decision Processes
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
53
52
0
11 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
96
42
0
09 May 2018
Deep Reinforcement Learning for Playing 2.5D Fighting Games
Deep Reinforcement Learning for Playing 2.5D Fighting Games
Yu-Jhe Li
Hsin-Yu Chang
Yu-Jing Lin
Po-Wei Wu
Y. Wang
GAN
28
5
0
05 May 2018
Behavioral Cloning from Observation
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
143
732
0
04 May 2018
Exploration by Distributional Reinforcement Learning
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
86
31
0
04 May 2018
VINE: An Open Source Interactive Data Visualization Tool for
  Neuroevolution
VINE: An Open Source Interactive Data Visualization Tool for Neuroevolution
Rui Wang
Jeff Clune
Kenneth O. Stanley
30
7
0
03 May 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
93
417
0
24 Apr 2018
Benchmarking projective simulation in navigation problems
Benchmarking projective simulation in navigation problems
A. Melnikov
A. Makmal
Hans J. Briegel
59
19
0
23 Apr 2018
State Distribution-aware Sampling for Deep Q-learning
State Distribution-aware Sampling for Deep Q-learning
Weichao Li
Fuxian Huang
Xi Li
G. Pan
Leilei Gan
TTA
46
4
0
23 Apr 2018
Previous
123...4849505152
Next