ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Reinforcement Learning with Competitive Ensembles of
  Information-Constrained Primitives
Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Anirudh Goyal
Shagun Sodhani
Jonathan Binas
Xue Bin Peng
Sergey Levine
Yoshua Bengio
91
49
0
25 Jun 2019
Sequential Neural Processes
Sequential Neural Processes
Gautam Singh
Jaesik Yoon
Youngsung Son
Sungjin Ahn
BDLAI4TS
115
85
0
24 Jun 2019
Modern Deep Reinforcement Learning Algorithms
Modern Deep Reinforcement Learning Algorithms
Sergey Ivanov
A. Dýakonov
OffRL
61
39
0
24 Jun 2019
Proximal Distilled Evolutionary Reinforcement Learning
Proximal Distilled Evolutionary Reinforcement Learning
Cristian Bodnar
Ben Day
Pietro Lio
101
76
0
24 Jun 2019
Boosting for Control of Dynamical Systems
Boosting for Control of Dynamical Systems
Naman Agarwal
Nataly Brukhim
Elad Hazan
Zhou Lu
56
13
0
20 Jun 2019
Experience Replay Optimization
Experience Replay Optimization
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Helen Zhou
OffRL
59
105
0
19 Jun 2019
Calibrated Model-Based Deep Reinforcement Learning
Calibrated Model-Based Deep Reinforcement Learning
Ali Malik
Volodymyr Kuleshov
Jiaming Song
Danny Nemer
Harlan Seymour
Stefano Ermon
159
55
0
19 Jun 2019
Wasserstein Adversarial Imitation Learning
Wasserstein Adversarial Imitation Learning
Huang Xiao
Michael Herman
Joerg Wagner
Sebastian Ziesche
Jalal Etesami
T. H. Linh
43
72
0
19 Jun 2019
RadGrad: Active learning with loss gradients
RadGrad: Active learning with loss gradients
Paul Budnarain
Renato Ferreira Pinto Junior
Ilan Kogan
107
3
0
18 Jun 2019
Directed Exploration for Reinforcement Learning
Directed Exploration for Reinforcement Learning
Z. Guo
Emma Brunskill
71
12
0
18 Jun 2019
Hill Climbing on Value Estimates for Search-control in Dyna
Hill Climbing on Value Estimates for Search-control in Dyna
Yangchen Pan
Hengshuai Yao
Amir-massoud Farahmand
Martha White
87
18
0
18 Jun 2019
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant
  Reinforcement Learning
Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning
Tadashi Kozuno
Dongqi Han
Kenji Doya
OffRL
43
2
0
18 Jun 2019
Weight Agnostic Neural Networks
Weight Agnostic Neural Networks
Adam Gaier
David R Ha
OOD
67
242
0
11 Jun 2019
Learning to Score Behaviors for Guided Policy Optimization
Learning to Score Behaviors for Guided Policy Optimization
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
A. Choromańska
K. Choromanski
Michael I. Jordan
91
39
0
11 Jun 2019
Exploration via Hindsight Goal Generation
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
85
90
0
10 Jun 2019
Boosting Soft Actor-Critic: Emphasizing Recent Experience without
  Forgetting the Past
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past
Che Wang
George Andriopoulos
51
45
0
10 Jun 2019
Data-Efficient and Safe Learning for Humanoid Locomotion Aided by a
  Dynamic Balancing Model
Data-Efficient and Safe Learning for Humanoid Locomotion Aided by a Dynamic Balancing Model
Junhyeok Ahn
Jaemin Lee
Luis Sentis
55
15
0
10 Jun 2019
Neural Heterogeneous Scheduler
Neural Heterogeneous Scheduler
Tegg Taekyong Sung
Valliappa Chockalingam
Alex Yahja
Bo Ryu
18
3
0
09 Jun 2019
An Extensible Interactive Interface for Agent Design
An Extensible Interactive Interface for Agent Design
Matthew Rahtz
James Fang
Anca Dragan
Dylan Hadfield-Menell
60
1
0
06 Jun 2019
A Survey of Behavior Learning Applications in Robotics -- State of the
  Art and Perspectives
A Survey of Behavior Learning Applications in Robotics -- State of the Art and Perspectives
Alexander Fabisch
Christoph Petzoldt
M. Otto
Frank Kirchner
AI4CE
54
13
0
05 Jun 2019
Neuropathic Pain Diagnosis Simulator for Causal Discovery Algorithm
  Evaluation
Neuropathic Pain Diagnosis Simulator for Causal Discovery Algorithm Evaluation
Ruibo Tu
Kun Zhang
Bo Christer Bertilson
Hedvig Kjellström
Cheng Zhang
OODCML
95
43
0
04 Jun 2019
Reinforcement Learning with Low-Complexity Liquid State Machines
Reinforcement Learning with Low-Complexity Liquid State Machines
Wachirawit Ponghiran
G. Srinivasan
Kaushik Roy
47
14
0
04 Jun 2019
Lattice Map Spiking Neural Networks (LM-SNNs) for Clustering and
  Classifying Image Data
Lattice Map Spiking Neural Networks (LM-SNNs) for Clustering and Classifying Image Data
Hananel Hazan
D. J. Saunders
Darpan T. Sanghavi
H. Siegelmann
R. Kozma
61
17
0
04 Jun 2019
Off-Policy Evaluation via Off-Policy Classification
Off-Policy Evaluation via Off-Policy Classification
A. Irpan
Kanishka Rao
Konstantinos Bousmalis
Chris Harris
Julian Ibarz
Sergey Levine
OffRL
83
50
0
04 Jun 2019
Hamiltonian Neural Networks
Hamiltonian Neural Networks
S. Greydanus
Misko Dzamba
J. Yosinski
PINNAI4CE
143
900
0
04 Jun 2019
Analysis and Improvement of Adversarial Training in DQN Agents With
  Adversarially-Guided Exploration (AGE)
Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)
Vahid Behzadan
W. Hsu
AAML
64
8
0
03 Jun 2019
RL-Based Method for Benchmarking the Adversarial Resilience and
  Robustness of Deep Reinforcement Learning Policies
RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies
Vahid Behzadan
W. Hsu
AAMLOffRL
66
9
0
03 Jun 2019
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial
  Robot Visual Navigation
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual Navigation
Srivatsan Krishnan
Behzad Boroujerdian
William Fu
Aleksandra Faust
Vijay Janapa Reddi
75
37
0
02 Jun 2019
Attentional Policies for Cross-Context Multi-Agent Reinforcement
  Learning
Attentional Policies for Cross-Context Multi-Agent Reinforcement Learning
Matthew A. Wright
R. Horowitz
52
3
0
31 May 2019
Defining Admissible Rewards for High Confidence Policy Evaluation
Defining Admissible Rewards for High Confidence Policy Evaluation
Niranjani Prasad
Barbara E. Engelhardt
Finale Doshi-Velez
OffRL
53
6
0
30 May 2019
Towards Finding Longer Proofs
Towards Finding Longer Proofs
Zsolt Zombori
Adrián Csiszárik
Henryk Michalewski
C. Kaliszyk
Josef Urban
OffRLLRM
83
16
0
30 May 2019
Structured Monte Carlo Sampling for Nonisotropic Distributions via
  Determinantal Point Processes
Structured Monte Carlo Sampling for Nonisotropic Distributions via Determinantal Point Processes
K. Choromanski
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
77
3
0
29 May 2019
Adversarial Imitation Learning from Incomplete Demonstrations
Adversarial Imitation Learning from Incomplete Demonstrations
Mingfei Sun
Xiaojuan Ma
78
29
0
29 May 2019
Linear interpolation gives better gradients than Gaussian smoothing in
  derivative-free optimization
Linear interpolation gives better gradients than Gaussian smoothing in derivative-free optimization
A. Berahas
Liyuan Cao
K. Choromanski
K. Scheinberg
136
19
0
29 May 2019
A General Markov Decision Process Framework for Directly Learning
  Optimal Control Policies
A General Markov Decision Process Framework for Directly Learning Optimal Control Policies
Yingdong Lu
M. Squillante
C. Wu
AI4CE
8
2
0
28 May 2019
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement
  Learning
Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning
Caleb Chuck
Supawit Chockchowwat
S. Niekum
67
14
0
27 May 2019
Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation
  from Video
Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video
Miguel Jaques
Michael G. Burke
Timothy M. Hospedales
VGenPINN
126
45
0
27 May 2019
SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards
SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards
S. Reddy
Anca Dragan
Sergey Levine
OffRL
69
52
0
27 May 2019
Disentangling Dynamics and Returns: Value Function Decomposition with
  Future Prediction
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Zhaopeng Meng
Yaodong Yang
Li Wang
33
2
0
27 May 2019
Policy Search by Target Distribution Learning for Continuous Control
Policy Search by Target Distribution Learning for Continuous Control
Wei Shen
Yuanqi Li
Jian Li
60
6
0
27 May 2019
Explainable Reinforcement Learning Through a Causal Lens
Explainable Reinforcement Learning Through a Causal Lens
Prashan Madumal
Tim Miller
L. Sonenberg
F. Vetere
CML
116
362
0
27 May 2019
Provably Efficient Imitation Learning from Observation Alone
Provably Efficient Imitation Learning from Observation Alone
Wen Sun
Anirudh Vemula
Byron Boots
J. Andrew Bagnell
167
107
0
27 May 2019
Operation and Imitation under Safety-Aware Shared Control
Operation and Imitation under Safety-Aware Shared Control
Alexander Broad
Todd Murphey
B. Argall
67
10
0
26 May 2019
Prioritized Sequence Experience Replay
Prioritized Sequence Experience Replay
Marc Brittain
J. R. Bertram
Xuxi Yang
Peng Wei
73
45
0
25 May 2019
MCP: Learning Composable Hierarchical Control with Multiplicative
  Compositional Policies
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies
Xue Bin Peng
Michael Chang
Grace Zhang
Pieter Abbeel
Sergey Levine
83
197
0
23 May 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire
  Evacuation Environment
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
72
70
0
23 May 2019
Combine PPO with NES to Improve Exploration
Combine PPO with NES to Improve Exploration
Lianjiang Li
Yunrong Yang
Bingna Li
13
1
0
23 May 2019
Learning Robust Options by Conditional Value at Risk Optimization
Learning Robust Options by Conditional Value at Risk Optimization
Takuya Hiraoka
Takahisa Imagawa
Tatsuya Mori
Takashi Onishi
Yoshimasa Tsuruoka
80
27
0
22 May 2019
Deep Signature Transforms
Deep Signature Transforms
Patrick Kidger
Patrick Kidger
Imanol Perez Arribas
C. Salvi
Terry Lyons
SLR
163
131
0
21 May 2019
Evolving Rewards to Automate Reinforcement Learning
Evolving Rewards to Automate Reinforcement Learning
Aleksandra Faust
Anthony G. Francis
Dar Mehta
77
51
0
18 May 2019
Previous
123...434445...505152
Next