ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,654 papers shown
Title
Model-Based Reinforcement Learning via Meta-Policy Optimization
Model-Based Reinforcement Learning via Meta-Policy Optimization
I. Clavera
Jonas Rothfuss
John Schulman
Yasuhiro Fujita
Tamim Asfour
Pieter Abbeel
30
225
0
14 Sep 2018
VPE: Variational Policy Embedding for Transfer Reinforcement Learning
VPE: Variational Policy Embedding for Transfer Reinforcement Learning
Isac Arnekvist
Danica Kragic
J. A. Stork
OffRL
25
37
0
10 Sep 2018
Unity: A General Platform for Intelligent Agents
Unity: A General Platform for Intelligent Agents
Arthur Juliani
Vincent-Pierre Berges
Esh Vckay
Andrew Cohen
Jonathan Harper
...
Chris Goy
Yuan Gao
Hunter Henry
Marwan Mattar
Danny Lange
39
808
0
07 Sep 2018
Recurrent World Models Facilitate Policy Evolution
Recurrent World Models Facilitate Policy Evolution
David R Ha
Jürgen Schmidhuber
SyDa
TPM
52
920
0
04 Sep 2018
Texar: A Modularized, Versatile, and Extensible Toolkit for Text
  Generation
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation
Zhiting Hu
Haoran Shi
Bowen Tan
Wentao Wang
Zichao Yang
...
Zhengzhong Liu
Xiaodan Liang
Wangrong Zhu
Devendra Singh Sachan
Eric Xing
VLM
25
56
0
04 Sep 2018
SOLAR: Deep Structured Representations for Model-Based Reinforcement
  Learning
SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning
Marvin Zhang
Sharad Vikram
Laura M. Smith
Pieter Abbeel
Matthew J. Johnson
Sergey Levine
OffRL
23
41
0
28 Aug 2018
SOTER: A Runtime Assurance Framework for Programming Safe Robotics
  Systems
SOTER: A Runtime Assurance Framework for Programming Safe Robotics Systems
Ankush Desai
Shromona Ghosh
S. Seshia
N. Shankar
A. Tiwari
17
12
0
23 Aug 2018
GeneSys: Enabling Continuous Learning through Neural Network Evolution
  in Hardware
GeneSys: Enabling Continuous Learning through Neural Network Evolution in Hardware
A. Samajdar
Parth Mannan
K. Garg
T. Krishna
32
20
0
03 Aug 2018
Learning Actionable Representations from Visual Observations
Learning Actionable Representations from Visual Observations
Debidatta Dwibedi
Jonathan Tompson
Corey Lynch
P. Sermanet
SSL
22
80
0
02 Aug 2018
Learning Dexterous In-Hand Manipulation
Learning Dexterous In-Hand Manipulation
OpenAI OpenAI
Marcin Andrychowicz
Bowen Baker
Maciek Chociej
Rafal Jozefowicz
...
Szymon Sidor
Joshua Tobin
Peter Welinder
Lilian Weng
Wojciech Zaremba
47
1,857
0
01 Aug 2018
ToriLLE: Learning Environment for Hand-to-Hand Combat
ToriLLE: Learning Environment for Hand-to-Hand Combat
Anssi Kanervisto
Ville Hautamaki
26
2
0
26 Jul 2018
Meta-Learning Priors for Efficient Online Bayesian Regression
Meta-Learning Priors for Efficient Online Bayesian Regression
James Harrison
Apoorva Sharma
Marco Pavone
BDL
24
99
0
24 Jul 2018
FuzzerGym: A Competitive Framework for Fuzzing and Learning
FuzzerGym: A Competitive Framework for Fuzzing and Learning
W. Drozd
Michael D. Wagner
33
32
0
19 Jul 2018
Online Robust Policy Learning in the Presence of Unknown Adversaries
Online Robust Policy Learning in the Presence of Unknown Adversaries
Aaron J. Havens
Zhanhong Jiang
S. Sarkar
AAML
18
43
0
16 Jul 2018
Toward Interpretable Deep Reinforcement Learning with Linear Model
  U-Trees
Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees
Guiliang Liu
Oliver Schulte
Wang Zhu
Qingcan Li
AI4CE
17
135
0
16 Jul 2018
Variance Reduction for Reinforcement Learning in Input-Driven
  Environments
Variance Reduction for Reinforcement Learning in Input-Driven Environments
Hongzi Mao
S. Venkatakrishnan
Malte Schwarzkopf
Mohammad Alizadeh
OffRL
41
95
0
06 Jul 2018
A Dissection of Overfitting and Generalization in Continuous
  Reinforcement Learning
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLL
OffRL
33
177
0
20 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
30
213
0
20 Jun 2018
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation
J. Matas
Stephen James
Andrew J. Davison
AI4CE
29
357
0
20 Jun 2018
VirtualHome: Simulating Household Activities via Programs
VirtualHome: Simulating Household Activities via Programs
Xavier Puig
K. Ra
Marko Boben
Jiaman Li
Tingwu Wang
Sanja Fidler
Antonio Torralba
LM&Ro
30
478
0
19 Jun 2018
Laplacian Smoothing Gradient Descent
Laplacian Smoothing Gradient Descent
Stanley Osher
Bao Wang
Penghang Yin
Xiyang Luo
Farzin Barekat
Minh Pham
A. Lin
ODL
22
43
0
17 Jun 2018
Qualitative Measurements of Policy Discrepancy for Return-Based Deep
  Q-Network
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network
Wenjia Meng
Qian Zheng
L. Yang
Pengfei Li
Gang Pan
20
21
0
14 Jun 2018
Accelerating Imitation Learning with Predictive Models
Accelerating Imitation Learning with Predictive Models
Ching-An Cheng
Xinyan Yan
Evangelos A. Theodorou
Byron Boots
32
21
0
12 Jun 2018
Re-evaluating Evaluation
Re-evaluating Evaluation
David Balduzzi
K. Tuyls
Julien Perolat
T. Graepel
MoMe
30
97
0
07 Jun 2018
Graph Convolutional Policy Network for Goal-Directed Molecular Graph
  Generation
Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation
Jiaxuan You
Bowen Liu
Rex Ying
Vijay S. Pande
J. Leskovec
GNN
215
887
0
07 Jun 2018
Human-like generalization in a machine through predicate learning
Human-like generalization in a machine through predicate learning
L. Doumas
Guillermo Puebla
Andrea E. Martin
NAI
33
9
0
05 Jun 2018
BindsNET: A machine learning-oriented spiking neural networks library in
  Python
BindsNET: A machine learning-oriented spiking neural networks library in Python
Hananel Hazan
D. J. Saunders
Hassaan Khan
Darpan T. Sanghavi
H. Siegelmann
R. Kozma
AI4CE
30
229
0
04 Jun 2018
Challenges in High-dimensional Reinforcement Learning with Evolution
  Strategies
Challenges in High-dimensional Reinforcement Learning with Evolution Strategies
Nils Müller
Tobias Glasmachers
33
28
0
04 Jun 2018
Sequential Attacks on Agents for Long-Term Adversarial Goals
Sequential Attacks on Agents for Long-Term Adversarial Goals
E. Tretschk
Seong Joon Oh
Mario Fritz
OnRL
329
47
1
31 May 2018
Supervised Policy Update for Deep Reinforcement Learning
Supervised Policy Update for Deep Reinforcement Learning
Q. Vuong
Yiming Zhang
Keith Ross
19
20
0
29 May 2018
Truncated Horizon Policy Search: Combining Reinforcement Learning &
  Imitation Learning
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Wen Sun
J. Andrew Bagnell
Byron Boots
28
93
0
29 May 2018
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul
Michael A. Osborne
Shimon Whiteson
32
18
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
26
83
0
26 May 2018
A0C: Alpha Zero in Continuous Action Space
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
16
48
0
24 May 2018
Intelligent Trainer for Model-Based Reinforcement Learning
Intelligent Trainer for Model-Based Reinforcement Learning
Yuanlong Li
Linsen Dong
Xin Zhou
Yonggang Wen
K. Guan
OffRL
24
0
0
24 May 2018
Representation Balancing MDPs for Off-Policy Policy Evaluation
Representation Balancing MDPs for Off-Policy Policy Evaluation
Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
11
75
0
23 May 2018
A General Family of Robust Stochastic Operators for Reinforcement
  Learning
A General Family of Robust Stochastic Operators for Reinforcement Learning
Yingdong Lu
M. Squillante
C. Wu
12
3
0
21 May 2018
Leveraging human knowledge in tabular reinforcement learning: A study of
  human subjects
Leveraging human knowledge in tabular reinforcement learning: A study of human subjects
Ariel Rosenfeld
Moshe Cohen
Matthew E. Taylor
Sarit Kraus
OffRL
6
31
0
15 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially
  Observable Markov Decision Processes
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
25
51
0
11 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
11
42
0
09 May 2018
Behavioral Cloning from Observation
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
46
710
0
04 May 2018
Exploration by Distributional Reinforcement Learning
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
41
30
0
04 May 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
38
401
0
24 Apr 2018
Benchmarking projective simulation in navigation problems
Benchmarking projective simulation in navigation problems
A. Melnikov
A. Makmal
H. Briegel
16
19
0
23 Apr 2018
Gotta Learn Fast: A New Benchmark for Generalization in RL
Gotta Learn Fast: A New Benchmark for Generalization in RL
Alex Nichol
Vicki Pfau
Christopher Hesse
Oleg Klimov
John Schulman
VLM
OffRL
15
177
0
10 Apr 2018
Learning to Run challenge: Synthesizing physiologically accurate motion
  using deep reinforcement learning
Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Jennifer Hicks
Sean F. Carroll
Sergey Levine
M. Salathé
Scott L. Delp
34
60
0
31 Mar 2018
Safe end-to-end imitation learning for model predictive control
Safe end-to-end imitation learning for model predictive control
Keuntaek Lee
Kamil Saigol
Evangelos A. Theodorou
BDL
24
24
0
27 Mar 2018
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
50
1,036
0
27 Mar 2018
Setting up a Reinforcement Learning Task with a Real-World Robot
Setting up a Reinforcement Learning Task with a Real-World Robot
A. R. Mahmood
D. Korenkevych
Brent Komer
James Bergstra
26
75
0
19 Mar 2018
Learning to Sequence Robot Behaviors for Visual Navigation
Learning to Sequence Robot Behaviors for Visual Navigation
Hadi Salman
Puneet Singhal
Tanmay Shankar
Peng Yin
A. Salman
William Paivine
Guillaume Sartoretti
Matthew Travers
Howie Choset
20
8
0
05 Mar 2018
Previous
123...31323334
Next