ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Action Branching Architectures for Deep Reinforcement Learning
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
74
265
0
24 Nov 2017
Deterministic Policy Optimization by Combining Pathwise and Score
  Function Estimators for Discrete Action Spaces
Deterministic Policy Optimization by Combining Pathwise and Score Function Estimators for Discrete Action Spaces
Daniel Levy
Stefano Ermon
66
4
0
21 Nov 2017
Vision-and-Language Navigation: Interpreting visually-grounded
  navigation instructions in real environments
Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
Peter Anderson
Qi Wu
Damien Teney
Jake Bruce
Mark Johnson
Niko Sünderhauf
Ian Reid
Stephen Gould
Anton Van Den Hengel
LM&Ro
174
1,325
0
20 Nov 2017
Variational Adaptive-Newton Method for Explorative Learning
Variational Adaptive-Newton Method for Explorative Learning
Mohammad Emtiyaz Khan
Wu Lin
Voot Tangkaratt
Zuozhu Liu
Didrik Nielsen
ODL
81
20
0
15 Nov 2017
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?
M. Raghu
A. Irpan
Jacob Andreas
Robert D. Kleinberg
Quoc V. Le
Jon M. Kleinberg
105
28
0
07 Nov 2017
Acquiring Target Stacking Skills by Goal-Parameterized Deep
  Reinforcement Learning
Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning
Wenbin Li
Jeannette Bohg
Mario Fritz
51
8
0
01 Nov 2017
Backpropagation through the Void: Optimizing control variates for
  black-box gradient estimation
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
138
300
0
31 Oct 2017
Action-depedent Control Variates for Policy Optimization via Stein's
  Identity
Action-depedent Control Variates for Policy Optimization via Stein's Identity
Hao Liu
Yihao Feng
Yi Mao
Dengyong Zhou
Jian-wei Peng
Qiang Liu
94
4
0
30 Oct 2017
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep
  Reinforcement Learning
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning
Sergio Valcarcel Macua
Aleksi Tukiainen
D. Hernández
David Baldazo
Enrique Munoz de Cote
S. Zazo
112
29
0
28 Oct 2017
Fast Model Identification via Physics Engines for Data-Efficient Policy
  Search
Fast Model Identification via Physics Engines for Data-Efficient Policy Search
Shaojun Zhu
A. Kimmel
Kostas E. Bekris
Abdeslam Boularias
135
14
0
24 Oct 2017
Nonsmooth optimal value and policy functions in mechanical systems
  subject to unilateral constraints
Nonsmooth optimal value and policy functions in mechanical systems subject to unilateral constraints
Bora S. Banjanin
Samuel A. Burden
87
0
0
18 Oct 2017
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of
  Robots by Deep Reinforcement Learning
Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning
A. Kume
Eiichi Matsumoto
K. Takahashi
W. Ko
Jethro Tan
69
11
0
17 Oct 2017
Flow: A Modular Learning Framework for Mixed Autonomy Traffic
Flow: A Modular Learning Framework for Mixed Autonomy Traffic
Cathy Wu
Abdul Rahman Kreidieh
Kanaad Parvate
Eugene Vinitsky
Alexandre M. Bayen
102
162
0
16 Oct 2017
Unsupervised Real-Time Control through Variational Empowerment
Unsupervised Real-Time Control through Variational Empowerment
Maximilian Karl
Maximilian Soelch
Philip Becker-Ehmck
Djalel Benbouzid
Patrick van der Smagt
Justin Bayer
78
55
0
13 Oct 2017
AMBER: Adaptive Multi-Batch Experience Replay for Continuous Action
  Control
AMBER: Adaptive Multi-Batch Experience Replay for Continuous Action Control
Seungyul Han
Y. Sung
OffRL
33
8
0
12 Oct 2017
Learning to Generalize: Meta-Learning for Domain Generalization
Learning to Generalize: Meta-Learning for Domain Generalization
Da Li
Yongxin Yang
Yi-Zhe Song
Timothy M. Hospedales
OOD
111
1,436
0
10 Oct 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
86
273
0
28 Sep 2017
Multi-task Learning with Gradient Guided Policy Specialization
Multi-task Learning with Gradient Guided Policy Specialization
Wenhao Yu
Chenxi Liu
Greg Turk
30
2
0
23 Sep 2017
OptLayer - Practical Constrained Optimization for Deep Reinforcement
  Learning in the Real World
OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World
Tu-Hoa Pham
Giovanni De Magistris
Ryuki Tachibana
OffRL
71
143
0
22 Sep 2017
OptionGAN: Learning Joint Reward-Policy Options using Generative
  Adversarial Inverse Reinforcement Learning
OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning
Peter Henderson
Wei-Di Chang
Pierre-Luc Bacon
David Meger
Joelle Pineau
Doina Precup
GAN
77
73
0
20 Sep 2017
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
149
1,970
0
19 Sep 2017
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
Kunal Menda
Katherine Driggs-Campbell
Mykel J. Kochenderfer
110
28
0
18 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and
  Open Problems for General Agents
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
114
558
0
18 Sep 2017
Shapechanger: Environments for Transfer Learning
Shapechanger: Environments for Transfer Learning
Sébastien M. R. Arnold
Tsam Kiu Pun
Théo-Tim J. Denisart
Francisco J. Valero Cuevas
3DPCLM&Ro
26
0
0
15 Sep 2017
Shared Learning : Enhancing Reinforcement in $Q$-Ensembles
Shared Learning : Enhancing Reinforcement in QQQ-Ensembles
Rakesh R Menon
Balaraman Ravindran
33
0
0
14 Sep 2017
One-Shot Visual Imitation Learning via Meta-Learning
One-Shot Visual Imitation Learning via Meta-Learning
Chelsea Finn
Tianhe Yu
Tianhao Zhang
Pieter Abbeel
Sergey Levine
SSL
133
566
0
14 Sep 2017
Pre-training Neural Networks with Human Demonstrations for Deep
  Reinforcement Learning
Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning
G. V. D. L. Cruz
Yunshu Du
Matthew E. Taylor
3DHOffRL
84
58
0
12 Sep 2017
Mean Actor Critic
Mean Actor Critic
Cameron Allen
Kavosh Asadi
Melrose Roderick
Abdel-rahman Mohamed
George Konidaris
Michael Littman
92
45
0
01 Sep 2017
Deep Learning for Video Game Playing
Deep Learning for Video Game Playing
Niels Justesen
Philip Bontrager
Julian Togelius
S. Risi
VLM
101
208
0
25 Aug 2017
Sim4CV: A Photo-Realistic Simulator for Computer Vision Applications
Sim4CV: A Photo-Realistic Simulator for Computer Vision Applications
Matthias Muller
Vincent Casser
Jean Lahoud
Neil G. Smith
Guohao Li
VGen
74
181
0
19 Aug 2017
A Brief Survey of Deep Reinforcement Learning
A Brief Survey of Deep Reinforcement Learning
Kai Arulkumaran
M. Deisenroth
Miles Brundage
Anil Anthony Bharath
OffRL
143
2,830
0
19 Aug 2017
Scalable trust-region method for deep reinforcement learning using
  Kronecker-factored approximation
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Yuhuai Wu
Elman Mansimov
Shun Liao
Roger C. Grosse
Jimmy Ba
OffRL
135
630
0
17 Aug 2017
OpenML Benchmarking Suites
OpenML Benchmarking Suites
B. Bischl
Giuseppe Casalicchio
Matthias Feurer
Pieter Gijsbers
Frank Hutter
Michel Lang
R. G. Mantovani
Jan N. van Rijn
Joaquin Vanschoren
VLMELM
123
165
0
11 Aug 2017
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for
  Continuous Control
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control
Riashat Islam
Peter Henderson
Maziar Gomrokchi
Doina Precup
BDLOffRL
103
253
0
10 Aug 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with
  Model-Free Fine-Tuning
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Anusha Nagabandi
G. Kahn
R. Fearing
Sergey Levine
116
977
0
08 Aug 2017
An Information-Theoretic Optimality Principle for Deep Reinforcement
  Learning
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
101
24
0
06 Aug 2017
Mutual Alignment Transfer Learning
Mutual Alignment Transfer Learning
Markus Wulfmeier
Ingmar Posner
Pieter Abbeel
152
61
0
25 Jul 2017
RAIL: Risk-Averse Imitation Learning
RAIL: Risk-Averse Imitation Learning
Anirban Santara
A. Naik
Balaraman Ravindran
Dipankar Das
Dheevatsa Mudigere
Sasikanth Avancha
Bharat Kaul
82
18
0
20 Jul 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
678
19,343
0
20 Jul 2017
ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical
  Systems
ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems
James Harrison
Animesh Garg
Boris Ivanovic
Yuke Zhu
Silvio Savarese
Li Fei-Fei
Marco Pavone
76
25
0
15 Jul 2017
ELF: An Extensive, Lightweight and Flexible Research Platform for
  Real-time Strategy Games
ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games
Yuandong Tian
Qucheng Gong
Wenling Shang
Yuxin Wu
C. L. Zitnick
OffRL
74
126
0
04 Jul 2017
OPEB: Open Physical Environment Benchmark for Artificial Intelligence
OPEB: Open Physical Environment Benchmark for Artificial Intelligence
H. Mirzaei
Mona Fathollahi
T. Givargis
21
2
0
04 Jul 2017
Dex: Incremental Learning for Complex Environments in Deep Reinforcement
  Learning
Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning
Nick Erickson
Qi Zhao
CLLOffRL
422
2
0
19 Jun 2017
Expected Policy Gradients
Expected Policy Gradients
K. Ciosek
Shimon Whiteson
129
58
0
15 Jun 2017
Reinforcement Learning under Model Mismatch
Reinforcement Learning under Model Mismatch
Aurko Roy
Huan Xu
Sebastian Pokutta
OOD
87
80
0
15 Jun 2017
Symmetry Learning for Function Approximation in Reinforcement Learning
Symmetry Learning for Function Approximation in Reinforcement Learning
Anuj Mahajan
Theja Tulabandhula
67
31
0
09 Jun 2017
Parameter Space Noise for Exploration
Parameter Space Noise for Exploration
Matthias Plappert
Rein Houthooft
Prafulla Dhariwal
Szymon Sidor
Richard Y. Chen
Xi Chen
Tamim Asfour
Pieter Abbeel
Marcin Andrychowicz
111
597
0
06 Jun 2017
Deep learning evaluation using deep linguistic processing
Deep learning evaluation using deep linguistic processing
A. Kuhnle
Ann A. Copestake
ELM
59
11
0
05 Jun 2017
Non-Markovian Control with Gated End-to-End Memory Policy Networks
Non-Markovian Control with Gated End-to-End Memory Policy Networks
J. Perez
T. Silander
OffRL
65
6
0
31 May 2017
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time
  Budget
Sequential Dynamic Decision Making with Deep Neural Nets on a Test-Time Budget
Henghui Zhu
Feng Nan
I. Paschalidis
Venkatesh Saligrama
6
2
0
31 May 2017
Previous
123...505152
Next