ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Learning Compositional Neural Programs for Continuous Control
Learning Compositional Neural Programs for Continuous Control
Thomas Pierrot
Nicolas Perrin
Feryal M. P. Behbahani
Alexandre Laterre
Olivier Sigaud
Karim Beguir
Nando de Freitas
CLL
95
4
0
27 Jul 2020
Weak Human Preference Supervision For Deep Reinforcement Learning
Weak Human Preference Supervision For Deep Reinforcement Learning
Zehong Cao
Kaichiu Wong
Chin-Teng Lin
60
5
0
25 Jul 2020
Clinician-in-the-Loop Decision Making: Reinforcement Learning with
  Near-Optimal Set-Valued Policies
Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies
Shengpu Tang
Aditya Modi
Michael Sjoding
Jenna Wiens
OffRL
68
26
0
24 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
82
4
0
24 Jul 2020
Distributional Reinforcement Learning via Moment Matching
Distributional Reinforcement Learning via Moment Matching
Thanh Tang Nguyen
Sunil R. Gupta
Svetha Venkatesh
OOD
22
22
0
24 Jul 2020
Adaptive Traffic Control with Deep Reinforcement Learning: Towards
  State-of-the-art and Beyond
Adaptive Traffic Control with Deep Reinforcement Learning: Towards State-of-the-art and Beyond
Siavash Alemzadeh
Ramin Moslemi
Ratnesh K. Sharma
M. Mesbahi
OffRL
36
5
0
21 Jul 2020
Soft Expert Reward Learning for Vision-and-Language Navigation
Soft Expert Reward Learning for Vision-and-Language Navigation
Hu Wang
Qi Wu
Chunhua Shen
55
51
0
21 Jul 2020
Integrating Deep Reinforcement Learning Networks with Health System
  Simulations
Integrating Deep Reinforcement Learning Networks with Health System Simulations
Michael Allen
T. Monks
AI4CE
41
4
0
21 Jul 2020
Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground
  with Human-in-the-loop
Battlesnake Challenge: A Multi-agent Reinforcement Learning Playground with Human-in-the-loop
Jonathan Chung
Anna Luo
Xavier Raffin
Scott Perry
OffRL
52
3
0
20 Jul 2020
Multi-agent Reinforcement Learning in Bayesian Stackelberg Markov Games
  for Adaptive Moving Target Defense
Multi-agent Reinforcement Learning in Bayesian Stackelberg Markov Games for Adaptive Moving Target Defense
Sailik Sengupta
S. Kambhampati
AAML
63
46
0
20 Jul 2020
A Hierarchical Approach to Scaling Batch Active Search Over Structured
  Data
A Hierarchical Approach to Scaling Batch Active Search Over Structured Data
Vivek Myers
Peyton Greenside
74
1
0
20 Jul 2020
Multi-robot Cooperative Object Transportation using Decentralized Deep
  Reinforcement Learning
Multi-robot Cooperative Object Transportation using Decentralized Deep Reinforcement Learning
Lin Zhang
Hao Xiong
Ou Ma
Zhaokui Wang
109
6
0
17 Jul 2020
CoNES: Convex Natural Evolutionary Strategies
CoNES: Convex Natural Evolutionary Strategies
Sushant Veer
Anirudha Majumdar
74
3
0
16 Jul 2020
PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient
  Learning
PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Alekh Agarwal
Mikael Henaff
Sham Kakade
Wen Sun
OffRL
94
110
0
16 Jul 2020
DRIFT: Deep Reinforcement Learning for Functional Software Testing
DRIFT: Deep Reinforcement Learning for Functional Software Testing
Luke R. Harries
Rebekah Storan Clarke
Timothy Chapman
Swamy V. P. L. N. Nallamalli
Levent Özgür
...
Aaron Dietrich
José Miguel Hernández-Lobato
Tom Ellis
Cheng Zhang
K. Ciosek
VLMOffRL
24
14
0
16 Jul 2020
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient
  Descent
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
51
25
0
15 Jul 2020
Learning to Sample with Local and Global Contexts in Experience Replay
  Buffer
Learning to Sample with Local and Global Contexts in Experience Replay Buffer
Youngmin Oh
Kimin Lee
Jinwoo Shin
Eunho Yang
Sung Ju Hwang
OffRL
66
16
0
14 Jul 2020
Efficient Empowerment Estimation for Unsupervised Stabilization
Efficient Empowerment Estimation for Unsupervised Stabilization
Ruihan Zhao
Kevin Lu
Pieter Abbeel
Stas Tiomkin
58
8
0
14 Jul 2020
Explore and Explain: Self-supervised Navigation and Recounting
Explore and Explain: Self-supervised Navigation and Recounting
Roberto Bigazzi
Federico Landi
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
EgoVLM&Ro
78
17
0
14 Jul 2020
Robustifying Reinforcement Learning Agents via Action Space Adversarial
  Training
Robustifying Reinforcement Learning Agents via Action Space Adversarial Training
Kai Liang Tan
Yasaman Esfandiari
Xian Yeow Lee
Aakanksha
Soumik Sarkar
AAML
135
57
0
14 Jul 2020
DinerDash Gym: A Benchmark for Policy Learning in High-Dimensional
  Action Space
DinerDash Gym: A Benchmark for Policy Learning in High-Dimensional Action Space
Siwei Chen
Xiao Ma
David Hsu
45
3
0
13 Jul 2020
OtoWorld: Towards Learning to Separate by Learning to Move
OtoWorld: Towards Learning to Separate by Learning to Move
Omkar Ranadive
Grant Gasser
David Terpay
Prem Seetharaman
47
1
0
12 Jul 2020
An Equivalence between Loss Functions and Non-Uniform Sampling in
  Experience Replay
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
Scott Fujimoto
David Meger
Doina Precup
96
58
0
12 Jul 2020
Learning Retrospective Knowledge with Reverse Reinforcement Learning
Learning Retrospective Knowledge with Reverse Reinforcement Learning
Shangtong Zhang
Vivek Veeriah
Shimon Whiteson
OffRLAI4TS
76
13
0
09 Jul 2020
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation
Chuang Gan
Jeremy Schwartz
S. Alter
Damian Mrowca
Martin Schrimpf
...
Antonio Torralba
J. DiCarlo
J. Tenenbaum
Josh H. McDermott
Daniel L. K. Yamins
VGen
177
317
0
09 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
119
205
0
09 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State
  Entropy Estimate
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
73
19
0
09 Jul 2020
Adaptive Regret for Control of Time-Varying Dynamics
Adaptive Regret for Control of Time-Varying Dynamics
Paula Gradu
Elad Hazan
Edgar Minasyan
97
47
0
08 Jul 2020
Double Prioritized State Recycled Experience Replay
Double Prioritized State Recycled Experience Replay
Fanchen Bu
D. Chang
OffRL
40
11
0
08 Jul 2020
Guided Exploration with Proximal Policy Optimization using a Single
  Demonstration
Guided Exploration with Proximal Policy Optimization using a Single Demonstration
Gabriele Libardi
Gianni De Fabritiis
58
24
0
07 Jul 2020
Natural Emergence of Heterogeneous Strategies in Artificially
  Intelligent Competitive Teams
Natural Emergence of Heterogeneous Strategies in Artificially Intelligent Competitive Teams
A. Deka
Katia Sycara
AAML
124
32
0
06 Jul 2020
robo-gym -- An Open Source Toolkit for Distributed Deep Reinforcement
  Learning on Real and Simulated Robots
robo-gym -- An Open Source Toolkit for Distributed Deep Reinforcement Learning on Real and Simulated Robots
M. Lucchi
Friedemann Zindler
Stephan Mühlbacher-Karrer
Horst Pichler
OffRL
86
30
0
06 Jul 2020
Explaining Fast Improvement in Online Imitation Learning
Explaining Fast Improvement in Online Imitation Learning
Xinyan Yan
Byron Boots
Ching-An Cheng
OnRL
72
1
0
06 Jul 2020
Bidirectional Model-based Policy Optimization
Bidirectional Model-based Policy Optimization
Hang Lai
Jian Shen
Weinan Zhang
Yong Yu
76
58
0
04 Jul 2020
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via
  Metagradient
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient
Yufei Wang
Tianwei Ni
70
21
0
03 Jul 2020
Unsupervised Learning of Lagrangian Dynamics from Images for Prediction
  and Control
Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control
Yaofeng Desmond Zhong
Naomi Ehrich Leonard
DRLAI4CE
95
43
0
03 Jul 2020
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement
  Learning Tasks
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks
Yuqian Jiang
Suda Bharadwaj
Bo Wu
Rishi Shah
Ufuk Topcu
Peter Stone
CLLOffRLLRM
58
43
0
03 Jul 2020
Policy Improvement via Imitation of Multiple Oracles
Policy Improvement via Imitation of Multiple Oracles
Ching-An Cheng
Andrey Kolobov
Alekh Agarwal
44
5
0
01 Jul 2020
Falsification-Based Robust Adversarial Reinforcement Learning
Falsification-Based Robust Adversarial Reinforcement Learning
Xiao Wang
Saasha Nair
Matthias Althoff
AAML
64
19
0
01 Jul 2020
Convex Regularization in Monte-Carlo Tree Search
Convex Regularization in Monte-Carlo Tree Search
Tuan Dam
Carlo DÉramo
Jan Peters
Joni Pajarinen
OffRL
77
11
0
01 Jul 2020
Group Equivariant Deep Reinforcement Learning
Group Equivariant Deep Reinforcement Learning
Arnab Kumar Mondal
Pratheeksha Nair
Kaleem Siddiqi
61
34
0
01 Jul 2020
Enforcing Almost-Sure Reachability in POMDPs
Enforcing Almost-Sure Reachability in POMDPs
Sebastian Junges
N. Jansen
Sanjit A. Seshia
61
26
0
30 Jun 2020
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
Elise van der Pol
Daniel E. Worrall
H. V. Hoof
F. Oliehoek
Max Welling
BDLAI4CE
111
164
0
30 Jun 2020
Model-based Reinforcement Learning: A Survey
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
132
49
0
30 Jun 2020
Extracting Latent State Representations with Linear Dynamics from Rich
  Observations
Extracting Latent State Representations with Linear Dynamics from Rich Observations
Abraham Frandsen
Rong Ge
30
2
0
29 Jun 2020
Concept and the implementation of a tool to convert industry 4.0
  environments modeled as FSM to an OpenAI Gym wrapper
Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper
Kallil M. C. Zielinski
Marcelo Teixeira
Richardson Ribeiro
Dalcimar Casanova
OffRL
33
1
0
29 Jun 2020
Using Reinforcement Learning to Herd a Robotic Swarm to a Target
  Distribution
Using Reinforcement Learning to Herd a Robotic Swarm to a Target Distribution
Z. Kakish
Karthik Elamvazhuthi
Spring Berman
32
8
0
29 Jun 2020
Deep Bayesian Quadrature Policy Optimization
Deep Bayesian Quadrature Policy Optimization
Akella Ravi Tej
Kamyar Azizzadenesheli
Mohammad Ghavamzadeh
Anima Anandkumar
Yisong Yue
62
5
0
28 Jun 2020
DDPG++: Striving for Simplicity in Continuous-control Off-Policy
  Reinforcement Learning
DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
39
4
0
26 Jun 2020
Intrinsic Reward Driven Imitation Learning via Generative Model
Intrinsic Reward Driven Imitation Learning via Generative Model
Xingrui Yu
Yueming Lyu
Ivor W. Tsang
45
55
0
26 Jun 2020
Previous
123...343536...505152
Next