ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Learning Optimal Strategies for Temporal Tasks in Stochastic Games
Learning Optimal Strategies for Temporal Tasks in Stochastic Games
A. Bozkurt
Yu Wang
Michael M. Zavlanos
Miroslav Pajic
64
3
0
08 Feb 2021
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning
  Workloads
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
James Gleeson
Srivatsan Krishnan
Moshe Gabel
Vijay Janapa Reddi
Eyal de Lara
Gennady Pekhimenko
OffRL
43
11
0
08 Feb 2021
Towards Hierarchical Task Decomposition using Deep Reinforcement
  Learning for Pick and Place Subtasks
Towards Hierarchical Task Decomposition using Deep Reinforcement Learning for Pick and Place Subtasks
Luca Marzari
Ameya Pore
Diego DallÁlba
G. Aragon-Camarasa
Alessandro Farinelli
Paolo Fiorini
76
29
0
08 Feb 2021
Model-Augmented Q-learning
Model-Augmented Q-learning
Youngmin Oh
Jinwoo Shin
Eunho Yang
Sung Ju Hwang
OffRL
47
1
0
07 Feb 2021
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Theodore H. Moskovitz
Jack Parker-Holder
Aldo Pacchiano
Michael Arbel
Michael I. Jordan
99
59
0
07 Feb 2021
An Analysis of Frame-skipping in Reinforcement Learning
An Analysis of Frame-skipping in Reinforcement Learning
Shivaram Kalyanakrishnan
Siddharth Aravindan
Vishwajeet Bagdawat
Varun Bhatt
Harshith Goka
Archit Gupta
Kalpesh Krishna
Vihari Piratla
60
20
0
07 Feb 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've
  Learned
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
162
538
0
04 Feb 2021
Embodied Intelligence via Learning and Evolution
Embodied Intelligence via Learning and Evolution
Agrim Gupta
Silvio Savarese
Surya Ganguli
Li Fei-Fei
AI4CE
105
255
0
03 Feb 2021
Variance Penalized On-Policy and Off-Policy Actor-Critic
Variance Penalized On-Policy and Off-Policy Actor-Critic
Arushi Jain
Gandharv Patil
Ayush Jain
Khimya Khetarpal
Doina Precup
OffRL
55
10
0
03 Feb 2021
GymD2D: A Device-to-Device Underlay Cellular Offload Evaluation Platform
GymD2D: A Device-to-Device Underlay Cellular Offload Evaluation Platform
David Cotton
Z. Chaczko
17
2
0
27 Jan 2021
Reinforcement Learning for Selective Key Applications in Power Systems:
  Recent Advances and Future Challenges
Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges
Xin Chen
Guannan Qu
Yujie Tang
S. Low
Na Li
84
241
0
27 Jan 2021
Learning Synthetic Environments for Reinforcement Learning with
  Evolution Strategies
Learning Synthetic Environments for Reinforcement Learning with Evolution Strategies
Fabio Ferreira
Thomas Nierhoff
Frank Hutter
69
8
0
24 Jan 2021
BF++: a language for general-purpose program synthesis
BF++: a language for general-purpose program synthesis
Vadim Liventsev
Aki Härmä
M. Petković
168
3
0
23 Jan 2021
Differentiable Trust Region Layers for Deep Reinforcement Learning
Differentiable Trust Region Layers for Deep Reinforcement Learning
Fabian Otto
P. Becker
Ngo Anh Vien
Hanna Ziesche
Gerhard Neumann
OffRL
76
19
0
22 Jan 2021
Prior Preference Learning from Experts:Designing a Reward with Active
  Inference
Prior Preference Learning from Experts:Designing a Reward with Active Inference
Jinyoung Shin
Cheolhyeong Kim
H. Hwang
98
9
0
22 Jan 2021
Shielding Atari Games with Bounded Prescience
Shielding Atari Games with Bounded Prescience
Mirco Giacobbe
Mohammadhosein Hasanbeig
Daniel Kroening
H. Wijk
72
23
0
20 Jan 2021
Deep Reinforcement Learning for Active High Frequency Trading
Deep Reinforcement Learning for Active High Frequency Trading
Antonio Briola
J. Turiel
Riccardo Marcaccioli
Alvaro Cauderan
T. Aste
AIFinAI4TS
89
36
0
18 Jan 2021
Mind the Gap when Conditioning Amortised Inference in Sequential
  Latent-Variable Models
Mind the Gap when Conditioning Amortised Inference in Sequential Latent-Variable Models
Justin Bayer
Maximilian Soelch
Atanas Mirchev
Baris Kayalibay
Patrick van der Smagt
117
15
0
18 Jan 2021
Stable deep reinforcement learning method by predicting uncertainty in
  rewards as a subtask
Stable deep reinforcement learning method by predicting uncertainty in rewards as a subtask
Kanata Suzuki
T. Ogata
78
2
0
18 Jan 2021
SimGAN: Hybrid Simulator Identification for Domain Adaptation via
  Adversarial Reinforcement Learning
SimGAN: Hybrid Simulator Identification for Domain Adaptation via Adversarial Reinforcement Learning
Yifeng Jiang
Tingnan Zhang
Daniel Ho
Yunfei Bai
Chenxi Liu
Sergey Levine
Jie Tan
GAN
99
57
0
15 Jan 2021
Differentiable Nonparametric Belief Propagation
Differentiable Nonparametric Belief Propagation
Anthony Opipari
Chao Chen
Shoutian Wang
Jana Pavlasek
Karthik Desingh
Odest Chadwicke Jenkins
50
5
0
15 Jan 2021
Smoothed functional-based gradient algorithms for off-policy
  reinforcement learning: A non-asymptotic viewpoint
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint
Nithia Vijayan
A. PrashanthL.
OffRL
59
7
0
06 Jan 2021
Deep Reinforcement Learning with Quantum-inspired Experience Replay
Deep Reinforcement Learning with Quantum-inspired Experience Replay
Qing Wei
Hailan Ma
Chunlin Chen
D. Dong
60
72
0
06 Jan 2021
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A
  Detection Approach
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection Approach
Amin Nikanjam
Mohammad Mehdi Morovati
Foutse Khomh
Houssem Ben Braiek
113
33
0
01 Jan 2021
Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle
  Coordination by Multi-Critic Policy Gradient Optimization
Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Coordination by Multi-Critic Policy Gradient Optimization
Yoav Alon
Huiyu Zhou
105
10
0
31 Dec 2020
Modeling Social Interaction for Baby in Simulated Environment for
  Developmental Robotics
Modeling Social Interaction for Baby in Simulated Environment for Developmental Robotics
Md Ashaduzzaman Rubel Mondol
Aishwarya Pothula
Deokgun Park
LM&Ro
34
0
0
29 Dec 2020
Federated Multi-Agent Actor-Critic Learning for Age Sensitive Mobile
  Edge Computing
Federated Multi-Agent Actor-Critic Learning for Age Sensitive Mobile Edge Computing
Zheqi Zhu
Shuo Wan
Pingyi Fan
Khaled B. Letaief
120
78
0
28 Dec 2020
Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds
  Globally Optimal Policy
Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy
Han Zhong
Xun Deng
Ethan X. Fang
Zhuoran Yang
Zhaoran Wang
Runze Li
71
3
0
28 Dec 2020
POPO: Pessimistic Offline Policy Optimization
POPO: Pessimistic Offline Policy Optimization
Qiang He
Xinwen Hou
OffRL
85
10
0
26 Dec 2020
Augmenting Policy Learning with Routines Discovered from a Single
  Demonstration
Augmenting Policy Learning with Routines Discovered from a Single Demonstration
Zelin Zhao
Chuang Gan
Jiajun Wu
Xiaoxiao Guo
J. Tenenbaum
OffRL
82
5
0
23 Dec 2020
Combining Deep Reinforcement Learning And Local Control For The Acrobot
  Swing-up And Balance Task
Combining Deep Reinforcement Learning And Local Control For The Acrobot Swing-up And Balance Task
Sean Gillen
Marco Molnar
Katie Byl
35
9
0
21 Dec 2020
myGym: Modular Toolkit for Visuomotor Robotic Tasks
myGym: Modular Toolkit for Visuomotor Robotic Tasks
M. Vavrecka
Nikita Sokovnin
Megi Mejdrechova
G. Sejnova
Marek Otáhal
30
6
0
21 Dec 2020
Trace-class Gaussian priors for Bayesian learning of neural networks
  with MCMC
Trace-class Gaussian priors for Bayesian learning of neural networks with MCMC
Torben Sell
Sumeetpal S. Singh
BDL
141
5
0
20 Dec 2020
Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region
  Approach
Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach
James Queeney
I. Paschalidis
Christos G. Cassandras
71
9
0
19 Dec 2020
Learning Cross-Domain Correspondence for Control with Dynamics
  Cycle-Consistency
Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency
Qiang Zhang
Tete Xiao
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
91
65
0
17 Dec 2020
Model-free and Bayesian Ensembling Model-based Deep Reinforcement
  Learning for Particle Accelerator Control Demonstrated on the FERMI FEL
Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL
Simon Hirlaender
N. Bruchon
55
23
0
17 Dec 2020
Embodied Visual Active Learning for Semantic Segmentation
Embodied Visual Active Learning for Semantic Segmentation
David Nilsson
Aleksis Pirinen
Erik Gartner
C. Sminchisescu
86
35
0
17 Dec 2020
Learning Accurate Long-term Dynamics for Model-based Reinforcement
  Learning
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning
Nathan Lambert
Albert Wilcox
Howard Zhang
K. Pister
Roberto Calandra
77
34
0
16 Dec 2020
CARLA Real Traffic Scenarios -- novel training ground and benchmark for
  autonomous driving
CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving
B. Osinski
Piotr Milos
Adam Jakubowski
Pawel Ziecina
Michal Martyniak
Christopher Galias
Antonia Breuer
S. Homoceanu
Henryk Michalewski
73
20
0
16 Dec 2020
Policy Manifold Search for Improving Diversity-based Neuroevolution
Policy Manifold Search for Improving Diversity-based Neuroevolution
Nemanja Rakićević
Antoine Cully
Petar Kormushev
58
0
0
15 Dec 2020
BeBold: Exploration Beyond the Boundary of Explored Regions
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
81
40
0
15 Dec 2020
TACTO: A Fast, Flexible, and Open-source Simulator for High-Resolution
  Vision-based Tactile Sensors
TACTO: A Fast, Flexible, and Open-source Simulator for High-Resolution Vision-based Tactile Sensors
Shaoxiong Wang
Mike Lambeta
Po-wei Chou
Roberto Calandra
90
144
0
15 Dec 2020
Learning Visual Robotic Control Efficiently with Contrastive
  Pre-training and Data Augmentation
Learning Visual Robotic Control Efficiently with Contrastive Pre-training and Data Augmentation
Albert Zhan
Philip Zhao
Lerrel Pinto
Pieter Abbeel
Michael Laskin
SSLDRL
95
21
0
14 Dec 2020
Policy Gradient RL Algorithms as Directed Acyclic Graphs
Policy Gradient RL Algorithms as Directed Acyclic Graphs
J. Luis
61
0
0
14 Dec 2020
Evolutionary learning of interpretable decision trees
Evolutionary learning of interpretable decision trees
Leonardo Lucio Custode
Giovanni Iacca
OffRL
104
41
0
14 Dec 2020
Tutoring Reinforcement Learning via Feedback Control
Tutoring Reinforcement Learning via Feedback Control
F. D. Lellis
G. Russo
M. D. Bernardo
43
6
0
12 Dec 2020
Protective Policy Transfer
Protective Policy Transfer
Wenhao Yu
Chenxi Liu
Greg Turk
AAML
126
2
0
11 Dec 2020
Regularizing Action Policies for Smooth Control with Reinforcement
  Learning
Regularizing Action Policies for Smooth Control with Reinforcement Learning
Siddharth Mysore
B. Mabsout
R. Mancuso
Kate Saenko
81
69
0
11 Dec 2020
OPAC: Opportunistic Actor-Critic
OPAC: Opportunistic Actor-Critic
Srinjoy Roy
Saptam Bakshi
Tamal Maharaj
51
2
0
11 Dec 2020
OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game
  Research
OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research
Kai Li
Hang Xu
Enmin Zhao
Zhe Wu
Junliang Xing
VLM
72
0
0
11 Dec 2020
Previous
123...293031...505152
Next