Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Learning Optimal Strategies for Temporal Tasks in Stochastic Games
A. Bozkurt
Yu Wang
Michael M. Zavlanos
Miroslav Pajic
64
3
0
08 Feb 2021
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
James Gleeson
Srivatsan Krishnan
Moshe Gabel
Vijay Janapa Reddi
Eyal de Lara
Gennady Pekhimenko
OffRL
43
11
0
08 Feb 2021
Towards Hierarchical Task Decomposition using Deep Reinforcement Learning for Pick and Place Subtasks
Luca Marzari
Ameya Pore
Diego DallÁlba
G. Aragon-Camarasa
Alessandro Farinelli
Paolo Fiorini
76
29
0
08 Feb 2021
Model-Augmented Q-learning
Youngmin Oh
Jinwoo Shin
Eunho Yang
Sung Ju Hwang
OffRL
47
1
0
07 Feb 2021
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Theodore H. Moskovitz
Jack Parker-Holder
Aldo Pacchiano
Michael Arbel
Michael I. Jordan
99
59
0
07 Feb 2021
An Analysis of Frame-skipping in Reinforcement Learning
Shivaram Kalyanakrishnan
Siddharth Aravindan
Vishwajeet Bagdawat
Varun Bhatt
Harshith Goka
Archit Gupta
Kalpesh Krishna
Vihari Piratla
60
20
0
07 Feb 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
162
538
0
04 Feb 2021
Embodied Intelligence via Learning and Evolution
Agrim Gupta
Silvio Savarese
Surya Ganguli
Li Fei-Fei
AI4CE
105
255
0
03 Feb 2021
Variance Penalized On-Policy and Off-Policy Actor-Critic
Arushi Jain
Gandharv Patil
Ayush Jain
Khimya Khetarpal
Doina Precup
OffRL
55
10
0
03 Feb 2021
GymD2D: A Device-to-Device Underlay Cellular Offload Evaluation Platform
David Cotton
Z. Chaczko
17
2
0
27 Jan 2021
Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges
Xin Chen
Guannan Qu
Yujie Tang
S. Low
Na Li
84
241
0
27 Jan 2021
Learning Synthetic Environments for Reinforcement Learning with Evolution Strategies
Fabio Ferreira
Thomas Nierhoff
Frank Hutter
69
8
0
24 Jan 2021
BF++: a language for general-purpose program synthesis
Vadim Liventsev
Aki Härmä
M. Petković
168
3
0
23 Jan 2021
Differentiable Trust Region Layers for Deep Reinforcement Learning
Fabian Otto
P. Becker
Ngo Anh Vien
Hanna Ziesche
Gerhard Neumann
OffRL
76
19
0
22 Jan 2021
Prior Preference Learning from Experts:Designing a Reward with Active Inference
Jinyoung Shin
Cheolhyeong Kim
H. Hwang
98
9
0
22 Jan 2021
Shielding Atari Games with Bounded Prescience
Mirco Giacobbe
Mohammadhosein Hasanbeig
Daniel Kroening
H. Wijk
72
23
0
20 Jan 2021
Deep Reinforcement Learning for Active High Frequency Trading
Antonio Briola
J. Turiel
Riccardo Marcaccioli
Alvaro Cauderan
T. Aste
AIFin
AI4TS
89
36
0
18 Jan 2021
Mind the Gap when Conditioning Amortised Inference in Sequential Latent-Variable Models
Justin Bayer
Maximilian Soelch
Atanas Mirchev
Baris Kayalibay
Patrick van der Smagt
117
15
0
18 Jan 2021
Stable deep reinforcement learning method by predicting uncertainty in rewards as a subtask
Kanata Suzuki
T. Ogata
78
2
0
18 Jan 2021
SimGAN: Hybrid Simulator Identification for Domain Adaptation via Adversarial Reinforcement Learning
Yifeng Jiang
Tingnan Zhang
Daniel Ho
Yunfei Bai
Chenxi Liu
Sergey Levine
Jie Tan
GAN
99
57
0
15 Jan 2021
Differentiable Nonparametric Belief Propagation
Anthony Opipari
Chao Chen
Shoutian Wang
Jana Pavlasek
Karthik Desingh
Odest Chadwicke Jenkins
50
5
0
15 Jan 2021
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint
Nithia Vijayan
A. PrashanthL.
OffRL
59
7
0
06 Jan 2021
Deep Reinforcement Learning with Quantum-inspired Experience Replay
Qing Wei
Hailan Ma
Chunlin Chen
D. Dong
60
72
0
06 Jan 2021
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection Approach
Amin Nikanjam
Mohammad Mehdi Morovati
Foutse Khomh
Houssem Ben Braiek
113
33
0
01 Jan 2021
Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Coordination by Multi-Critic Policy Gradient Optimization
Yoav Alon
Huiyu Zhou
105
10
0
31 Dec 2020
Modeling Social Interaction for Baby in Simulated Environment for Developmental Robotics
Md Ashaduzzaman Rubel Mondol
Aishwarya Pothula
Deokgun Park
LM&Ro
34
0
0
29 Dec 2020
Federated Multi-Agent Actor-Critic Learning for Age Sensitive Mobile Edge Computing
Zheqi Zhu
Shuo Wan
Pingyi Fan
Khaled B. Letaief
120
78
0
28 Dec 2020
Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy
Han Zhong
Xun Deng
Ethan X. Fang
Zhuoran Yang
Zhaoran Wang
Runze Li
71
3
0
28 Dec 2020
POPO: Pessimistic Offline Policy Optimization
Qiang He
Xinwen Hou
OffRL
85
10
0
26 Dec 2020
Augmenting Policy Learning with Routines Discovered from a Single Demonstration
Zelin Zhao
Chuang Gan
Jiajun Wu
Xiaoxiao Guo
J. Tenenbaum
OffRL
82
5
0
23 Dec 2020
Combining Deep Reinforcement Learning And Local Control For The Acrobot Swing-up And Balance Task
Sean Gillen
Marco Molnar
Katie Byl
35
9
0
21 Dec 2020
myGym: Modular Toolkit for Visuomotor Robotic Tasks
M. Vavrecka
Nikita Sokovnin
Megi Mejdrechova
G. Sejnova
Marek Otáhal
30
6
0
21 Dec 2020
Trace-class Gaussian priors for Bayesian learning of neural networks with MCMC
Torben Sell
Sumeetpal S. Singh
BDL
141
5
0
20 Dec 2020
Uncertainty-Aware Policy Optimization: A Robust, Adaptive Trust Region Approach
James Queeney
I. Paschalidis
Christos G. Cassandras
71
9
0
19 Dec 2020
Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency
Qiang Zhang
Tete Xiao
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
91
65
0
17 Dec 2020
Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL
Simon Hirlaender
N. Bruchon
55
23
0
17 Dec 2020
Embodied Visual Active Learning for Semantic Segmentation
David Nilsson
Aleksis Pirinen
Erik Gartner
C. Sminchisescu
86
35
0
17 Dec 2020
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning
Nathan Lambert
Albert Wilcox
Howard Zhang
K. Pister
Roberto Calandra
77
34
0
16 Dec 2020
CARLA Real Traffic Scenarios -- novel training ground and benchmark for autonomous driving
B. Osinski
Piotr Milos
Adam Jakubowski
Pawel Ziecina
Michal Martyniak
Christopher Galias
Antonia Breuer
S. Homoceanu
Henryk Michalewski
73
20
0
16 Dec 2020
Policy Manifold Search for Improving Diversity-based Neuroevolution
Nemanja Rakićević
Antoine Cully
Petar Kormushev
58
0
0
15 Dec 2020
BeBold: Exploration Beyond the Boundary of Explored Regions
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
81
40
0
15 Dec 2020
TACTO: A Fast, Flexible, and Open-source Simulator for High-Resolution Vision-based Tactile Sensors
Shaoxiong Wang
Mike Lambeta
Po-wei Chou
Roberto Calandra
90
144
0
15 Dec 2020
Learning Visual Robotic Control Efficiently with Contrastive Pre-training and Data Augmentation
Albert Zhan
Philip Zhao
Lerrel Pinto
Pieter Abbeel
Michael Laskin
SSL
DRL
95
21
0
14 Dec 2020
Policy Gradient RL Algorithms as Directed Acyclic Graphs
J. Luis
61
0
0
14 Dec 2020
Evolutionary learning of interpretable decision trees
Leonardo Lucio Custode
Giovanni Iacca
OffRL
104
41
0
14 Dec 2020
Tutoring Reinforcement Learning via Feedback Control
F. D. Lellis
G. Russo
M. D. Bernardo
43
6
0
12 Dec 2020
Protective Policy Transfer
Wenhao Yu
Chenxi Liu
Greg Turk
AAML
126
2
0
11 Dec 2020
Regularizing Action Policies for Smooth Control with Reinforcement Learning
Siddharth Mysore
B. Mabsout
R. Mancuso
Kate Saenko
81
69
0
11 Dec 2020
OPAC: Opportunistic Actor-Critic
Srinjoy Roy
Saptam Bakshi
Tamal Maharaj
51
2
0
11 Dec 2020
OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research
Kai Li
Hang Xu
Enmin Zhao
Zhe Wu
Junliang Xing
VLM
72
0
0
11 Dec 2020
Previous
1
2
3
...
29
30
31
...
50
51
52
Next