Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
AvE: Assistance via Empowerment
Yuqing Du
Stas Tiomkin
Emre Kıcıman
Daniel Polani
Pieter Abbeel
Anca Dragan
113
35
0
26 Jun 2020
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Shengyi Huang
Santiago Ontañón
108
329
0
25 Jun 2020
Strictly Batch Imitation Learning by Energy-based Distribution Matching
Daniel Jarrett
Ioana Bica
M. Schaar
OffRL
84
63
0
25 Jun 2020
Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences
Erdem Biyik
Dylan P. Losey
Malayandi Palan
Nicholas C. Landolfi
Gleb Shevchuk
Dorsa Sadigh
88
118
0
24 Jun 2020
Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers
Benjamin Eysenbach
Swapnil Asawa
Shreyas Chaudhari
Sergey Levine
Ruslan Salakhutdinov
108
94
0
24 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
105
58
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
76
27
0
23 Jun 2020
Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Viraj Mehta
I. Char
Willie Neiswanger
Youngseog Chung
A. Nelson
M. Boyer
E. Kolemen
J. Schneider
AI4CE
45
28
0
23 Jun 2020
Ecological Reinforcement Learning
John D. Co-Reyes
Suvansh Sanjeev
Glen Berseth
Abhishek Gupta
Sergey Levine
OffRL
105
23
0
22 Jun 2020
dm_control: Software and Tasks for Continuous Control
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
...
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
153
419
0
22 Jun 2020
Safe Reinforcement Learning via Curriculum Induction
M. Turchetta
Andrey Kolobov
S. Shah
Andreas Krause
Alekh Agarwal
66
93
0
22 Jun 2020
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko
Zhehui Huang
T. Kumar
Gaurav Sukhatme
V. Koltun
103
105
0
21 Jun 2020
Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies
Tsung-Yen Yang
Justinian P. Rosca
Karthik Narasimhan
Peter J. Ramadge
103
19
0
20 Jun 2020
Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees
Kishan Panaganti
D. Kalathil
119
4
0
20 Jun 2020
Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes
Mengdi Xu
Wenhao Ding
Jiacheng Zhu
Zuxin Liu
Baiming Chen
Ding Zhao
CLL
OffRL
102
35
0
19 Jun 2020
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration
Shuai Han
Wenbo Zhou
Jing Liu
Shuai Lu
43
28
0
19 Jun 2020
Learn to Earn: Enabling Coordination within a Ride Hailing Fleet
Harshal A. Chaudhari
J. Byers
Evimaria Terzi
77
6
0
19 Jun 2020
An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Anton Dereventsov
Clayton Webster
Joseph Daws
65
10
0
18 Jun 2020
Reparameterized Variational Divergence Minimization for Stable Imitation
Dilip Arumugam
Debadeepta Dey
Alekh Agarwal
Asli Celikyilmaz
E. Nouri
W. Dolan
50
3
0
18 Jun 2020
Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Annie Xie
James Harrison
Chelsea Finn
CLL
OffRL
94
65
0
18 Jun 2020
Learning to Track Dynamic Targets in Partially Known Environments
Heejin Jeong
Hamed Hassani
M. Morari
Daniel D. Lee
George J. Pappas
73
11
0
17 Jun 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
66
4
0
17 Jun 2020
Deep Reinforcement Learning Controller for 3D Path-following and Collision Avoidance by Autonomous Underwater Vehicles
Simen Theie Havenstrom
Adil Rasheed
Omer San
AI4CE
16
39
0
17 Jun 2020
Data Driven Control with Learned Dynamics: Model-Based versus Model-Free Approach
Wenjian Hao
Yiqiang Han
45
6
0
16 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
76
75
0
16 Jun 2020
Untangling tradeoffs between recurrence and self-attention in neural networks
Giancarlo Kerg
Bhargav Kanuparthi
Anirudh Goyal
Kyle Goyette
Yoshua Bengio
Guillaume Lajoie
57
9
0
16 Jun 2020
Model Embedding Model-Based Reinforcement Learning
Xiao Tan
Chao Qu
Junwu Xiong
James Y. Zhang
OffRL
32
0
0
16 Jun 2020
Q
Q
Q
-learning with Logarithmic Regret
Kunhe Yang
Lin F. Yang
S. Du
102
62
0
16 Jun 2020
Model-based Adversarial Meta-Reinforcement Learning
Zichuan Lin
G. Thomas
Guangwen Yang
Tengyu Ma
OOD
74
52
0
16 Jun 2020
Designing high-fidelity multi-qubit gates for semiconductor quantum dots through deep reinforcement learning
Sahar Daraeizadeh
S. Premaratne
A. Matsuura
43
5
0
15 Jun 2020
Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control
Rika Antonova
Maksim Maydanskiy
Danica Kragic
Sam Devlin
Katja Hofmann
76
9
0
15 Jun 2020
Provably Efficient Model-based Policy Adaptation
Yuda Song
Aditi Mavalankar
Wen Sun
Sicun Gao
TTA
OffRL
74
10
0
14 Jun 2020
Optimistic Distributionally Robust Policy Optimization
Jun Song
Chaoyue Zhao
46
12
0
14 Jun 2020
Reinforcement Learning with Supervision from Noisy Demonstrations
Kun-Peng Ning
Sheng-Jun Huang
49
7
0
14 Jun 2020
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Yunhao Tang
K. Choromanski
OffRL
38
14
0
13 Jun 2020
Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Yunhao Tang
A. Kucukelbir
OffRL
68
16
0
13 Jun 2020
Human and Multi-Agent collaboration in a human-MARL teaming framework
N. Navidi
Francois Chabot
Sagar Kurandwad
Irv Lustigman
Vincent Robert
Gregory Szriftgiser
Andrea Schuch
33
2
0
12 Jun 2020
Data-driven Koopman Operators for Model-based Shared Control of Human-Machine Systems
Alexander Broad
Ian Abraham
Todd Murphey
B. Argall
77
37
0
12 Jun 2020
Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling
Russell Mendonca
Xinyang Geng
Chelsea Finn
Sergey Levine
OOD
OffRL
93
40
0
12 Jun 2020
SAMBA: Safe Model-Based & Active Reinforcement Learning
Alexander I. Cowen-Rivers
Daniel Palenicek
Vincent Moens
Mohammed Abdullah
Aivar Sootla
Jun Wang
Haitham Bou-Ammar
75
45
0
12 Jun 2020
From proprioception to long-horizon planning in novel environments: A hierarchical RL model
Nishad Gothoskar
Miguel Lázaro-Gredilla
Dileep George
33
0
0
11 Jun 2020
Zeroth-Order Supervised Policy Improvement
Hao Sun
Ziping Xu
Yuhang Song
Meng Fang
Jiechao Xiong
Bo Dai
Bolei Zhou
OffRL
59
9
0
11 Jun 2020
Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework
Wei Shen
Yuanying Cai
Longbo Huang
Jian Li
OffRL
50
1
0
11 Jun 2020
Borrowing From the Future: Addressing Double Sampling in Model-free Control
Yuhua Zhu
Zachary Izzo
Lexing Ying
32
4
0
11 Jun 2020
Report from the NSF Future Directions Workshop, Toward User-Oriented Agents: Research Directions and Challenges
M. Eskénazi
Tiancheng Zhao
LLMAG
AI4TS
AI4CE
90
9
0
10 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
Matthieu Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
92
225
0
10 Jun 2020
Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges
E. Galván
P. Mooney
111
136
0
09 Jun 2020
Distributed Learning on Heterogeneous Resource-Constrained Devices
Martin Rapp
R. Khalili
J. Henkel
FedML
68
7
0
09 Jun 2020
AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation
Jae Hyun Lim
Aaron Courville
C. Pal
Chin-Wei Huang
DRL
71
23
0
09 Jun 2020
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
105
129
0
08 Jun 2020
Previous
1
2
3
...
35
36
37
...
50
51
52
Next