Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,657 papers shown
Title
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
35
11
0
08 Jun 2021
Exploration and preference satisfaction trade-off in reward-free learning
Noor Sajid
P. Tigas
Alexey Zakharov
Zafeirios Fountas
Karl J. Friston
27
20
0
08 Jun 2021
3DB: A Framework for Debugging Computer Vision Models
Guillaume Leclerc
Hadi Salman
Andrew Ilyas
Sai H. Vemprala
Logan Engstrom
...
Pengchuan Zhang
Shibani Santurkar
Greg Yang
Ashish Kapoor
Aleksander Madry
40
40
0
07 Jun 2021
Average-Reward Reinforcement Learning with Trust Region Methods
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
24
16
0
07 Jun 2021
Same State, Different Task: Continual Reinforcement Learning without Interference
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLL
OffRL
21
46
0
05 Jun 2021
What Matters for Adversarial Imitation Learning?
Manu Orsini
Anton Raichuk
Léonard Hussenot
Damien Vincent
Robert Dadashi
Sertan Girgin
M. Geist
Olivier Bachem
Olivier Pietquin
Marcin Andrychowicz
55
77
0
01 Jun 2021
Did I do that? Blame as a means to identify controlled effects in reinforcement learning
Oriol Corcoll
Youssef Mohamed
Raul Vicente
24
3
0
01 Jun 2021
Make Bipedal Robots Learn How to Imitate
Vishal Kumar
Sinnu Susan Thomas
20
0
0
15 May 2021
Deeply-Debiased Off-Policy Interval Estimation
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
30
36
0
10 May 2021
Robotic Surgery With Lean Reinforcement Learning
Yotam Barnoy
Molly O'Brien
Wenjie Wang
Gregory D. Hager
OffRL
46
20
0
03 May 2021
Ensemble Feature Extraction for Multi-Container Quality-Diversity Algorithms
L. Cazenille
32
9
0
03 May 2021
XAI-N: Sensor-based Robot Navigation using Expert Policies and Decision Trees
Aaron M. Roth
Jing Liang
Tianyi Zhou
52
8
0
22 Apr 2021
Low-rank State-action Value-function Approximation
Sergio Rozada
Victor M. Tenorio
A. Marques
OffRL
34
9
0
18 Apr 2021
Pylot: A Modular Platform for Exploring Latency-Accuracy Tradeoffs in Autonomous Vehicles
Ionel Gog
Sukrit Kalra
Peter Schafhalter
Matthew A. Wright
Joseph E. Gonzalez
Ion Stoica
35
69
0
16 Apr 2021
CropGym: a Reinforcement Learning Environment for Crop Management
H. Overweg
H. Berghuijs
Ioannis Athanasiadis
OffRL
12
34
0
09 Apr 2021
GEM: Group Enhanced Model for Learning Dynamical Control Systems
Philippe Hansen-Estruch
Wenling Shang
Lerrel Pinto
Pieter Abbeel
Stas Tiomkin
AI4CE
38
2
0
07 Apr 2021
Towards Real-World Deployment of Reinforcement Learning for Traffic Signal Control
Arthur Muller
Vishal S. Rangras
Georg Schnittker
Michael Waldmann
Maxim Friesen
Tobias Ferfers
Lukas Schreckenberg
Florian Hufen
J. Jasperneite
M. Wiering
OffRL
26
14
0
30 Mar 2021
Fundamental Challenges in Deep Learning for Stiff Contact Dynamics
Mihir Parmar
Mathew Halm
Michael Posa
29
36
0
29 Mar 2021
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning
A. S. Morgan
Daljeet Nandha
Georgia Chalvatzaki
Carlo DÉramo
A. Dollar
Jan Peters
48
43
0
25 Mar 2021
Online Baum-Welch algorithm for Hierarchical Imitation Learning
Vittorio Giammarino
I. Paschalidis
OffRL
22
2
0
22 Mar 2021
Reward-Reinforced Reinforcement Learning for Multi-agent Systems
Changgang Zheng
Shufan Yang
Juan Marcelo Parra Ullauri
A. García-Domínguez
Nelly Bencomo
12
9
0
22 Mar 2021
MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation
Zachary Seymour
Kowshik Thopalli
Niluthpol Chowdhury Mithun
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
3DPC
24
18
0
21 Mar 2021
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Clément Romac
Rémy Portelas
Katja Hofmann
Pierre-Yves Oudeyer
27
21
0
17 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
23
17
0
15 Mar 2021
Vision-Based Mobile Robotics Obstacle Avoidance With Deep Reinforcement Learning
Patrick Wenzel
Torsten Schön
Laura Leal-Taixé
Daniel Cremers
28
36
0
08 Mar 2021
Comparing Popular Simulation Environments in the Scope of Robotics and Reinforcement Learning
Marian Korber
Johann Lange
S. Rediske
Simon Steinmann
Roland Glück
19
50
0
08 Mar 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Chong Chen
Yaodong Yang
Lu Zhang
Wulong Liu
Zhaopeng Meng
OffRL
35
4
0
03 Mar 2021
Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter Control
Jacopo Panerati
Hehui Zheng
Siqi Zhou
James Xu
Amanda Prorok
Angela P. Schoellig University of Toronto Institute for A Studies
AI4CE
22
155
0
03 Mar 2021
Offline Reinforcement Learning with Pseudometric Learning
Robert Dadashi
Shideh Rezaeifar
Nino Vieillard
Léonard Hussenot
Olivier Pietquin
M. Geist
OffRL
39
40
0
02 Mar 2021
Generalizing to Unseen Domains: A Survey on Domain Generalization
Jindong Wang
Cuiling Lan
Chang-Shu Liu
Yidong Ouyang
Tao Qin
Wang Lu
Yiqiang Chen
Wenjun Zeng
Philip S. Yu
OOD
69
1,179
0
02 Mar 2021
Expected Value of Communication for Planning in Ad Hoc Teamwork
William Macke
Reuth Mirsky
Peter Stone
42
25
0
01 Mar 2021
Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods
Nicholay Topin
Stephanie Milani
Fei Fang
Manuela Veloso
OffRL
29
32
0
25 Feb 2021
Modular Object-Oriented Games: A Task Framework for Reinforcement Learning, Psychology, and Neuroscience
Nicholas Watters
J. Tenenbaum
M. Jazayeri
GP
29
3
0
25 Feb 2021
Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective
Mirza Ramicic
Andrea Bonarini
21
3
0
22 Feb 2021
CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels
Yongqian Xiao
Xin Xu
Yifei Shi
22
9
0
19 Feb 2021
Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Manan Tomar
Amy Zhang
Roberto Calandra
Matthew E. Taylor
Joelle Pineau
27
24
0
19 Feb 2021
Sim-Env: Decoupling OpenAI Gym Environments from Simulation Models
Andreas Schuderer
Stefano Bromuri
M. V. Eekelen
AI4CE
18
2
0
19 Feb 2021
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
31
14
0
18 Feb 2021
Automated Curriculum Learning for Embodied Agents: A Neuroevolutionary Approach
Nicola Milano
S. Nolfi
92
10
0
17 Feb 2021
Learning from Demonstrations using Signal Temporal Logic
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
Stefanos Nikolaidis
40
25
0
15 Feb 2021
Sliced Multi-Marginal Optimal Transport
Samuel N. Cohen
Alexander Terenin
Yannik Pitcan
Brandon Amos
M. Deisenroth
K. S. S. Kumar
OT
23
9
0
14 Feb 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
18
18
0
13 Feb 2021
Scalable Bayesian Inverse Reinforcement Learning
Alex J. Chan
M. Schaar
OffRL
BDL
26
67
0
12 Feb 2021
Multi-Task Reinforcement Learning with Context-based Representations
Shagun Sodhani
Amy Zhang
Joelle Pineau
37
182
0
11 Feb 2021
Towards Hierarchical Task Decomposition using Deep Reinforcement Learning for Pick and Place Subtasks
Luca Marzari
Ameya Pore
Diego DallÁlba
G. Aragon-Camarasa
Alessandro Farinelli
Paolo Fiorini
38
28
0
08 Feb 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
20
520
0
04 Feb 2021
Embodied Intelligence via Learning and Evolution
Agrim Gupta
Silvio Savarese
Surya Ganguli
Li Fei-Fei
AI4CE
27
232
0
03 Feb 2021
Differentiable Trust Region Layers for Deep Reinforcement Learning
Fabian Otto
P. Becker
Ngo Anh Vien
Hanna Ziesche
Gerhard Neumann
OffRL
41
19
0
22 Jan 2021
Mind the Gap when Conditioning Amortised Inference in Sequential Latent-Variable Models
Justin Bayer
Maximilian Soelch
Atanas Mirchev
Baris Kayalibay
Patrick van der Smagt
31
15
0
18 Jan 2021
SimGAN: Hybrid Simulator Identification for Domain Adaptation via Adversarial Reinforcement Learning
Yifeng Jiang
Tingnan Zhang
Daniel Ho
Yunfei Bai
Chenxi Liu
Sergey Levine
Jie Tan
GAN
29
54
0
15 Jan 2021
Previous
1
2
3
...
23
24
25
...
32
33
34
Next