Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank Wood
Adam Scibior
107
7
0
30 May 2022
TaSIL: Taylor Series Imitation Learning
Daniel Pfrommer
Thomas T. Zhang
Stephen Tu
Nikolai Matni
76
17
0
30 May 2022
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
Dian Wang
Colin Kohler
Xu Zhu
Ming Jia
Robert Platt
67
9
0
28 May 2022
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters
Seyed Kamyar Seyed Ghasemipour
S. Gu
Ofir Nachum
OffRL
90
72
0
27 May 2022
MyoSuite -- A contact-rich simulation suite for musculoskeletal motor control
Vittorio Caggiano
Huawei Wang
G. Durandau
Massimo Sartori
Vikash Kumar
80
99
0
26 May 2022
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning
Marco Bagatella
Sammy Christen
Otmar Hilliges
OffRL
123
6
0
26 May 2022
Efficient Reinforcement Learning from Demonstration Using Local Ensemble and Reparameterization with Split and Merge of Expert Policies
Yu Wang
Fang Liu
86
0
0
23 May 2022
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Anurag Koul
Mariano Phielipp
Alan Fern
OffRL
74
0
0
22 May 2022
ARLO: A Framework for Automated Reinforcement Learning
Marco Mussi
Davide Lombarda
Alberto Maria Metelli
F. Trovò
Marcello Restelli
OffRL
80
4
0
20 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
142
9
0
20 May 2022
Towards biologically plausible Dreaming and Planning in recurrent spiking networks
C. Capone
P. Paolucci
CLL
51
7
0
20 May 2022
HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers
Yu-Wei Chao
Chris Paxton
Yu Xiang
Wei Yang
Balakumar Sundaralingam
Tao Chen
Adithyavairavan Murali
Maya Cakmak
Dieter Fox
111
17
0
19 May 2022
Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks
Qiang Wang
Francisco Roldan Sanchez
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
M. Wuthrich
Felix Widmaier
Stefan Bauer
S. Redmond
107
15
0
19 May 2022
Data Valuation for Offline Reinforcement Learning
Amir Abolfazli
Gregory Palmer
D. Kudenko
OffRL
38
0
0
19 May 2022
TC-Driver: Trajectory Conditioned Driving for Robust Autonomous Racing -- A Reinforcement Learning Approach
Edoardo Ghignone
Nicolas Baumann
Mike Boss
Michele Magno
92
15
0
19 May 2022
Generating Explanations from Deep Reinforcement Learning Using Episodic Memory
Sam Blakeman
D. Mareschal
63
3
0
18 May 2022
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
Ryan M Sander
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
S. Karaman
Daniela Rus
69
2
0
18 May 2022
Robust Losses for Learning Value Functions
Andrew Patterson
Victor Liao
Martha White
97
13
0
17 May 2022
MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments
Giuseppe Vecchio
S. Palazzo
D. Guastella
Ignacio Carlucho
Stefano V. Albrecht
Giovanni Muscato
C. Spampinato
47
0
0
17 May 2022
Automatic Acquisition of a Repertoire of Diverse Grasping Trajectories through Behavior Shaping and Novelty Search
Aurélien Morel
Yakumo Kunimoto
Alexandre Coninx
Stéphane Doncieux
99
13
0
17 May 2022
Qualitative Differences Between Evolutionary Strategies and Reinforcement Learning Methods for Control of Autonomous Agents
Nicola Milano
S. Nolfi
59
0
0
16 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
132
77
0
15 May 2022
A Learning Approach for Joint Design of Event-triggered Control and Power-Efficient Resource Allocation
Atefeh Termehchi
M. Rasti
26
5
0
14 May 2022
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments
Ryan Sullivan
J. K. Terry
Benjamin Black
John P. Dickerson
92
8
0
14 May 2022
Efficient Off-Policy Reinforcement Learning via Brain-Inspired Computing
Yang Ni
Danny Abraham
Mariam Issa
Yeseong Kim
Pietro Mercati
Mohsen Imani
OffRL
57
12
0
14 May 2022
Unified Distributed Environment
Woong Gyu La
Sunil Muralidhara
Ling-Xue Kong
Pratik Nichat
53
2
0
14 May 2022
Distributed Transmission Control for Wireless Networks using Multi-Agent Reinforcement Learning
Collin Farquhar
P. Kumar
Anu Jagannath
Jithin Jagannath
AI4CE
50
2
0
13 May 2022
Characterizing the Action-Generalization Gap in Deep Q-Learning
Zhi Zhou
Cameron Allen
Kavosh Asadi
George Konidaris
50
2
0
11 May 2022
Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods
Qing Li
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
OffRL
37
2
0
08 May 2022
Search-Based Testing of Reinforcement Learning
Martin Tappler
Filip Cano Córdoba
B. Aichernig
Bettina Könighofer
ELM
OffRL
60
24
0
07 May 2022
Dynamically writing coupled memories using a reinforcement learning agent, meeting physical bounds
Théo Jules
Laura Michel
A. Douin
F. Lechenault
AI4CE
21
0
0
06 May 2022
Vehicle management in a modular production context using Deep Q-Learning
Lucain Pouget
Timo Hasenbichler
Jakob Auer
K. Lichtenegger
Andreas Windisch
21
0
0
06 May 2022
Variance Reduction based Partial Trajectory Reuse to Accelerate Policy Gradient Optimization
Hua Zheng
Wei Xie
76
3
0
06 May 2022
Learn-to-Race Challenge 2022: Benchmarking Safe Learning and Cross-domain Generalisation in Autonomous Racing
Jonathan M Francis
Bingqing Chen
Siddha Ganju
Sidharth Kathpal
Jyotish Poonganam
...
Ivan Zhukov
Max Kumskoy
Anirudh Koul
Jean Oh
Eric Nyberg
99
13
0
05 May 2022
RLFlow: Optimising Neural Network Subgraph Transformation with World Models
Sean Parker
Sami Alabed
Eiko Yoneki
34
0
0
03 May 2022
Triangular Dropout: Variable Network Width without Retraining
Edward W. Staley
Jared Markowitz
57
2
0
02 May 2022
Neural Implicit Representations for Physical Parameter Inference from a Single Video
Florian Hofherr
Lukas Koestler
Florian Bernard
Zorah Lähner
AI4CE
132
10
0
29 Apr 2022
Watts: Infrastructure for Open-Ended Learning
Aaron Dharna
Charlie Summers
Rohin Dasari
Julian Togelius
Amy K. Hoover
55
1
0
28 Apr 2022
Optimizing Nitrogen Management with Deep Reinforcement Learning and Crop Simulations
Jing Wu
Ran Tao
Pan Zhao
N. F. Martin
N. Hovakimyan
OffRL
75
47
0
21 Apr 2022
Learning to Fold Real Garments with One Arm: A Case Study in Cloud-Based Robotics Research
Ryan Hoque
K. Shivakumar
Shrey Aeron
Gabriel Deza
Aditya Ganapathi
Adrian S. Wong
Johnny Lee
Andy Zeng
Vincent Vanhoucke
Ken Goldberg
95
23
0
21 Apr 2022
FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence
Zhijie Xie
Shenghui Song
FedML
92
51
0
18 Apr 2022
Exploiting Embodied Simulation to Detect Novel Object Classes Through Interaction
Nikhil Krishnaswamy
Sadaf Ghaffari
50
4
0
17 Apr 2022
Efficient Bayesian Policy Reuse with a Scalable Observation Model in Deep Reinforcement Learning
Donghan Xie
Zhi Wang
Chunlin Chen
D. Dong
OffRL
96
2
0
16 Apr 2022
Accelerated Policy Learning with Parallel Differentiable Simulation
Jie Xu
Viktor Makoviychuk
Yashraj S. Narang
Fabio Ramos
Wojciech Matusik
Animesh Garg
Miles Macklin
77
95
0
14 Apr 2022
deep-significance - Easy and Meaningful Statistical Significance Testing in the Age of Neural Networks
Dennis Ulmer
Christian Hardmeier
J. Frellsen
139
42
0
14 Apr 2022
Effective Mutation Rate Adaptation through Group Elite Selection
Akarsh Kumar
B. Liu
Risto Miikkulainen
Peter Stone
28
10
0
11 Apr 2022
MR-iNet Gym: Framework for Edge Deployment of Deep Reinforcement Learning on Embedded Software Defined Radio
Jithin Jagannath
Kian Hamedani
Collin Farquhar
Keyvan Ramezanpour
Anu Jagannath
65
6
0
09 Apr 2022
A Spiking Neural Network Structure Implementing Reinforcement Learning
Mikhail Kiselev
38
0
0
09 Apr 2022
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics
Frank Röder
Manfred Eppe
S. Wermter
80
7
0
08 Apr 2022
Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning
Carl Qi
Pieter Abbeel
Aditya Grover
OffRL
35
3
0
07 Apr 2022
Previous
1
2
3
...
18
19
20
...
50
51
52
Next