ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Critic Sequential Monte Carlo
Critic Sequential Monte Carlo
Vasileios Lioutas
J. Lavington
Justice Sefas
Matthew Niedoba
Yunpeng Liu
Berend Zwartsenberg
Setareh Dabiri
Frank Wood
Adam Scibior
107
7
0
30 May 2022
TaSIL: Taylor Series Imitation Learning
TaSIL: Taylor Series Imitation Learning
Daniel Pfrommer
Thomas T. Zhang
Stephen Tu
Nikolai Matni
76
17
0
30 May 2022
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning
  Framework
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
Dian Wang
Colin Kohler
Xu Zhu
Ming Jia
Robert Platt
67
9
0
28 May 2022
Why So Pessimistic? Estimating Uncertainties for Offline RL through
  Ensembles, and Why Their Independence Matters
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters
Seyed Kamyar Seyed Ghasemipour
S. Gu
Ofir Nachum
OffRL
90
72
0
27 May 2022
MyoSuite -- A contact-rich simulation suite for musculoskeletal motor
  control
MyoSuite -- A contact-rich simulation suite for musculoskeletal motor control
Vittorio Caggiano
Huawei Wang
G. Durandau
Massimo Sartori
Vikash Kumar
80
99
0
26 May 2022
SFP: State-free Priors for Exploration in Off-Policy Reinforcement
  Learning
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning
Marco Bagatella
Sammy Christen
Otmar Hilliges
OffRL
123
6
0
26 May 2022
Efficient Reinforcement Learning from Demonstration Using Local Ensemble
  and Reparameterization with Split and Merge of Expert Policies
Efficient Reinforcement Learning from Demonstration Using Local Ensemble and Reparameterization with Split and Merge of Expert Policies
Yu Wang
Fang Liu
86
0
0
23 May 2022
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Anurag Koul
Mariano Phielipp
Alan Fern
OffRL
74
0
0
22 May 2022
ARLO: A Framework for Automated Reinforcement Learning
ARLO: A Framework for Automated Reinforcement Learning
Marco Mussi
Davide Lombarda
Alberto Maria Metelli
F. Trovò
Marcello Restelli
OffRL
80
4
0
20 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
142
9
0
20 May 2022
Towards biologically plausible Dreaming and Planning in recurrent
  spiking networks
Towards biologically plausible Dreaming and Planning in recurrent spiking networks
C. Capone
P. Paolucci
CLL
51
7
0
20 May 2022
HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot
  Object Handovers
HandoverSim: A Simulation Framework and Benchmark for Human-to-Robot Object Handovers
Yu-Wei Chao
Chris Paxton
Yu Xiang
Wei Yang
Balakumar Sundaralingam
Tao Chen
Adithyavairavan Murali
Maya Cakmak
Dieter Fox
111
17
0
19 May 2022
Dexterous Robotic Manipulation using Deep Reinforcement Learning and
  Knowledge Transfer for Complex Sparse Reward-based Tasks
Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks
Qiang Wang
Francisco Roldan Sanchez
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
M. Wuthrich
Felix Widmaier
Stefan Bauer
S. Redmond
107
15
0
19 May 2022
Data Valuation for Offline Reinforcement Learning
Data Valuation for Offline Reinforcement Learning
Amir Abolfazli
Gregory Palmer
D. Kudenko
OffRL
38
0
0
19 May 2022
TC-Driver: Trajectory Conditioned Driving for Robust Autonomous Racing
  -- A Reinforcement Learning Approach
TC-Driver: Trajectory Conditioned Driving for Robust Autonomous Racing -- A Reinforcement Learning Approach
Edoardo Ghignone
Nicolas Baumann
Mike Boss
Michele Magno
92
15
0
19 May 2022
Generating Explanations from Deep Reinforcement Learning Using Episodic
  Memory
Generating Explanations from Deep Reinforcement Learning Using Episodic Memory
Sam Blakeman
D. Mareschal
63
3
0
18 May 2022
Neighborhood Mixup Experience Replay: Local Convex Interpolation for
  Improved Sample Efficiency in Continuous Control Tasks
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
Ryan M Sander
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
S. Karaman
Daniela Rus
69
2
0
18 May 2022
Robust Losses for Learning Value Functions
Robust Losses for Learning Value Functions
Andrew Patterson
Victor Liao
Martha White
97
13
0
17 May 2022
MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured
  Environments
MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments
Giuseppe Vecchio
S. Palazzo
D. Guastella
Ignacio Carlucho
Stefano V. Albrecht
Giovanni Muscato
C. Spampinato
47
0
0
17 May 2022
Automatic Acquisition of a Repertoire of Diverse Grasping Trajectories
  through Behavior Shaping and Novelty Search
Automatic Acquisition of a Repertoire of Diverse Grasping Trajectories through Behavior Shaping and Novelty Search
Aurélien Morel
Yakumo Kunimoto
Alexandre Coninx
Stéphane Doncieux
99
13
0
17 May 2022
Qualitative Differences Between Evolutionary Strategies and
  Reinforcement Learning Methods for Control of Autonomous Agents
Qualitative Differences Between Evolutionary Strategies and Reinforcement Learning Methods for Control of Autonomous Agents
Nicola Milano
S. Nolfi
59
0
0
16 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
132
77
0
15 May 2022
A Learning Approach for Joint Design of Event-triggered Control and
  Power-Efficient Resource Allocation
A Learning Approach for Joint Design of Event-triggered Control and Power-Efficient Resource Allocation
Atefeh Termehchi
M. Rasti
26
5
0
14 May 2022
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning
  Environments
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments
Ryan Sullivan
J. K. Terry
Benjamin Black
John P. Dickerson
92
8
0
14 May 2022
Efficient Off-Policy Reinforcement Learning via Brain-Inspired Computing
Efficient Off-Policy Reinforcement Learning via Brain-Inspired Computing
Yang Ni
Danny Abraham
Mariam Issa
Yeseong Kim
Pietro Mercati
Mohsen Imani
OffRL
57
12
0
14 May 2022
Unified Distributed Environment
Unified Distributed Environment
Woong Gyu La
Sunil Muralidhara
Ling-Xue Kong
Pratik Nichat
53
2
0
14 May 2022
Distributed Transmission Control for Wireless Networks using Multi-Agent
  Reinforcement Learning
Distributed Transmission Control for Wireless Networks using Multi-Agent Reinforcement Learning
Collin Farquhar
P. Kumar
Anu Jagannath
Jithin Jagannath
AI4CE
50
2
0
13 May 2022
Characterizing the Action-Generalization Gap in Deep Q-Learning
Characterizing the Action-Generalization Gap in Deep Q-Learning
Zhi Zhou
Cameron Allen
Kavosh Asadi
George Konidaris
50
2
0
11 May 2022
Simultaneous Double Q-learning with Conservative Advantage Learning for
  Actor-Critic Methods
Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods
Qing Li
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
OffRL
37
2
0
08 May 2022
Search-Based Testing of Reinforcement Learning
Search-Based Testing of Reinforcement Learning
Martin Tappler
Filip Cano Córdoba
B. Aichernig
Bettina Könighofer
ELMOffRL
60
24
0
07 May 2022
Dynamically writing coupled memories using a reinforcement learning
  agent, meeting physical bounds
Dynamically writing coupled memories using a reinforcement learning agent, meeting physical bounds
Théo Jules
Laura Michel
A. Douin
F. Lechenault
AI4CE
21
0
0
06 May 2022
Vehicle management in a modular production context using Deep Q-Learning
Vehicle management in a modular production context using Deep Q-Learning
Lucain Pouget
Timo Hasenbichler
Jakob Auer
K. Lichtenegger
Andreas Windisch
21
0
0
06 May 2022
Variance Reduction based Partial Trajectory Reuse to Accelerate Policy
  Gradient Optimization
Variance Reduction based Partial Trajectory Reuse to Accelerate Policy Gradient Optimization
Hua Zheng
Wei Xie
76
3
0
06 May 2022
Learn-to-Race Challenge 2022: Benchmarking Safe Learning and
  Cross-domain Generalisation in Autonomous Racing
Learn-to-Race Challenge 2022: Benchmarking Safe Learning and Cross-domain Generalisation in Autonomous Racing
Jonathan M Francis
Bingqing Chen
Siddha Ganju
Sidharth Kathpal
Jyotish Poonganam
...
Ivan Zhukov
Max Kumskoy
Anirudh Koul
Jean Oh
Eric Nyberg
99
13
0
05 May 2022
RLFlow: Optimising Neural Network Subgraph Transformation with World
  Models
RLFlow: Optimising Neural Network Subgraph Transformation with World Models
Sean Parker
Sami Alabed
Eiko Yoneki
34
0
0
03 May 2022
Triangular Dropout: Variable Network Width without Retraining
Triangular Dropout: Variable Network Width without Retraining
Edward W. Staley
Jared Markowitz
57
2
0
02 May 2022
Neural Implicit Representations for Physical Parameter Inference from a
  Single Video
Neural Implicit Representations for Physical Parameter Inference from a Single Video
Florian Hofherr
Lukas Koestler
Florian Bernard
Zorah Lähner
AI4CE
132
10
0
29 Apr 2022
Watts: Infrastructure for Open-Ended Learning
Watts: Infrastructure for Open-Ended Learning
Aaron Dharna
Charlie Summers
Rohin Dasari
Julian Togelius
Amy K. Hoover
55
1
0
28 Apr 2022
Optimizing Nitrogen Management with Deep Reinforcement Learning and Crop
  Simulations
Optimizing Nitrogen Management with Deep Reinforcement Learning and Crop Simulations
Jing Wu
Ran Tao
Pan Zhao
N. F. Martin
N. Hovakimyan
OffRL
75
47
0
21 Apr 2022
Learning to Fold Real Garments with One Arm: A Case Study in Cloud-Based
  Robotics Research
Learning to Fold Real Garments with One Arm: A Case Study in Cloud-Based Robotics Research
Ryan Hoque
K. Shivakumar
Shrey Aeron
Gabriel Deza
Aditya Ganapathi
Adrian S. Wong
Johnny Lee
Andy Zeng
Vincent Vanhoucke
Ken Goldberg
95
23
0
21 Apr 2022
FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning
  by Penalizing KL Divergence
FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence
Zhijie Xie
Shenghui Song
FedML
92
51
0
18 Apr 2022
Exploiting Embodied Simulation to Detect Novel Object Classes Through
  Interaction
Exploiting Embodied Simulation to Detect Novel Object Classes Through Interaction
Nikhil Krishnaswamy
Sadaf Ghaffari
50
4
0
17 Apr 2022
Efficient Bayesian Policy Reuse with a Scalable Observation Model in
  Deep Reinforcement Learning
Efficient Bayesian Policy Reuse with a Scalable Observation Model in Deep Reinforcement Learning
Donghan Xie
Zhi Wang
Chunlin Chen
D. Dong
OffRL
96
2
0
16 Apr 2022
Accelerated Policy Learning with Parallel Differentiable Simulation
Accelerated Policy Learning with Parallel Differentiable Simulation
Jie Xu
Viktor Makoviychuk
Yashraj S. Narang
Fabio Ramos
Wojciech Matusik
Animesh Garg
Miles Macklin
77
95
0
14 Apr 2022
deep-significance - Easy and Meaningful Statistical Significance Testing
  in the Age of Neural Networks
deep-significance - Easy and Meaningful Statistical Significance Testing in the Age of Neural Networks
Dennis Ulmer
Christian Hardmeier
J. Frellsen
139
42
0
14 Apr 2022
Effective Mutation Rate Adaptation through Group Elite Selection
Effective Mutation Rate Adaptation through Group Elite Selection
Akarsh Kumar
B. Liu
Risto Miikkulainen
Peter Stone
28
10
0
11 Apr 2022
MR-iNet Gym: Framework for Edge Deployment of Deep Reinforcement
  Learning on Embedded Software Defined Radio
MR-iNet Gym: Framework for Edge Deployment of Deep Reinforcement Learning on Embedded Software Defined Radio
Jithin Jagannath
Kian Hamedani
Collin Farquhar
Keyvan Ramezanpour
Anu Jagannath
65
6
0
09 Apr 2022
A Spiking Neural Network Structure Implementing Reinforcement Learning
A Spiking Neural Network Structure Implementing Reinforcement Learning
Mikhail Kiselev
38
0
0
09 Apr 2022
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning
  for Robotics
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics
Frank Röder
Manfred Eppe
S. Wermter
80
7
0
08 Apr 2022
Imitating, Fast and Slow: Robust learning from demonstrations via
  decision-time planning
Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning
Carl Qi
Pieter Abbeel
Aditya Grover
OffRL
35
3
0
07 Apr 2022
Previous
123...181920...505152
Next