ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
μ-DDRL: A QoS-Aware Distributed Deep Reinforcement Learning
  Technique for Service Offloading in Fog computing Environments
μ-DDRL: A QoS-Aware Distributed Deep Reinforcement Learning Technique for Service Offloading in Fog computing Environments
M. Goudarzi
M. A. Rodriguez
Majid Sarvi
Rajkumar Buyya
OffRL
79
3
0
13 Oct 2023
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement
  Learning with Sub-optimal Demonstrations
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations
Lu Li
Yuxin Pan
Ruobing Chen
Jie Liu
Zilin Wang
Yu Liu
Zhiheng Li
125
0
0
13 Oct 2023
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Jingkang Yang
Yuhao Dong
Shuai Liu
Yue Liu
Ziyue Wang
...
Haoran Tan
Jiamu Kang
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
LM&Ro
89
49
0
12 Oct 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General
  Sequential Decision Scenarios
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Yazhe Niu
Yuan Pu
Zhenjie Yang
Xueyan Li
Tong Zhou
Jiyuan Ren
Shuai Hu
Hongsheng Li
Yu Liu
139
15
0
12 Oct 2023
Accountability in Offline Reinforcement Learning: Explaining Decisions
  with a Corpus of Examples
Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Hao Sun
Alihan Huyuk
Daniel Jarrett
M. Schaar
OffRL
113
8
0
11 Oct 2023
RANS: Highly-Parallelised Simulator for Reinforcement Learning based
  Autonomous Navigating Spacecrafts
RANS: Highly-Parallelised Simulator for Reinforcement Learning based Autonomous Navigating Spacecrafts
Matteo El Hariry
Antoine Richard
Miguel Olivares-Mendez
74
4
0
11 Oct 2023
Imitation Learning from Purified Demonstration
Imitation Learning from Purified Demonstration
Yunke Wang
Minjing Dong
Bo Du
Chang Xu
68
1
0
11 Oct 2023
RoboHive: A Unified Framework for Robot Learning
RoboHive: A Unified Framework for Robot Learning
Vikash Kumar
Rutav Shah
Gaoyue Zhou
Vincent Moens
Vittorio Caggiano
Jay Vakil
Abhishek Gupta
Aravind Rajeswaran
67
25
0
10 Oct 2023
Realizing Stabilized Landing for Computation-Limited Reusable Rockets: A
  Quantum Reinforcement Learning Approach
Realizing Stabilized Landing for Computation-Limited Reusable Rockets: A Quantum Reinforcement Learning Approach
Gyusun Kim
Jaehyun Chung
Soohyun Park
51
8
0
10 Oct 2023
Initial Task Assignment in Multi-Human Multi-Robot Teams: An
  Attention-enhanced Hierarchical Reinforcement Learning Approach
Initial Task Assignment in Multi-Human Multi-Robot Teams: An Attention-enhanced Hierarchical Reinforcement Learning Approach
Ruiqi Wang
Dezhong Zhao
Arjun Gupte
Byung-Cheol Min
40
1
0
08 Oct 2023
Surgical Gym: A high-performance GPU-based platform for reinforcement
  learning with surgical robots
Surgical Gym: A high-performance GPU-based platform for reinforcement learning with surgical robots
Samuel Schmidgall
Axel Krieger
Jason K. Eshraghian
OOD
88
16
0
07 Oct 2023
Searching for Optimal Runtime Assurance via Reachability and
  Reinforcement Learning
Searching for Optimal Runtime Assurance via Reachability and Reinforcement Learning
Kristina Miller
Christopher K. Zeitler
William Shen
Kerianne L. Hobbs
Sayan Mitra
John Schierman
Mahesh Viswanathan
49
0
0
06 Oct 2023
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in
  Non-Visual Environments: A Comparison
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A Comparison
Moritz Lange
Noah Krystiniak
Raphael C. Engelhardt
Wolfgang Konen
Laurenz Wiskott
OffRL
59
1
0
06 Oct 2023
On Representation Complexity of Model-based and Model-free Reinforcement
  Learning
On Representation Complexity of Model-based and Model-free Reinforcement Learning
Hanlin Zhu
Baihe Huang
Stuart Russell
OffRL
76
4
0
03 Oct 2023
Imitation Learning from Observation through Optimal Transport
Imitation Learning from Observation through Optimal Transport
Wei-Di Chang
Scott Fujimoto
David Meger
Gregory Dudek
61
4
0
02 Oct 2023
Accurate Simulation and Parameter Identification of Deformable Linear Objects using Discrete Elastic Rods in Generalized Coordinates
Accurate Simulation and Parameter Identification of Deformable Linear Objects using Discrete Elastic Rods in Generalized Coordinates
Qi Jing Chen
Timothy Bretl
AI4CE
48
0
0
02 Oct 2023
Optimizing with Low Budgets: a Comparison on the Black-box Optimization
  Benchmarking Suite and OpenAI Gym
Optimizing with Low Budgets: a Comparison on the Black-box Optimization Benchmarking Suite and OpenAI Gym
Elena Raponi
Nathanaël Carraz Rakotonirina
Jérémy Rapin
Carola Doerr
O. Teytaud
106
6
0
29 Sep 2023
HyperPPO: A scalable method for finding small policies for robotic
  control
HyperPPO: A scalable method for finding small policies for robotic control
Luming Tang
Zhehui Huang
Gaurav Sukhatme
65
4
0
28 Sep 2023
CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action
  Architecture
CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture
Zixuan Chen
Ze Ji
Shuyang Liu
Jing Huo
Yiyu Chen
Yang Gao
54
1
0
28 Sep 2023
Stackelberg Batch Policy Learning
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
78
1
0
28 Sep 2023
Enhancing data efficiency in reinforcement learning: a novel imagination
  mechanism based on mesh information propagation
Enhancing data efficiency in reinforcement learning: a novel imagination mechanism based on mesh information propagation
Zihang Wang
Maowei Jiang
AI4CE
78
0
0
25 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified
  Error Quantification Framework
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
83
5
0
23 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy
  Optimization
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
Hai Zhang
Hang Yu
Junqiao Zhao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
87
10
0
22 Sep 2023
Learning Actions and Control of Focus of Attention with a Log-Polar-like
  Sensor
Learning Actions and Control of Focus of Attention with a Log-Polar-like Sensor
Robin Göransson
Volker Krueger
29
0
0
22 Sep 2023
Trip Planning for Autonomous Vehicles with Wireless Data Transfer Needs
  Using Reinforcement Learning
Trip Planning for Autonomous Vehicles with Wireless Data Transfer Needs Using Reinforcement Learning
Yousef AlSaqabi
Bhaskar Krishnamachari
55
2
0
21 Sep 2023
State2Explanation: Concept-Based Explanations to Benefit Agent Learning
  and User Understanding
State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding
Devleena Das
Sonia Chernova
Been Kim
LRMLLMAG
114
24
0
21 Sep 2023
Learning to Recover for Safe Reinforcement Learning
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
56
0
0
21 Sep 2023
Practical Probabilistic Model-based Deep Reinforcement Learning by
  Integrating Dropout Uncertainty and Trajectory Sampling
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling
Wenjun Huang
Yunduan Cui
Huiyun Li
Xin Wu
MU
121
0
0
20 Sep 2023
Monte-Carlo tree search with uncertainty propagation via optimal
  transport
Monte-Carlo tree search with uncertainty propagation via optimal transport
Tuan Dam
Pascal Stenger
Lukas Schneider
Joni Pajarinen
Carlo DÉramo
Odalric-Ambrym Maillard
46
1
0
19 Sep 2023
gym-saturation: Gymnasium environments for saturation provers (System
  description)
gym-saturation: Gymnasium environments for saturation provers (System description)
Boris Shminke
71
1
0
16 Sep 2023
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
Xiao-Yin Liu
Xiao-Hu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRLOOD
84
5
0
16 Sep 2023
Deep Multi-Agent Reinforcement Learning for Decentralized Active
  Hypothesis Testing
Deep Multi-Agent Reinforcement Learning for Decentralized Active Hypothesis Testing
Hadar Szostak
Kobi Cohen
59
4
0
14 Sep 2023
Stable In-hand Manipulation with Finger Specific Multi-agent Shadow
  Reward
Stable In-hand Manipulation with Finger Specific Multi-agent Shadow Reward
Lingfeng Tao
Jiucai Zhang
Xiaoli Zhang
56
0
0
13 Sep 2023
Investigating the Impact of Action Representations in Policy Gradient
  Algorithms
Investigating the Impact of Action Representations in Policy Gradient Algorithms
Jan Schneider-Barnes
Pierre Schumacher
Daniel Haeufle
Bernhard Scholkopf
Le Chen
OffRL
41
2
0
13 Sep 2023
Attention Loss Adjusted Prioritized Experience Replay
Attention Loss Adjusted Prioritized Experience Replay
Zhuoying Chen
Huiping Li
Rizhong Wang
53
2
0
13 Sep 2023
Fitness Approximation through Machine Learning
Fitness Approximation through Machine Learning
Itai Tzruia
Tomer Halperin
Moshe Sipper
Achiya Elyasaf
45
2
0
06 Sep 2023
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning
L. Du
Min Chen
Mingyang Sun
Shouling Ji
Peng Cheng
Jiming Chen
Zhikun Zhang
OffRL
101
9
0
06 Sep 2023
Representation Learning for Sequential Volumetric Design Tasks
Representation Learning for Sequential Volumetric Design Tasks
Md Ferdous Alam
Yi Wang
Linh Tran
Chin-Yi Cheng
Jieliang Luo
3DV
91
2
0
05 Sep 2023
Distributionally Robust Model-based Reinforcement Learning with Large
  State Spaces
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Yifan Hu
Andreas Krause
Ilija Bogunovic
OOD
78
12
0
05 Sep 2023
Marginalized Importance Sampling for Off-Environment Policy Evaluation
Marginalized Importance Sampling for Off-Environment Policy Evaluation
Pulkit Katdare
Nan Jiang
Katherine Driggs-Campbell
OffRL
89
4
0
04 Sep 2023
Leveraging Reward Consistency for Interpretable Feature Discovery in
  Reinforcement Learning
Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning
Qisen Yang
Huanqian Wang
Mukun Tong
Wenjie Shi
Gao Huang
Shiji Song
72
5
0
04 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with
  Expert Guidance
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRLOnRL
79
8
0
04 Sep 2023
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization
Uri Gadot
E. Derman
Navdeep Kumar
Maxence Mohamed Elfatihi
Kfir Y. Levy
Shie Mannor
76
7
0
03 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey
Neurosymbolic Reinforcement Learning and Planning: A Survey
Kamal Acharya
Waleed Raza
Carlos Dourado
Alvaro Velasquez
Houbing Song
NAIOffRL
90
17
0
02 Sep 2023
Suicidal Pedestrian: Generation of Safety-Critical Scenarios for
  Autonomous Vehicles
Suicidal Pedestrian: Generation of Safety-Critical Scenarios for Autonomous Vehicles
Yuhang Yang
Kalle Kujanpää
Amin Babadi
Joni Pajarinen
Alexander Ilin
69
3
0
01 Sep 2023
The Power of MEME: Adversarial Malware Creation with Model-Based
  Reinforcement Learning
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement Learning
M. Rigaki
Sebastian Garcia
AAML
53
4
0
31 Aug 2023
DRL-Based Trajectory Tracking for Motion-Related Modules in Autonomous
  Driving
DRL-Based Trajectory Tracking for Motion-Related Modules in Autonomous Driving
Yinda Xu
Lidong Yu
64
7
0
30 Aug 2023
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous
  Robotics
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics
Zexin Li
Aritra Samanta
Yufei Li
Andrea Soltoggio
Hyoseung Kim
Cong Liu
113
7
0
29 Aug 2023
Target-independent XLA optimization using Reinforcement Learning
Target-independent XLA optimization using Reinforcement Learning
Milan Ganai
Haichen Li
Theodore Enns
Yida Wang
Randy Huang
74
0
0
28 Aug 2023
Distributionally Robust Statistical Verification with Imprecise Neural Networks
Distributionally Robust Statistical Verification with Imprecise Neural Networks
Souradeep Dutta
Michele Caprio
Vivian Lin
Matthew Cleaveland
Kuk Jin Jang
I. Ruchkin
O. Sokolsky
Insup Lee
OODAAML
217
8
0
28 Aug 2023
Previous
123...789...505152
Next