ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Causal Repair of Learning-enabled Cyber-physical Systems
Causal Repair of Learning-enabled Cyber-physical Systems
Pengyuan Lu
I. Ruchkin
Matthew Cleaveland
O. Sokolsky
Insup Lee
45
2
0
06 Apr 2023
AutoRL Hyperparameter Landscapes
AutoRL Hyperparameter Landscapes
Aditya Mohan
C. Benjamins
Konrad Wienecke
A. Dockhorn
Marius Lindauer
136
8
0
05 Apr 2023
Online augmentation of learned grasp sequence policies for more
  adaptable and data-efficient in-hand manipulation
Online augmentation of learned grasp sequence policies for more adaptable and data-efficient in-hand manipulation
E. Gordon
Rana Soltani-Zarrin
OffRL
58
6
0
04 Apr 2023
Quantum Imitation Learning
Quantum Imitation Learning
Zhihao Cheng
Kaining Zhang
Li Shen
Dacheng Tao
67
1
0
04 Apr 2023
PyFlyt -- UAV Simulation Environments for Reinforcement Learning
  Research
PyFlyt -- UAV Simulation Environments for Reinforcement Learning Research
Jun Jet Tai
J. Wong
M. Innocente
N. Horri
J. Brusey
S. K. Phang
120
10
0
03 Apr 2023
Managing power grids through topology actions: A comparative study
  between advanced rule-based and reinforcement learning agents
Managing power grids through topology actions: A comparative study between advanced rule-based and reinforcement learning agents
Malte Lehna
J. Viebahn
Christoph Scholz
Antoine Marot
Sven Tomforde
77
22
0
03 Apr 2023
Physical Deep Reinforcement Learning Towards Safety Guarantee
Physical Deep Reinforcement Learning Towards Safety Guarantee
H. Cao
Y. Mao
L. Sha
Marco Caccamo
AI4CE
24
5
0
29 Mar 2023
Learning Complicated Manipulation Skills via Deterministic Policy with
  Limited Demonstrations
Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations
Li Haofeng
C. Yiwen
Tan Jiayi
Marcelo H. Ang Jr
OffRL
35
2
0
29 Mar 2023
A Survey of Machine Learning-Based Ride-Hailing Planning
A Survey of Machine Learning-Based Ride-Hailing Planning
Dacheng Wen
Yupeng Li
F. Lau
68
4
0
26 Mar 2023
Sequential Knockoffs for Variable Selection in Reinforcement Learning
Sequential Knockoffs for Variable Selection in Reinforcement Learning
Tao Ma
Hengrui Cai
Zhengling Qi
C. Shi
Eric B. Laber
97
3
0
24 Mar 2023
marl-jax: Multi-Agent Reinforcement Leaning Framework
marl-jax: Multi-Agent Reinforcement Leaning Framework
K. Mehta
Anuj Mahajan
Kiran Ravish
109
3
0
24 Mar 2023
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the
  MineRL BASALT 2022 Competition
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Stephanie Milani
Anssi Kanervisto
Karolis Ramanauskas
Sander Schulhoff
Brandon Houghton
...
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
J. Miller
Rohin Shah
59
16
0
23 Mar 2023
Reinforcement Learning with Exogenous States and Rewards
Reinforcement Learning with Exogenous States and Rewards
George Trimponias
Thomas G. Dietterich
OffRL
65
2
0
22 Mar 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and
  Global Optimality
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
81
0
0
22 Mar 2023
A multi-functional simulation platform for on-demand ride service
  operations
A multi-functional simulation platform for on-demand ride service operations
Siyuan Feng
Taijie Chen
Yuhao Zhang
Jintao Ke
Zhengfei Zheng
Hai Yang
35
5
0
22 Mar 2023
Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian
  Interactions using Reinforcement Learning
Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions using Reinforcement Learning
Qiming Ye
Yuxiang Feng
Jose Javier Escribano Macias
M. Stettler
Panagiotis Angeloudis
44
4
0
22 Mar 2023
ChatGPT and a New Academic Reality: Artificial Intelligence-Written
  Research Papers and the Ethics of the Large Language Models in Scholarly
  Publishing
ChatGPT and a New Academic Reality: Artificial Intelligence-Written Research Papers and the Ethics of the Large Language Models in Scholarly Publishing
Brady Lund
Ting Wang
Nishith Reddy Mannuru
Bing Nie
S. Shimray
Ziang Wang
AI4CE
94
529
0
21 Mar 2023
Affective Workload Allocation for Multi-human Multi-robot Teams
Affective Workload Allocation for Multi-human Multi-robot Teams
Wonse Jo
Ruiqi Wang
B. Yang
D. Foti
M. Rastgaar
Byung-Cheol Min
33
1
0
18 Mar 2023
Comparing NARS and Reinforcement Learning: An Analysis of ONA and
  $Q$-Learning Algorithms
Comparing NARS and Reinforcement Learning: An Analysis of ONA and QQQ-Learning Algorithms
Ali Beikmohammadi
Sindri Magnússon
50
3
0
17 Mar 2023
Reinforcement Learning for Omega-Regular Specifications on
  Continuous-Time MDP
Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP
A. Falah
Shibashis Guha
Ashutosh Trivedi
36
0
0
16 Mar 2023
Learning Rewards to Optimize Global Performance Metrics in Deep
  Reinforcement Learning
Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning
Junqi Qian
Paul Weng
Chenmien Tan
71
1
0
16 Mar 2023
MAHTM: A Multi-Agent Framework for Hierarchical Transactive Microgrids
MAHTM: A Multi-Agent Framework for Hierarchical Transactive Microgrids
Nicolas Mauricio Cuadrado
Roberto Gutiérrez
Yongli Zhu
Martin Takáč
72
2
0
15 Mar 2023
Act-Then-Measure: Reinforcement Learning for Partially Observable
  Environments with Active Measuring
Act-Then-Measure: Reinforcement Learning for Partially Observable Environments with Active Measuring
Merlijn Krale
T. D. Simão
N. Jansen
OffRL
57
7
0
14 Mar 2023
Beyond Games: A Systematic Review of Neural Monte Carlo Tree Search
  Applications
Beyond Games: A Systematic Review of Neural Monte Carlo Tree Search Applications
Marco Kemmerling
Daniel Lutticke
Robert H. Schmitt
74
15
0
14 Mar 2023
Reinforcement Learning-based Wavefront Sensorless Adaptive Optics
  Approaches for Satellite-to-Ground Laser Communication
Reinforcement Learning-based Wavefront Sensorless Adaptive Optics Approaches for Satellite-to-Ground Laser Communication
Payam Parvizi
Runnan Zou
C. Bellinger
R. Cheriton
D. Spinello
42
2
0
13 Mar 2023
Kernel Density Bayesian Inverse Reinforcement Learning
Kernel Density Bayesian Inverse Reinforcement Learning
Aishwarya Mandyam
Didong Li
Jiayu Yao
Diana Cai
Andrew Jones
Barbara E. Engelhardt
OffRLBDL
90
3
0
13 Mar 2023
Synthetic Experience Replay
Synthetic Experience Replay
Cong Lu
Philip J. Ball
Yee Whye Teh
Jack Parker-Holder
OffRL
170
80
0
12 Mar 2023
Beware of Instantaneous Dependence in Reinforcement Learning
Beware of Instantaneous Dependence in Reinforcement Learning
Zhengmao Zhu
Yu-Ren Liu
Hong Tian
Yang Yu
Kun Zhang
OffRL
53
1
0
09 Mar 2023
On the Benefits of Biophysical Synapses
On the Benefits of Biophysical Synapses
Julian Lemmel
Radu Grosu
18
0
0
08 Mar 2023
A Multiplicative Value Function for Safe and Efficient Reinforcement
  Learning
A Multiplicative Value Function for Safe and Efficient Reinforcement Learning
Nick Bührer
Zhejun Zhang
Alexander Liniger
Feng Yu
Luc Van Gool
67
1
0
07 Mar 2023
Evolutionary Reinforcement Learning: A Survey
Evolutionary Reinforcement Learning: A Survey
Hui Bai
Ran Cheng
Yaochu Jin
OffRL
142
56
0
07 Mar 2023
Learning to Backdoor Federated Learning
Learning to Backdoor Federated Learning
Henger Li
Chen Wu
Senchun Zhu
Zizhan Zheng
FedML
82
10
0
06 Mar 2023
Ensemble Reinforcement Learning: A Survey
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
91
41
0
05 Mar 2023
Demonstration-guided Deep Reinforcement Learning for Coordinated Ramp
  Metering and Perimeter Control in Large Scale Networks
Demonstration-guided Deep Reinforcement Learning for Coordinated Ramp Metering and Perimeter Control in Large Scale Networks
Zijian Hu
Wei-Ying Ma
39
5
0
04 Mar 2023
CoRL: Environment Creation and Management Focused on System Integration
CoRL: Environment Creation and Management Focused on System Integration
J. D. Merrick
Benjamin K. Heiner
Cameron Long
Brian Stieber
Steve Fierro
Vardaan Gangal
Madison Blake
Joshua Blackburn
AI4CE
69
2
0
03 Mar 2023
Synthetic Data Generator for Adaptive Interventions in Global Health
Synthetic Data Generator for Adaptive Interventions in Global Health
Aditya Rastogi
J. F. Garamendi
Ana Fernández del Río
Anna Guitart
Moiz Hassan Khan
Dexian Tang
África Periánez
63
0
0
03 Mar 2023
Dynamic Competency Self-Assessment for Autonomous Agents
Dynamic Competency Self-Assessment for Autonomous Agents
Nicholas Conlon
Nisar R. Ahmed
D. Szafir
49
3
0
03 Mar 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL
  Algorithms by Policy Path Trimming and Boosting
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
69
1
0
02 Mar 2023
Reinforced Labels: Multi-Agent Deep Reinforcement Learning for
  Point-Feature Label Placement
Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement
Petr Bobák
Ladislav Čmolík
Martin Cadík
OffRL
68
3
0
02 Mar 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and
  Algorithms
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula
Yuda Song
Aarti Singh
J. Andrew Bagnell
Sanjiban Choudhury
OffRL
83
14
0
01 Mar 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
56
0
0
28 Feb 2023
Taylor TD-learning
Taylor TD-learning
Michele Garibbo
Maxime Robeyns
Laurence Aitchison
OffRL
58
1
0
27 Feb 2023
CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems
CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems
Sagar Patel
Sangeetha Abdu Jyothi
Nina Narodytska
OffRL
54
0
0
27 Feb 2023
High-Precise Robot Arm Manipulation based on Online Iterative Learning
  and Forward Simulation with Positioning Error Below End-Effector Physical
  Minimum Displacement
High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement
Weiming Qu
Tianlin Liu
D. Luo
75
2
0
26 Feb 2023
Q-Cogni: An Integrated Causal Reinforcement Learning Framework
Q-Cogni: An Integrated Causal Reinforcement Learning Framework
C. Cunha
Wen Liu
T. French
Ajmal Mian
71
1
0
26 Feb 2023
DeepCPG Policies for Robot Locomotion
DeepCPG Policies for Robot Locomotion
Aditya M. Deshpande
Eric Hurd
A. Minai
Manish Kumar
74
9
0
25 Feb 2023
Autonomous Exploration and Mapping for Mobile Robots via Cumulative
  Curriculum Reinforcement Learning
Autonomous Exploration and Mapping for Mobile Robots via Cumulative Curriculum Reinforcement Learning
Zehan Li
Jinghao Xin
Ning Li
67
5
0
25 Feb 2023
EvoTorch: Scalable Evolutionary Computation in Python
EvoTorch: Scalable Evolutionary Computation in Python
N. E. Toklu
Timothy James Atkinson
Vojtvech Micka
Paweł Liskowski
R. Srivastava
94
14
0
24 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OODAAML
97
8
0
24 Feb 2023
Improving Deep Policy Gradients with Value Function Search
Improving Deep Policy Gradients with Value Function Search
Enrico Marchesini
Chris Amato
49
9
0
20 Feb 2023
Previous
123...111213...505152
Next