ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Neural Optimal Control using Learned System Dynamics
Neural Optimal Control using Learned System Dynamics
Selim Engin
Volkan Isler
85
3
0
20 Feb 2023
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided
  Bounds on the Value Function
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided Bounds on the Value Function
Jacob Adamczyk
Stas Tiomkin
R. Kulkarni
OffRL
25
0
0
19 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human
  Preferences
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
50
2
0
17 Feb 2023
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long
  Horizon Trajectories
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long Horizon Trajectories
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
73
6
0
17 Feb 2023
Conservative State Value Estimation for Offline Reinforcement Learning
Conservative State Value Estimation for Offline Reinforcement Learning
Liting Chen
Jie Yan
Zhengdao Shao
Lu Wang
Qingwei Lin
Saravan Rajmohan
Thomas Moscibroda
Dongmei Zhang
OffRL
66
6
0
14 Feb 2023
Universal Agent Mixtures and the Geometry of Intelligence
Universal Agent Mixtures and the Geometry of Intelligence
S. Alexander
David Quarel
Len Du
Marcus Hutter
60
1
0
13 Feb 2023
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and
  Landing at Urban Air Mobility Vertiports
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and Landing at Urban Air Mobility Vertiports
Prajit K. Kumar
Jhoel Witter
Steve Paul
Karthik Dantu
Souma Chowdhury
50
3
0
12 Feb 2023
UGAE: A Novel Approach to Non-exponential Discounting
UGAE: A Novel Approach to Non-exponential Discounting
Ariel Kwiatkowski
Vicky Kalogeiton
Julien Pettré
Marie-Paule Cani
OffRL
39
2
0
11 Feb 2023
Learning cooperative behaviours in adversarial multi-agent systems
Learning cooperative behaviours in adversarial multi-agent systems
Ni Wang
Gautham P. Das
Alan G. Millard
27
0
0
10 Feb 2023
A SWAT-based Reinforcement Learning Framework for Crop Management
A SWAT-based Reinforcement Learning Framework for Crop Management
Malvern Madondo
Muneeza Azmat
Kelsey L. DiPietro
R. Horesh
Michael Jacobs
Arun Bawa
Raghavan Srinivasan
Fearghal O'Donncha
14
8
0
10 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals
Scaling Goal-based Exploration via Pruning Proto-goals
Akhil Bagaria
Ray Jiang
Ramana Kumar
Tom Schaul
LRM
55
2
0
09 Feb 2023
A Systematic Performance Analysis of Deep Perceptual Loss Networks:
  Breaking Transfer Learning Conventions
A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning Conventions
G. Pihlgren
Konstantina Nikolaidou
Prakash Chandra Chhipa
Nosheen Abid
Rajkumar Saini
Fredrik Sandin
Marcus Liwicki
95
10
0
08 Feb 2023
Online Reinforcement Learning with Uncertain Episode Lengths
Online Reinforcement Learning with Uncertain Episode Lengths
Debmalya Mandal
Goran Radanović
Jiarui Gan
Adish Singla
R. Majumdar
OffRL
70
8
0
07 Feb 2023
Two Losses Are Better Than One: Faster Optimization Using a Cheaper
  Proxy
Two Losses Are Better Than One: Faster Optimization Using a Cheaper Proxy
Blake E. Woodworth
Konstantin Mishchenko
Francis R. Bach
82
6
0
07 Feb 2023
Object-Centric Scene Representations using Active Inference
Object-Centric Scene Representations using Active Inference
Toon Van de Maele
Tim Verbelen
Pietro Mazzaglia
Stefano Ferraro
Bart Dhoedt
OCLBDL
88
5
0
07 Feb 2023
Diversity Induced Environment Design via Self-Play
Diversity Induced Environment Design via Self-Play
Dexun Li
Wenjun Li
Pradeep Varakantham
69
0
0
04 Feb 2023
Deep Reinforcement Learning for Cyber System Defense under Dynamic
  Adversarial Uncertainties
Deep Reinforcement Learning for Cyber System Defense under Dynamic Adversarial Uncertainties
Ashutosh Dutta
Samrat Chatterjee
A. Bhattacharya
M. Halappanavar
49
9
0
03 Feb 2023
DiSProD: Differentiable Symbolic Propagation of Distributions for
  Planning
DiSProD: Differentiable Symbolic Propagation of Distributions for Planning
Palash Chatterjee
Ashutosh Chapagain
Weizhe (Wesley) Chen
Roni Khardon
23
2
0
03 Feb 2023
Accelerating Policy Gradient by Estimating Value Function from Prior
  Computation in Deep Reinforcement Learning
Accelerating Policy Gradient by Estimating Value Function from Prior Computation in Deep Reinforcement Learning
Hassam Sheikh
Mariano Phielipp
OffRL
64
6
0
02 Feb 2023
Normalizing Flow Ensembles for Rich Aleatoric and Epistemic Uncertainty
  Modeling
Normalizing Flow Ensembles for Rich Aleatoric and Epistemic Uncertainty Modeling
Lucas Berry
David Meger
71
8
0
02 Feb 2023
Task Placement and Resource Allocation for Edge Machine Learning: A
  GNN-based Multi-Agent Reinforcement Learning Paradigm
Task Placement and Resource Allocation for Edge Machine Learning: A GNN-based Multi-Agent Reinforcement Learning Paradigm
Yihong Li
Xiaoxi Zhang
Tian Zeng
Jingpu Duan
Chuanxi Wu
Di Wu
Xu Chen
73
19
0
01 Feb 2023
QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing
QMP: Q-switch Mixture of Policies for Multi-Task Behavior Sharing
Grace Zhang
Ayush Jain
Injune Hwang
Shao-Hua Sun
Joseph J. Lim
75
4
0
01 Feb 2023
Reducing Blackwell and Average Optimality to Discounted MDPs via the
  Blackwell Discount Factor
Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor
Julien Grand-Clément
Marko Petrik
57
14
0
31 Jan 2023
Learning Vision-based Robotic Manipulation Tasks Sequentially in Offline
  Reinforcement Learning Settings
Learning Vision-based Robotic Manipulation Tasks Sequentially in Offline Reinforcement Learning Settings
Sudhir Pratap Yadav
R. Nagar
S. Shah
OffRL
59
3
0
31 Jan 2023
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both
  Worlds in Stochastic and Deterministic Environments
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments
Runlong Zhou
Zihan Zhang
S. Du
87
12
0
31 Jan 2023
Enabling surrogate-assisted evolutionary reinforcement learning via
  policy embedding
Enabling surrogate-assisted evolutionary reinforcement learning via policy embedding
Lan Tang
Xiaxi Li
Jinyuan Zhang
Guiying Li
Peng Yang
Ke Tang
108
1
0
31 Jan 2023
PAC-Bayesian Soft Actor-Critic Learning
PAC-Bayesian Soft Actor-Critic Learning
Bahareh Tasdighi
Abdullah Akgul
Manuel Haussmann
Kenny Kazimirzak Brink
M. Kandemir
112
4
0
30 Jan 2023
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary
  Computation
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary Computation
Beichen Huang
Ran Cheng
Zhuozhao Li
Yaochu Jin
Kay Chen Tan
182
28
0
29 Jan 2023
Improving Behavioural Cloning with Positive Unlabeled Learning
Improving Behavioural Cloning with Positive Unlabeled Learning
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
Nico Gürtler
Felix Widmaier
Francisco Roldan Sanchez
S. Redmond
OffRLOnRL
79
8
0
27 Jan 2023
Single-Trajectory Distributionally Robust Reinforcement Learning
Single-Trajectory Distributionally Robust Reinforcement Learning
Zhipeng Liang
Xiaoteng Ma
Jose H. Blanchet
Jiheng Zhang
Zhengyuan Zhou
OODOffRL
86
12
0
27 Jan 2023
Certifiably Robust Reinforcement Learning through Model-Based Abstract
  Interpretation
Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation
Chenxi Yang
Greg Anderson
Swarat Chaudhuri
65
1
0
26 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
56
0
0
26 Jan 2023
Perceptive Locomotion with Controllable Pace and Natural Gait
  Transitions Over Uneven Terrains
Perceptive Locomotion with Controllable Pace and Natural Gait Transitions Over Uneven Terrains
Daniel C.H. Tan
Jenny Zhang
Michael
M. Chuah
Zhibin Li
63
2
0
26 Jan 2023
Evaluating Deception and Moving Target Defense with Network Attack
  Simulation
Evaluating Deception and Moving Target Defense with Network Attack Simulation
Daniel Reti
Karina Elzer
Daniel Fraunholz
Daniel Schneider
Hans D. Schotten
AAML
46
7
0
25 Jan 2023
Deep Reinforcement Learning for Concentric Tube Robot Path Following
Deep Reinforcement Learning for Concentric Tube Robot Path Following
Keshav Iyengar
Sarah Spurgeon
Danail Stoyanov
MedIm
28
5
0
22 Jan 2023
Asynchronous Deep Double Duelling Q-Learning for Trading-Signal
  Execution in Limit Order Book Markets
Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets
Peer Nagy
Jan-Peter Calliess
S. Zohren
46
3
0
20 Jan 2023
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement
  Learning
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Haoxuan Pan
Deheng Ye
Xiaoming Duan
Qiang Fu
Wei Yang
Jianping He
Mingfei Sun
OffRL
65
2
0
20 Jan 2023
Multi-Agent Interplay in a Competitive Survival Environment
Multi-Agent Interplay in a Competitive Survival Environment
Andrea Fanti
67
0
0
19 Jan 2023
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative
  Reward Co-Training
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training
Philipp Altmann
Thomy Phan
Fabian Ritz
Thomas Gabor
Claudia Linnhoff-Popien
OffRL
63
1
0
18 Jan 2023
Adversarial Robust Deep Reinforcement Learning Requires Redefining
  Robustness
Adversarial Robust Deep Reinforcement Learning Requires Redefining Robustness
Ezgi Korkmaz
45
29
0
17 Jan 2023
Asynchronous training of quantum reinforcement learning
Asynchronous training of quantum reinforcement learning
Samuel Yen-Chi Chen
OffRL
93
24
0
12 Jan 2023
Efficient Preference-Based Reinforcement Learning Using Learned Dynamics
  Models
Efficient Preference-Based Reinforcement Learning Using Learned Dynamics Models
Yi Liu
Gaurav Datta
Ellen R. Novoseller
Daniel S. Brown
104
24
0
11 Jan 2023
schlably: A Python Framework for Deep Reinforcement Learning Based
  Scheduling Experiments
schlably: A Python Framework for Deep Reinforcement Learning Based Scheduling Experiments
Constantin Waubert de Puiseau
Jannik Peters
Christian Dörpelkus
Hasan Tercan
Tobias Meisen
OffRL
31
8
0
10 Jan 2023
Learning to Perceive in Deep Model-Free Reinforcement Learning
Learning to Perceive in Deep Model-Free Reinforcement Learning
Gonccalo Querido
Alberto Sardinha
Francisco S. Melo
29
0
0
10 Jan 2023
DRL-GAN: A Hybrid Approach for Binary and Multiclass Network Intrusion
  Detection
DRL-GAN: A Hybrid Approach for Binary and Multiclass Network Intrusion Detection
Caroline Strickland
Chandrika Saha
Muhammad Zakar
Sareh Nejad
Noshin Tasnim
D. Lizotte
Anwar Haque
64
10
0
05 Jan 2023
Character Simulation Using Imitation Learning With Game Engine Physics
Character Simulation Using Imitation Learning With Game Engine Physics
Joao Rodrigues
R. Nóbrega
AI4CE
69
2
0
05 Jan 2023
Genetic Imitation Learning by Reward Extrapolation
Genetic Imitation Learning by Reward Extrapolation
Boyuan Zheng
Jianlong Zhou
Fang Chen
71
0
0
03 Jan 2023
Explaining Imitation Learning through Frames
Explaining Imitation Learning through Frames
Boyuan Zheng
Jianlong Zhou
Chun-Hao Liu
Yiqiao Li
Fang Chen
60
0
0
03 Jan 2023
A Policy Optimization Method Towards Optimal-time Stability
A Policy Optimization Method Towards Optimal-time Stability
Shengjie Wang
Lan Fengb
Xiang Zheng
Yu-wen Cao
Oluwatosin Oseni
Haotian Xu
Tao Zhang
Yang Gao
88
1
0
02 Jan 2023
On the Challenges of using Reinforcement Learning in Precision Drug
  Dosing: Delay and Prolongedness of Action Effects
On the Challenges of using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects
Sumana Basu
M. Legault
Adriana Romero Soriano
Doina Precup
OffRL
44
4
0
02 Jan 2023
Previous
123...121314...505152
Next