ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,673 papers shown
Title
Learning to Backdoor Federated Learning
Learning to Backdoor Federated Learning
Henger Li
Chen Wu
Senchun Zhu
Zizhan Zheng
FedML
36
10
0
06 Mar 2023
Ensemble Reinforcement Learning: A Survey
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
59
38
0
05 Mar 2023
Demonstration-guided Deep Reinforcement Learning for Coordinated Ramp
  Metering and Perimeter Control in Large Scale Networks
Demonstration-guided Deep Reinforcement Learning for Coordinated Ramp Metering and Perimeter Control in Large Scale Networks
Zijian Hu
Wei-Ying Ma
27
5
0
04 Mar 2023
CoRL: Environment Creation and Management Focused on System Integration
CoRL: Environment Creation and Management Focused on System Integration
J. D. Merrick
Benjamin K. Heiner
Cameron Long
Brian Stieber
Steve Fierro
Vardaan Gangal
Madison Blake
Joshua Blackburn
AI4CE
30
2
0
03 Mar 2023
Synthetic Data Generator for Adaptive Interventions in Global Health
Synthetic Data Generator for Adaptive Interventions in Global Health
Aditya Rastogi
J. F. Garamendi
Ana Fernández del Río
Anna Guitart
Moiz Hassan Khan
Dexian Tang
África Periánez
42
0
0
03 Mar 2023
Dynamic Competency Self-Assessment for Autonomous Agents
Dynamic Competency Self-Assessment for Autonomous Agents
Nicholas Conlon
Nisar R. Ahmed
D. Szafir
34
3
0
03 Mar 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL
  Algorithms by Policy Path Trimming and Boosting
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
28
1
0
02 Mar 2023
Reinforced Labels: Multi-Agent Deep Reinforcement Learning for
  Point-Feature Label Placement
Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement
Petr Bobák
Ladislav Čmolík
Martin Cadík
OffRL
40
3
0
02 Mar 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and
  Algorithms
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula
Yuda Song
Aarti Singh
J. Andrew Bagnell
Sanjiban Choudhury
OffRL
46
13
0
01 Mar 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
27
0
0
28 Feb 2023
Taylor TD-learning
Taylor TD-learning
Michele Garibbo
Maxime Robeyns
Laurence Aitchison
OffRL
23
1
0
27 Feb 2023
CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems
CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems
Sagar Patel
Sangeetha Abdu Jyothi
Nina Narodytska
OffRL
27
0
0
27 Feb 2023
High-Precise Robot Arm Manipulation based on Online Iterative Learning
  and Forward Simulation with Positioning Error Below End-Effector Physical
  Minimum Displacement
High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement
Weiming Qu
Tianlin Liu
D. Luo
26
2
0
26 Feb 2023
Q-Cogni: An Integrated Causal Reinforcement Learning Framework
Q-Cogni: An Integrated Causal Reinforcement Learning Framework
C. Cunha
Wen Liu
T. French
Ajmal Mian
38
1
0
26 Feb 2023
DeepCPG Policies for Robot Locomotion
DeepCPG Policies for Robot Locomotion
Aditya M. Deshpande
Eric Hurd
A. Minai
Manish Kumar
29
9
0
25 Feb 2023
Autonomous Exploration and Mapping for Mobile Robots via Cumulative
  Curriculum Reinforcement Learning
Autonomous Exploration and Mapping for Mobile Robots via Cumulative Curriculum Reinforcement Learning
Zehan Li
Jinghao Xin
Ning Li
31
4
0
25 Feb 2023
EvoTorch: Scalable Evolutionary Computation in Python
EvoTorch: Scalable Evolutionary Computation in Python
N. E. Toklu
Timothy James Atkinson
Vojtvech Micka
Paweł Liskowski
R. Srivastava
22
12
0
24 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
21
7
0
24 Feb 2023
Improving Deep Policy Gradients with Value Function Search
Improving Deep Policy Gradients with Value Function Search
Enrico Marchesini
Chris Amato
31
9
0
20 Feb 2023
Neural Optimal Control using Learned System Dynamics
Neural Optimal Control using Learned System Dynamics
Selim Engin
Volkan Isler
26
3
0
20 Feb 2023
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided
  Bounds on the Value Function
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided Bounds on the Value Function
Jacob Adamczyk
Stas Tiomkin
R. Kulkarni
OffRL
22
0
0
19 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human
  Preferences
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
35
2
0
17 Feb 2023
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long
  Horizon Trajectories
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long Horizon Trajectories
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
34
6
0
17 Feb 2023
Conservative State Value Estimation for Offline Reinforcement Learning
Conservative State Value Estimation for Offline Reinforcement Learning
Liting Chen
Jie Yan
Zhengdao Shao
Lu Wang
Qingwei Lin
Saravan Rajmohan
Thomas Moscibroda
Dongmei Zhang
OffRL
26
6
0
14 Feb 2023
Universal Agent Mixtures and the Geometry of Intelligence
Universal Agent Mixtures and the Geometry of Intelligence
S. Alexander
David Quarel
Len Du
Marcus Hutter
23
1
0
13 Feb 2023
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and
  Landing at Urban Air Mobility Vertiports
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and Landing at Urban Air Mobility Vertiports
Prajit K. Kumar
Jhoel Witter
Steve Paul
Karthik Dantu
Souma Chowdhury
27
3
0
12 Feb 2023
UGAE: A Novel Approach to Non-exponential Discounting
UGAE: A Novel Approach to Non-exponential Discounting
Ariel Kwiatkowski
Vicky Kalogeiton
Julien Pettré
Marie-Paule Cani
OffRL
30
2
0
11 Feb 2023
Learning cooperative behaviours in adversarial multi-agent systems
Learning cooperative behaviours in adversarial multi-agent systems
Ni Wang
Gautham P. Das
Alan G. Millard
21
0
0
10 Feb 2023
A SWAT-based Reinforcement Learning Framework for Crop Management
A SWAT-based Reinforcement Learning Framework for Crop Management
Malvern Madondo
Muneeza Azmat
Kelsey L. DiPietro
R. Horesh
Michael Jacobs
Arun Bawa
Raghavan Srinivasan
Fearghal O'Donncha
11
8
0
10 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals
Scaling Goal-based Exploration via Pruning Proto-goals
Akhil Bagaria
Ray Jiang
Ramana Kumar
Tom Schaul
LRM
16
2
0
09 Feb 2023
A Systematic Performance Analysis of Deep Perceptual Loss Networks:
  Breaking Transfer Learning Conventions
A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning Conventions
G. Pihlgren
Konstantina Nikolaidou
Prakash Chandra Chhipa
Nosheen Abid
Rajkumar Saini
Fredrik Sandin
Marcus Liwicki
29
10
0
08 Feb 2023
Online Reinforcement Learning with Uncertain Episode Lengths
Online Reinforcement Learning with Uncertain Episode Lengths
Debmalya Mandal
Goran Radanović
Jiarui Gan
Adish Singla
R. Majumdar
OffRL
38
5
0
07 Feb 2023
Two Losses Are Better Than One: Faster Optimization Using a Cheaper
  Proxy
Two Losses Are Better Than One: Faster Optimization Using a Cheaper Proxy
Blake E. Woodworth
Konstantin Mishchenko
Francis R. Bach
47
6
0
07 Feb 2023
Object-Centric Scene Representations using Active Inference
Object-Centric Scene Representations using Active Inference
Toon Van de Maele
Tim Verbelen
Pietro Mazzaglia
Stefano Ferraro
Bart Dhoedt
OCL
BDL
48
5
0
07 Feb 2023
Diversity Induced Environment Design via Self-Play
Diversity Induced Environment Design via Self-Play
Dexun Li
Wenjun Li
Pradeep Varakantham
29
0
0
04 Feb 2023
Deep Reinforcement Learning for Cyber System Defense under Dynamic
  Adversarial Uncertainties
Deep Reinforcement Learning for Cyber System Defense under Dynamic Adversarial Uncertainties
Ashutosh Dutta
Samrat Chatterjee
A. Bhattacharya
M. Halappanavar
32
8
0
03 Feb 2023
DiSProD: Differentiable Symbolic Propagation of Distributions for
  Planning
DiSProD: Differentiable Symbolic Propagation of Distributions for Planning
Palash Chatterjee
Ashutosh Chapagain
Weizhe (Wesley) Chen
Roni Khardon
21
1
0
03 Feb 2023
Accelerating Policy Gradient by Estimating Value Function from Prior
  Computation in Deep Reinforcement Learning
Accelerating Policy Gradient by Estimating Value Function from Prior Computation in Deep Reinforcement Learning
Hassam Sheikh
Mariano Phielipp
OffRL
24
6
0
02 Feb 2023
Normalizing Flow Ensembles for Rich Aleatoric and Epistemic Uncertainty
  Modeling
Normalizing Flow Ensembles for Rich Aleatoric and Epistemic Uncertainty Modeling
Lucas Berry
David Meger
26
7
0
02 Feb 2023
Task Placement and Resource Allocation for Edge Machine Learning: A
  GNN-based Multi-Agent Reinforcement Learning Paradigm
Task Placement and Resource Allocation for Edge Machine Learning: A GNN-based Multi-Agent Reinforcement Learning Paradigm
Yihong Li
Xiaoxi Zhang
Tian Zeng
Jingpu Duan
Chuanxi Wu
Di Wu
Xu Chen
34
16
0
01 Feb 2023
Reducing Blackwell and Average Optimality to Discounted MDPs via the
  Blackwell Discount Factor
Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor
Julien Grand-Clément
Marko Petrik
35
14
0
31 Jan 2023
Learning Vision-based Robotic Manipulation Tasks Sequentially in Offline
  Reinforcement Learning Settings
Learning Vision-based Robotic Manipulation Tasks Sequentially in Offline Reinforcement Learning Settings
Sudhir Pratap Yadav
R. Nagar
S. Shah
OffRL
29
3
0
31 Jan 2023
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both
  Worlds in Stochastic and Deterministic Environments
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments
Runlong Zhou
Zihan Zhang
S. Du
49
10
0
31 Jan 2023
Enabling surrogate-assisted evolutionary reinforcement learning via
  policy embedding
Enabling surrogate-assisted evolutionary reinforcement learning via policy embedding
Lan Tang
Xiaxi Li
Jinyuan Zhang
Guiying Li
Peng Yang
Ke Tang
18
0
0
31 Jan 2023
PAC-Bayesian Soft Actor-Critic Learning
PAC-Bayesian Soft Actor-Critic Learning
Bahareh Tasdighi
Abdullah Akgul
Manuel Haussmann
Kenny Kazimirzak Brink
M. Kandemir
41
3
0
30 Jan 2023
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary
  Computation
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary Computation
Beichen Huang
Ran Cheng
Zhuozhao Li
Yaochu Jin
Kay Chen Tan
18
26
0
29 Jan 2023
Improving Behavioural Cloning with Positive Unlabeled Learning
Improving Behavioural Cloning with Positive Unlabeled Learning
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
Nico Gürtler
Felix Widmaier
Francisco Roldan Sanchez
S. Redmond
OffRL
OnRL
36
8
0
27 Jan 2023
Single-Trajectory Distributionally Robust Reinforcement Learning
Single-Trajectory Distributionally Robust Reinforcement Learning
Zhipeng Liang
Xiaoteng Ma
Jose H. Blanchet
Jiheng Zhang
Zhengyuan Zhou
OOD
OffRL
39
11
0
27 Jan 2023
Certifiably Robust Reinforcement Learning through Model-Based Abstract
  Interpretation
Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation
Chenxi Yang
Greg Anderson
Swarat Chaudhuri
37
1
0
26 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
29
0
0
26 Jan 2023
Previous
123...111213...323334
Next