ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Comparing Reinforcement Learning and Human Learning using the Game of
  Hidden Rules
Comparing Reinforcement Learning and Human Learning using the Game of Hidden Rules
Eric Pulick
Vladimir Menkov
Yonatan Dov Mintz
Paul B. Kantor
Vicki M. Bier
OffRL
28
0
0
30 Jun 2023
Zespol: A Lightweight Environment for Training Swarming Agents
Zespol: A Lightweight Environment for Training Swarming Agents
Shay Snyder
Kevin A. Zhu
Ricardo Vega
Cameron Nowzari
Maryam Parsa
84
2
0
30 Jun 2023
Probabilistic Constraint for Safety-Critical Reinforcement Learning
Probabilistic Constraint for Safety-Critical Reinforcement Learning
Weiqin Chen
D. Subramanian
Santiago Paternain
87
15
0
29 Jun 2023
Learning Environment Models with Continuous Stochastic Dynamics
Learning Environment Models with Continuous Stochastic Dynamics
Martin Tappler
Edi Muškardin
B. Aichernig
Bettina Könighofer
AI4CE
50
2
0
29 Jun 2023
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value
  Approximation in Reinforcement Learning
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
Qiang He
Dinesh Manocha
Meng Fang
S. Maghsudi
76
5
0
29 Jun 2023
Principles and Guidelines for Evaluating Social Robot Navigation
  Algorithms
Principles and Guidelines for Evaluating Social Robot Navigation Algorithms
Anthony G. Francis
Claudia Pérez-DÁrpino
Chengshu Li
Fei Xia
Alexandre Alahi
...
Xuesu Xiao
Peng Xu
Naoki Yokoyama
Alexander Toshev
Roberto Martin-Martin Logical Robotics
120
77
0
29 Jun 2023
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark
Federico Berto
Chuanbo Hua
J. Park
Laurin Luttmann
Yining Ma
...
Guojie Song
Changhyun Kwon
Kevin Tierney
Lin Xie
Jinkyoo Park
OffRL
139
29
0
29 Jun 2023
MRHER: Model-based Relay Hindsight Experience Replay for Sequential
  Object Manipulation Tasks with Sparse Rewards
MRHER: Model-based Relay Hindsight Experience Replay for Sequential Object Manipulation Tasks with Sparse Rewards
Yuming Huang
Bin Ren
Ziming Xu
Lianghong Wu
OffRL
61
0
0
28 Jun 2023
Rethinking Closed-loop Training for Autonomous Driving
Rethinking Closed-loop Training for Autonomous Driving
Chris Zhang
R. Guo
Wenyuan Zeng
Yuwen Xiong
Binbin Dai
Rui Hu
Mengye Ren
R. Urtasun
OffRL
101
30
0
27 Jun 2023
Optimizing Credit Limit Adjustments Under Adversarial Goals Using
  Reinforcement Learning
Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning
Sherly Alfonso-Sánchez
Jesus Solano
Alejandro Correa-Bahnsen
Kristina P. Sendova
Cristián Bravo
20
7
0
27 Jun 2023
Creating Valid Adversarial Examples of Malware
Creating Valid Adversarial Examples of Malware
M. Kozák
M. Jureček
Mark Stamp
Fabio Di Troia
AAML
67
10
0
23 Jun 2023
Transferable Curricula through Difficulty Conditioned Generators
Transferable Curricula through Difficulty Conditioned Generators
Sidney Tio
Pradeep Varakantham
58
4
0
22 Jun 2023
Novelty Accommodating Multi-Agent Planning in High Fidelity Simulated Open World
Novelty Accommodating Multi-Agent Planning in High Fidelity Simulated Open World
James Chao
W. Piotrowski
Roni Stern
Héctor J. Ortiz-Peña
Mitch Manzanares
Shiwali Mohan
D. Lange
93
0
0
22 Jun 2023
Optimistic Active Exploration of Dynamical Systems
Optimistic Active Exploration of Dynamical Systems
Bhavya Sukhija
Lenart Treven
Cansu Sancaktar
Sebastian Blaes
Stelian Coros
Andreas Krause
117
18
0
21 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for
  Search Engine Marketing Optimization
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
90
1
0
21 Jun 2023
Practical First-Order Bayesian Optimization Algorithms
Practical First-Order Bayesian Optimization Algorithms
Utkarsh Prakash
Aryan Chollera
Kushagra Khatwani
P. K. J.
Tejas Bodas
70
1
0
19 Jun 2023
SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models
SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models
Shenghua Wan
Yucen Wang
Minghao Shao
Ruying Chen
De-Chuan Zhan
91
8
0
19 Jun 2023
On Evolvability and Behavior Landscapes in Neuroevolutionary Divergent
  Search
On Evolvability and Behavior Landscapes in Neuroevolutionary Divergent Search
Bruno Gašperov
Marko Đurasević
56
0
0
16 Jun 2023
Mimicking Better by Matching the Approximate Action Distribution
Mimicking Better by Matching the Approximate Action Distribution
Joao A. Candido Ramos
Lionel Blondé
Naoya Takeishi
Alexandros Kalousis
71
2
0
16 Jun 2023
QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement
  Learning with Direct Thrust Control
QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control
Zhehui Huang
Sumeet Batra
Tao Chen
Rahul Krupani
T. Kumar
Artem Molchanov
Aleksei Petrenko
James A. Preiss
Zhaojing Yang
Gaurav Sukhatme
73
6
0
15 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
72
13
0
15 Jun 2023
DiAReL: Reinforcement Learning with Disturbance Awareness for Robust
  Sim2Real Policy Transfer in Robot Control
DiAReL: Reinforcement Learning with Disturbance Awareness for Robust Sim2Real Policy Transfer in Robot Control
M. Malmir
Josip Josifovski
Noah Klarmann
Alois C. Knoll
90
2
0
15 Jun 2023
Off-policy Evaluation in Doubly Inhomogeneous Environments
Off-policy Evaluation in Doubly Inhomogeneous Environments
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
107
7
0
14 Jun 2023
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning
  Approach to Critical Care
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care
Ali Shirali
Alexander Schubert
Ahmed Alaa
OffRL
80
4
0
13 Jun 2023
Stepsize Learning for Policy Gradient Methods in Contextual Markov
  Decision Processes
Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes
Luca Sabbioni
Francesco Corda
Marcello Restelli
47
0
0
13 Jun 2023
Using Collision Momentum in Deep Reinforcement Learning Based
  Adversarial Pedestrian Modeling
Using Collision Momentum in Deep Reinforcement Learning Based Adversarial Pedestrian Modeling
Di Chen
Ekim Yurtsever
Keith A. Redmill
Ü. Özgüner
67
4
0
13 Jun 2023
Tuning Legged Locomotion Controllers via Safe Bayesian Optimization
Tuning Legged Locomotion Controllers via Safe Bayesian Optimization
Daniel Widmer
Dong-oh Kang
Bhavya Sukhija
Jonas Hübotter
Andreas Krause
Stelian Coros
98
14
0
12 Jun 2023
Reinforcement Learning with Parameterized Manipulation Primitives for
  Robotic Assembly
Reinforcement Learning with Parameterized Manipulation Primitives for Robotic Assembly
N. Vuong
Quang Pham
54
0
0
11 Jun 2023
PACER: A Fully Push-forward-based Distributional Reinforcement Learning
  Algorithm
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm
Wensong Bai
Chao Zhang
Yichao Fu
Lingwei Peng
Hui Qian
Bin Dai
73
1
0
11 Jun 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement
  Learning
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
Yuhang Ran
Yi-Chen Li
Fuxiang Zhang
Zongzhang Zhang
Yang Yu
OffRL
89
27
0
11 Jun 2023
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating
  The Worst Kernel
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel
Kaixin Wang
Uri Gadot
Navdeep Kumar
Kfir Y. Levy
Shie Mannor
85
4
0
09 Jun 2023
Decoupled Prioritized Resampling for Offline RL
Decoupled Prioritized Resampling for Offline RL
Yang Yue
Bingyi Kang
Xiao Ma
Qisen Yang
Gao Huang
S. Song
Shuicheng Yan
OffRL
88
0
0
08 Jun 2023
Active Inference in Hebbian Learning Networks
Active Inference in Hebbian Learning Networks
A. Safa
Tim Verbelen
Lars Keuninckx
I. Ocket
A. Bourdoux
F. Catthoor
Georges G. E. Gielen
Gert Cauwenberghs
71
2
0
08 Jun 2023
Boosting Offline Reinforcement Learning with Action Preference Query
Boosting Offline Reinforcement Learning with Action Preference Query
Qisen Yang
Shenzhi Wang
Matthieu Lin
S. Song
Gao Huang
OffRL
87
10
0
06 Jun 2023
Learning Embeddings for Sequential Tasks Using Population of Agents
Learning Embeddings for Sequential Tasks Using Population of Agents
Mridul Mahajan
Georgios Tzannetos
Goran Radanović
Adish Singla
FedML
39
0
0
05 Jun 2023
Risk-Aware Reward Shaping of Reinforcement Learning Agents for
  Autonomous Driving
Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving
Linjin Wu
Zengjie Zhang
S. Haesaert
Zhiqiang Ma
Zhiyong Sun
OffRL
62
6
0
05 Jun 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy
  Actor-Critic
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRLOnRL
108
17
0
05 Jun 2023
For SALE: State-Action Representation Learning for Deep Reinforcement
  Learning
For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Scott Fujimoto
Wei-Di Chang
Edward James Smith
S. Gu
Doina Precup
David Meger
OffRL
95
55
0
04 Jun 2023
Reinforcement Learning with General Utilities: Simpler Variance
  Reduction and Large State-Action Space
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Anas Barakat
Ilyas Fatkhullin
Niao He
93
12
0
02 Jun 2023
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive
  Advantages
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages
Andrew Jesson
Chris Xiaoxuan Lu
Gunshi Gupta
Angelos Filos
Jakob N. Foerster
Y. Gal
OffRL
77
8
0
02 Jun 2023
Extracting Reward Functions from Diffusion Models
Extracting Reward Functions from Diffusion Models
Felipe Nuti
Tim Franzmeyer
João F. Henriques
87
15
0
01 Jun 2023
Train Offline, Test Online: A Real Robot Learning Benchmark
Train Offline, Test Online: A Real Robot Learning Benchmark
G. Zhou
Victoria Dean
Mohan Kumar Srirama
Aravind Rajeswaran
Jyothish Pari
...
Tianhe Yu
Pieter Abbeel
Lerrel Pinto
Chelsea Finn
Abhi Gupta
OffRL
126
40
0
01 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
105
14
0
01 Jun 2023
NetHack is Hard to Hack
NetHack is Hard to Hack
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
53
7
0
30 May 2023
IDToolkit: A Toolkit for Benchmarking and Developing Inverse Design
  Algorithms in Nanophotonics
IDToolkit: A Toolkit for Benchmarking and Developing Inverse Design Algorithms in Nanophotonics
Jia-Qi Yang
Yucheng Xu
Jianwei Shen
Ke-Bin Fan
De-Chuan Zhan
Yang Yang
55
1
0
30 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
63
9
0
29 May 2023
Self-Supervised Reinforcement Learning that Transfers using Random
  Features
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Kai Zhang
Abhishek Gupta
OffRLSSL
84
9
0
26 May 2023
NASimEmu: Network Attack Simulator & Emulator for Training Agents
  Generalizing to Novel Scenarios
NASimEmu: Network Attack Simulator & Emulator for Training Agents Generalizing to Novel Scenarios
Jaromír Janisch
Tomávs Pevný
Viliam Lisý
82
19
0
26 May 2023
Counterfactual Explainer Framework for Deep Reinforcement Learning
  Models Using Policy Distillation
Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy Distillation
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
OffRL
58
3
0
25 May 2023
Aerial Gym -- Isaac Gym Simulator for Aerial Robots
Aerial Gym -- Isaac Gym Simulator for Aerial Robots
Mihir Kulkarni
Theodor J. L. Forgaard
Kostas Alexis
72
15
0
25 May 2023
Previous
123...91011...505152
Next