ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,654 papers shown
Title
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
35
1
0
21 Jul 2023
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Zhiao Huang
Litian Liang
Z. Ling
Xuanlin Li
Chuang Gan
H. Su
43
11
0
20 Jul 2023
Technical Challenges of Deploying Reinforcement Learning Agents for Game
  Testing in AAA Games
Technical Challenges of Deploying Reinforcement Learning Agents for Game Testing in AAA Games
Jonas Gillberg
Joakim Bergdahl
Alessandro Sestini
Andy Eakins
Linus Gisslén
OffRL
33
7
0
19 Jul 2023
PyTAG: Challenges and Opportunities for Reinforcement Learning in
  Tabletop Games
PyTAG: Challenges and Opportunities for Reinforcement Learning in Tabletop Games
Martin Balla
G. E. Long
Dominik Jeurissen
J. Goodman
Raluca D. Gaina
Diego Perez-Liebana
LMTD
OffRL
OnRL
33
1
0
19 Jul 2023
Reproducibility in Machine Learning-Driven Research
Reproducibility in Machine Learning-Driven Research
Harald Semmelrock
Simone Kopeinik
Dieter Theiler
Tony Ross-Hellauer
Dominik Kowald
AI4CE
33
15
0
19 Jul 2023
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
Luigi Quarantiello
Simone Marzeddu
Antonio Guzzi
Vincenzo Lomonaco
29
0
0
17 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement
  Learning
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
40
4
0
16 Jul 2023
Probabilistic Constrained Reinforcement Learning with Formal
  Interpretability
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
Yanran Wang
Qiuchen Qian
David E. Boyle
21
4
0
13 Jul 2023
Bag of Views: An Appearance-based Approach to Next-Best-View Planning
  for 3D Reconstruction
Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D Reconstruction
Sara Hatami Gazani
Matthew Tucsok
I. Mantegh
Homayoun Najjaran
23
4
0
11 Jul 2023
Boosting Feedback Efficiency of Interactive Reinforcement Learning by
  Adaptive Learning from Scores
Boosting Feedback Efficiency of Interactive Reinforcement Learning by Adaptive Learning from Scores
Shukai Liu
Chenming Wu
Ying Li
Liang Zhang
42
0
0
11 Jul 2023
Pegasus Simulator: An Isaac Sim Framework for Multiple Aerial Vehicles
  Simulation
Pegasus Simulator: An Isaac Sim Framework for Multiple Aerial Vehicles Simulation
Marcelo Jacinto
Joao Pinto
Jay Patrikar
John Keller
R. Cunha
Sebastian Scherer
A. Pascoal
24
14
0
11 Jul 2023
Contextual Pre-planning on Reward Machine Abstractions for Enhanced
  Transfer in Deep Reinforcement Learning
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
Guy Azran
Mohamad H. Danesh
Stefano V. Albrecht
Sarah Keren
AI4CE
45
1
0
11 Jul 2023
Probabilistic Counterexample Guidance for Safer Reinforcement Learning
  (Extended Version)
Probabilistic Counterexample Guidance for Safer Reinforcement Learning (Extended Version)
Xiaotong Ji
Antonio Filieri
OffRL
27
1
0
10 Jul 2023
Procedurally generating rules to adapt difficulty for narrative puzzle
  games
Procedurally generating rules to adapt difficulty for narrative puzzle games
Thomas Vase Schultz Volden
Djordje Grbic
Paolo Burelli
10
1
0
07 Jul 2023
OmniBoost: Boosting Throughput of Heterogeneous Embedded Devices under
  Multi-DNN Workload
OmniBoost: Boosting Throughput of Heterogeneous Embedded Devices under Multi-DNN Workload
Andreas Karatzas
Iraklis Anagnostopoulos
21
20
0
06 Jul 2023
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource
  Allocation
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
Abhijeet Pendyala
Justin Dettmer
Tobias Glasmachers
Asma Atamna
OffRL
19
6
0
06 Jul 2023
A Neuromorphic Architecture for Reinforcement Learning from Real-Valued
  Observations
A Neuromorphic Architecture for Reinforcement Learning from Real-Valued Observations
S. Chevtchenko
Y. Bethi
Teresa B Ludermir
Saeed Afshar
OffRL
16
1
0
06 Jul 2023
Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement
  Learning
Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement Learning
C. Bellinger
Mark Crowley
Isaac Tamblyn
22
3
0
05 Jul 2023
Hierarchical Planning and Policy Shaping Shared Autonomy for Articulated
  Robots
Hierarchical Planning and Policy Shaping Shared Autonomy for Articulated Robots
E. Yousefi
Mo Chen
I. Sharf
14
1
0
04 Jul 2023
Distributional Model Equivalence for Risk-Sensitive Reinforcement
  Learning
Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning
Tyler Kastner
Murat A. Erdogdu
Amir-massoud Farahmand
OffRL
34
4
0
04 Jul 2023
Comparing Reinforcement Learning and Human Learning using the Game of
  Hidden Rules
Comparing Reinforcement Learning and Human Learning using the Game of Hidden Rules
Eric Pulick
Vladimir Menkov
Yonatan Dov Mintz
Paul B. Kantor
Vicki M. Bier
OffRL
19
0
0
30 Jun 2023
Zespol: A Lightweight Environment for Training Swarming Agents
Zespol: A Lightweight Environment for Training Swarming Agents
Shay Snyder
Kevin A. Zhu
Ricardo Vega
Cameron Nowzari
Maryam Parsa
25
2
0
30 Jun 2023
Probabilistic Constraint for Safety-Critical Reinforcement Learning
Probabilistic Constraint for Safety-Critical Reinforcement Learning
Weiqin Chen
D. Subramanian
Santiago Paternain
37
15
0
29 Jun 2023
RL4CO: an Extensive Reinforcement Learning for Combinatorial
  Optimization Benchmark
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark
Federico Berto
Chuanbo Hua
Junyoung Park
Laurin Luttmann
Yining Ma
...
Guojie Song
Changhyun Kwon
Kevin Tierney
Lin Xie
Jinkyoo Park
OffRL
34
27
0
29 Jun 2023
Learning Environment Models with Continuous Stochastic Dynamics
Learning Environment Models with Continuous Stochastic Dynamics
Martin Tappler
Edi Muškardin
B. Aichernig
Bettina Könighofer
AI4CE
38
1
0
29 Jun 2023
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value
  Approximation in Reinforcement Learning
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
Qiang He
Dinesh Manocha
Meng Fang
S. Maghsudi
45
4
0
29 Jun 2023
Principles and Guidelines for Evaluating Social Robot Navigation
  Algorithms
Principles and Guidelines for Evaluating Social Robot Navigation Algorithms
Anthony G. Francis
Claudia Pérez-DÁrpino
Chengshu Li
Fei Xia
Alexandre Alahi
...
Xuesu Xiao
Peng Xu
Naoki Yokoyama
Alexander Toshev
Roberto Martin-Martin Logical Robotics
39
69
0
29 Jun 2023
MRHER: Model-based Relay Hindsight Experience Replay for Sequential
  Object Manipulation Tasks with Sparse Rewards
MRHER: Model-based Relay Hindsight Experience Replay for Sequential Object Manipulation Tasks with Sparse Rewards
Yuming Huang
Bin Ren
Ziming Xu
Lianghong Wu
OffRL
23
0
0
28 Jun 2023
Rethinking Closed-loop Training for Autonomous Driving
Rethinking Closed-loop Training for Autonomous Driving
Chris Zhang
R. Guo
Wenyuan Zeng
Yuwen Xiong
Binbin Dai
Rui Hu
Mengye Ren
R. Urtasun
OffRL
33
30
0
27 Jun 2023
Optimizing Credit Limit Adjustments Under Adversarial Goals Using
  Reinforcement Learning
Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning
Sherly Alfonso-Sánchez
Jesus Solano
Alejandro Correa-Bahnsen
Kristina P. Sendova
Cristián Bravo
18
7
0
27 Jun 2023
Creating Valid Adversarial Examples of Malware
Creating Valid Adversarial Examples of Malware
M. Kozák
M. Jureček
Mark Stamp
Fabio Di Troia
AAML
23
8
0
23 Jun 2023
Transferable Curricula through Difficulty Conditioned Generators
Transferable Curricula through Difficulty Conditioned Generators
Sidney Tio
Pradeep Varakantham
25
4
0
22 Jun 2023
Novelty Accommodating Multi-Agent Planning in High Fidelity Simulated Open World
Novelty Accommodating Multi-Agent Planning in High Fidelity Simulated Open World
James Chao
W. Piotrowski
Roni Stern
Héctor J. Ortiz-Peña
Mitch Manzanares
Shiwali Mohan
D. Lange
30
0
0
22 Jun 2023
Optimistic Active Exploration of Dynamical Systems
Optimistic Active Exploration of Dynamical Systems
Bhavya Sukhija
Lenart Treven
Cansu Sancaktar
Sebastian Blaes
Stelian Coros
Andreas Krause
42
17
0
21 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for
  Search Engine Marketing Optimization
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
35
1
0
21 Jun 2023
Practical First-Order Bayesian Optimization Algorithms
Practical First-Order Bayesian Optimization Algorithms
Utkarsh Prakash
Aryan Chollera
Kushagra Khatwani
P. K. J.
Tejas Bodas
32
1
0
19 Jun 2023
SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models
SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models
Shenghua Wan
Yucen Wang
Minghao Shao
Ruying Chen
De-Chuan Zhan
61
7
0
19 Jun 2023
On Evolvability and Behavior Landscapes in Neuroevolutionary Divergent
  Search
On Evolvability and Behavior Landscapes in Neuroevolutionary Divergent Search
Bruno Gašperov
Marko Đurasević
26
0
0
16 Jun 2023
Mimicking Better by Matching the Approximate Action Distribution
Mimicking Better by Matching the Approximate Action Distribution
Joao A. Candido Ramos
Lionel Blondé
Naoya Takeishi
Alexandros Kalousis
48
2
0
16 Jun 2023
QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement
  Learning with Direct Thrust Control
QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control
Zhehui Huang
Sumeet Batra
Tao Chen
Rahul Krupani
T. Kumar
Artem Molchanov
Aleksei Petrenko
James A. Preiss
Zhaojing Yang
Gaurav Sukhatme
28
6
0
15 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
35
13
0
15 Jun 2023
DiAReL: Reinforcement Learning with Disturbance Awareness for Robust
  Sim2Real Policy Transfer in Robot Control
DiAReL: Reinforcement Learning with Disturbance Awareness for Robust Sim2Real Policy Transfer in Robot Control
M. Malmir
Josip Josifovski
Noah Klarmann
Alois C. Knoll
42
2
0
15 Jun 2023
Off-policy Evaluation in Doubly Inhomogeneous Environments
Off-policy Evaluation in Doubly Inhomogeneous Environments
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
37
4
0
14 Jun 2023
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning
  Approach to Critical Care
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care
Ali Shirali
Alexander Schubert
Ahmed Alaa
OffRL
38
3
0
13 Jun 2023
Stepsize Learning for Policy Gradient Methods in Contextual Markov
  Decision Processes
Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes
Luca Sabbioni
Francesco Corda
Marcello Restelli
29
0
0
13 Jun 2023
Using Collision Momentum in Deep Reinforcement Learning Based
  Adversarial Pedestrian Modeling
Using Collision Momentum in Deep Reinforcement Learning Based Adversarial Pedestrian Modeling
Di Chen
Ekim Yurtsever
Keith A. Redmill
Ü. Özgüner
33
4
0
13 Jun 2023
Tuning Legged Locomotion Controllers via Safe Bayesian Optimization
Tuning Legged Locomotion Controllers via Safe Bayesian Optimization
Daniel Widmer
Dong-oh Kang
Bhavya Sukhija
Jonas Hübotter
Andreas Krause
Stelian Coros
30
14
0
12 Jun 2023
Reinforcement Learning with Parameterized Manipulation Primitives for
  Robotic Assembly
Reinforcement Learning with Parameterized Manipulation Primitives for Robotic Assembly
N. Vuong
Quang Pham
32
0
0
11 Jun 2023
PACER: A Fully Push-forward-based Distributional Reinforcement Learning
  Algorithm
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm
Wensong Bai
Chao Zhang
Yichao Fu
Lingwei Peng
Hui Qian
Bin Dai
32
1
0
11 Jun 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement
  Learning
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
Yuhang Ran
Yi-Chen Li
Fuxiang Zhang
Zongzhang Zhang
Yang Yu
OffRL
37
23
0
11 Jun 2023
Previous
123...8910...323334
Next