ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,654 papers shown
Title
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
Runze Zhao
Yue Yu
Adams Yiyue Zhu
Chen Yang
Dongruo Zhou
7
0
0
20 May 2025
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Dongsu Lee
Minhae Kwon
OffRL
11
0
0
19 May 2025
Learning Probabilistic Temporal Logic Specifications for Stochastic Systems
Learning Probabilistic Temporal Logic Specifications for Stochastic Systems
Rajarshi Roy
Yash Pote
David Parker
Marta Kwiatkowska
11
0
0
17 May 2025
Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAP
Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAP
Francesco Sovrano
22
0
0
16 May 2025
ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks
ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks
Feiran You
Hongyang Du
OffRL
LRM
29
0
0
16 May 2025
Visual Planning: Let's Think Only with Images
Visual Planning: Let's Think Only with Images
Yi Xu
Chengzu Li
Han Zhou
Xingchen Wan
Caiqi Zhang
Anna Korhonen
Ivan Vulić
LM&Ro
LRM
19
0
0
16 May 2025
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Yansen Wang
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRL
LLMAG
LM&Ro
LRM
19
0
0
15 May 2025
Diffusion-SAFE: Shared Autonomy Framework with Diffusion for Safe Human-to-Robot Driving Handover
Yunxin Fan
Monroe Kennedy III
28
0
0
15 May 2025
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Ningyuan Yang
Jiaxuan Gao
Feng Gao
Yi Wu
Chao Yu
41
0
0
15 May 2025
High-order Regularization for Machine Learning and Learning-based Control
High-order Regularization for Machine Learning and Learning-based Control
Xinghua Liu
Ming Cao
25
0
0
13 May 2025
Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control
Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control
Hazim Alzorgan
Abolfazl Razi
36
0
0
13 May 2025
Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems
Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems
Matthew Sgambati
Aleksandar Vakanski
Matthew Anderson
37
0
0
06 May 2025
TutorGym: A Testbed for Evaluating AI Agents as Tutors and Students
TutorGym: A Testbed for Evaluating AI Agents as Tutors and Students
Daniel Weitekamp
M. N. Siddiqui
Christopher MacLellan
LLMAG
ELM
37
0
0
02 May 2025
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
Lang Feng
Weihao Tan
Zhiyi Lyu
Longtao Zheng
Haiyang Xu
Ming Yan
Fei Huang
Jingyi Wang
31
0
0
01 May 2025
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Harry Mead
Clarissa Costen
Bruno Lacerda
Nick Hawes
24
0
0
29 Apr 2025
DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction
DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction
Chris Child
Lam Ngo
49
0
0
29 Apr 2025
Fitness Landscape of Large Language Model-Assisted Automated Algorithm Search
Fitness Landscape of Large Language Model-Assisted Automated Algorithm Search
Fei Liu
Qingfu Zhang
Xialiang Tong
M. Yuan
K. Mao
77
0
0
28 Apr 2025
HyperController: A Hyperparameter Controller for Fast and Stable Training of Reinforcement Learning Neural Networks
HyperController: A Hyperparameter Controller for Fast and Stable Training of Reinforcement Learning Neural Networks
J. Gornet
Yiannis Kantaros
Bruno Sinopoli
206
0
0
27 Apr 2025
Recursive Deep Inverse Reinforcement Learning
Recursive Deep Inverse Reinforcement Learning
Paul Ghanem
Michael Potter
Owen Howell
Pau Closas
A. Ramezani
Deniz Erdogmus
Tales Imbiriba
32
0
0
17 Apr 2025
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
Xuyang Chen
Guojian Wang
Keyu Yan
Lin Zhao
OffRL
37
1
0
16 Apr 2025
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
Jonas Myhre Schiøtt
Viktor Sebastian Petersen
Dimitrios P. Papadopoulos
VLM
40
0
0
16 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
41
0
0
14 Apr 2025
TRATSS: Transformer-Based Task Scheduling System for Autonomous Vehicles
TRATSS: Transformer-Based Task Scheduling System for Autonomous Vehicles
Yazan Youssef
Paulo Ricardo Marques de Araujo
Aboelmagd Noureldin
Sidney Givigi
26
0
0
07 Apr 2025
Sim4EndoR: A Reinforcement Learning Centered Simulation Platform for Task Automation of Endovascular Robotics
Sim4EndoR: A Reinforcement Learning Centered Simulation Platform for Task Automation of Endovascular Robotics
Tianliang Yao
Madaoji Ban
Bo Lu
Zhiqiang Pei
Peng Qi
37
2
0
04 Apr 2025
A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control
A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control
Anirudh Satheesh
Keenan Powell
50
0
0
30 Mar 2025
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
Rajdeep Singh Hundal
Yan Xiao
Xiaochun Cao
Jin Song Dong
Manuel Rigger
51
0
0
28 Mar 2025
Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization
Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization
Zhenyu Liang
Hao Li
Naiwei Yu
Kebin Sun
Ran Cheng
71
1
0
26 Mar 2025
FF-SRL: High Performance GPU-Based Surgical Simulation For Robot Learning
FF-SRL: High Performance GPU-Based Surgical Simulation For Robot Learning
Diego DallÁlba
Michał Nasket
Sabina Kaminska
Przemysław Korzeniowski
OffRL
AI4CE
62
1
0
24 Mar 2025
Mining-Gym: A Configurable RL Benchmarking Environment for Truck Dispatch Scheduling
Mining-Gym: A Configurable RL Benchmarking Environment for Truck Dispatch Scheduling
C. Banerjee
Kien Nguyen
Clinton Fookes
OffRL
64
0
0
24 Mar 2025
Computationally and Sample Efficient Safe Reinforcement Learning Using Adaptive Conformal Prediction
Computationally and Sample Efficient Safe Reinforcement Learning Using Adaptive Conformal Prediction
Hao Zhou
Yanze Zhang
Wenhao Luo
39
0
0
22 Mar 2025
Survey on Evaluation of LLM-based Agents
Survey on Evaluation of LLM-based Agents
Asaf Yehudai
Lilach Eden
Alan Li
Guy Uziel
Yilun Zhao
Roy Bar-Haim
Arman Cohan
Michal Shmueli-Scheuer
LLMAG
ELM
Presented at ResearchTrend Connect | LLMAG on 07 May 2025
102
7
0
20 Mar 2025
APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games
APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games
Yifei Chen
Lambert Schomaker
41
0
0
17 Mar 2025
A nonlinear real time capable motion cueing algorithm based on deep reinforcement learning
A nonlinear real time capable motion cueing algorithm based on deep reinforcement learning
Hendrik Scheidel
Camilo Gonzalez
Houshyar Asadi
Tobias Bellmann
A. Seefried
Shady M. K. Mohamed
Saeid Nahavandi
55
0
0
13 Mar 2025
MarineGym: A High-Performance Reinforcement Learning Platform for Underwater Robotics
Shuguang Chu
Zebin Huang
Yutong Li
Mingwei Lin
Ignacio Carlucho
Y. Pétillot
Canjun Yang
OffRL
AI4CE
45
0
0
13 Mar 2025
Safe exploration in reproducing kernel Hilbert spaces
Abdullah Tokmak
Kiran G. Krishnan
Thomas B. Schon
Dominik Baumann
42
0
0
13 Mar 2025
RESTRAIN: Reinforcement Learning-Based Secure Framework for Trigger-Action IoT Environment
Md Morshed Alam
Lokesh Chandra Das
Sandip Roy
Sachin Shetty
Weichao Wang
AAML
OffRL
61
0
0
12 Mar 2025
Soft Actor-Critic-based Control Barrier Adaptation for Robust Autonomous Navigation in Unknown Environments
Nicholas Mohammad
Nicola Bezzo
52
1
0
11 Mar 2025
Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation
Mohit Prashant
Arvind Easwaran
Suman Das
Michael Yuhas
OffRL
75
1
0
07 Mar 2025
Review of Machine Learning for Micro-Electronic Design Verification
Review of Machine Learning for Micro-Electronic Design Verification
Christopher Bennett
Kerstin Eder
36
0
0
05 Mar 2025
Actor-Critic Cooperative Compensation to Model Predictive Control for Off-Road Autonomous Vehicles Under Unknown Dynamics
Prakhar Gupta
J. Smereka
Yunyi Jia
47
0
0
01 Mar 2025
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
Beomyeol Yu
Taeyoung Lee
39
0
0
27 Feb 2025
Physics-Driven Data Generation for Contact-Rich Manipulation via Trajectory Optimization
Physics-Driven Data Generation for Contact-Rich Manipulation via Trajectory Optimization
Lujie Yang
H.J. Terry Suh
Tong Zhao
B. P. Graesdal
Tarik Kelestemur
Jiuguang Wang
Tao Pang
Russ Tedrake
88
3
0
27 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
70
1
0
24 Feb 2025
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
Xuyang Li
Romit Maulik
48
0
0
24 Feb 2025
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Giuseppe Paolo
Abdelhakim Benechehab
Hamza Cherkaoui
Albert Thomas
Balázs Kégl
48
0
0
21 Feb 2025
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
Lakshmi Nair
Ian Trase
Mark Kim
AIFin
LRM
AI4CE
53
1
0
18 Feb 2025
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Elias B. Kosmatopoulos
LM&Ro
82
0
0
18 Feb 2025
Warm Starting of CMA-ES for Contextual Optimization Problems
Warm Starting of CMA-ES for Contextual Optimization Problems
Yuta Sekino
Kento Uchida
Shinichi Shirakawa
86
0
0
18 Feb 2025
MassSpecGym: A benchmark for the discovery and identification of molecules
MassSpecGym: A benchmark for the discovery and identification of molecules
Roman Bushuiev
Anton Bushuiev
Niek F. de Jonge
A. Young
Fleming Kretschmer
...
Justin J. J. van der Hooft
Michael A. Stravs
Sebastian Böcker
Josef Sivic
Tomáš Pluskal
54
4
0
17 Feb 2025
Stonefish: Supporting Machine Learning Research in Marine Robotics
Stonefish: Supporting Machine Learning Research in Marine Robotics
Michele Grimaldi
Patryk Cieslak
Eduardo Ochoa
Vibhav Bharti
Hayat Rajani
Ignacio Carlucho
Maria Koskinopoulou
Y. Pétillot
N. Gracias
AI4CE
63
3
0
17 Feb 2025
1234...323334
Next