ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,654 papers shown
Title
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty
  Sharing
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Haobin Jiang
Ziluo Ding
Zongqing Lu
34
2
0
03 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
36
7
0
02 Feb 2024
Two-Timescale Critic-Actor for Average Reward MDPs with Function
  Approximation
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda
Shalabh Bhatnagar
38
1
0
02 Feb 2024
Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions
Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions
Hansung Kim
Siddharth H. Nair
Francesco Borrelli
44
1
0
02 Feb 2024
Control in Stochastic Environment with Delays: A Model-based
  Reinforcement Learning Approach
Control in Stochastic Environment with Delays: A Model-based Reinforcement Learning Approach
Zhiyuan Yao
Ionuţ Florescu
Chihoon Lee
OffRL
14
2
0
01 Feb 2024
A Reinforcement Learning Based Controller to Minimize Forces on the
  Crutches of a Lower-Limb Exoskeleton
A Reinforcement Learning Based Controller to Minimize Forces on the Crutches of a Lower-Limb Exoskeleton
Aydin Emre Utku
S. E. Ada
Muhammet Hatipoglu
Mustafa Derman
Emre Ugur
Evren Samur
17
0
0
31 Jan 2024
Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic
  Motivation Reinforcement Learning Algorithms for Improved Training and
  Adaptability
Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic Motivation Reinforcement Learning Algorithms for Improved Training and Adaptability
Navin Kamuni
Hardik Shah
Sathishkumar Chintala
Naveen Kunchakuri
Sujatha Alla Old Dominion
42
19
0
31 Jan 2024
A comparison of RL-based and PID controllers for 6-DOF swimming robots:
  hybrid underwater object tracking
A comparison of RL-based and PID controllers for 6-DOF swimming robots: hybrid underwater object tracking
F. Lotfi
K. Virji
Nicholas Dudek
Gregory Dudek
27
0
0
29 Jan 2024
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Jianlan Luo
Zheyuan Hu
Charles Xu
You Liang Tan
Jacob Berg
Archit Sharma
S. Schaal
Chelsea Finn
Abhishek Gupta
Sergey Levine
OffRL
OnRL
39
41
0
29 Jan 2024
DiffuserLite: Towards Real-time Diffusion Planning
DiffuserLite: Towards Real-time Diffusion Planning
Zibin Dong
Jianye Hao
Yifu Yuan
Fei Ni
Yitian Wang
Pengyi Li
Yan Zheng
83
15
0
27 Jan 2024
Regularized Q-Learning with Linear Function Approximation
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
40
2
0
26 Jan 2024
Learning fast changing slow in spiking neural networks
Learning fast changing slow in spiking neural networks
Cristiano Capone
P. Muratore
OffRL
23
0
0
25 Jan 2024
Integrating Human Expertise in Continuous Spaces: A Novel Interactive
  Bayesian Optimization Framework with Preference Expected Improvement
Integrating Human Expertise in Continuous Spaces: A Novel Interactive Bayesian Optimization Framework with Preference Expected Improvement
Nikolaus Feith
Elmar Rueckert
37
1
0
23 Jan 2024
VRMN-bD: A Multi-modal Natural Behavior Dataset of Immersive Human Fear
  Responses in VR Stand-up Interactive Games
VRMN-bD: A Multi-modal Natural Behavior Dataset of Immersive Human Fear Responses in VR Stand-up Interactive Games
He Zhang
Xinyang Li
Yuanxi Sun
Xinyi Fu
Christine Qiu
John M. Carroll
40
4
0
22 Jan 2024
Information-Theoretic State Variable Selection for Reinforcement
  Learning
Information-Theoretic State Variable Selection for Reinforcement Learning
Charles Westphal
Stephen Hailes
Mirco Musolesi
26
3
0
21 Jan 2024
Synergistic Reinforcement and Imitation Learning for Vision-driven
  Autonomous Flight of UAV Along River
Synergistic Reinforcement and Imitation Learning for Vision-driven Autonomous Flight of UAV Along River
Zihan Wang
Jianwen Li
N. Mahmoudian
44
0
0
17 Jan 2024
IoTWarden: A Deep Reinforcement Learning Based Real-time Defense System
  to Mitigate Trigger-action IoT Attacks
IoTWarden: A Deep Reinforcement Learning Based Real-time Defense System to Mitigate Trigger-action IoT Attacks
Md Morshed Alam
Israt Jahan
Charlotte
AAML
14
2
0
16 Jan 2024
Learned Best-Effort LLM Serving
Learned Best-Effort LLM Serving
Siddharth Jha
Coleman Hooper
Xiaoxuan Liu
Sehoon Kim
Kurt Keutzer
18
2
0
15 Jan 2024
Towards Safe Load Balancing based on Control Barrier Functions and Deep
  Reinforcement Learning
Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning
L. Dinh
Pham Tran Anh Quang
Jérémie Leguay
24
2
0
10 Jan 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
Christoph Dann
Rahul Kidambi
Zhiwei Steven Wu
Alekh Agarwal
OffRL
51
96
0
08 Jan 2024
Policy Optimization with Smooth Guidance Learned from State-Only
  Demonstrations
Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Zhiming Zheng
44
0
0
30 Dec 2023
Design Space Exploration of Approximate Computing Techniques with a
  Reinforcement Learning Approach
Design Space Exploration of Approximate Computing Techniques with a Reinforcement Learning Approach
Sepide Saeedi
A. Savino
S. Di Carlo
14
2
0
29 Dec 2023
Parameterized Projected Bellman Operator
Parameterized Projected Bellman Operator
Th´eo Vincent
Alberto Maria Metelli
Boris Belousov
Jan Peters
Marcello Restelli
Carlo DÉramo
30
3
0
20 Dec 2023
Model-Based Control with Sparse Neural Dynamics
Model-Based Control with Sparse Neural Dynamics
Ziang Liu
Genggeng Zhou
Jeff He
Tobia Marcucci
Fei-Fei Li
Jiajun Wu
Yunzhu Li
AI4CE
43
17
0
20 Dec 2023
Value Explicit Pretraining for Learning Transferable Representations
Value Explicit Pretraining for Learning Transferable Representations
Kiran Lekkala
Henghui Bao
Sumedh Anand Sontakke
Laurent Itti
SSL
45
0
0
19 Dec 2023
PPO-Clip Attains Global Optimality: Towards Deeper Understandings of
  Clipping
PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping
Nai-Chieh Huang
Ping-Chun Hsieh
Kuo-Hao Ho
I-Chen Wu
29
8
0
19 Dec 2023
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
41
26
0
19 Dec 2023
TPTO: A Transformer-PPO based Task Offloading Solution for Edge
  Computing Environments
TPTO: A Transformer-PPO based Task Offloading Solution for Edge Computing Environments
N. Gholipour
M. D. Assunção
Pranav Agarwal
Julien Gascon-Samson
Rajkumar Buyya
27
1
0
18 Dec 2023
Human-Machine Teaming for UAVs: An Experimentation Platform
Human-Machine Teaming for UAVs: An Experimentation Platform
Laila El Moujtahid
S. Gottipati
Clodéric Mars
Matthew E. Taylor
31
1
0
18 Dec 2023
Solving the swing-up and balance task for the Acrobot and Pendubot with
  SAC
Solving the swing-up and balance task for the Acrobot and Pendubot with SAC
Chi Zhang
Akhil Sathuluri
Markus Zimmermann
23
3
0
18 Dec 2023
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via
  Stationary Distribution Correction Estimation
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation
Abhinav Jain
Vaibhav Unhelkar
OffRL
27
4
0
17 Dec 2023
GraphRARE: Reinforcement Learning Enhanced Graph Neural Network with
  Relative Entropy
GraphRARE: Reinforcement Learning Enhanced Graph Neural Network with Relative Entropy
Tianhao Peng
Wenjun Wu
Haitao Yuan
Zhifeng Bao
Pengrui Zhao
Xin Yu
Xuetao Lin
Yu Liang
Yanjun Pu
48
10
0
15 Dec 2023
Improve Robustness of Reinforcement Learning against Observation
  Perturbations via $l_\infty$ Lipschitz Policy Networks
Improve Robustness of Reinforcement Learning against Observation Perturbations via l∞l_\inftyl∞​ Lipschitz Policy Networks
Buqing Nie
Jingtian Ji
Yangqing Fu
Yue Gao
51
4
0
14 Dec 2023
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement
  Learning
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Yinmin Zhang
Jie Liu
Chuming Li
Yazhe Niu
Yaodong Yang
Yu Liu
Wanli Ouyang
OffRL
OnRL
63
11
0
12 Dec 2023
A dynamical clipping approach with task feedback for Proximal Policy
  Optimization
A dynamical clipping approach with task feedback for Proximal Policy Optimization
Ziqi Zhang
Jingzehua Xu
Zifeng Zhuang
Jinxin Liu
Donglin Wang
Shuai Zhang
29
1
0
12 Dec 2023
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
32
2
0
11 Dec 2023
Robotic Control of the Deformation of Soft Linear Objects Using Deep
  Reinforcement Learning
Robotic Control of the Deformation of Soft Linear Objects Using Deep Reinforcement Learning
Mélodie Hani Daniel Zakaria
Miguel Aranda
Laurent Lequievre
S. Lengagne
J. Corrales
Y. Mezouar
AI4CE
20
6
0
08 Dec 2023
Control of a pendulum system: From simulation to reality
Control of a pendulum system: From simulation to reality
Iyer Venkataraman Natarajan
11
0
0
08 Dec 2023
MIMo: A Multi-Modal Infant Model for Studying Cognitive Development
MIMo: A Multi-Modal Infant Model for Studying Cognitive Development
Dominik Mattern
Pierre Schumacher
F. M. López
Marcel C. Raabe
M. Ernst
A. Aubret
Jochen Triesch
31
4
0
07 Dec 2023
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy
  Learning
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy Learning
Eric Hanchen Jiang
Andrew Lizarraga
34
0
0
06 Dec 2023
Constrained Bayesian Optimization Under Partial Observations: Balanced
  Improvements and Provable Convergence
Constrained Bayesian Optimization Under Partial Observations: Balanced Improvements and Provable Convergence
Shengbo Wang
Ke Li
18
11
0
06 Dec 2023
Using Curiosity for an Even Representation of Tasks in Continual Offline
  Reinforcement Learning
Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning
Pathmanathan Pankayaraj
Natalia Díaz Rodríguez
Javier Del Ser
CLL
OffRL
43
0
0
05 Dec 2023
Contact Energy Based Hindsight Experience Prioritization
Contact Energy Based Hindsight Experience Prioritization
Erdi Sayar
Zhenshan Bing
Carlo DÉramo
Ozgur S. Oguz
Alois Knoll
29
3
0
05 Dec 2023
Domain Adaptive Imitation Learning with Visual Observation
Domain Adaptive Imitation Learning with Visual Observation
Sungho Choi
Seungyul Han
Woojun Kim
Jongseong Chae
Whiyoung Jung
Young-Jin Sung
OOD
29
6
0
01 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory
  Control
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Bernd Frauenknecht
Tobias Ehlgen
Sebastian Trimpe
47
3
0
30 Nov 2023
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Daniel Bairamian
Philippe Marcotte
Joshua Romoff
Gabriel Robert
Derek Nowrouzezahrai
21
0
0
28 Nov 2023
How to ensure a safe control strategy? Towards a SRL for urban transit
  autonomous operation
How to ensure a safe control strategy? Towards a SRL for urban transit autonomous operation
Zicong Zhao
23
1
0
24 Nov 2023
Resilient Control of Networked Microgrids using Vertical Federated
  Reinforcement Learning: Designs and Real-Time Test-Bed Validations
Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations
Sayak Mukherjee
Ramij-Raja Hossain
Sheik M. Mohiuddin
Yuan Liu
Wei Du
Veronica Adetola
Rohit A Jinsiwale
Qiuhua Huang
Tianzhixi Yin
Ankit Singhal
27
2
0
21 Nov 2023
Nav-Q: Quantum Deep Reinforcement Learning for Collision-Free Navigation
  of Self-Driving Cars
Nav-Q: Quantum Deep Reinforcement Learning for Collision-Free Navigation of Self-Driving Cars
Akash Sinha
A. Macaluso
Matthias Klusch
47
4
0
20 Nov 2023
Towards a Standardized Reinforcement Learning Framework for AAM
  Contingency Management
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
19
2
0
17 Nov 2023
Previous
123...567...323334
Next