ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Unveiling the Significance of Toddler-Inspired Reward Transition in
  Goal-Oriented Reinforcement Learning
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
Junseok Park
Yoonsung Kim
Hee Bin Yoo
Min Whoo Lee
Kibeom Kim
Won-Seok Choi
Minsu Lee
Byoung-Tak Zhang
OffRL
68
1
0
11 Mar 2024
LitSim: A Conflict-aware Policy for Long-term Interactive Traffic
  Simulation
LitSim: A Conflict-aware Policy for Long-term Interactive Traffic Simulation
Haojie Xin
Xiaodong Zhang
Renzhi Tang
Songyang Yan
Qianrui Zhao
Chunze Yang
Wen Cui
Zijiang Yang
110
2
0
07 Mar 2024
RACE-SM: Reinforcement Learning Based Autonomous Control for Social
  On-Ramp Merging
RACE-SM: Reinforcement Learning Based Autonomous Control for Social On-Ramp Merging
Jordan Poots
84
0
0
05 Mar 2024
Behavior Generation with Latent Actions
Behavior Generation with Latent Actions
Seungjae Lee
Yibin Wang
Haritheja Etukuru
H. J. Kim
Mahi Shafiullah
Lerrel Pinto
VGenOffRL
123
80
0
05 Mar 2024
Deep Reinforcement Learning for Dynamic Algorithm Selection: A
  Proof-of-Principle Study on Differential Evolution
Deep Reinforcement Learning for Dynamic Algorithm Selection: A Proof-of-Principle Study on Differential Evolution
Hongshu Guo
Yining Ma
Zeyuan Ma
Jiacheng Chen
Xinglin Zhang
Zhiguang Cao
Jun Zhang
Yue-Jiao Gong
100
23
0
04 Mar 2024
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement
  Learning
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Michael T. Matthews
Michael Beukman
Benjamin Ellis
Mikayel Samvelyan
Matthew Jackson
Samuel Coward
Jakob Foerster
OffRL
98
31
0
26 Feb 2024
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement
  Learning
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
Anthony Liang
Guy Tennenholtz
Chih-Wei Hsu
Yinlam Chow
Erdem Biyik
Craig Boutilier
OffRL
82
1
0
25 Feb 2024
Leveraging Demonstrator-perceived Precision for Safe Interactive
  Imitation Learning of Clearance-limited Tasks
Leveraging Demonstrator-perceived Precision for Safe Interactive Imitation Learning of Clearance-limited Tasks
Hanbit Oh
Takamitsu Matsubara
110
3
0
21 Feb 2024
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret
  Minimization
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization
Luca DÁmico-Wong
Hugh Zhang
Marc Lanctot
David C. Parkes
OffRL
28
1
0
19 Feb 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Quentin Gallouedec
E. Beeching
Clément Romac
Emmanuel Dellandrea
45
11
0
15 Feb 2024
Dataset Clustering for Improved Offline Policy Learning
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
89
2
0
14 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through
  Partially Supervised Reinforcement Learning
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
64
2
0
14 Feb 2024
Decision Theory-Guided Deep Reinforcement Learning for Fast Learning
Decision Theory-Guided Deep Reinforcement Learning for Fast Learning
Zelin Wan
Jin-Hee Cho
Mu Zhu
Ahmed H. Anwar
Charles A. Kamhoua
Munindar P. Singh
AI4CE
47
0
0
08 Feb 2024
Learning Uncertainty-Aware Temporally-Extended Actions
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee
Seung Joon Park
Yunhao Tang
Min-hwan Oh
55
2
0
08 Feb 2024
Exploration Without Maps via Zero-Shot Out-of-Distribution Deep
  Reinforcement Learning
Exploration Without Maps via Zero-Shot Out-of-Distribution Deep Reinforcement Learning
Shathushan Sivashangaran
Apoorva Khairnar
A. Eskandarian
OffRL
71
0
0
07 Feb 2024
Voronoi Candidates for Bayesian Optimization
Voronoi Candidates for Bayesian Optimization
Nathan Wycoff
John W. Smith
Annie S. Booth
R. Gramacy
85
2
0
07 Feb 2024
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
Chen Wang
S. Erfani
T. Alpcan
Christopher Leckie
OffRL
59
3
0
07 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
63
4
0
07 Feb 2024
A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in
  Next-gen Networks
A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in Next-gen Networks
A. Abrol
Purnima Murali Mohan
Tram Truong-Huu
34
1
0
07 Feb 2024
An Architecture for Unattended Containerized (Deep) Reinforcement
  Learning with Webots
An Architecture for Unattended Containerized (Deep) Reinforcement Learning with Webots
Tobias Haubold
Petra Linke
OffRL
26
0
0
06 Feb 2024
No-Regret Reinforcement Learning in Smooth MDPs
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restell
62
4
0
06 Feb 2024
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model
  Feedback
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
Yufei Wang
Zhanyi Sun
Jesse Zhang
Zhou Xian
Erdem Biyik
David Held
Zackory M. Erickson
VLM
122
59
0
06 Feb 2024
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement
  Learning Using Unique Experiences
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Nikhil Kumar Singh
Indranil Saha
OffRL
35
0
0
05 Feb 2024
Gazebo Plants: Simulating Plant-Robot Interaction with Cosserat Rods
Gazebo Plants: Simulating Plant-Robot Interaction with Cosserat Rods
Junchen Deng
Samhita Marri
Jonathan Klein
Wojciech Palubicki
Soren Pirk
Girish Chowdhary
D. L. Michels
60
4
0
04 Feb 2024
Towards Optimal Adversarial Robust Q-learning with Bellman
  Infinity-error
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li
Zicheng Zhang
Wang Luo
Congying Han
Yudong Hu
Tiande Guo
Shichen Liao
AAML
135
2
0
03 Feb 2024
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty
  Sharing
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Haobin Jiang
Ziluo Ding
Zongqing Lu
82
3
0
03 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
80
10
0
02 Feb 2024
Two-Timescale Critic-Actor for Average Reward MDPs with Function
  Approximation
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda
Shalabh Bhatnagar
112
2
0
02 Feb 2024
Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions
Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions
Hansung Kim
Siddharth H. Nair
Francesco Borrelli
200
1
0
02 Feb 2024
Control in Stochastic Environment with Delays: A Model-based
  Reinforcement Learning Approach
Control in Stochastic Environment with Delays: A Model-based Reinforcement Learning Approach
Zhiyuan Yao
Ionuţ Florescu
Chihoon Lee
OffRL
43
2
0
01 Feb 2024
A Reinforcement Learning Based Controller to Minimize Forces on the
  Crutches of a Lower-Limb Exoskeleton
A Reinforcement Learning Based Controller to Minimize Forces on the Crutches of a Lower-Limb Exoskeleton
Aydin Emre Utku
S. E. Ada
Muhammet Hatipoglu
Mustafa Derman
Emre Ugur
Evren Samur
87
0
0
31 Jan 2024
Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic
  Motivation Reinforcement Learning Algorithms for Improved Training and
  Adaptability
Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic Motivation Reinforcement Learning Algorithms for Improved Training and Adaptability
Navin Kamuni
Hardik Shah
Sathishkumar Chintala
Naveen Kunchakuri
Sujatha Alla Old Dominion
79
19
0
31 Jan 2024
A comparison of RL-based and PID controllers for 6-DOF swimming robots:
  hybrid underwater object tracking
A comparison of RL-based and PID controllers for 6-DOF swimming robots: hybrid underwater object tracking
F. Lotfi
K. Virji
Nicholas Dudek
Gregory Dudek
59
0
0
29 Jan 2024
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Jianlan Luo
Zheyuan Hu
Charles Xu
You Liang Tan
Jacob Berg
Archit Sharma
S. Schaal
Chelsea Finn
Abhishek Gupta
Sergey Levine
OffRLOnRL
172
49
0
29 Jan 2024
DiffuserLite: Towards Real-time Diffusion Planning
DiffuserLite: Towards Real-time Diffusion Planning
Zibin Dong
Jianye Hao
Yifu Yuan
Fei Ni
Yitian Wang
Pengyi Li
Yan Zheng
177
20
0
27 Jan 2024
Regularized Q-Learning with Linear Function Approximation
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
120
2
0
26 Jan 2024
Learning fast changing slow in spiking neural networks
Learning fast changing slow in spiking neural networks
Cristiano Capone
P. Muratore
OffRL
52
0
0
25 Jan 2024
Integrating Human Expertise in Continuous Spaces: A Novel Interactive
  Bayesian Optimization Framework with Preference Expected Improvement
Integrating Human Expertise in Continuous Spaces: A Novel Interactive Bayesian Optimization Framework with Preference Expected Improvement
Nikolaus Feith
Elmar Rueckert
121
1
0
23 Jan 2024
VRMN-bD: A Multi-modal Natural Behavior Dataset of Immersive Human Fear
  Responses in VR Stand-up Interactive Games
VRMN-bD: A Multi-modal Natural Behavior Dataset of Immersive Human Fear Responses in VR Stand-up Interactive Games
He Zhang
Xinyang Li
Yuanxi Sun
Xinyi Fu
Christine Qiu
John M. Carroll
60
4
0
22 Jan 2024
Information-Theoretic State Variable Selection for Reinforcement
  Learning
Information-Theoretic State Variable Selection for Reinforcement Learning
Charles Westphal
Stephen Hailes
Mirco Musolesi
76
3
0
21 Jan 2024
Synergistic Reinforcement and Imitation Learning for Vision-driven
  Autonomous Flight of UAV Along River
Synergistic Reinforcement and Imitation Learning for Vision-driven Autonomous Flight of UAV Along River
Zihan Wang
Jianwen Li
N. Mahmoudian
60
0
0
17 Jan 2024
IoTWarden: A Deep Reinforcement Learning Based Real-time Defense System
  to Mitigate Trigger-action IoT Attacks
IoTWarden: A Deep Reinforcement Learning Based Real-time Defense System to Mitigate Trigger-action IoT Attacks
Md Morshed Alam
Israt Jahan
Charlotte
AAML
113
2
0
16 Jan 2024
Learned Best-Effort LLM Serving
Learned Best-Effort LLM Serving
Siddharth Jha
Coleman Hooper
Xiaoxuan Liu
Sehoon Kim
Kurt Keutzer
43
2
0
15 Jan 2024
Towards Safe Load Balancing based on Control Barrier Functions and Deep
  Reinforcement Learning
Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning
L. Dinh
Pham Tran Anh Quang
Jérémie Leguay
36
2
0
10 Jan 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
Christoph Dann
Rahul Kidambi
Zhiwei Steven Wu
Alekh Agarwal
OffRL
125
112
0
08 Jan 2024
Policy Optimization with Smooth Guidance Learned from State-Only
  Demonstrations
Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Zhiming Zheng
95
0
0
30 Dec 2023
Design Space Exploration of Approximate Computing Techniques with a
  Reinforcement Learning Approach
Design Space Exploration of Approximate Computing Techniques with a Reinforcement Learning Approach
Sepide Saeedi
A. Savino
S. Di Carlo
32
2
0
29 Dec 2023
Parameterized Projected Bellman Operator
Parameterized Projected Bellman Operator
Th´eo Vincent
Alberto Maria Metelli
Boris Belousov
Jan Peters
Marcello Restelli
Carlo DÉramo
67
4
0
20 Dec 2023
Model-Based Control with Sparse Neural Dynamics
Model-Based Control with Sparse Neural Dynamics
Ziang Liu
Genggeng Zhou
Jeff He
Tobia Marcucci
Fei-Fei Li
Jiajun Wu
Yunzhu Li
AI4CE
92
18
0
20 Dec 2023
Value Explicit Pretraining for Learning Transferable Representations
Value Explicit Pretraining for Learning Transferable Representations
Kiran Lekkala
Henghui Bao
Sumedh Anand Sontakke
Laurent Itti
SSL
76
0
0
19 Dec 2023
Previous
123...567...505152
Next