Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
Junseok Park
Yoonsung Kim
Hee Bin Yoo
Min Whoo Lee
Kibeom Kim
Won-Seok Choi
Minsu Lee
Byoung-Tak Zhang
OffRL
68
1
0
11 Mar 2024
LitSim: A Conflict-aware Policy for Long-term Interactive Traffic Simulation
Haojie Xin
Xiaodong Zhang
Renzhi Tang
Songyang Yan
Qianrui Zhao
Chunze Yang
Wen Cui
Zijiang Yang
110
2
0
07 Mar 2024
RACE-SM: Reinforcement Learning Based Autonomous Control for Social On-Ramp Merging
Jordan Poots
84
0
0
05 Mar 2024
Behavior Generation with Latent Actions
Seungjae Lee
Yibin Wang
Haritheja Etukuru
H. J. Kim
Mahi Shafiullah
Lerrel Pinto
VGen
OffRL
123
80
0
05 Mar 2024
Deep Reinforcement Learning for Dynamic Algorithm Selection: A Proof-of-Principle Study on Differential Evolution
Hongshu Guo
Yining Ma
Zeyuan Ma
Jiacheng Chen
Xinglin Zhang
Zhiguang Cao
Jun Zhang
Yue-Jiao Gong
100
23
0
04 Mar 2024
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Michael T. Matthews
Michael Beukman
Benjamin Ellis
Mikayel Samvelyan
Matthew Jackson
Samuel Coward
Jakob Foerster
OffRL
98
31
0
26 Feb 2024
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
Anthony Liang
Guy Tennenholtz
Chih-Wei Hsu
Yinlam Chow
Erdem Biyik
Craig Boutilier
OffRL
82
1
0
25 Feb 2024
Leveraging Demonstrator-perceived Precision for Safe Interactive Imitation Learning of Clearance-limited Tasks
Hanbit Oh
Takamitsu Matsubara
110
3
0
21 Feb 2024
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization
Luca DÁmico-Wong
Hugh Zhang
Marc Lanctot
David C. Parkes
OffRL
28
1
0
19 Feb 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Quentin Gallouedec
E. Beeching
Clément Romac
Emmanuel Dellandrea
45
11
0
15 Feb 2024
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
89
2
0
14 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
64
2
0
14 Feb 2024
Decision Theory-Guided Deep Reinforcement Learning for Fast Learning
Zelin Wan
Jin-Hee Cho
Mu Zhu
Ahmed H. Anwar
Charles A. Kamhoua
Munindar P. Singh
AI4CE
47
0
0
08 Feb 2024
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee
Seung Joon Park
Yunhao Tang
Min-hwan Oh
55
2
0
08 Feb 2024
Exploration Without Maps via Zero-Shot Out-of-Distribution Deep Reinforcement Learning
Shathushan Sivashangaran
Apoorva Khairnar
A. Eskandarian
OffRL
71
0
0
07 Feb 2024
Voronoi Candidates for Bayesian Optimization
Nathan Wycoff
John W. Smith
Annie S. Booth
R. Gramacy
85
2
0
07 Feb 2024
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
Chen Wang
S. Erfani
T. Alpcan
Christopher Leckie
OffRL
59
3
0
07 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
63
4
0
07 Feb 2024
A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in Next-gen Networks
A. Abrol
Purnima Murali Mohan
Tram Truong-Huu
34
1
0
07 Feb 2024
An Architecture for Unattended Containerized (Deep) Reinforcement Learning with Webots
Tobias Haubold
Petra Linke
OffRL
26
0
0
06 Feb 2024
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restell
62
4
0
06 Feb 2024
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
Yufei Wang
Zhanyi Sun
Jesse Zhang
Zhou Xian
Erdem Biyik
David Held
Zackory M. Erickson
VLM
122
59
0
06 Feb 2024
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Nikhil Kumar Singh
Indranil Saha
OffRL
35
0
0
05 Feb 2024
Gazebo Plants: Simulating Plant-Robot Interaction with Cosserat Rods
Junchen Deng
Samhita Marri
Jonathan Klein
Wojciech Palubicki
Soren Pirk
Girish Chowdhary
D. L. Michels
60
4
0
04 Feb 2024
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li
Zicheng Zhang
Wang Luo
Congying Han
Yudong Hu
Tiande Guo
Shichen Liao
AAML
135
2
0
03 Feb 2024
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Haobin Jiang
Ziluo Ding
Zongqing Lu
82
3
0
03 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
80
10
0
02 Feb 2024
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda
Shalabh Bhatnagar
112
2
0
02 Feb 2024
Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions
Hansung Kim
Siddharth H. Nair
Francesco Borrelli
200
1
0
02 Feb 2024
Control in Stochastic Environment with Delays: A Model-based Reinforcement Learning Approach
Zhiyuan Yao
Ionuţ Florescu
Chihoon Lee
OffRL
43
2
0
01 Feb 2024
A Reinforcement Learning Based Controller to Minimize Forces on the Crutches of a Lower-Limb Exoskeleton
Aydin Emre Utku
S. E. Ada
Muhammet Hatipoglu
Mustafa Derman
Emre Ugur
Evren Samur
87
0
0
31 Jan 2024
Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic Motivation Reinforcement Learning Algorithms for Improved Training and Adaptability
Navin Kamuni
Hardik Shah
Sathishkumar Chintala
Naveen Kunchakuri
Sujatha Alla Old Dominion
79
19
0
31 Jan 2024
A comparison of RL-based and PID controllers for 6-DOF swimming robots: hybrid underwater object tracking
F. Lotfi
K. Virji
Nicholas Dudek
Gregory Dudek
59
0
0
29 Jan 2024
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Jianlan Luo
Zheyuan Hu
Charles Xu
You Liang Tan
Jacob Berg
Archit Sharma
S. Schaal
Chelsea Finn
Abhishek Gupta
Sergey Levine
OffRL
OnRL
172
49
0
29 Jan 2024
DiffuserLite: Towards Real-time Diffusion Planning
Zibin Dong
Jianye Hao
Yifu Yuan
Fei Ni
Yitian Wang
Pengyi Li
Yan Zheng
177
20
0
27 Jan 2024
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
120
2
0
26 Jan 2024
Learning fast changing slow in spiking neural networks
Cristiano Capone
P. Muratore
OffRL
52
0
0
25 Jan 2024
Integrating Human Expertise in Continuous Spaces: A Novel Interactive Bayesian Optimization Framework with Preference Expected Improvement
Nikolaus Feith
Elmar Rueckert
121
1
0
23 Jan 2024
VRMN-bD: A Multi-modal Natural Behavior Dataset of Immersive Human Fear Responses in VR Stand-up Interactive Games
He Zhang
Xinyang Li
Yuanxi Sun
Xinyi Fu
Christine Qiu
John M. Carroll
60
4
0
22 Jan 2024
Information-Theoretic State Variable Selection for Reinforcement Learning
Charles Westphal
Stephen Hailes
Mirco Musolesi
76
3
0
21 Jan 2024
Synergistic Reinforcement and Imitation Learning for Vision-driven Autonomous Flight of UAV Along River
Zihan Wang
Jianwen Li
N. Mahmoudian
60
0
0
17 Jan 2024
IoTWarden: A Deep Reinforcement Learning Based Real-time Defense System to Mitigate Trigger-action IoT Attacks
Md Morshed Alam
Israt Jahan
Charlotte
AAML
113
2
0
16 Jan 2024
Learned Best-Effort LLM Serving
Siddharth Jha
Coleman Hooper
Xiaoxuan Liu
Sehoon Kim
Kurt Keutzer
43
2
0
15 Jan 2024
Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning
L. Dinh
Pham Tran Anh Quang
Jérémie Leguay
36
2
0
10 Jan 2024
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
Gokul Swamy
Christoph Dann
Rahul Kidambi
Zhiwei Steven Wu
Alekh Agarwal
OffRL
125
112
0
08 Jan 2024
Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Zhiming Zheng
95
0
0
30 Dec 2023
Design Space Exploration of Approximate Computing Techniques with a Reinforcement Learning Approach
Sepide Saeedi
A. Savino
S. Di Carlo
32
2
0
29 Dec 2023
Parameterized Projected Bellman Operator
Th´eo Vincent
Alberto Maria Metelli
Boris Belousov
Jan Peters
Marcello Restelli
Carlo DÉramo
67
4
0
20 Dec 2023
Model-Based Control with Sparse Neural Dynamics
Ziang Liu
Genggeng Zhou
Jeff He
Tobia Marcucci
Fei-Fei Li
Jiajun Wu
Yunzhu Li
AI4CE
92
18
0
20 Dec 2023
Value Explicit Pretraining for Learning Transferable Representations
Kiran Lekkala
Henghui Bao
Sumedh Anand Sontakke
Laurent Itti
SSL
76
0
0
19 Dec 2023
Previous
1
2
3
...
5
6
7
...
50
51
52
Next