Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping
Nai-Chieh Huang
Ping-Chun Hsieh
Kuo-Hao Ho
I-Chen Wu
67
9
0
19 Dec 2023
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
116
30
0
19 Dec 2023
TPTO: A Transformer-PPO based Task Offloading Solution for Edge Computing Environments
N. Gholipour
M. D. Assunção
Pranav Agarwal
Julien Gascon-Samson
Rajkumar Buyya
63
2
0
18 Dec 2023
Human-Machine Teaming for UAVs: An Experimentation Platform
Laila El Moujtahid
S. Gottipati
Clodéric Mars
Matthew E. Taylor
73
1
0
18 Dec 2023
Solving the swing-up and balance task for the Acrobot and Pendubot with SAC
Chi Zhang
Akhil Sathuluri
Markus Zimmermann
53
4
0
18 Dec 2023
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation
Abhinav Jain
Vaibhav Unhelkar
OffRL
70
7
0
17 Dec 2023
GraphRARE: Reinforcement Learning Enhanced Graph Neural Network with Relative Entropy
Tianhao Peng
Wenjun Wu
Haitao Yuan
Zhifeng Bao
Pengrui Zhao
Xin Yu
Xuetao Lin
Yu Liang
Yanjun Pu
142
10
0
15 Dec 2023
Improve Robustness of Reinforcement Learning against Observation Perturbations via
l
∞
l_\infty
l
∞
Lipschitz Policy Networks
Buqing Nie
Jingtian Ji
Yangqing Fu
Yue Gao
73
4
0
14 Dec 2023
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Yinmin Zhang
Jie Liu
Chuming Li
Yazhe Niu
Yaodong Yang
Yu Liu
Wanli Ouyang
OffRL
OnRL
133
12
0
12 Dec 2023
A dynamical clipping approach with task feedback for Proximal Policy Optimization
Ziqi Zhang
Jingzehua Xu
Zifeng Zhuang
Jinxin Liu
Donglin Wang
Shuai Zhang
102
1
0
12 Dec 2023
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
80
2
0
11 Dec 2023
Robotic Control of the Deformation of Soft Linear Objects Using Deep Reinforcement Learning
Mélodie Hani Daniel Zakaria
Miguel Aranda
Laurent Lequievre
S. Lengagne
J. Corrales
Y. Mezouar
AI4CE
60
6
0
08 Dec 2023
Control of a pendulum system: From simulation to reality
Iyer Venkataraman Natarajan
23
0
0
08 Dec 2023
MIMo: A Multi-Modal Infant Model for Studying Cognitive Development
Dominik Mattern
Pierre Schumacher
F. M. López
Marcel C. Raabe
M. Ernst
A. Aubret
Jochen Triesch
58
4
0
07 Dec 2023
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy Learning
Eric Hanchen Jiang
Andrew Lizarraga
64
0
0
06 Dec 2023
Constrained Bayesian Optimization Under Partial Observations: Balanced Improvements and Provable Convergence
Shengbo Wang
Ke Li
51
12
0
06 Dec 2023
Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning
Pathmanathan Pankayaraj
Natalia Díaz Rodríguez
Javier Del Ser
CLL
OffRL
151
0
0
05 Dec 2023
Contact Energy Based Hindsight Experience Prioritization
Erdi Sayar
Zhenshan Bing
Carlo DÉramo
Ozgur S. Oguz
Alois Knoll
92
3
0
05 Dec 2023
Domain Adaptive Imitation Learning with Visual Observation
Sungho Choi
Seungyul Han
Woojun Kim
Jongseong Chae
Whiyoung Jung
Young-Jin Sung
OOD
79
7
0
01 Dec 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Bernd Frauenknecht
Tobias Ehlgen
Sebastian Trimpe
85
4
0
30 Nov 2023
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play
Daniel Bairamian
Philippe Marcotte
Joshua Romoff
Gabriel Robert
Derek Nowrouzezahrai
79
0
0
28 Nov 2023
How to ensure a safe control strategy? Towards a SRL for urban transit autonomous operation
Zicong Zhao
43
1
0
24 Nov 2023
Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations
Sayak Mukherjee
Ramij-Raja Hossain
Sheik M. Mohiuddin
Yuan Liu
Wei Du
Veronica Adetola
Rohit A Jinsiwale
Qiuhua Huang
Tianzhixi Yin
Ankit Singhal
55
4
0
21 Nov 2023
Nav-Q: Quantum Deep Reinforcement Learning for Collision-Free Navigation of Self-Driving Cars
Akash Sinha
A. Macaluso
Matthias Klusch
82
6
0
20 Nov 2023
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
49
3
0
17 Nov 2023
Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning
F. D. Lellis
M. Coraggio
G. Russo
Mirco Musolesi
Mario di Bernardo
OffRL
84
4
0
16 Nov 2023
Safety Aware Autonomous Path Planning Using Model Predictive Reinforcement Learning for Inland Waterways
Astrid Vanneste
Simon Vanneste
O. Vasseur
R. Janssens
Mattias Billast
Ali Anwar
Kevin Mets
Tom De Schepper
Siegfried Mercelis
P. Hellinckx
24
4
0
16 Nov 2023
A Software-Hardware Co-Optimized Toolkit for Deep Reinforcement Learning on Heterogeneous Platforms
Yuan Meng
Michael Kinsner
Deshanand Singh
Mahesh Iyer
Viktor Prasanna
46
2
0
15 Nov 2023
A Central Motor System Inspired Pre-training Reinforcement Learning for Robotic Control
Pei Zhang
Zhaobo Hua
Jinliang Ding
68
0
0
14 Nov 2023
Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning
Arjun Bhardwaj
Jonas Rothfuss
Bhavya Sukhija
Yarden As
Marco Hutter
Stelian Coros
Andreas Krause
90
5
0
13 Nov 2023
Controllability-Constrained Deep Network Models for Enhanced Control of Dynamical Systems
Suruchi Sharma
Volodymyr Makarenko
Gautam Kumar
Stas Tiomkin
AI4CE
30
0
0
11 Nov 2023
Learning-Augmented Scheduling for Solar-Powered Electric Vehicle Charging
Tongxin Li
46
0
0
10 Nov 2023
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers
Yuang Geng
Jake Brandon Baldauf
Souradeep Dutta
Chao Huang
Ivan Ruchkin
122
5
0
08 Nov 2023
Deep Bayesian Reinforcement Learning for Spacecraft Proximity Maneuvers and Docking
Desong Du
Naiming Qi
Yanfang Liu
Wei Pan
64
0
0
07 Nov 2023
PcLast: Discovering Plannable Continuous Latent States
Anurag Koul
Shivakanth Sujit
Shaoru Chen
Ben Evans
Lili Wu
...
Yonathan Efroni
Lekan Molu
Miro Dudik
John Langford
Alex Lamb
OffRL
BDL
102
1
0
06 Nov 2023
Towards model-free RL algorithms that scale well with unstructured data
Joseph Modayil
Zaheer Abbas
OffRL
52
3
0
03 Nov 2023
Network Contention-Aware Cluster Scheduling with Reinforcement Learning
Junyeol Ryu
Jeongyoon Eo
GNN
32
0
0
31 Oct 2023
Efficient Exploration in Continuous-time Model-based Reinforcement Learning
Lenart Treven
Jonas Hübotter
Bhavya Sukhija
Florian Dorfler
Andreas Krause
79
6
0
30 Oct 2023
Online Decision Mediation
Daniel Jarrett
Alihan Huyuk
M. Schaar
97
4
0
28 Oct 2023
Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage
Kishan Panaganti
Zaiyan Xu
D. Kalathil
Mohammad Ghavamzadeh
OOD
OffRL
83
8
0
27 Oct 2023
Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models
Pushkal Katara
Zhou Xian
Katerina Fragkiadaki
LM&Ro
123
44
0
27 Oct 2023
Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies
Michael Beukman
Devon Jarvis
Richard Klein
Steven D. James
Benjamin Rosman
105
13
0
25 Oct 2023
Absolute Policy Optimization
Weiye Zhao
Feihan Li
Yifan Sun
Rui Chen
Tianhao Wei
Changliu Liu
132
4
0
20 Oct 2023
UNav-Sim: A Visually Realistic Underwater Robotics Simulator and Synthetic Data-generation Framework
Abdelhakim Amer
Olaya Álvarez-Tunón
Halil Ibrahim Ugurlu
Jonas Le Fevre Sejersen
Yury Brodskiy
Erdal Kayacan
83
22
0
18 Oct 2023
Using Experience Classification for Training Non-Markovian Tasks
Ruixuan Miao
Xu Lu
Cong Tian
Bin Yu
Zhenhua Duan
OffRL
54
0
0
18 Oct 2023
Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms
Alexander Bukharin
Yan Li
Yue Yu
Qingru Zhang
Zhehui Chen
Simiao Zuo
Chao Zhang
Songan Zhang
Tuo Zhao
OOD
AAML
80
19
0
16 Oct 2023
BayRnTune: Adaptive Bayesian Domain Randomization via Strategic Fine-tuning
Tianle Huang
Nitish Sontakke
K. N. Kumar
Irfan Essa
Stefanos Nikolaidis
Dennis W. Hong
Sehoon Ha
52
4
0
16 Oct 2023
Leveraging Knowledge Distillation for Efficient Deep Reinforcement Learning in Resource-Constrained Environments
Guanlin Meng
40
1
0
16 Oct 2023
Theory of Mind for Multi-Agent Collaboration via Large Language Models
Huao Li
Yu Quan Chong
Simon Stepputtis
Joseph Campbell
Dana Hughes
Michael Lewis
Katia Sycara
LLMAG
132
78
0
16 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
121
33
0
15 Oct 2023
Previous
1
2
3
...
6
7
8
...
50
51
52
Next