Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,657 papers shown
Title
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel
Kaixin Wang
Uri Gadot
Navdeep Kumar
Kfir Y. Levy
Shie Mannor
44
3
0
09 Jun 2023
Decoupled Prioritized Resampling for Offline RL
Yang Yue
Bingyi Kang
Xiao Ma
Qisen Yang
Gao Huang
S. Song
Shuicheng Yan
OffRL
29
0
0
08 Jun 2023
Active Inference in Hebbian Learning Networks
A. Safa
Tim Verbelen
Lars Keuninckx
I. Ocket
A. Bourdoux
F. Catthoor
Georges G. E. Gielen
Gert Cauwenberghs
38
2
0
08 Jun 2023
Boosting Offline Reinforcement Learning with Action Preference Query
Qisen Yang
Shenzhi Wang
Matthieu Lin
S. Song
Gao Huang
OffRL
24
10
0
06 Jun 2023
Learning Embeddings for Sequential Tasks Using Population of Agents
Mridul Mahajan
Georgios Tzannetos
Goran Radanović
Adish Singla
FedML
28
0
0
05 Jun 2023
Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving
Linjin Wu
Zengjie Zhang
S. Haesaert
Zhiqiang Ma
Zhiyong Sun
OffRL
13
6
0
05 Jun 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRL
OnRL
50
15
0
05 Jun 2023
For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Scott Fujimoto
Wei-Di Chang
Edward James Smith
S. Gu
Doina Precup
David Meger
OffRL
30
46
0
04 Jun 2023
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Anas Barakat
Ilyas Fatkhullin
Niao He
36
11
0
02 Jun 2023
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages
Andrew Jesson
Chris Xiaoxuan Lu
Gunshi Gupta
Angelos Filos
Jakob N. Foerster
Y. Gal
OffRL
31
5
0
02 Jun 2023
Extracting Reward Functions from Diffusion Models
Felipe Nuti
Tim Franzmeyer
João F. Henriques
27
14
0
01 Jun 2023
Train Offline, Test Online: A Real Robot Learning Benchmark
G. Zhou
Victoria Dean
Mohan Kumar Srirama
Aravind Rajeswaran
Jyothish Pari
...
Tianhe Yu
Pieter Abbeel
Lerrel Pinto
Chelsea Finn
Abhi Gupta
OffRL
62
39
0
01 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
41
11
0
01 Jun 2023
NetHack is Hard to Hack
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
35
7
0
30 May 2023
IDToolkit: A Toolkit for Benchmarking and Developing Inverse Design Algorithms in Nanophotonics
Jia-Qi Yang
Yucheng Xu
Jianwei Shen
Ke-Bin Fan
De-Chuan Zhan
Yang Yang
39
1
0
30 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
38
9
0
29 May 2023
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Kai Zhang
Abhishek Gupta
OffRL
SSL
41
6
0
26 May 2023
NASimEmu: Network Attack Simulator & Emulator for Training Agents Generalizing to Novel Scenarios
Jaromír Janisch
Tomávs Pevný
Viliam Lisý
26
14
0
26 May 2023
Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy Distillation
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
OffRL
39
3
0
25 May 2023
Aerial Gym -- Isaac Gym Simulator for Aerial Robots
Mihir Kulkarni
Theodor J. L. Forgaard
Kostas Alexis
21
14
0
25 May 2023
Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep Reinforcement Learning
V. Moschopoulos
Pantelis Kyriakidis
A. Lazaridis
I. Vlahavas
23
0
0
25 May 2023
Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Sharan Vaswani
A. Kazemi
Reza Babanezhad
Nicolas Le Roux
OffRL
39
3
0
24 May 2023
Neural Lyapunov and Optimal Control
Daniel Layeghi
Steve Tonneau
M. Mistry
21
0
0
24 May 2023
Adaptive Policy Learning to Additional Tasks
Wenjian Hao
Zehui Lu
Zihao Liang
Tianyu Zhou
Shaoshuai Mou
37
0
0
24 May 2023
ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games
Ruoyao Wang
Graham Todd
Xingdi Yuan
Ziang Xiao
Marc-Alexandre Côté
Peter Alexander Jansen
LRM
29
13
0
24 May 2023
Inverse Reinforcement Learning with the Average Reward Criterion
Feiyang Wu
Jingyang Ke
Anqi Wu
37
9
0
24 May 2023
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
24
0
0
23 May 2023
XRoute Environment: A Novel Reinforcement Learning Environment for Routing
Zhanwen Zhou
H. Zhuo
Xiaowu Zhang
Qiyuan Deng
25
0
0
23 May 2023
Strategy Extraction in Single-Agent Games
Archana Vadakattu
Michelle L. Blom
A. Pearce
26
1
0
22 May 2023
Client Selection for Federated Policy Optimization with Environment Heterogeneity
Zhijie Xie
S. H. Song
35
3
0
18 May 2023
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
34
10
0
17 May 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
Jigang Kim
Daesol Cho
H. J. Kim
27
3
0
17 May 2023
RAMP: A Benchmark for Evaluating Robotic Assembly Manipulation and Planning
J. Collins
Mark Robson
Jun Yamada
Mohan Sridharan
Karol Janik
Ingmar Posner
45
14
0
16 May 2023
Trojan Playground: A Reinforcement Learning Framework for Hardware Trojan Insertion and Detection
Amin Sarihi
Ahmad Patooghy
Peter Jamieson
Abdel-Hameed A. Badawy
32
8
0
16 May 2023
An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions
Xi Yang
Ge Gao
Min Chi
OffRL
32
2
0
15 May 2023
Multi-Agent Reinforcement Learning for Network Routing in Integrated Access Backhaul Networks
Shahaf Yamin
Haim Permuter
27
3
0
12 May 2023
On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm
Ukjo Hwang
Songnam Hong
28
0
0
11 May 2023
HoneyIoT: Adaptive High-Interaction Honeypot for IoT Devices Through Reinforcement Learning
Chong Guan
Heting Liu
Guohong Cao
Sencun Zhu
T. L. La Porta
17
5
0
10 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
Homayoon Farrahi
Rupam Mahmood
34
5
0
09 May 2023
Provable Preimage Under-Approximation for Neural Networks (Full Version)
Xiyue Zhang
Benjie Wang
Marta Z. Kwiatkowska
AAML
41
7
0
05 May 2023
Maximum Causal Entropy Inverse Constrained Reinforcement Learning
Mattijs Baert
Pietro Mazzaglia
Sam Leroux
Pieter Simoens
CML
48
10
0
04 May 2023
Explainable Reinforcement Learning via a Causal World Model
Zhongwei Yu
Jingqing Ruan
Dengpeng Xing
CML
40
15
0
04 May 2023
Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality Guarantees
Daqian Shao
Marta Kwiatkowska
OffRL
31
7
0
02 May 2023
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs Transformation
Guoliang He
Sean Parker
Eiko Yoneki
32
2
0
28 Apr 2023
BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading
Maniraman Periyasamy
Marc Hölle
Marco Wiedmann
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
OffRL
54
6
0
27 Apr 2023
Learning Environment for the Air Domain (LEAD)
Andreas Strand
Patrick Ribu Gorton
M. Asprusten
K. Brathen
31
1
0
27 Apr 2023
A Control-Centric Benchmark for Video Prediction
Stephen Tian
Chelsea Finn
Jiajun Wu
47
10
0
26 Apr 2023
CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing
Philipp Altmann
Fabian Ritz
Leonard Feuchtinger
Jonas Nusslein
Claudia Linnhoff-Popien
Thomy Phan
OOD
OffRL
29
5
0
26 Apr 2023
Games for Artificial Intelligence Research: A Review and Perspectives
Chengpeng Hu
Yunlong Zhao
Ziqi Wang
Haocheng Du
Jialin Liu
AI4CE
37
13
0
26 Apr 2023
Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Xiao-Yang Liu
Ziyi Xia
Hongyang Yang
Jiechao Gao
Daochen Zha
Ming Zhu
Chris Wang
Zhaoran Wang
Jian Guo
OffRL
32
27
0
25 Apr 2023
Previous
1
2
3
...
9
10
11
...
32
33
34
Next