ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,657 papers shown
Title
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating
  The Worst Kernel
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel
Kaixin Wang
Uri Gadot
Navdeep Kumar
Kfir Y. Levy
Shie Mannor
44
3
0
09 Jun 2023
Decoupled Prioritized Resampling for Offline RL
Decoupled Prioritized Resampling for Offline RL
Yang Yue
Bingyi Kang
Xiao Ma
Qisen Yang
Gao Huang
S. Song
Shuicheng Yan
OffRL
29
0
0
08 Jun 2023
Active Inference in Hebbian Learning Networks
Active Inference in Hebbian Learning Networks
A. Safa
Tim Verbelen
Lars Keuninckx
I. Ocket
A. Bourdoux
F. Catthoor
Georges G. E. Gielen
Gert Cauwenberghs
38
2
0
08 Jun 2023
Boosting Offline Reinforcement Learning with Action Preference Query
Boosting Offline Reinforcement Learning with Action Preference Query
Qisen Yang
Shenzhi Wang
Matthieu Lin
S. Song
Gao Huang
OffRL
24
10
0
06 Jun 2023
Learning Embeddings for Sequential Tasks Using Population of Agents
Learning Embeddings for Sequential Tasks Using Population of Agents
Mridul Mahajan
Georgios Tzannetos
Goran Radanović
Adish Singla
FedML
28
0
0
05 Jun 2023
Risk-Aware Reward Shaping of Reinforcement Learning Agents for
  Autonomous Driving
Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving
Linjin Wu
Zengjie Zhang
S. Haesaert
Zhiqiang Ma
Zhiyong Sun
OffRL
13
6
0
05 Jun 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy
  Actor-Critic
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRL
OnRL
50
15
0
05 Jun 2023
For SALE: State-Action Representation Learning for Deep Reinforcement
  Learning
For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Scott Fujimoto
Wei-Di Chang
Edward James Smith
S. Gu
Doina Precup
David Meger
OffRL
30
46
0
04 Jun 2023
Reinforcement Learning with General Utilities: Simpler Variance
  Reduction and Large State-Action Space
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Anas Barakat
Ilyas Fatkhullin
Niao He
36
11
0
02 Jun 2023
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive
  Advantages
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages
Andrew Jesson
Chris Xiaoxuan Lu
Gunshi Gupta
Angelos Filos
Jakob N. Foerster
Y. Gal
OffRL
31
5
0
02 Jun 2023
Extracting Reward Functions from Diffusion Models
Extracting Reward Functions from Diffusion Models
Felipe Nuti
Tim Franzmeyer
João F. Henriques
27
14
0
01 Jun 2023
Train Offline, Test Online: A Real Robot Learning Benchmark
Train Offline, Test Online: A Real Robot Learning Benchmark
G. Zhou
Victoria Dean
Mohan Kumar Srirama
Aravind Rajeswaran
Jyothish Pari
...
Tianhe Yu
Pieter Abbeel
Lerrel Pinto
Chelsea Finn
Abhi Gupta
OffRL
62
39
0
01 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
41
11
0
01 Jun 2023
NetHack is Hard to Hack
NetHack is Hard to Hack
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
35
7
0
30 May 2023
IDToolkit: A Toolkit for Benchmarking and Developing Inverse Design
  Algorithms in Nanophotonics
IDToolkit: A Toolkit for Benchmarking and Developing Inverse Design Algorithms in Nanophotonics
Jia-Qi Yang
Yucheng Xu
Jianwei Shen
Ke-Bin Fan
De-Chuan Zhan
Yang Yang
39
1
0
30 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
38
9
0
29 May 2023
Self-Supervised Reinforcement Learning that Transfers using Random
  Features
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Kai Zhang
Abhishek Gupta
OffRL
SSL
41
6
0
26 May 2023
NASimEmu: Network Attack Simulator & Emulator for Training Agents
  Generalizing to Novel Scenarios
NASimEmu: Network Attack Simulator & Emulator for Training Agents Generalizing to Novel Scenarios
Jaromír Janisch
Tomávs Pevný
Viliam Lisý
26
14
0
26 May 2023
Counterfactual Explainer Framework for Deep Reinforcement Learning
  Models Using Policy Distillation
Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy Distillation
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
OffRL
39
3
0
25 May 2023
Aerial Gym -- Isaac Gym Simulator for Aerial Robots
Aerial Gym -- Isaac Gym Simulator for Aerial Robots
Mihir Kulkarni
Theodor J. L. Forgaard
Kostas Alexis
21
14
0
25 May 2023
Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep
  Reinforcement Learning
Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep Reinforcement Learning
V. Moschopoulos
Pantelis Kyriakidis
A. Lazaridis
I. Vlahavas
23
0
0
25 May 2023
Decision-Aware Actor-Critic with Function Approximation and Theoretical
  Guarantees
Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Sharan Vaswani
A. Kazemi
Reza Babanezhad
Nicolas Le Roux
OffRL
39
3
0
24 May 2023
Neural Lyapunov and Optimal Control
Neural Lyapunov and Optimal Control
Daniel Layeghi
Steve Tonneau
M. Mistry
21
0
0
24 May 2023
Adaptive Policy Learning to Additional Tasks
Adaptive Policy Learning to Additional Tasks
Wenjian Hao
Zehui Lu
Zihao Liang
Tianyu Zhou
Shaoshuai Mou
37
0
0
24 May 2023
ByteSized32: A Corpus and Challenge Task for Generating Task-Specific
  World Models Expressed as Text Games
ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games
Ruoyao Wang
Graham Todd
Xingdi Yuan
Ziang Xiao
Marc-Alexandre Côté
Peter Alexander Jansen
LRM
29
13
0
24 May 2023
Inverse Reinforcement Learning with the Average Reward Criterion
Inverse Reinforcement Learning with the Average Reward Criterion
Feiyang Wu
Jingyang Ke
Anqi Wu
37
9
0
24 May 2023
Constrained Reinforcement Learning for Dynamic Material Handling
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
24
0
0
23 May 2023
XRoute Environment: A Novel Reinforcement Learning Environment for
  Routing
XRoute Environment: A Novel Reinforcement Learning Environment for Routing
Zhanwen Zhou
H. Zhuo
Xiaowu Zhang
Qiyuan Deng
25
0
0
23 May 2023
Strategy Extraction in Single-Agent Games
Strategy Extraction in Single-Agent Games
Archana Vadakattu
Michelle L. Blom
A. Pearce
26
1
0
22 May 2023
Client Selection for Federated Policy Optimization with Environment
  Heterogeneity
Client Selection for Federated Policy Optimization with Environment Heterogeneity
Zhijie Xie
S. H. Song
35
3
0
18 May 2023
Model-Free Robust Average-Reward Reinforcement Learning
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George Atia
Ashley Prater-Bennette
Shaofeng Zou
34
10
0
17 May 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and
  Bidirectional Curriculum
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
Jigang Kim
Daesol Cho
H. J. Kim
27
3
0
17 May 2023
RAMP: A Benchmark for Evaluating Robotic Assembly Manipulation and
  Planning
RAMP: A Benchmark for Evaluating Robotic Assembly Manipulation and Planning
J. Collins
Mark Robson
Jun Yamada
Mohan Sridharan
Karol Janik
Ingmar Posner
45
14
0
16 May 2023
Trojan Playground: A Reinforcement Learning Framework for Hardware
  Trojan Insertion and Detection
Trojan Playground: A Reinforcement Learning Framework for Hardware Trojan Insertion and Detection
Amin Sarihi
Ahmad Patooghy
Peter Jamieson
Abdel-Hameed A. Badawy
32
8
0
16 May 2023
An Offline Time-aware Apprenticeship Learning Framework for Evolving
  Reward Functions
An Offline Time-aware Apprenticeship Learning Framework for Evolving Reward Functions
Xi Yang
Ge Gao
Min Chi
OffRL
32
2
0
15 May 2023
Multi-Agent Reinforcement Learning for Network Routing in Integrated
  Access Backhaul Networks
Multi-Agent Reinforcement Learning for Network Routing in Integrated Access Backhaul Networks
Shahaf Yamin
Haim Permuter
27
3
0
12 May 2023
On Practical Robust Reinforcement Learning: Practical Uncertainty Set
  and Double-Agent Algorithm
On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm
Ukjo Hwang
Songnam Hong
28
0
0
11 May 2023
HoneyIoT: Adaptive High-Interaction Honeypot for IoT Devices Through
  Reinforcement Learning
HoneyIoT: Adaptive High-Interaction Honeypot for IoT Devices Through Reinforcement Learning
Chong Guan
Heting Liu
Guohong Cao
Sencun Zhu
T. L. La Porta
17
5
0
10 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy
  Optimization
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
Homayoon Farrahi
Rupam Mahmood
34
5
0
09 May 2023
Provable Preimage Under-Approximation for Neural Networks (Full Version)
Provable Preimage Under-Approximation for Neural Networks (Full Version)
Xiyue Zhang
Benjie Wang
Marta Z. Kwiatkowska
AAML
41
7
0
05 May 2023
Maximum Causal Entropy Inverse Constrained Reinforcement Learning
Maximum Causal Entropy Inverse Constrained Reinforcement Learning
Mattijs Baert
Pietro Mazzaglia
Sam Leroux
Pieter Simoens
CML
48
10
0
04 May 2023
Explainable Reinforcement Learning via a Causal World Model
Explainable Reinforcement Learning via a Causal World Model
Zhongwei Yu
Jingqing Ruan
Dengpeng Xing
CML
40
15
0
04 May 2023
Sample Efficient Model-free Reinforcement Learning from LTL
  Specifications with Optimality Guarantees
Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality Guarantees
Daqian Shao
Marta Kwiatkowska
OffRL
31
7
0
02 May 2023
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs
  Transformation
X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs Transformation
Guoliang He
Sean Parker
Eiko Yoneki
32
2
0
28 Apr 2023
BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading
BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading
Maniraman Periyasamy
Marc Hölle
Marco Wiedmann
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
OffRL
54
6
0
27 Apr 2023
Learning Environment for the Air Domain (LEAD)
Learning Environment for the Air Domain (LEAD)
Andreas Strand
Patrick Ribu Gorton
M. Asprusten
K. Brathen
31
1
0
27 Apr 2023
A Control-Centric Benchmark for Video Prediction
A Control-Centric Benchmark for Video Prediction
Stephen Tian
Chelsea Finn
Jiajun Wu
47
10
0
26 Apr 2023
CROP: Towards Distributional-Shift Robust Reinforcement Learning using
  Compact Reshaped Observation Processing
CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing
Philipp Altmann
Fabian Ritz
Leonard Feuchtinger
Jonas Nusslein
Claudia Linnhoff-Popien
Thomy Phan
OOD
OffRL
29
5
0
26 Apr 2023
Games for Artificial Intelligence Research: A Review and Perspectives
Games for Artificial Intelligence Research: A Review and Perspectives
Chengpeng Hu
Yunlong Zhao
Ziqi Wang
Haocheng Du
Jialin Liu
AI4CE
37
13
0
26 Apr 2023
Dynamic Datasets and Market Environments for Financial Reinforcement
  Learning
Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Xiao-Yang Liu
Ziyi Xia
Hongyang Yang
Jiechao Gao
Daochen Zha
Ming Zhu
Chris Wang
Zhaoran Wang
Jian Guo
OffRL
32
27
0
25 Apr 2023
Previous
123...91011...323334
Next