ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Robust Reinforcement Learning as a Stackelberg Game via
  Adaptively-Regularized Adversarial Training
Robust Reinforcement Learning as a Stackelberg Game via Adaptively-Regularized Adversarial Training
Peide Huang
Mengdi Xu
Fei Fang
Ding Zhao
155
38
0
19 Feb 2022
Transformation Coding: Simple Objectives for Equivariant Representations
Transformation Coding: Simple Objectives for Equivariant Representations
Mehran Shakerinava
A. Mondal
Siamak Ravanbakhsh
OffRL
72
0
0
19 Feb 2022
Shaping Advice in Deep Reinforcement Learning
Shaping Advice in Deep Reinforcement Learning
Baicen Xiao
Bhaskar Ramasubramanian
Radha Poovendran
46
0
0
19 Feb 2022
Design-Bench: Benchmarks for Data-Driven Offline Model-Based
  Optimization
Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization
Brandon Trabucco
Xinyang Geng
Aviral Kumar
Sergey Levine
OffRL
99
102
0
17 Feb 2022
Robust Reinforcement Learning via Genetic Curriculum
Robust Reinforcement Learning via Genetic Curriculum
Yeeho Song
J. Schneider
71
9
0
17 Feb 2022
Deep Koopman Operator with Control for Nonlinear Systems
Deep Koopman Operator with Control for Nonlinear Systems
Hao-bin Shi
Max Meng
73
79
0
16 Feb 2022
Policy Learning and Evaluation with Randomized Quasi-Monte Carlo
Policy Learning and Evaluation with Randomized Quasi-Monte Carlo
Sébastien M. R. Arnold
P. LÉcuyer
Liyu Chen
Yi-fan Chen
Fei Sha
OffRL
84
4
0
16 Feb 2022
CUP: A Conservative Update Policy Algorithm for Safe Reinforcement
  Learning
CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning
Long Yang
Jiaming Ji
Juntao Dai
Yu Zhang
Pengfei Li
Gang Pan
69
17
0
15 Feb 2022
L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth
  Reinforcement Learning
L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth Reinforcement Learning
Taisuke Kobayashi
94
16
0
15 Feb 2022
QuadSim: A Quadcopter Rotational Dynamics Simulation Framework For
  Reinforcement Learning Algorithms
QuadSim: A Quadcopter Rotational Dynamics Simulation Framework For Reinforcement Learning Algorithms
Burak Han Demirbilek
24
0
0
14 Feb 2022
Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous
  Demonstration
Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration
Sravan Jayanthi
Letian Chen
Matthew C. Gombolay
48
0
0
14 Feb 2022
Autonomous Vehicles on the Edge: A Survey on Autonomous Vehicle Racing
Autonomous Vehicles on the Edge: A Survey on Autonomous Vehicle Racing
Johannes Betz
Hongrui Zheng
Alexander Liniger
Ugo Rosolia
Phillip Karle
Madhur Behl
Venkat Krovi
Rahul Mangharam
82
232
0
14 Feb 2022
Saute RL: Almost Surely Safe Reinforcement Learning Using State
  Augmentation
Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
Aivar Sootla
Alexander I. Cowen-Rivers
Taher Jafferjee
Ziyan Wang
D. Mguni
Jun Wang
Haitham Bou-Ammar
130
54
0
14 Feb 2022
Evolving Neural Networks with Optimal Balance between Information Flow
  and Connections Cost
Evolving Neural Networks with Optimal Balance between Information Flow and Connections Cost
A. Khalili
A. Bouchachia
82
0
0
12 Feb 2022
End-to-end Reinforcement Learning of Robotic Manipulation with Robust
  Keypoints Representation
End-to-end Reinforcement Learning of Robotic Manipulation with Robust Keypoints Representation
Tianying Wang
En Yen Puang
Marcus Lee
Yongpeng Wu
Wei Jing
SSL
67
5
0
12 Feb 2022
Uncertainty Aware System Identification with Universal Policies
Uncertainty Aware System Identification with Universal Policies
B. L. Semage
Thommen George Karimpanal
Santu Rana
Svetha Venkatesh
105
3
0
11 Feb 2022
Fast Model-based Policy Search for Universal Policy Networks
Fast Model-based Policy Search for Universal Policy Networks
B. L. Semage
Thommen George Karimpanal
Santu Rana
Svetha Venkatesh
77
1
0
11 Feb 2022
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Uncovering Instabilities in Variational-Quantum Deep Q-Networks
Maja Franz
Lucas Wolf
Maniraman Periyasamy
Christian Ufrecht
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Wolfgang Mauerer
131
30
0
10 Feb 2022
Bayesian Nonparametrics for Offline Skill Discovery
Bayesian Nonparametrics for Offline Skill Discovery
Valentin Villecroze
H. Braviner
Panteha Naderian
Chris J. Maddison
Gabriel Loaiza-Ganem
BDLOffRL
85
8
0
09 Feb 2022
Contextualize Me -- The Case for Context in Reinforcement Learning
Contextualize Me -- The Case for Context in Reinforcement Learning
C. Benjamins
Theresa Eimer
Frederik Schubert
Aditya Mohan
Sebastian Dohler
André Biedenkapp
Bodo Rosenhahn
Frank Hutter
Marius Lindauer
OffRL
104
32
0
09 Feb 2022
Scenario-Assisted Deep Reinforcement Learning
Scenario-Assisted Deep Reinforcement Learning
Raz Yerushalmi
Guy Amir
Achiya Elyasaf
D. Harel
Guy Katz
Assaf Marron
OffRL
55
13
0
09 Feb 2022
skrl: Modular and Flexible Library for Reinforcement Learning
skrl: Modular and Flexible Library for Reinforcement Learning
Antonio Serrano-Muñoz
D. Chrysostomou
Simon Boegh
N. Arana-Arexolaleiba
89
32
0
08 Feb 2022
Approximating Gradients for Differentiable Quality Diversity in
  Reinforcement Learning
Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning
Bryon Tjanaka
Matthew C. Fontaine
Julian Togelius
Stefanos Nikolaidis
86
54
0
08 Feb 2022
Local Explanations for Reinforcement Learning
Local Explanations for Reinforcement Learning
Ronny Luss
Amit Dhurandhar
Miao Liu
FAttOffRL
81
3
0
08 Feb 2022
Transfer Reinforcement Learning for Differing Action Spaces via
  Q-Network Representations
Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations
Nathan Beck
Abhiramon Rajasekharan
H. Tran
24
3
0
05 Feb 2022
Offline Reinforcement Learning for Mobile Notifications
Offline Reinforcement Learning for Mobile Notifications
Yiping Yuan
A. Muralidharan
Preetam Nandy
Miao Cheng
Prakruthi Prabhakar
OffRL
61
10
0
04 Feb 2022
SAFE-OCC: A Novelty Detection Framework for Convolutional Neural Network
  Sensors and its Application in Process Control
SAFE-OCC: A Novelty Detection Framework for Convolutional Neural Network Sensors and its Application in Process Control
J. Pulsipher
Luke D. J. Coutinho
Tyler A. Soderstrom
Victor M. Zavala
HAI
48
7
0
03 Feb 2022
ExPoSe: Combining State-Based Exploration with Gradient-Based Online
  Search
ExPoSe: Combining State-Based Exploration with Gradient-Based Online Search
Dixant Mittal
Siddharth Aravindan
W. Lee
OnRL
48
3
0
03 Feb 2022
A General, Evolution-Inspired Reward Function for Social Robotics
A General, Evolution-Inspired Reward Function for Social Robotics
Thomas Kingsford
135
0
0
01 Feb 2022
Finding the optimal human strategy for Wordle using maximum correct
  letter probabilities and reinforcement learning
Finding the optimal human strategy for Wordle using maximum correct letter probabilities and reinforcement learning
B. Anderson
Jesse G. Meyer
87
24
0
01 Feb 2022
Accelerating Deep Reinforcement Learning for Digital Twin Network
  Optimization with Evolutionary Strategies
Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies
Carlos Güemes-Palau
Paul Almasan
Shihan Xiao
Xiangle Cheng
Xiang Shi
Pere Barlet-Ros
A. Cabellos-Aparicio
58
9
0
01 Feb 2022
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method
  with Probabilistic Gradient Estimation
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation
Matilde Gargiani
Andrea Zanelli
Andrea Martinelli
Tyler H. Summers
John Lygeros
78
14
0
01 Feb 2022
Adversarial Imitation Learning from Video using a State Observer
Adversarial Imitation Learning from Video using a State Observer
Haresh Karnan
Garrett A. Warnell
F. Torabi
Peter Stone
GAN
108
13
0
01 Feb 2022
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Michael Laskin
Hao Liu
Xue Bin Peng
Denis Yarats
Aravind Rajeswaran
Pieter Abbeel
SSL
164
69
0
01 Feb 2022
You May Not Need Ratio Clipping in PPO
You May Not Need Ratio Clipping in PPO
Mingfei Sun
Vitaly Kurin
Guoqing Liu
Sam Devlin
Tao Qin
Katja Hofmann
Shimon Whiteson
62
16
0
31 Jan 2022
ApolloRL: a Reinforcement Learning Platform for Autonomous Driving
ApolloRL: a Reinforcement Learning Platform for Autonomous Driving
Fei Gao
Peng Geng
Jiaqi Guo
YuanQiang Liu
Dingfeng Guo
Yabo Su
Jie Zhou
Xiao Wei
Jin Li
Xu Liu
41
9
0
29 Jan 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal
  Point Processes
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Jue Chen
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
74
18
0
29 Jan 2022
Zeroth-Order Actor-Critic: An Evolutionary Framework for Sequential Decision Problems
Zeroth-Order Actor-Critic: An Evolutionary Framework for Sequential Decision Problems
Yuheng Lei
Jianyu Chen
Guojian Zhan
Tao Zhang
Jiangtao Li
Jianyu Chen
Shengbo Eben Li
Sifa Zheng
OffRL
82
3
0
29 Jan 2022
Towards Safe Reinforcement Learning with a Safety Editor Policy
Towards Safe Reinforcement Learning with a Safety Editor Policy
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
149
31
0
28 Jan 2022
Using Deep Reinforcement Learning for Zero Defect Smart Forging
Using Deep Reinforcement Learning for Zero Defect Smart Forging
Yunpeng Ma
A. Kassler
Bestoun S. Ahmed
P. Krakhmalev
A. Thore
Arash Toyser
Hans Lindback
OffRLAI4CE
21
4
0
25 Jan 2022
Evolution Gym: A Large-Scale Benchmark for Evolving Soft Robots
Evolution Gym: A Large-Scale Benchmark for Evolving Soft Robots
Jagdeep Bhatia
Holly Jackson
Yunsheng Tian
Jie Xu
Wojciech Matusik
88
82
0
24 Jan 2022
STOPS: Short-Term-based Volatility-controlled Policy Search and its
  Global Convergence
STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence
Liang Xu
Daoming Lyu
Yangchen Pan
Aiwen Jiang
Bo Liu
95
0
0
24 Jan 2022
Deep Q-learning: a robust control approach
Deep Q-learning: a robust control approach
B. Varga
Balázs Kulcsár
M. Chehreghani
OOD
55
11
0
21 Jan 2022
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement
  Learning
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning
Sergio Rozada
Santiago Paternain
A. Marques
101
15
0
21 Jan 2022
DROPO: Sim-to-Real Transfer with Offline Domain Randomization
DROPO: Sim-to-Real Transfer with Offline Domain Randomization
Gabriele Tiboni
Karol Arndt
Ville Kyrki
63
28
0
20 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
104
144
0
20 Jan 2022
A Deep Learning Approach To Estimation Using Measurements Received Over
  a Network
A Deep Learning Approach To Estimation Using Measurements Received Over a Network
S. Agarwal
S. Kaul
Saket Anand
P. B. Sujit
25
0
0
20 Jan 2022
AdaTerm: Adaptive T-Distribution Estimated Robust Moments for
  Noise-Robust Stochastic Gradient Optimization
AdaTerm: Adaptive T-Distribution Estimated Robust Moments for Noise-Robust Stochastic Gradient Optimization
Wendyam Eric Lionel Ilboudo
Taisuke Kobayashi
Takamitsu Matsubara
89
13
0
18 Jan 2022
Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal
  Abstraction
Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction
Tom Bewley
J. Lawry
Arthur G. Richards
46
3
0
17 Jan 2022
Parameterized Convex Universal Approximators for Decision-Making
  Problems
Parameterized Convex Universal Approximators for Decision-Making Problems
Jinrae Kim
Youdan Kim
54
5
0
17 Jan 2022
Previous
123...202122...505152
Next