ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Deep Reinforcement Learning based Evasion Generative Adversarial Network
  for Botnet Detection
Deep Reinforcement Learning based Evasion Generative Adversarial Network for Botnet Detection
Rizwan Hamid Randhawa
N. Aslam
Mohammad Alauthman
Muhammad Khalid
Husnain Rafiq
GAN
61
26
0
06 Oct 2022
Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement
  Learning
Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement Learning
Carlos M. Casas
B. Carro
Antonio J. Sánchez-Esguevillas
24
2
0
06 Oct 2022
Training Diverse High-Dimensional Controllers by Scaling Covariance
  Matrix Adaptation MAP-Annealing
Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing
Bryon Tjanaka
Matthew C. Fontaine
David H. Lee
Aniruddha Kalkar
Stefanos Nikolaidis
124
10
0
06 Oct 2022
Option-Aware Adversarial Inverse Reinforcement Learning for Robotic
  Control
Option-Aware Adversarial Inverse Reinforcement Learning for Robotic Control
Jiayu Chen
Tian-Shing Lan
Vaneet Aggarwal
BDL
110
17
0
05 Oct 2022
CostNet: An End-to-End Framework for Goal-Directed Reinforcement
  Learning
CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
3DVOffRL
31
0
0
03 Oct 2022
Interpretable Option Discovery using Deep Q-Learning and Variational
  Autoencoders
Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders
Per-Arne Andersen
Ole-Christoffer Granmo
Morten Goodwin
OOD
56
0
0
03 Oct 2022
Hierarchical reinforcement learning for in-hand robotic manipulation
  using Davenport chained rotations
Hierarchical reinforcement learning for in-hand robotic manipulation using Davenport chained rotations
Francisco Roldan Sanchez
Qiang-qiang Wang
David Córdova Bulens
Kevin McGuinness
Stephen J. Redmond
Noel E. O'Connor
40
1
0
03 Oct 2022
Accelerate Reinforcement Learning with PID Controllers in the Pendulum
  Simulations
Accelerate Reinforcement Learning with PID Controllers in the Pendulum Simulations
Liping Bai
23
0
0
03 Oct 2022
WorldGen: A Large Scale Generative Simulator
WorldGen: A Large Scale Generative Simulator
Chahat Deep Singh
R. Kumari
Cornelia Fermuller
N. Sanket
Yiannis Aloimonos
77
4
0
03 Oct 2022
Deep Intrinsically Motivated Exploration in Continuous Control
Deep Intrinsically Motivated Exploration in Continuous Control
Baturay Saglam
Suleyman S. Kozat
61
4
0
01 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing
  Plausible Novel States
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
C. Banerjee
Zhiyong Chen
N. Noman
60
3
0
01 Oct 2022
Efficiently Learning Small Policies for Locomotion and Manipulation
Efficiently Learning Small Policies for Locomotion and Manipulation
Shashank Hegde
Gaurav Sukhatme
100
3
0
30 Sep 2022
Midas: A Multi-Joint Robotics Simulator with Intersection-Free
  Frictional Contact
Midas: A Multi-Joint Robotics Simulator with Intersection-Free Frictional Contact
Yunuo Chen
Minchen Li
Wenlong Lu
Chuyuan Fu
Chenfanfu Jiang
71
4
0
30 Sep 2022
Online Weighted Q-Ensembles for Reduced Hyperparameter Tuning in
  Reinforcement Learning
Online Weighted Q-Ensembles for Reduced Hyperparameter Tuning in Reinforcement Learning
R. G. Oliveira
W. Caarls
OffRL
58
0
0
29 Sep 2022
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical
  Multi-Step Approach for Policy Training
Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training
Gang Chen
Victoria Huang
OffRL
111
1
0
29 Sep 2022
MultiRoboLearn: An open-source Framework for Multi-robot Deep
  Reinforcement Learning
MultiRoboLearn: An open-source Framework for Multi-robot Deep Reinforcement Learning
Junfeng Chen
Fuqin Deng
Yuan Gao
Junjie Hu
Xiyue Guo
Guanqi Liang
Tin Lun Lam
65
7
0
28 Sep 2022
MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs
MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs
T. Langerak
Sammy Christen
Mert Albaba
Christoph Gebhardt
Otmar Hilliges
OffRL
62
0
0
26 Sep 2022
Deep Reinforcement Learning for Adaptive Mesh Refinement
Deep Reinforcement Learning for Adaptive Mesh Refinement
C. Foucart
A. Charous
Pierre FJ Lermusiaux
AI4CE
81
23
0
25 Sep 2022
Learn what matters: cross-domain imitation learning with task-relevant
  embeddings
Learn what matters: cross-domain imitation learning with task-relevant embeddings
Tim Franzmeyer
Philip Torr
João F. Henriques
OOD
90
22
0
24 Sep 2022
Explainable Reinforcement Learning via Model Transforms
Explainable Reinforcement Learning via Model Transforms
Mira Finkelstein
Lucy Liu
Nitsan Levy Schlot
Y. Kolumbus
David C. Parkes
Jeffrey S. Rosenshein
Sarah Keren
85
14
0
24 Sep 2022
Fast Lifelong Adaptive Inverse Reinforcement Learning from
  Demonstrations
Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations
Letian Chen
Sravan Jayanthi
Rohan R. Paleja
Daniel Martin
Viacheslav Zakharov
Matthew C. Gombolay
131
16
0
24 Sep 2022
Quantification before Selection: Active Dynamics Preference for Robust
  Reinforcement Learning
Quantification before Selection: Active Dynamics Preference for Robust Reinforcement Learning
Kang Xu
Yan Ma
Wei Li
97
0
0
23 Sep 2022
Learning Dexterous Manipulation from Exemplar Object Trajectories and
  Pre-Grasps
Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-Grasps
Sudeep Dasari
Abhi Gupta
Vikash Kumar
111
43
0
22 Sep 2022
Proximal Point Imitation Learning
Proximal Point Imitation Learning
Luca Viano
Angeliki Kamoutsi
Gergely Neu
Igor Krawczuk
Volkan Cevher
114
16
0
22 Sep 2022
Pretraining the Vision Transformer using self-supervised methods for
  vision based Deep Reinforcement Learning
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
Manuel Goulão
Arlindo L. Oliveira
ViT
111
6
0
22 Sep 2022
Hierarchical Decision Transformer
Hierarchical Decision Transformer
André Rosa de Sousa Porfírio Correia
L. A. Alexandre
OffRL
154
11
0
21 Sep 2022
On the Convergence Theory of Meta Reinforcement Learning with
  Personalized Policies
On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies
Haozhi Wang
Qing Wang
Yunfeng Shao
Dong Li
Jianye Hao
Yinchuan Li
73
0
0
21 Sep 2022
Lamarckian Platform: Pushing the Boundaries of Evolutionary
  Reinforcement Learning towards Asynchronous Commercial Games
Lamarckian Platform: Pushing the Boundaries of Evolutionary Reinforcement Learning towards Asynchronous Commercial Games
Hui Bai
R. Shen
Yue Lin
Bo Xu
Ran Cheng
VLM
85
5
0
21 Sep 2022
Teaching Autonomous Systems Hands-On: Leveraging Modular Small-Scale
  Hardware in the Robotics Classroom
Teaching Autonomous Systems Hands-On: Leveraging Modular Small-Scale Hardware in the Robotics Classroom
Johannes Betz
Hongrui Zheng
Zirui Zang
Florian Sauerbeck
Krzysztof Walas
...
Madhur Behl
Rosa Zheng
Joydeep Biswas
Venkat Krovi
Rahul Mangharam
129
7
0
21 Sep 2022
DiffTune: Auto-Tuning through Auto-Differentiation
DiffTune: Auto-Tuning through Auto-Differentiation
Sheng Cheng
Minkyung Kim
Lin Song
Chengyu Yang
Zhuohuan Wu
Shenlong Wang
N. Hovakimyan
101
7
0
20 Sep 2022
Optimizing Crop Management with Reinforcement Learning and Imitation
  Learning
Optimizing Crop Management with Reinforcement Learning and Imitation Learning
Ran Tao
Pan Zhao
Jing Wu
N. F. Martin
M. Harrison
C. Ferreira
Z. Kalantari
N. Hovakimyan
OffRL
61
26
0
20 Sep 2022
Soft Action Priors: Towards Robust Policy Transfer
Soft Action Priors: Towards Robust Policy Transfer
M. Centa
Philippe Preux
OffRLOnRL
22
1
0
20 Sep 2022
A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline
  Regret
A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret
Sheelabhadra Dey
Sumedh Pendurkar
Guni Sharon
Josiah P. Hanna
68
10
0
20 Sep 2022
Locally Constrained Representations in Reinforcement Learning
Locally Constrained Representations in Reinforcement Learning
Somjit Nath
Rushiv Arora
Samira Ebrahimi Kahou
OODOffRL
53
0
0
20 Sep 2022
A Transferable and Automatic Tuning of Deep Reinforcement Learning for
  Cost Effective Phishing Detection
A Transferable and Automatic Tuning of Deep Reinforcement Learning for Cost Effective Phishing Detection
Orel Lavie
A. Shabtai
Gilad Katz
AAMLOffRL
144
1
0
19 Sep 2022
Towards advanced robotic manipulation
Towards advanced robotic manipulation
Francisco Roldan Sanchez
Stephen J. Redmond
Kevin McGuinness
Noel E. O'Connor
54
1
0
19 Sep 2022
Evolutionary Deep Reinforcement Learning Using Elite Buffer: A Novel
  Approach Towards DRL Combined with EA in Continuous Control Tasks
Evolutionary Deep Reinforcement Learning Using Elite Buffer: A Novel Approach Towards DRL Combined with EA in Continuous Control Tasks
Marzie Esmaeeli
H. Malek
66
2
0
18 Sep 2022
Value Summation: A Novel Scoring Function for MPC-based Model-based
  Reinforcement Learning
Value Summation: A Novel Scoring Function for MPC-based Model-based Reinforcement Learning
Mehran Raisi
Amirhossein Noohian
Lucy McCutcheon
Saber Fallah
49
3
0
16 Sep 2022
M$^2$DQN: A Robust Method for Accelerating Deep Q-learning Network
M2^22DQN: A Robust Method for Accelerating Deep Q-learning Network
Zhe Zhang
Yukun Zou
Junjie Lai
Qinglong Xu
27
4
0
16 Sep 2022
Towards A Unified Policy Abstraction Theory and Representation Learning
  Approach in Markov Decision Processes
Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes
Hao Fei
Hongyao Tang
Jianye Hao
Yan Zheng
OffRL
79
1
0
16 Sep 2022
AssembleRL: Learning to Assemble Furniture from Their Point Clouds
AssembleRL: Learning to Assemble Furniture from Their Point Clouds
Ozgur Aslan
Burak Bolat
Batuhan Bal
Tuugba Tumer
Erol cSahin
Sinan Kalkan
3DPC
65
6
0
15 Sep 2022
COOL-MC: A Comprehensive Tool for Reinforcement Learning and Model
  Checking
COOL-MC: A Comprehensive Tool for Reinforcement Learning and Model Checking
Dennis Gross
N. Jansen
Sebastian Junges
G. Pérez
71
9
0
15 Sep 2022
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees
Zichuan Liu
Zichuan Liu
Zhi Wang
Yuanyang Zhu
Chunlin Chen
243
7
0
15 Sep 2022
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
Augustine N. Mavor-Parker
Matthew J. Sargent
Christian Pehle
Andrea Banino
Lewis D. Griffin
Caswell Barry
71
1
0
14 Sep 2022
Deep Reinforcement Learning for Cryptocurrency Trading: Practical
  Approach to Address Backtest Overfitting
Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting
Berend Gort
Xiao-Yang Liu
Xinghang Sun
Jiechao Gao
Shuai Chen
Chris Wang
115
13
0
12 Sep 2022
Instruction-driven history-aware policies for robotic manipulations
Instruction-driven history-aware policies for robotic manipulations
Pierre-Louis Guhur
Shizhe Chen
Ricardo Garcia Pinel
Makarand Tapaswi
Ivan Laptev
Cordelia Schmid
LM&Ro
195
109
0
11 Sep 2022
Distilling Deep RL Models Into Interpretable Neuro-Fuzzy Systems
Distilling Deep RL Models Into Interpretable Neuro-Fuzzy Systems
Arne Gevaert
Jonathan Peck
Yvan Saeys
60
2
0
07 Sep 2022
Finite-Time Error Bounds for Greedy-GQ
Finite-Time Error Bounds for Greedy-GQ
Yue Wang
Yi Zhou
Shaofeng Zou
108
2
0
06 Sep 2022
A Benchmark for Unsupervised Anomaly Detection in Multi-Agent
  Trajectories
A Benchmark for Unsupervised Anomaly Detection in Multi-Agent Trajectories
Julian Wiederer
Julian Schmidt
U. Kressel
Klaus C. J. Dietmayer
Vasileios Belagiannis
AI4TS
78
7
0
05 Sep 2022
Indoor Path Planning for Multiple Unmanned Aerial Vehicles via
  Curriculum Learning
Indoor Path Planning for Multiple Unmanned Aerial Vehicles via Curriculum Learning
J. Park
Kwansik Park
36
2
0
05 Sep 2022
Previous
123...151617...505152
Next