ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Multi-Task Learning with Sequence-Conditioned Transporter Networks
Multi-Task Learning with Sequence-Conditioned Transporter Networks
M. H. Lim
Andy Zeng
Brian Ichter
Maryam Bandari
Erwin Coumans
Claire Tomlin
S. Schaal
Aleksandra Faust
65
15
0
15 Sep 2021
Learning Robot Structure and Motion Embeddings using Graph Neural
  Networks
Learning Robot Structure and Motion Embeddings using Graph Neural Networks
J. Kim
Jeongeun Park
Sungjoon Choi
Sehoon Ha
39
11
0
15 Sep 2021
DCUR: Data Curriculum for Teaching via Samples with Reinforcement
  Learning
DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning
Daniel Seita
Abhinav Gopal
Zhao Mandi
John F. Canny
OffRLOnRL
47
0
0
15 Sep 2021
GRiD: GPU-Accelerated Rigid Body Dynamics with Analytical Gradients
GRiD: GPU-Accelerated Rigid Body Dynamics with Analytical Gradients
Brian Plancher
Sabrina M. Neuman
Radhika Ghosal
S. Kuindersma
Vijay Janapa Reddi
AI4CEPINN
100
16
0
14 Sep 2021
HPOBench: A Collection of Reproducible Multi-Fidelity Benchmark Problems
  for HPO
HPOBench: A Collection of Reproducible Multi-Fidelity Benchmark Problems for HPO
Katharina Eggensperger
Philip Muller
Neeratyoy Mallik
Matthias Feurer
René Sass
Aaron Klein
Noor H. Awad
Marius Lindauer
Frank Hutter
243
104
0
14 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
103
0
14 Sep 2021
safe-control-gym: a Unified Benchmark Suite for Safe Learning-based
  Control and Reinforcement Learning in Robotics
safe-control-gym: a Unified Benchmark Suite for Safe Learning-based Control and Reinforcement Learning in Robotics
Zhaocong Yuan
Adam W. Hall
Siqi Zhou
Lukas Brunke
Melissa Greeff
Jacopo Panerati
Angela P. Schoellig
OffRL
166
55
0
13 Sep 2021
OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via
  Distribution Matching
OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching
Hanako Hoshino
Keita Ota
Asako Kanezaki
Rio Yokota
OffRLOOD
63
19
0
09 Sep 2021
Membership Inference Attacks Against Temporally Correlated Data in Deep
  Reinforcement Learning
Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning
Maziar Gomrokchi
Susan Amin
Hossein Aboutalebi
Alexander Wong
Doina Precup
MIACVAAML
99
3
0
08 Sep 2021
CyGIL: A Cyber Gym for Training Autonomous Agents over Emulated Network
  Systems
CyGIL: A Cyber Gym for Training Autonomous Agents over Emulated Network Systems
Li Li
Raed Fayad
Adrian Taylor
62
42
0
07 Sep 2021
Robust Predictable Control
Robust Predictable Control
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
91
45
0
07 Sep 2021
Optimal Stroke Learning with Policy Gradient Approach for Robotic Table
  Tennis
Optimal Stroke Learning with Policy Gradient Approach for Robotic Table Tennis
Yapeng Gao
Jonas Tebbe
A. Zell
OffRL
96
14
0
07 Sep 2021
The Sensory Neuron as a Transformer: Permutation-Invariant Neural
  Networks for Reinforcement Learning
The Sensory Neuron as a Transformer: Permutation-Invariant Neural Networks for Reinforcement Learning
Yujin Tang
David R Ha
110
77
0
07 Sep 2021
ViSTA: a Framework for Virtual Scenario-based Testing of Autonomous
  Vehicles
ViSTA: a Framework for Virtual Scenario-based Testing of Autonomous Vehicles
A. Piazzoni
Jim Cherian
Mohamed Azhar
Jing Yew Yap
James Lee Wei Shung
Roshan Vijay
93
20
0
06 Sep 2021
Error Controlled Actor-Critic
Error Controlled Actor-Critic
Xingen Gao
Yong Li
Changle Zhou
Zhen Ge
Chih-Min Lin
Longzhi Yang
Xiang Chang
C. Shang
29
3
0
06 Sep 2021
Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning
Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning
Ning Wei
Jiahua Liang
Di Xie
Shiliang Pu
52
0
0
06 Sep 2021
Supervised DKRC with Images for Offline System Identification
Supervised DKRC with Images for Offline System Identification
Alexander Krolicki
P. Lavertu
52
1
0
06 Sep 2021
An Exploration of Deep Learning Methods in Hungry Geese
An Exploration of Deep Learning Methods in Hungry Geese
Nikzad Khani
Matthew Kluska
27
0
0
05 Sep 2021
Event-Based Communication in Distributed Q-Learning
Event-Based Communication in Distributed Q-Learning
Daniel Jarne Ornia
M. Mazo
65
2
0
03 Sep 2021
Variational Quantum Reinforcement Learning via Evolutionary Optimization
Variational Quantum Reinforcement Learning via Evolutionary Optimization
Samuel Yen-Chi Chen
Chih-Min Huang
Chia-Wei Hsing
H. Goan
Y. Kao
92
87
0
01 Sep 2021
Catastrophic Interference in Reinforcement Learning: A Solution Based on
  Context Division and Knowledge Distillation
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation
Tiantian Zhang
Xueqian Wang
Bin Liang
Bo Yuan
OffRL
80
18
0
01 Sep 2021
Phy-Q as a measure for physical reasoning intelligence
Phy-Q as a measure for physical reasoning intelligence
Cheng Xue
Vimukthini Pinto
C. Gamage
Ekaterina Nikonova
Peng Zhang
Jochen Renz
LRM
77
12
0
31 Aug 2021
SurRoL: An Open-source Reinforcement Learning Centered and dVRK
  Compatible Platform for Surgical Robot Learning
SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning
Jiaqi Xu
Bin Li
Bo Lu
Yunhui Liu
Qi Dou
Pheng-Ann Heng
156
78
0
30 Aug 2021
Photonic Quantum Policy Learning in OpenAI Gym
Photonic Quantum Policy Learning in OpenAI Gym
D. Nagy
Zsolt I. Tabi
Péter Hága
Zsófia Kallus
Z. Zimborás
96
8
0
29 Aug 2021
Influence-Based Reinforcement Learning for Intrinsically-Motivated
  Agents
Influence-Based Reinforcement Learning for Intrinsically-Motivated Agents
Ammar Fayad
M. Ibrahim
62
5
0
28 Aug 2021
Entropy-Aware Model Initialization for Effective Exploration in Deep
  Reinforcement Learning
Entropy-Aware Model Initialization for Effective Exploration in Deep Reinforcement Learning
Sooyoung Jang
Hyungil Kim
55
5
0
24 Aug 2021
Indoor Path Planning for an Unmanned Aerial Vehicle via Curriculum
  Learning
Indoor Path Planning for an Unmanned Aerial Vehicle via Curriculum Learning
Jongmin Park
Soo-beom Jang
Y. Shin
SSL
52
11
0
23 Aug 2021
CybORG: A Gym for the Development of Autonomous Cyber Agents
CybORG: A Gym for the Development of Autonomous Cyber Agents
Maxwell Standen
Martin Lucas
David Bowman
Toby J. Richer
Junae Kim
Damian A. Marriott
55
79
0
20 Aug 2021
Diversity-based Trajectory and Goal Selection with Hindsight Experience
  Replay
Diversity-based Trajectory and Goal Selection with Hindsight Experience Replay
Tianhong Dai
Hengyan Liu
Kai Arulkumaran
Guangyu Ren
Anil Anthony Bharath
70
11
0
17 Aug 2021
APReL: A Library for Active Preference-based Reward Learning Algorithms
APReL: A Library for Active Preference-based Reward Learning Algorithms
Erdem Biyik
Aditi Talati
Dorsa Sadigh
68
37
0
16 Aug 2021
Introduction to Quantum Reinforcement Learning: Theory and
  PennyLane-based Implementation
Introduction to Quantum Reinforcement Learning: Theory and PennyLane-based Implementation
Yunseok Kwak
Won Joon Yun
Soyi Jung
Jong-Kook Kim
Joongheon Kim
69
49
0
16 Aug 2021
A general class of surrogate functions for stable and efficient
  reinforcement learning
A general class of surrogate functions for stable and efficient reinforcement learning
Sharan Vaswani
Olivier Bachem
Simone Totaro
Robert Mueller
Shivam Garg
Matthieu Geist
Marlos C. Machado
Pablo Samuel Castro
Nicolas Le Roux
OffRL
94
16
0
12 Aug 2021
Neural Network Repair with Reachability Analysis
Neural Network Repair with Reachability Analysis
Xiaodong Yang
Tomochika Yamaguchi
Hoang-Dung Tran
Bardh Hoxha
Taylor T. Johnson
Danil Prokhorov
AAML
62
30
0
09 Aug 2021
Modified Double DQN: addressing stability
Modified Double DQN: addressing stability
Shervin Halat
M. Ebadzadeh
27
2
0
09 Aug 2021
Online Bootstrap Inference For Policy Evaluation in Reinforcement
  Learning
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
140
28
0
08 Aug 2021
BEHAVIOR: Benchmark for Everyday Household Activities in Virtual,
  Interactive, and Ecological Environments
BEHAVIOR: Benchmark for Everyday Household Activities in Virtual, Interactive, and Ecological Environments
S. Srivastava
Chengshu Li
Michael Lingelbach
Roberto Martín-Martín
Fei Xia
...
Chenxi Liu
Silvio Savarese
H. Gweon
Jiajun Wu
Li Fei-Fei
LM&Ro
251
168
0
06 Aug 2021
Active Reinforcement Learning over MDPs
Qi Yang
Peng Yang
K. Tang
89
0
0
05 Aug 2021
Learning Task Agnostic Skills with Data-driven Guidance
Learning Task Agnostic Skills with Data-driven Guidance
E. Klemsdal
Sverre Herland
Abdulmajid Murad
39
1
0
04 Aug 2021
A Pragmatic Look at Deep Imitation Learning
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
59
9
0
04 Aug 2021
Learning Barrier Certificates: Towards Safe Reinforcement Learning with
  Zero Training-time Violations
Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Yuping Luo
Tengyu Ma
OffRL
101
44
0
04 Aug 2021
Offline Decentralized Multi-Agent Reinforcement Learning
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang
Zongqing Lu
OffRL
91
39
0
04 Aug 2021
Hierarchical Representations and Explicit Memory: Learning Effective
  Navigation Policies on 3D Scene Graphs using Graph Neural Networks
Hierarchical Representations and Explicit Memory: Learning Effective Navigation Policies on 3D Scene Graphs using Graph Neural Networks
Zachary Ravichandran
Lisa Peng
Nathan Hughes
J. D. Griffith
Luca Carlone
130
71
0
02 Aug 2021
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale
  Demonstrations
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations
Tongzhou Mu
Z. Ling
Fanbo Xiang
Derek Yang
Xuanlin Li
Stone Tao
Zhiao Huang
Zhiwei Jia
Hao Su
158
138
0
30 Jul 2021
Hindsight Value Function for Variance Reduction in Stochastic Dynamic
  Environment
Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment
Jiaming Guo
Rui Zhang
Xishan Zhang
Shaohui Peng
Qiaomin Yi
Zidong Du
Xing Hu
Qi Guo
Yunji Chen
59
7
0
26 Jul 2021
Model-based micro-data reinforcement learning: what are the crucial
  model properties and which model to choose?
Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?
Balázs Kégl
Gabriel Hurtado
Albert Thomas
76
12
0
24 Jul 2021
MarsExplorer: Exploration of Unknown Terrains via Deep Reinforcement
  Learning and Procedurally Generated Environments
MarsExplorer: Exploration of Unknown Terrains via Deep Reinforcement Learning and Procedurally Generated Environments
Dimitrios I. Koutras
Athanasios Ch. Kapoutsis
A. Amanatiadis
Elias B. Kosmatopoulos
59
10
0
21 Jul 2021
An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
João Carvalho
Davide Tateo
Fabio Muratore
Jan Peters
OffRL
56
7
0
20 Jul 2021
Using reinforcement learning to autonomously identify sources of error
  for agents in group missions
Using reinforcement learning to autonomously identify sources of error for agents in group missions
Keishu Utimula
Ken-taro Hayaschi
Trevor Bihl
K. Hongo
R. Maezono
43
0
0
20 Jul 2021
Constrained Policy Gradient Method for Safe and Fast Reinforcement
  Learning: a Neural Tangent Kernel Based Approach
Constrained Policy Gradient Method for Safe and Fast Reinforcement Learning: a Neural Tangent Kernel Based Approach
B. Varga
Balázs Kulcsár
M. Chehreghani
79
1
0
19 Jul 2021
High-Accuracy Model-Based Reinforcement Learning, a Survey
High-Accuracy Model-Based Reinforcement Learning, a Survey
Aske Plaat
W. Kosters
Mike Preuss
OffRL
75
37
0
17 Jul 2021
Previous
123...242526...505152
Next