ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online
  Representation Learning
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online Representation Learning
Banafsheh Rafiee
Zaheer Abbas
Sina Ghiassian
Raksha Kumaraswamy
R. Sutton
Elliot A. Ludvig
Adam White
OffRL
67
17
0
09 Nov 2020
Universal Activation Function For Machine Learning
Universal Activation Function For Machine Learning
Brosnan Yuen
Minh Tu Hoang
Xiaodai Dong
Tao Lu
33
43
0
07 Nov 2020
Adversarial Skill Learning for Robust Manipulation
Adversarial Skill Learning for Robust Manipulation
Pingcheng Jian
Chao Yang
Di Guo
Huaping Liu
F. Sun
AAML
68
7
0
06 Nov 2020
Sample-efficient Reinforcement Learning in Robotic Table Tennis
Sample-efficient Reinforcement Learning in Robotic Table Tennis
Jonas Tebbe
Lukas Krauch
Yapeng Gao
A. Zell
76
34
0
06 Nov 2020
RealAnt: An Open-Source Low-Cost Quadruped for Education and Research in
  Real-World Reinforcement Learning
RealAnt: An Open-Source Low-Cost Quadruped for Education and Research in Real-World Reinforcement Learning
Rinu Boney
Jussi Sainio
M. Kaivola
Arno Solin
Arno Solin
58
5
0
05 Nov 2020
Playing optical tweezers with deep reinforcement learning: in virtual,
  physical and augmented environments
Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments
M. Praeger
Yunhui Xie
J. Grant-Jacob
R. Eason
B. Mills
64
12
0
05 Nov 2020
Harnessing Distribution Ratio Estimators for Learning Agents with
  Quality and Diversity
Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Tanmay Gangwani
Jian Peng
Yuanshuo Zhou
80
11
0
05 Nov 2020
Federated Knowledge Distillation
Federated Knowledge Distillation
Hyowoon Seo
Jihong Park
Seungeun Oh
M. Bennis
Seong-Lyun Kim
FedML
103
92
0
04 Nov 2020
Rearrangement: A Challenge for Embodied AI
Rearrangement: A Challenge for Embodied AI
Dhruv Batra
Angel X. Chang
Sonia Chernova
Andrew J. Davison
Jia Deng
...
Jitendra Malik
Igor Mordatch
Roozbeh Mottaghi
Manolis Savva
Hao Su
LM&Ro
119
220
0
03 Nov 2020
Control with adaptive Q-learning
Control with adaptive Q-learning
J. Araújo
Mário A. T. Figueiredo
M. Botto
92
2
0
03 Nov 2020
Generalization to New Actions in Reinforcement Learning
Generalization to New Actions in Reinforcement Learning
Ayush Jain
Andrew Szot
Joseph J. Lim
AI4CE
94
35
0
03 Nov 2020
Intrinsic Robotic Introspection: Learning Internal States From Neuron
  Activations
Intrinsic Robotic Introspection: Learning Internal States From Neuron Activations
N. Pitsillos
Ameya Pore
B. S. Jensen
G. Aragon-Camarasa
70
4
0
03 Nov 2020
Episodic Linear Quadratic Regulators with Low-rank Transitions
Episodic Linear Quadratic Regulators with Low-rank Transitions
Tianyu Wang
Lin F. Yang
45
3
0
03 Nov 2020
Sim-to-Real Learning of All Common Bipedal Gaits via Periodic Reward
  Composition
Sim-to-Real Learning of All Common Bipedal Gaits via Periodic Reward Composition
J. Siekmann
Yesh Godse
Alan Fern
J. Hurst
108
159
0
02 Nov 2020
Useful Policy Invariant Shaping from Arbitrary Advice
Useful Policy Invariant Shaping from Arbitrary Advice
Paniz Behboudian
Yash Satsangi
Matthew E. Taylor
Anna Harutyunyan
Michael Bowling
OffRL
28
7
0
02 Nov 2020
Information-theoretic Task Selection for Meta-Reinforcement Learning
Information-theoretic Task Selection for Meta-Reinforcement Learning
Ricardo Luna Gutierrez
Matteo Leonetti
74
18
0
02 Nov 2020
NEARL: Non-Explicit Action Reinforcement Learning for Robotic Control
NEARL: Non-Explicit Action Reinforcement Learning for Robotic Control
Nanlin. Lin
Yuxuan Li
Yujun Zhu
Ruolin Wang
Xiayu Zhang
Jianmin Ji
Keke Tang
Xiaoping Chen
Xinming Zhang
OffRL
21
0
0
02 Nov 2020
Learning Sequences of Manipulation Primitives for Robotic Assembly
Learning Sequences of Manipulation Primitives for Robotic Assembly
N. Vuong
H. Pham
Quang Pham
81
26
0
02 Nov 2020
Observation Space Matters: Benchmark and Optimization Algorithm
Observation Space Matters: Benchmark and Optimization Algorithm
J. Kim
Sehoon Ha
OODOffRL
49
11
0
02 Nov 2020
Fast Reinforcement Learning with Incremental Gaussian Mixture Models
Fast Reinforcement Learning with Incremental Gaussian Mixture Models
R. Pinto
24
1
0
02 Nov 2020
Methods for Pruning Deep Neural Networks
Methods for Pruning Deep Neural Networks
S. Vadera
Salem Ameen
3DPC
76
131
0
31 Oct 2020
An interactive sequential-decision benchmark from geosteering
An interactive sequential-decision benchmark from geosteering
S. Alyaev
R. Bratvold
Sofija Ivanova
Andrew Holsaeter
M. Bendiksen
29
10
0
30 Oct 2020
Designing Interpretable Approximations to Deep Reinforcement Learning
Designing Interpretable Approximations to Deep Reinforcement Learning
Nathan Dahlin
K. C. Kalagarla
Nikhil Naik
Rahul Jain
Pierluigi Nuzzo
64
10
0
28 Oct 2020
Learning to Represent Action Values as a Hypergraph on the Action
  Vertices
Learning to Represent Action Values as a Hypergraph on the Action Vertices
Arash Tavakoli
Mehdi Fatemi
Petar Kormushev
83
23
0
28 Oct 2020
Implicit Under-Parameterization Inhibits Data-Efficient Deep
  Reinforcement Learning
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Aviral Kumar
Rishabh Agarwal
Dibya Ghosh
Sergey Levine
OffRL
88
123
0
27 Oct 2020
Batch Reinforcement Learning with a Nonparametric Off-Policy Policy
  Gradient
Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient
Samuele Tosatto
João Carvalho
Jan Peters
OffRL
62
7
0
27 Oct 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time
  Systems with Lipschitz Continuous Controls
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
Jeongho Kim
Jaeuk Shin
Insoon Yang
61
35
0
27 Oct 2020
VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement
  Learning
VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement Learning
Thomas Carta
Subhajit Chaudhury
Kartik Talamadupula
Michiaki Tatsubori
32
3
0
26 Oct 2020
Deep reinforced learning enables solving rich discrete-choice life cycle
  models to analyze social security reforms
Deep reinforced learning enables solving rich discrete-choice life cycle models to analyze social security reforms
A. Tanskanen
11
1
0
26 Oct 2020
Expert Selection in High-Dimensional Markov Decision Processes
Expert Selection in High-Dimensional Markov Decision Processes
Vicenç Rúbies Royo
Eric Mazumdar
Roy Dong
Claire Tomlin
S. Shankar Sastry
OffRL
10
0
0
26 Oct 2020
Improving the Exploration of Deep Reinforcement Learning in Continuous
  Domains using Planning for Policy Search
Improving the Exploration of Deep Reinforcement Learning in Continuous Domains using Planning for Policy Search
Jakob J. Hollenstein
Erwan Renaudo
Matteo Saveriano
J. Piater
43
2
0
24 Oct 2020
Stabilizing Transformer-Based Action Sequence Generation For Q-Learning
Stabilizing Transformer-Based Action Sequence Generation For Q-Learning
Gideon Stein
Andrey Filchenkov
Arip Asadulaev
OffRL
99
2
0
23 Oct 2020
Error Bounds of Imitating Policies and Environments
Error Bounds of Imitating Policies and Environments
Tian Xu
Ziniu Li
Yang Yu
97
121
0
22 Oct 2020
Logistic Q-Learning
Logistic Q-Learning
Joan Bas-Serrano
Sebastian Curi
Andreas Krause
Gergely Neu
105
40
0
21 Oct 2020
Iterative Amortized Policy Optimization
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
117
21
0
20 Oct 2020
Reinforcement Learning for Optimization of COVID-19 Mitigation policies
Reinforcement Learning for Optimization of COVID-19 Mitigation policies
Varun Kompella
Roberto Capobianco
Stacy Jong
Jonathan Browne
S. Fox
L. Meyers
Peter R. Wurman
Peter Stone
128
50
0
20 Oct 2020
Proximal Policy Gradient: PPO with Policy Gradient
Proximal Policy Gradient: PPO with Policy Gradient
Ju-Seung Byun
Byungmoon Kim
Huamin Wang
OffRL
46
8
0
20 Oct 2020
How much progress have we made in neural network training? A New
  Evaluation Protocol for Benchmarking Optimizers
How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers
Yuanhao Xiong
Xuanqing Liu
Li-Cheng Lan
Yang You
Si Si
Cho-Jui Hsieh
OOD
99
1
0
19 Oct 2020
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for
  Autonomous Driving
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Ming Zhou
Jun Luo
Julian Villela
Yaodong Yang
David Rusu
...
H. Ammar
Hongbo Zhang
Wulong Liu
Jianye Hao
Jun Wang
187
198
0
19 Oct 2020
Learning by Competition of Self-Interested Reinforcement Learning Agents
Learning by Competition of Self-Interested Reinforcement Learning Agents
Stephen Chung
60
5
0
19 Oct 2020
Model-based Policy Optimization with Unsupervised Model Adaptation
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
114
28
0
19 Oct 2020
What About Inputing Policy in Value Function: Policy Representation and
  Policy-extended Value Function Approximator
What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator
Hongyao Tang
Zhaopeng Meng
Jianye Hao
Chong Chen
D. Graves
...
Hangyu Mao
Wulong Liu
Yaodong Yang
Wenyuan Tao
Li Wang
OffRL
82
7
0
19 Oct 2020
Chance-Constrained Control with Lexicographic Deep Reinforcement
  Learning
Chance-Constrained Control with Lexicographic Deep Reinforcement Learning
Alessandro Giuseppi
A. Pietrabissa
33
7
0
19 Oct 2020
Evaluating the Safety of Deep Reinforcement Learning Models using
  Semi-Formal Verification
Evaluating the Safety of Deep Reinforcement Learning Models using Semi-Formal Verification
Davide Corsi
Enrico Marchesini
Alessandro Farinelli
OffRL
29
2
0
19 Oct 2020
Softmax Deep Double Deterministic Policy Gradients
Softmax Deep Double Deterministic Policy Gradients
Ling Pan
Qingpeng Cai
Longbo Huang
118
93
0
19 Oct 2020
D2RL: Deep Dense Architectures in Reinforcement Learning
D2RL: Deep Dense Architectures in Reinforcement Learning
Samarth Sinha
Homanga Bharadhwaj
A. Srinivas
Animesh Garg
OffRLAI4CE
122
56
0
19 Oct 2020
Robot Navigation in Constrained Pedestrian Environments using
  Reinforcement Learning
Robot Navigation in Constrained Pedestrian Environments using Reinforcement Learning
Claudia Pérez-DÁrpino
Can Liu
P. Goebel
Roberto Martín-Martín
Silvio Savarese
103
68
0
16 Oct 2020
Few-shot model-based adaptation in noisy conditions
Few-shot model-based adaptation in noisy conditions
Karol Arndt
Ali Ghadirzadeh
Murtaza Hazara
Ville Kyrki
62
8
0
16 Oct 2020
On the Guaranteed Almost Equivalence between Imitation Learning from
  Observation and Demonstration
On the Guaranteed Almost Equivalence between Imitation Learning from Observation and Demonstration
Zhihao Cheng
Liu Liu
Aishan Liu
Hao Sun
Meng Fang
Dacheng Tao
40
10
0
16 Oct 2020
An Empowerment-based Solution to Robotic Manipulation Tasks with Sparse
  Rewards
An Empowerment-based Solution to Robotic Manipulation Tasks with Sparse Rewards
Siyu Dai
Wenyuan Xu
Andreas G. Hofmann
B. Williams
97
8
0
15 Oct 2020
Previous
123...313233...505152
Next