ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
AvE: Assistance via Empowerment
AvE: Assistance via Empowerment
Yuqing Du
Stas Tiomkin
Emre Kıcıman
Daniel Polani
Pieter Abbeel
Anca Dragan
113
35
0
26 Jun 2020
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Shengyi Huang
Santiago Ontañón
108
329
0
25 Jun 2020
Strictly Batch Imitation Learning by Energy-based Distribution Matching
Strictly Batch Imitation Learning by Energy-based Distribution Matching
Daniel Jarrett
Ioana Bica
M. Schaar
OffRL
84
63
0
25 Jun 2020
Learning Reward Functions from Diverse Sources of Human Feedback:
  Optimally Integrating Demonstrations and Preferences
Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences
Erdem Biyik
Dylan P. Losey
Malayandi Palan
Nicholas C. Landolfi
Gleb Shevchuk
Dorsa Sadigh
88
118
0
24 Jun 2020
Off-Dynamics Reinforcement Learning: Training for Transfer with Domain
  Classifiers
Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers
Benjamin Eysenbach
Swapnil Asawa
Shreyas Chaudhari
Sergey Levine
Ruslan Salakhutdinov
108
94
0
24 Jun 2020
Experience Replay with Likelihood-free Importance Weights
Experience Replay with Likelihood-free Importance Weights
Samarth Sinha
Jiaming Song
Animesh Garg
Stefano Ermon
OffRL
105
58
0
23 Jun 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement
  Learning
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulic
OffRL
76
27
0
23 Jun 2020
Neural Dynamical Systems: Balancing Structure and Flexibility in
  Physical Prediction
Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction
Viraj Mehta
I. Char
Willie Neiswanger
Youngseog Chung
A. Nelson
M. Boyer
E. Kolemen
J. Schneider
AI4CE
45
28
0
23 Jun 2020
Ecological Reinforcement Learning
Ecological Reinforcement Learning
John D. Co-Reyes
Suvansh Sanjeev
Glen Berseth
Abhishek Gupta
Sergey Levine
OffRL
105
23
0
22 Jun 2020
dm_control: Software and Tasks for Continuous Control
dm_control: Software and Tasks for Continuous Control
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
...
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
153
419
0
22 Jun 2020
Safe Reinforcement Learning via Curriculum Induction
Safe Reinforcement Learning via Curriculum Induction
M. Turchetta
Andrey Kolobov
S. Shah
Andreas Krause
Alekh Agarwal
66
93
0
22 Jun 2020
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with
  Asynchronous Reinforcement Learning
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko
Zhehui Huang
T. Kumar
Gaurav Sukhatme
V. Koltun
103
105
0
21 Jun 2020
Accelerating Safe Reinforcement Learning with Constraint-mismatched
  Policies
Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies
Tsung-Yen Yang
Justinian P. Rosca
Karthik Narasimhan
Peter J. Ramadge
103
19
0
20 Jun 2020
Robust Reinforcement Learning using Least Squares Policy Iteration with
  Provable Performance Guarantees
Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees
Kishan Panaganti
D. Kalathil
119
4
0
20 Jun 2020
Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of
  Gaussian Processes
Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes
Mengdi Xu
Wenhao Ding
Jiacheng Zhu
Zuxin Liu
Baiming Chen
Ding Zhao
CLLOffRL
102
35
0
19 Jun 2020
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online
  Weight Adjustment for Exploration
NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration
Shuai Han
Wenbo Zhou
Jing Liu
Shuai Lu
43
28
0
19 Jun 2020
Learn to Earn: Enabling Coordination within a Ride Hailing Fleet
Learn to Earn: Enabling Coordination within a Ride Hailing Fleet
Harshal A. Chaudhari
J. Byers
Evimaria Terzi
77
6
0
19 Jun 2020
An adaptive stochastic gradient-free approach for high-dimensional
  blackbox optimization
An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Anton Dereventsov
Clayton Webster
Joseph Daws
65
10
0
18 Jun 2020
Reparameterized Variational Divergence Minimization for Stable Imitation
Reparameterized Variational Divergence Minimization for Stable Imitation
Dilip Arumugam
Debadeepta Dey
Alekh Agarwal
Asli Celikyilmaz
E. Nouri
W. Dolan
50
3
0
18 Jun 2020
Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Annie Xie
James Harrison
Chelsea Finn
CLLOffRL
94
65
0
18 Jun 2020
Learning to Track Dynamic Targets in Partially Known Environments
Learning to Track Dynamic Targets in Partially Known Environments
Heejin Jeong
Hamed Hassani
M. Morari
Daniel D. Lee
George J. Pappas
73
11
0
17 Jun 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from
  Demonstrations
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations
Alexey Skrynnik
A. Staroverov
Ermek Aitygulov
Kirill Aksenov
Vasilii Davydov
Aleksandr I. Panov
OffRL
66
4
0
17 Jun 2020
Deep Reinforcement Learning Controller for 3D Path-following and
  Collision Avoidance by Autonomous Underwater Vehicles
Deep Reinforcement Learning Controller for 3D Path-following and Collision Avoidance by Autonomous Underwater Vehicles
Simen Theie Havenstrom
Adil Rasheed
Omer San
AI4CE
16
39
0
17 Jun 2020
Data Driven Control with Learned Dynamics: Model-Based versus Model-Free
  Approach
Data Driven Control with Learned Dynamics: Model-Based versus Model-Free Approach
Wenjian Hao
Yiqiang Han
45
6
0
16 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using
  Deep Reinforcement Learning
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
76
75
0
16 Jun 2020
Untangling tradeoffs between recurrence and self-attention in neural
  networks
Untangling tradeoffs between recurrence and self-attention in neural networks
Giancarlo Kerg
Bhargav Kanuparthi
Anirudh Goyal
Kyle Goyette
Yoshua Bengio
Guillaume Lajoie
57
9
0
16 Jun 2020
Model Embedding Model-Based Reinforcement Learning
Model Embedding Model-Based Reinforcement Learning
Xiao Tan
Chao Qu
Junwu Xiong
James Y. Zhang
OffRL
32
0
0
16 Jun 2020
$Q$-learning with Logarithmic Regret
QQQ-learning with Logarithmic Regret
Kunhe Yang
Lin F. Yang
S. Du
102
62
0
16 Jun 2020
Model-based Adversarial Meta-Reinforcement Learning
Model-based Adversarial Meta-Reinforcement Learning
Zichuan Lin
G. Thomas
Guangwen Yang
Tengyu Ma
OOD
74
52
0
16 Jun 2020
Designing high-fidelity multi-qubit gates for semiconductor quantum dots
  through deep reinforcement learning
Designing high-fidelity multi-qubit gates for semiconductor quantum dots through deep reinforcement learning
Sahar Daraeizadeh
S. Premaratne
A. Matsuura
43
5
0
15 Jun 2020
Analytic Manifold Learning: Unifying and Evaluating Representations for
  Continuous Control
Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control
Rika Antonova
Maksim Maydanskiy
Danica Kragic
Sam Devlin
Katja Hofmann
76
9
0
15 Jun 2020
Provably Efficient Model-based Policy Adaptation
Provably Efficient Model-based Policy Adaptation
Yuda Song
Aditi Mavalankar
Wen Sun
Sicun Gao
TTAOffRL
74
10
0
14 Jun 2020
Optimistic Distributionally Robust Policy Optimization
Optimistic Distributionally Robust Policy Optimization
Jun Song
Chaoyue Zhao
46
12
0
14 Jun 2020
Reinforcement Learning with Supervision from Noisy Demonstrations
Reinforcement Learning with Supervision from Noisy Demonstrations
Kun-Peng Ning
Sheng-Jun Huang
49
7
0
14 Jun 2020
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary
  Strategies
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Yunhao Tang
K. Choromanski
OffRL
38
14
0
13 Jun 2020
Hindsight Expectation Maximization for Goal-conditioned Reinforcement
  Learning
Hindsight Expectation Maximization for Goal-conditioned Reinforcement Learning
Yunhao Tang
A. Kucukelbir
OffRL
68
16
0
13 Jun 2020
Human and Multi-Agent collaboration in a human-MARL teaming framework
Human and Multi-Agent collaboration in a human-MARL teaming framework
N. Navidi
Francois Chabot
Sagar Kurandwad
Irv Lustigman
Vincent Robert
Gregory Szriftgiser
Andrea Schuch
33
2
0
12 Jun 2020
Data-driven Koopman Operators for Model-based Shared Control of
  Human-Machine Systems
Data-driven Koopman Operators for Model-based Shared Control of Human-Machine Systems
Alexander Broad
Ian Abraham
Todd Murphey
B. Argall
77
37
0
12 Jun 2020
Meta-Reinforcement Learning Robust to Distributional Shift via Model
  Identification and Experience Relabeling
Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling
Russell Mendonca
Xinyang Geng
Chelsea Finn
Sergey Levine
OODOffRL
93
40
0
12 Jun 2020
SAMBA: Safe Model-Based & Active Reinforcement Learning
SAMBA: Safe Model-Based & Active Reinforcement Learning
Alexander I. Cowen-Rivers
Daniel Palenicek
Vincent Moens
Mohammed Abdullah
Aivar Sootla
Jun Wang
Haitham Bou-Ammar
75
45
0
12 Jun 2020
From proprioception to long-horizon planning in novel environments: A
  hierarchical RL model
From proprioception to long-horizon planning in novel environments: A hierarchical RL model
Nishad Gothoskar
Miguel Lázaro-Gredilla
Dileep George
33
0
0
11 Jun 2020
Zeroth-Order Supervised Policy Improvement
Zeroth-Order Supervised Policy Improvement
Hao Sun
Ziping Xu
Yuhang Song
Meng Fang
Jiechao Xiong
Bo Dai
Bolei Zhou
OffRL
59
9
0
11 Jun 2020
Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework
Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework
Wei Shen
Yuanying Cai
Longbo Huang
Jian Li
OffRL
50
1
0
11 Jun 2020
Borrowing From the Future: Addressing Double Sampling in Model-free
  Control
Borrowing From the Future: Addressing Double Sampling in Model-free Control
Yuhua Zhu
Zachary Izzo
Lexing Ying
32
4
0
11 Jun 2020
Report from the NSF Future Directions Workshop, Toward User-Oriented
  Agents: Research Directions and Challenges
Report from the NSF Future Directions Workshop, Toward User-Oriented Agents: Research Directions and Challenges
M. Eskénazi
Tiancheng Zhao
LLMAGAI4TSAI4CE
90
9
0
10 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale
  Empirical Study
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
Matthieu Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
92
225
0
10 Jun 2020
Neuroevolution in Deep Neural Networks: Current Trends and Future
  Challenges
Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges
E. Galván
P. Mooney
111
136
0
09 Jun 2020
Distributed Learning on Heterogeneous Resource-Constrained Devices
Distributed Learning on Heterogeneous Resource-Constrained Devices
Martin Rapp
R. Khalili
J. Henkel
FedML
68
7
0
09 Jun 2020
AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation
AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation
Jae Hyun Lim
Aaron Courville
C. Pal
Chin-Wei Huang
DRL
71
23
0
09 Jun 2020
Primal Wasserstein Imitation Learning
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
105
129
0
08 Jun 2020
Previous
123...353637...505152
Next