ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Performance-Weighed Policy Sampling for Meta-Reinforcement Learning
Performance-Weighed Policy Sampling for Meta-Reinforcement Learning
Ibrahim Ahmed
Marcos Quiñones-Grueiro
G. Biswas
47
2
0
10 Dec 2020
Robust Domain Randomised Reinforcement Learning through Peer-to-Peer
  Distillation
Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation
Chenyang Zhao
Timothy M. Hospedales
OOD
49
16
0
09 Dec 2020
Emergence of Different Modes of Tool Use in a Reaching and Dragging Task
Emergence of Different Modes of Tool Use in a Reaching and Dragging Task
K. Nguyen
Yoonsuck Choe
21
0
0
08 Dec 2020
Resolving Implicit Coordination in Multi-Agent Deep Reinforcement
  Learning with Deep Q-Networks & Game Theory
Resolving Implicit Coordination in Multi-Agent Deep Reinforcement Learning with Deep Q-Networks & Game Theory
Griffin Adams
Sarguna Padmanabhan
S. Shekhar
39
1
0
08 Dec 2020
MAP-Elites enables Powerful Stepping Stones and Diversity for Modular
  Robotics
MAP-Elites enables Powerful Stepping Stones and Diversity for Modular Robotics
Jørgen Nordmoen
Frank Veenstra
K. Ellefsen
K. Glette
40
34
0
08 Dec 2020
Proximal Policy Optimization Smoothed Algorithm
Proximal Policy Optimization Smoothed Algorithm
Wangshu Zhu
A. Rosendo
37
2
0
04 Dec 2020
Are Gradient-based Saliency Maps Useful in Deep Reinforcement Learning?
Are Gradient-based Saliency Maps Useful in Deep Reinforcement Learning?
Matthias Rosynski
Frank Kirchner
Matias Valdenegro-Toro
FAtt
52
13
0
02 Dec 2020
Revisiting Maximum Entropy Inverse Reinforcement Learning: New
  Perspectives and Algorithms
Revisiting Maximum Entropy Inverse Reinforcement Learning: New Perspectives and Algorithms
Aaron J. Snoswell
Surya P. N. Singh
N. Ye
OOD
22
13
0
01 Dec 2020
Assessing and Accelerating Coverage in Deep Reinforcement Learning
Assessing and Accelerating Coverage in Deep Reinforcement Learning
Arpan Kusari
23
2
0
01 Dec 2020
IV-Posterior: Inverse Value Estimation for Interpretable Policy
  Certificates
IV-Posterior: Inverse Value Estimation for Interpretable Policy Certificates
Tatiana Lopez-Guevara
Michael G. Burke
Nick K. Taylor
Kartic Subr
OffRL
53
0
0
30 Nov 2020
UniCon: Universal Neural Controller For Physics-based Character Motion
UniCon: Universal Neural Controller For Physics-based Character Motion
Tingwu Wang
Yunrong Guo
Maria Shugrina
Sanja Fidler
80
55
0
30 Nov 2020
Optimizing the Neural Architecture of Reinforcement Learning Agents
Optimizing the Neural Architecture of Reinforcement Learning Agents
Nina Mazyavkina
S. Moustafa
I. Trofimov
Evgeny Burnaev
AI4CE
78
4
0
30 Nov 2020
Applied Machine Learning for Games: A Graduate School Course
Applied Machine Learning for Games: A Graduate School Course
Yilei Zeng
Aayush Shah
Jameson Thai
M. Zyda
AI4CE
69
3
0
30 Nov 2020
Continuous Transition: Improving Sample Efficiency for Continuous
  Control Problems via MixUp
Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp
Junfan Lin
Zhongzhan Huang
Keze Wang
Xiaodan Liang
Weiwei Chen
Liang Lin
30
11
0
30 Nov 2020
Hybrid Imitation Learning for Real-Time Service Restoration in Resilient
  Distribution Systems
Hybrid Imitation Learning for Real-Time Service Restoration in Resilient Distribution Systems
Yichen Zhang
F. Qiu
Tianqi Hong
Zhaoyu Wang
F. Li
23
26
0
29 Nov 2020
A survey of benchmarking frameworks for reinforcement learning
A survey of benchmarking frameworks for reinforcement learning
B. Stapelberg
K. Malan
OffRL
46
3
0
27 Nov 2020
Learning from Simulation, Racing in Reality
Learning from Simulation, Racing in Reality
Eugenio Chisari
Alexander Liniger
Alisa Rupenyan
Luc Van Gool
John Lygeros
83
25
0
26 Nov 2020
Reinforcement Learning for Robust Missile Autopilot Design
Reinforcement Learning for Robust Missile Autopilot Design
Bernardo Cortez
20
2
0
26 Nov 2020
C-Learning: Horizon-Aware Cumulative Accessibility Estimation
C-Learning: Horizon-Aware Cumulative Accessibility Estimation
Panteha Naderian
Gabriel Loaiza-Ganem
H. Braviner
Anthony L. Caterini
Jesse C. Cresswell
Tong Li
Animesh Garg
80
1
0
24 Nov 2020
Solving The Lunar Lander Problem under Uncertainty using Reinforcement
  Learning
Solving The Lunar Lander Problem under Uncertainty using Reinforcement Learning
Soham Gadgil
Yunfeng Xin
Chengzhe Xu
27
12
0
24 Nov 2020
An analysis of Reinforcement Learning applied to Coach task in IEEE Very
  Small Size Soccer
An analysis of Reinforcement Learning applied to Coach task in IEEE Very Small Size Soccer
Carlos H. C. Pena
Mateus G. Machado
M. S. Barros
José D. P. Silva
Lucas D. Maciel
Ing Ren Tsang
Edna N. S. Barros
Pedro H. M. Braga
H. Bassani
57
2
0
23 Nov 2020
Generative Adversarial Simulator
Generative Adversarial Simulator
Jonathan Raiman
GAN
19
0
0
23 Nov 2020
Double Meta-Learning for Data Efficient Policy Optimization in
  Non-Stationary Environments
Double Meta-Learning for Data Efficient Policy Optimization in Non-Stationary Environments
Elahe Aghapour
Nora Ayanian
OffRL
42
4
0
21 Nov 2020
Inverse Constrained Reinforcement Learning
Inverse Constrained Reinforcement Learning
Usman Anwar
Shehryar Malik
Alireza Aghasi
Ali Ahmed
103
59
0
19 Nov 2020
FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading
  in Quantitative Finance
FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance
Xiao-Yang Liu
Hongyang Yang
Qian Chen
Runjia Zhang
Liuqing Yang
Bowen Xiao
Chris Wang
AIFinOffRL
88
127
0
19 Nov 2020
SAFARI: Safe and Active Robot Imitation Learning with Imagination
SAFARI: Safe and Active Robot Imitation Learning with Imagination
Norman Di Palo
Edward Johns
74
8
0
18 Nov 2020
Weighted Entropy Modification for Soft Actor-Critic
Weighted Entropy Modification for Soft Actor-Critic
Yizhou Zhao
Song-Chun Zhu
35
1
0
18 Nov 2020
An analytical diabolo model for robotic learning and control
An analytical diabolo model for robotic learning and control
Felix von Drigalski
D. Joshi
Takayuki Murooka
Kazutoshi Tanaka
Masashi Hamaya
Yoshihisa Ijiri
60
7
0
18 Nov 2020
A User's Guide to Calibrating Robotics Simulators
A User's Guide to Calibrating Robotics Simulators
Bhairav Mehta
Ankur Handa
Dieter Fox
F. Ramos
55
12
0
17 Nov 2020
Efficient Exploration of Reward Functions in Inverse Reinforcement
  Learning via Bayesian Optimization
Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization
Sreejith Balakrishnan
Q. Nguyen
Bryan Kian Hsiang Low
Harold Soh
85
26
0
17 Nov 2020
NLPGym -- A toolkit for evaluating RL agents on Natural Language
  Processing Tasks
NLPGym -- A toolkit for evaluating RL agents on Natural Language Processing Tasks
Rajkumar Ramamurthy
R. Sifa
Christian Bauckhage
54
5
0
16 Nov 2020
Hierarchical clustering in particle physics through reinforcement
  learning
Hierarchical clustering in particle physics through reinforcement learning
Johann Brehmer
S. Macaluso
D. Pappadopulo
Kyle Cranmer
39
6
0
16 Nov 2020
Blind Decision Making: Reinforcement Learning with Delayed Observations
Blind Decision Making: Reinforcement Learning with Delayed Observations
Mridul Agarwal
Vaneet Aggarwal
OffRL
39
23
0
16 Nov 2020
Stein Variational Model Predictive Control
Stein Variational Model Predictive Control
Alexander Lambert
Adam Fishman
Dieter Fox
Byron Boots
F. Ramos
110
62
0
15 Nov 2020
CDT: Cascading Decision Trees for Explainable Reinforcement Learning
CDT: Cascading Decision Trees for Explainable Reinforcement Learning
Zihan Ding
Pablo Hernandez-Leal
G. Ding
Changjian Li
Ruitong Huang
65
22
0
15 Nov 2020
Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and
  Benchmarking
Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking
Fabio Pardo
OffRL
60
31
0
15 Nov 2020
Convex Optimization with an Interpolation-based Projection and its
  Application to Deep Learning
Convex Optimization with an Interpolation-based Projection and its Application to Deep Learning
R. Akrour
Asma Atamna
Jan Peters
19
3
0
13 Nov 2020
Reinforcement Learning Control of Constrained Dynamic Systems with
  Uniformly Ultimate Boundedness Stability Guarantee
Reinforcement Learning Control of Constrained Dynamic Systems with Uniformly Ultimate Boundedness Stability Guarantee
Minghao Han
Yuan Tian
Lixian Zhang
Jun Wang
Wei Pan
68
49
0
13 Nov 2020
Critic PI2: Master Continuous Planning via Policy Improvement with Path
  Integrals and Deep Actor-Critic Reinforcement Learning
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning
Jiajun Fan
He Ba
Xian Guo
Jianye Hao
OffRL
49
5
0
13 Nov 2020
Steady State Analysis of Episodic Reinforcement Learning
Steady State Analysis of Episodic Reinforcement Learning
Bojun Huang
OffRL
54
23
0
12 Nov 2020
Learning Latent Representations to Influence Multi-Agent Interaction
Learning Latent Representations to Influence Multi-Agent Interaction
Annie Xie
Dylan P. Losey
R. Tolsma
Chelsea Finn
Dorsa Sadigh
DRL
192
134
0
12 Nov 2020
Reinforcement Learning with Videos: Combining Offline Observations with
  Interaction
Reinforcement Learning with Videos: Combining Offline Observations with Interaction
Karl Schmeckpeper
Oleh Rybkin
Kostas Daniilidis
Sergey Levine
Chelsea Finn
OffRL
111
107
0
12 Nov 2020
Joint Space Control via Deep Reinforcement Learning
Joint Space Control via Deep Reinforcement Learning
Visak C. V. Kumar
David Hoeller
Balakumar Sundaralingam
Jonathan Tremblay
Stan Birchfield
DRL
85
16
0
12 Nov 2020
I Know What You Meant: Learning Human Objectives by (Under)estimating
  Their Choice Set
I Know What You Meant: Learning Human Objectives by (Under)estimating Their Choice Set
Ananth Jonnavittula
Dylan P. Losey
86
16
0
11 Nov 2020
Ecole: A Gym-like Library for Machine Learning in Combinatorial
  Optimization Solvers
Ecole: A Gym-like Library for Machine Learning in Combinatorial Optimization Solvers
Antoine Prouvost
Justin Dumouchelle
Lara Scavuzzo
Maxime Gasse
Didier Chételat
Andrea Lodi
OffRL
106
50
0
11 Nov 2020
Accounting for Human Learning when Inferring Human Preferences
Accounting for Human Learning when Inferring Human Preferences
Harry Giles
Lawrence Chan
OffRL
12
0
0
11 Nov 2020
What Did You Think Would Happen? Explaining Agent Behaviour Through
  Intended Outcomes
What Did You Think Would Happen? Explaining Agent Behaviour Through Intended Outcomes
Herman Yau
Chris Russell
Simon Hadfield
FAttLRM
58
38
0
10 Nov 2020
Model-based Reinforcement Learning from Signal Temporal Logic
  Specifications
Model-based Reinforcement Learning from Signal Temporal Logic Specifications
Parv Kapoor
Anand Balakrishnan
Jyotirmoy V. Deshmukh
74
21
0
10 Nov 2020
f-IRL: Inverse Reinforcement Learning via State Marginal Matching
f-IRL: Inverse Reinforcement Learning via State Marginal Matching
Tianwei Ni
Harshit S. Sikchi
Yufei Wang
Tejus Gupta
Lisa Lee
Benjamin Eysenbach
93
73
0
09 Nov 2020
Deep reinforcement learning for RAN optimization and control
Deep reinforcement learning for RAN optimization and control
Yu Chen
Jie Chen
G. Krishnamurthi
Huijing Yang
Huahui Wang
Wenjie Zhao
39
1
0
09 Nov 2020
Previous
123...303132...505152
Next