ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Generalized Hindsight for Reinforcement Learning
Generalized Hindsight for Reinforcement Learning
Alexander C. Li
Lerrel Pinto
Pieter Abbeel
67
70
0
26 Feb 2020
Whole-Body Control of a Mobile Manipulator using End-to-End
  Reinforcement Learning
Whole-Body Control of a Mobile Manipulator using End-to-End Reinforcement Learning
Julien Kindle
Fadri Furrer
Tonci Novkovic
Jen Jen Chung
Roland Siegwart
Juan I. Nieto
OffRL
96
30
0
25 Feb 2020
TanksWorld: A Multi-Agent Environment for AI Safety Research
TanksWorld: A Multi-Agent Environment for AI Safety Research
Corban G. Rivera
Olivia Lyons
Arielle Summitt
Ayman Fatima
J. Pak
...
R. Chalmers
Aryeh Englander
Edward W. Staley
I. Wang
Ashley J. Llorens
23
2
0
25 Feb 2020
Simultaneously Evolving Deep Reinforcement Learning Models using
  Multifactorial Optimization
Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization
Aritz D. Martinez
E. Osaba
Javier Del Ser
Francisco Herrera
71
10
0
25 Feb 2020
Off-Policy Deep Reinforcement Learning with Analogous Disentangled
  Exploration
Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration
Hoang Trung-Dung
Yitao Liang
Guy Van den Broeck
OffRL
71
3
0
25 Feb 2020
CybORG: An Autonomous Cyber Operations Research Gym
CybORG: An Autonomous Cyber Operations Research Gym
Callum Baillie
Maxwell Standen
Jonathon Schwartz
Michael Docking
David Bowman
Junae Kim
42
30
0
25 Feb 2020
Safe reinforcement learning for probabilistic reachability and safety
  specifications: A Lyapunov-based approach
Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approach
Subin Huh
Insoon Yang
76
20
0
24 Feb 2020
Behavior Cloning in OpenAI using Case Based Reasoning
Behavior Cloning in OpenAI using Case Based Reasoning
C. Peters
B. Esfandiari
Mohamad Zalat
Robert West
OffRL
17
0
0
23 Feb 2020
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Ashley D. Edwards
Himanshu Sahni
Rosanne Liu
Jane Hung
Ankit Jain
Rui Wang
Adrien Ecoffet
Thomas Miconi
Charles Isbell
J. Yosinski
OffRL
48
18
0
21 Feb 2020
On the Search for Feedback in Reinforcement Learning
On the Search for Feedback in Reinforcement Learning
Ran A. Wang
Karthikeya S. Parunandi
Aayushman Sharma
R. Goyal
S. Chakravorty
53
9
0
21 Feb 2020
Accelerating Reinforcement Learning with a
  Directional-Gaussian-Smoothing Evolution Strategy
Accelerating Reinforcement Learning with a Directional-Gaussian-Smoothing Evolution Strategy
Jiaxing Zhang
Hoang Tran
Guannan Zhang
68
9
0
21 Feb 2020
Support-weighted Adversarial Imitation Learning
Support-weighted Adversarial Imitation Learning
Ruohan Wang
C. Ciliberto
P. Amadori
Y. Demiris
39
4
0
20 Feb 2020
Adaptive Temporal Difference Learning with Linear Function Approximation
Adaptive Temporal Difference Learning with Linear Function Approximation
Tao Sun
Han Shen
Tianyi Chen
Dongsheng Li
77
23
0
20 Feb 2020
Using AI for Mitigating the Impact of Network Delay in Cloud-based
  Intelligent Traffic Signal Control
Using AI for Mitigating the Impact of Network Delay in Cloud-based Intelligent Traffic Signal Control
Rusheng Zhang
Xinze Zhou
Ozan K. Tonguz
36
1
0
19 Feb 2020
Informative Path Planning for Mobile Sensing with Reinforcement Learning
Informative Path Planning for Mobile Sensing with Reinforcement Learning
Yongyong Wei
Rong Zheng
81
34
0
18 Feb 2020
Adaptive Estimator Selection for Off-Policy Evaluation
Adaptive Estimator Selection for Off-Policy Evaluation
Yi-Hsun Su
Pavithra Srinath
A. Krishnamurthy
OffRL
62
48
0
18 Feb 2020
DISCO: Double Likelihood-free Inference Stochastic Control
DISCO: Double Likelihood-free Inference Stochastic Control
Lucas Barcelos
Rafael Oliveira
Rafael Possas
Lionel Ott
Fabio Ramos
33
11
0
18 Feb 2020
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Kalman meets Bellman: Improving Policy Evaluation through Value Tracking
Shirli Di-Castro Shashua
Shie Mannor
OffRL
50
12
0
17 Feb 2020
Adaptive Experience Selection for Policy Gradient
Adaptive Experience Selection for Policy Gradient
S. Mohamad
Giovanni Montana
104
0
0
17 Feb 2020
Universal Value Density Estimation for Imitation Learning and
  Goal-Conditioned Reinforcement Learning
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning
Yannick Schroecker
Charles Isbell
OffRL
88
13
0
15 Feb 2020
PDDLGym: Gym Environments from PDDL Problems
PDDLGym: Gym Environments from PDDL Problems
Tom Silver
Rohan Chitnis
AI4CE
105
57
0
15 Feb 2020
Applying Depth-Sensing to Automated Surgical Manipulation with a da
  Vinci Robot
Applying Depth-Sensing to Automated Surgical Manipulation with a da Vinci Robot
M. Hwang
Daniel Seita
Brijen Thananjeyan
Jeffrey Ichnowski
Samuel Paradis
Danyal Fer
Thomas Low
Ken Goldberg
125
31
0
15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin
  Dynamics
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics
Parameswaran Kamalaruban
Yu-ting Huang
Ya-Ping Hsieh
Paul Rolland
C. Shi
Volkan Cevher
103
61
0
14 Feb 2020
Learning Functionally Decomposed Hierarchies for Continuous Control
  Tasks with Path Planning
Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Sammy Christen
Lukás Jendele
Emre Aksan
Otmar Hilliges
OffRL
78
25
0
14 Feb 2020
A Framework for End-to-End Learning on Semantic Tree-Structured Data
A Framework for End-to-End Learning on Semantic Tree-Structured Data
William Woof
Ke Chen
40
3
0
13 Feb 2020
XCS Classifier System with Experience Replay
XCS Classifier System with Experience Replay
Anthony Stein
Roland Maier
Lukas Rosenbauer
J. Hähner
BDL
35
21
0
13 Feb 2020
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted
  Prescription
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription
Olivier Francon
Santiago Gonzalez
Babak Hodjat
Elliot Meyerson
Risto Miikkulainen
Xin Qiu
Hormoz Shahrzad
80
17
0
13 Feb 2020
Learning to Generate Levels From Nothing
Learning to Generate Levels From Nothing
Philip Bontrager
Julian Togelius
GAN
61
22
0
12 Feb 2020
Multi-task Reinforcement Learning with a Planning Quasi-Metric
Multi-task Reinforcement Learning with a Planning Quasi-Metric
Vincent Micheli
Karthigan Sinnathamby
Franccois Fleuret
70
2
0
08 Feb 2020
Capsule Network Performance with Autonomous Navigation
Capsule Network Performance with Autonomous Navigation
Tom Molnar
Eugenio Culurciello
3DPC
25
2
0
08 Feb 2020
BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization
BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization
Milovs Nikolić
G. B. Hacene
Ciaran Bannon
Alberto Delmas Lascorz
Matthieu Courbariaux
Yoshua Bengio
Vincent Gripon
Andreas Moshovos
MQ
66
25
0
08 Feb 2020
Representation of Reinforcement Learning Policies in Reproducing Kernel
  Hilbert Spaces
Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces
Bogdan Mazoure
T. Doan
Tianyu Li
V. Makarenkov
Joelle Pineau
Doina Precup
Guillaume Rabusseau
OffRL
67
1
0
07 Feb 2020
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic
  with Advantage Weighted Mixture Policy(SAC-AWMP)
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
103
15
0
07 Feb 2020
Accelerating Reinforcement Learning for Reaching using Continuous
  Curriculum Learning
Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning
Sha Luo
Hamidreza Kasaei
Lambert Schomaker
CLL
92
46
0
07 Feb 2020
Ready Policy One: World Building Through Active Learning
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
92
49
0
07 Feb 2020
Mutual Information-based State-Control for Intrinsically Motivated
  Reinforcement Learning
Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning
Rui Zhao
Yang Gao
Pieter Abbeel
Volker Tresp
Wenyuan Xu
SSL
57
4
0
05 Feb 2020
Deep Radial-Basis Value Functions for Continuous Control
Deep Radial-Basis Value Functions for Continuous Control
Kavosh Asadi
Neev Parikh
Ronald E. Parr
George Konidaris
Michael L. Littman
44
4
0
05 Feb 2020
Learning rewards for robotic ultrasound scanning using probabilistic
  temporal ranking
Learning rewards for robotic ultrasound scanning using probabilistic temporal ranking
Michael G. Burke
Katie Lu
Daniel Angelov
Artūras Straižys
Craig Innes
Kartic Subr
S. Ramamoorthy
54
11
0
04 Feb 2020
Effective Diversity in Population Based Reinforcement Learning
Effective Diversity in Population Based Reinforcement Learning
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
130
165
0
03 Feb 2020
Evolving Neural Networks through a Reverse Encoding Tree
Evolving Neural Networks through a Reverse Encoding Tree
Haoling Zhang
Chao-Han Huck Yang
Hector Zenil
N. Kiani
Yue-Hong Shen
Jesper N. Tegnér
55
5
0
03 Feb 2020
GradientDICE: Rethinking Generalized Offline Estimation of Stationary
  Values
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Shangtong Zhang
Bo Liu
Shimon Whiteson
OffRL
116
103
0
29 Jan 2020
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under
  Lipschitz Assumptions
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions
Samuele Tosatto
R. Akrour
Jan Peters
64
4
0
29 Jan 2020
Rotation, Translation, and Cropping for Zero-Shot Generalization
Rotation, Translation, and Cropping for Zero-Shot Generalization
Chang Ye
Ahmed Khalifa
Philip Bontrager
Julian Togelius
93
38
0
27 Jan 2020
Challenges and Countermeasures for Adversarial Attacks on Deep
  Reinforcement Learning
Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning
Inaam Ilahi
Muhammad Usama
Junaid Qadir
M. Janjua
Ala I. Al-Fuqaha
D. Hoang
Dusit Niyato
AAML
147
137
0
27 Jan 2020
PCGRL: Procedural Content Generation via Reinforcement Learning
PCGRL: Procedural Content Generation via Reinforcement Learning
Ahmed Khalifa
Philip Bontrager
Sam Earle
Julian Togelius
73
146
0
24 Jan 2020
On Simple Reactive Neural Networks for Behaviour-Based Reinforcement
  Learning
On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning
Ameya Pore
G. Aragon-Camarasa
53
11
0
22 Jan 2020
Continuous-action Reinforcement Learning for Playing Racing Games:
  Comparing SPG to PPO
Continuous-action Reinforcement Learning for Playing Racing Games: Comparing SPG to PPO
Mario S. Holubar
M. Wiering
50
10
0
15 Jan 2020
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
Dor Livne
Kobi Cohen
61
52
0
14 Jan 2020
Multi-Robot Formation Control Using Reinforcement Learning
Multi-Robot Formation Control Using Reinforcement Learning
Abhay Rawat
K. Karlapalem
36
4
0
13 Jan 2020
Improving Image Autoencoder Embeddings with Perceptual Loss
Improving Image Autoencoder Embeddings with Perceptual Loss
G. Pihlgren
Fredrik Sandin
Marcus Liwicki
72
34
0
10 Jan 2020
Previous
123...383940...505152
Next