ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Regularizing Model-Based Planning with Energy-Based Models
Regularizing Model-Based Planning with Energy-Based Models
Rinu Boney
Arno Solin
Alexander Ilin
76
18
0
12 Oct 2019
Assistive Gym: A Physics Simulation Framework for Assistive Robotics
Assistive Gym: A Physics Simulation Framework for Assistive Robotics
Zackory M. Erickson
Vamsee Gangaram
Ariel Kapusta
Chenxi Liu
Charles C. Kemp
106
111
0
10 Oct 2019
Imitation Learning from Observations by Minimizing Inverse Dynamics
  Disagreement
Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement
Chao Yang
Xiaojian Ma
Wenbing Huang
F. Sun
Huaping Liu
Junzhou Huang
Chuang Gan
106
71
0
10 Oct 2019
RLCard: A Toolkit for Reinforcement Learning in Card Games
RLCard: A Toolkit for Reinforcement Learning in Card Games
Daochen Zha
Kwei-Herng Lai
Yuanpu Cao
Songyi Huang
Ruzhe Wei
Junyu Guo
Helen Zhou
OffRL
82
58
0
10 Oct 2019
Asking Easy Questions: A User-Friendly Approach to Active Reward
  Learning
Asking Easy Questions: A User-Friendly Approach to Active Reward Learning
Erdem Biyik
Malayandi Palan
Nicholas C. Landolfi
Dylan P. Losey
Dorsa Sadigh
53
116
0
10 Oct 2019
Backpropagation Algorithms and Reservoir Computing in Recurrent Neural
  Networks for the Forecasting of Complex Spatiotemporal Dynamics
Backpropagation Algorithms and Reservoir Computing in Recurrent Neural Networks for the Forecasting of Complex Spatiotemporal Dynamics
Pantelis R. Vlachas
Jaideep Pathak
Brian R. Hunt
T. Sapsis
M. Girvan
Edward Ott
Petros Koumoutsakos
AI4TS
94
400
0
09 Oct 2019
MVFST-RL: An Asynchronous RL Framework for Congestion Control with
  Delayed Actions
MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions
V. Sivakumar
Olivier Delalleau
Tim Rocktaschel
Alexander H. Miller
Heinrich Küttler
Nantas Nardelli
Michael G. Rabbat
Joelle Pineau
Sebastian Riedel
85
36
0
09 Oct 2019
Hierarchical Deep Double Q-Routing
Hierarchical Deep Double Q-Routing
Ramy E. Ali
B. Erman
Ejder Bastug
Bruce Cilli
46
17
0
09 Oct 2019
Ctrl-Z: Recovering from Instability in Reinforcement Learning
Ctrl-Z: Recovering from Instability in Reinforcement Learning
Vibhavari Dasagi
Jake Bruce
T. Peynot
Jurgen Leitner
53
10
0
09 Oct 2019
Tactical Reward Shaping: Bypassing Reinforcement Learning with
  Strategy-Based Goals
Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals
Yizheng Zhang
A. Rosendo
128
6
0
08 Oct 2019
Action-conditioned Benchmarking of Robotic Video Prediction Models: a
  Comparative Study
Action-conditioned Benchmarking of Robotic Video Prediction Models: a Comparative Study
Manuel S. Nunes
Atabak Dehban
Plinio Moreno
J. Santos-Victor
61
12
0
07 Oct 2019
If MaxEnt RL is the Answer, What is the Question?
If MaxEnt RL is the Answer, What is the Question?
Benjamin Eysenbach
Sergey Levine
77
59
0
04 Oct 2019
Benchmarking Batch Deep Reinforcement Learning Algorithms
Benchmarking Batch Deep Reinforcement Learning Algorithms
Shih-Han Chou
Wen-Yen Chang
W. Hsu
Jianlong Fu
OffRL
74
185
0
03 Oct 2019
Relationship Explainable Multi-objective Optimization Via Vector Value
  Function Based Reinforcement Learning
Relationship Explainable Multi-objective Optimization Via Vector Value Function Based Reinforcement Learning
Huixin Zhan
Yongcan Cao
66
7
0
02 Oct 2019
Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in
  Damaged Robots
Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots
Shresth Verma
A. Mustafa
Gaurav Agarwal
E. Imre
A. Hilton
13
7
0
02 Oct 2019
Attacking Vision-based Perception in End-to-End Autonomous Driving
  Models
Attacking Vision-based Perception in End-to-End Autonomous Driving Models
Adith Boloor
Karthik Garimella
Xin He
C. Gill
Yevgeniy Vorobeychik
Xuan Zhang
AAML
75
108
0
02 Oct 2019
CWAE-IRL: Formulating a supervised approach to Inverse Reinforcement
  Learning problem
CWAE-IRL: Formulating a supervised approach to Inverse Reinforcement Learning problem
Arpan Kusari
BDL
22
0
0
02 Oct 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy
  Reinforcement Learning
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
174
570
0
01 Oct 2019
DualSMC: Tunneling Differentiable Filtering and Planning under
  Continuous POMDPs
DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs
Yunbo Wang
Bo Liu
Jiajun Wu
Yuke Zhu
Simon S. Du
Fei-Fei Li
Joshua B. Tenenbaum
54
8
0
28 Sep 2019
SURREAL-System: Fully-Integrated Stack for Distributed Deep
  Reinforcement Learning
SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning
Linxi Fan
Yuke Zhu
Jiren Zhu
Zihua Liu
Orien Zeng
Anchit Gupta
Joan Creus-Costa
Silvio Savarese
Li Fei-Fei
OffRLGNN
89
3
0
27 Sep 2019
Playing Atari Ball Games with Hierarchical Reinforcement Learning
Playing Atari Ball Games with Hierarchical Reinforcement Learning
Hua Huang
Adrian Barbu
34
0
0
27 Sep 2019
RLBench: The Robot Learning Benchmark & Learning Environment
RLBench: The Robot Learning Benchmark & Learning Environment
Stephen James
Z. Ma
David Rovick Arrojo
Andrew J. Davison
SSLVLMOffRL
145
563
0
26 Sep 2019
Relationship Explainable Multi-objective Reinforcement Learning with
  Semantic Explainability Generation
Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation
Huixin Zhan
Yongcan Cao
61
2
0
26 Sep 2019
Can $Q$-Learning with Graph Networks Learn a Generalizable Branching
  Heuristic for a SAT Solver?
Can QQQ-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?
Vitaly Kurin
Saad Godil
Shimon Whiteson
Bryan Catanzaro
NAI
76
28
0
26 Sep 2019
Invariant Transform Experience Replay: Data Augmentation for Deep
  Reinforcement Learning
Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning
Yijiong Lin
Jiancong Huang
Matthieu Zimmer
Yisheng Guan
Juan Rojas
Paul Weng
OffRL
51
0
0
24 Sep 2019
Deep Imitation Learning of Sequential Fabric Smoothing From an
  Algorithmic Supervisor
Deep Imitation Learning of Sequential Fabric Smoothing From an Algorithmic Supervisor
Daniel Seita
Aditya Ganapathi
Ryan Hoque
M. Hwang
Edward Cen
...
Nawid Jamali
K. Yamane
Soshi Iba
John F. Canny
Ken Goldberg
58
120
0
23 Sep 2019
Multi-task Learning and Catastrophic Forgetting in Continual
  Reinforcement Learning
Multi-task Learning and Catastrophic Forgetting in Continual Reinforcement Learning
Joao G. Ribeiro
Francisco S. Melo
João Dias
CLL
56
12
0
22 Sep 2019
How Much Do Unstated Problem Constraints Limit Deep Robotic
  Reinforcement Learning?
How Much Do Unstated Problem Constraints Limit Deep Robotic Reinforcement Learning?
W. Lewis
Mark Moll
Lydia E. Kavraki
OffRL
45
5
0
20 Sep 2019
Visualizing Movement Control Optimization Landscapes
Visualizing Movement Control Optimization Landscapes
Perttu Hämäläinen
Juuso Toikka
Amin Babadi
Karen Liu
59
7
0
17 Sep 2019
Meta Reinforcement Learning for Sim-to-real Domain Adaptation
Meta Reinforcement Learning for Sim-to-real Domain Adaptation
Karol Arndt
Murtaza Hazara
Ali Ghadirzadeh
Ville Kyrki
173
106
0
16 Sep 2019
Model Based Planning with Energy Based Models
Model Based Planning with Energy Based Models
Yilun Du
Toru Lin
Igor Mordatch
97
38
0
15 Sep 2019
Wield: Systematic Reinforcement Learning With Progressive Randomization
Wield: Systematic Reinforcement Learning With Progressive Randomization
Michael Schaarschmidt
Kai Fricke
Eiko Yoneki
51
2
0
15 Sep 2019
State Representation Learning from Demonstration
State Representation Learning from Demonstration
Astrid Merckling
Michael Pearce
Loic Cressot
Stéphane Doncieux
Matthias Poloczek
OffRL
63
8
0
15 Sep 2019
Policy Prediction Network: Model-Free Behavior Policy with Model-Based
  Learning in Continuous Action Space
Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space
Zac Wellmer
James T. Kwok
26
0
0
15 Sep 2019
Torchmeta: A Meta-Learning library for PyTorch
Torchmeta: A Meta-Learning library for PyTorch
T. Deleu
Tobias Würfl
Mandana Samiei
Joseph Paul Cohen
Yoshua Bengio
OffRL
74
85
0
14 Sep 2019
Selfie Drone Stick: A Natural Interface for Quadcopter Photography
Selfie Drone Stick: A Natural Interface for Quadcopter Photography
Saif Alabachi
G. Sukthankar
Rahul Sukthankar
23
0
0
14 Sep 2019
A Stochastic Proximal Point Algorithm for Saddle-Point Problems
A Stochastic Proximal Point Algorithm for Saddle-Point Problems
Luo Luo
Cheng Chen
Yujun Li
Guangzeng Xie
Zhihua Zhang
142
16
0
13 Sep 2019
Deep Learned Path Planning via Randomized Reward-Linked-Goals and
  Potential Space Applications
Deep Learned Path Planning via Randomized Reward-Linked-Goals and Potential Space Applications
Tamir Blum
William Jones
Kazuya Yoshida
34
8
0
13 Sep 2019
Modeling Sensorimotor Coordination as Multi-Agent Reinforcement Learning
  with Differentiable Communication
Modeling Sensorimotor Coordination as Multi-Agent Reinforcement Learning with Differentiable Communication
Bowen Jing
William Yin
26
1
0
12 Sep 2019
Interactive Fiction Games: A Colossal Adventure
Interactive Fiction Games: A Colossal Adventure
Matthew J. Hausknecht
Prithviraj Ammanabrolu
Marc-Alexandre Côté
Xingdi Yuan
LLMAGLM&RoAI4CE
89
197
0
11 Sep 2019
FAT Forensics: A Python Toolbox for Algorithmic Fairness, Accountability
  and Transparency
FAT Forensics: A Python Toolbox for Algorithmic Fairness, Accountability and Transparency
Kacper Sokol
Raúl Santos-Rodríguez
Peter A. Flach
55
37
0
11 Sep 2019
Predicting optimal value functions by interpolating reward functions in
  scalarized multi-objective reinforcement learning
Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning
Arpan Kusari
Jonathan P. How
16
3
0
11 Sep 2019
Learning Transferable Domain Priors for Safe Exploration in
  Reinforcement Learning
Learning Transferable Domain Priors for Safe Exploration in Reinforcement Learning
Thommen George Karimpanal
Santu Rana
Sunil R. Gupta
T. Tran
Svetha Venkatesh
OffRLOnRL
64
10
0
10 Sep 2019
Recommendation System-based Upper Confidence Bound for Online
  Advertising
Recommendation System-based Upper Confidence Bound for Online Advertising
Nhan Nguyen-Thanh
D. Marinca
K. Khawam
D. Rohde
Flavian Vasile
E. Lohan
Steven Martin
Dominique Quadri
OffRL
56
13
0
09 Sep 2019
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement
  Learning
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Kristopher De Asis
Alan Chan
Silviu Pitis
R. Sutton
D. Graves
64
32
0
09 Sep 2019
Partner Approximating Learners (PAL): Simulation-Accelerated Learning
  with Explicit Partner Modeling in Multi-Agent Domains
Partner Approximating Learners (PAL): Simulation-Accelerated Learning with Explicit Partner Modeling in Multi-Agent Domains
Florian Köpf
A. Nitsch
M. Flad
Sören Hohmann
54
2
0
09 Sep 2019
Imitation Learning from Pixel-Level Demonstrations by HashReward
Imitation Learning from Pixel-Level Demonstrations by HashReward
Xin-Qiang Cai
Yao-Xiang Ding
Yuan Jiang
Zhi Zhou
41
10
0
09 Sep 2019
A Survey on Reproducibility by Evaluating Deep Reinforcement Learning
  Algorithms on Real-World Robots
A Survey on Reproducibility by Evaluating Deep Reinforcement Learning Algorithms on Real-World Robots
Nicolai A. Lynnerup
Laura Nolling
Rasmus Hasle
J. Hallam
46
18
0
09 Sep 2019
DEAR: Deep Reinforcement Learning for Online Advertising Impression in
  Recommender Systems
DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems
Xiangyu Zhao
Changsheng Gu
Haoshenglun Zhang
Xiwang Yang
Xiaobing Liu
Jiliang Tang
Hui Liu
OffRL
81
102
0
09 Sep 2019
Mature GAIL: Imitation Learning for Low-level and High-dimensional Input
  using Global Encoder and Cost Transformation
Mature GAIL: Imitation Learning for Low-level and High-dimensional Input using Global Encoder and Cost Transformation
Wonsup Shin
Hyolim Kang
Sunghoon Hong
15
0
0
07 Sep 2019
Previous
123...414243...505152
Next