ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
A Novel Traffic Simulation Framework for Testing Autonomous Vehicles
  Using SUMO and CARLA
A Novel Traffic Simulation Framework for Testing Autonomous Vehicles Using SUMO and CARLA
P. Li
Arpan Kusari
D. Leblanc
43
15
0
14 Oct 2021
Extending Environments To Measure Self-Reflection In Reinforcement
  Learning
Extending Environments To Measure Self-Reflection In Reinforcement Learning
S. Alexander
Michael Castaneda
K. Compher
Oscar Martinez
68
6
0
13 Oct 2021
Next-Best-View Estimation based on Deep Reinforcement Learning for
  Active Object Classification
Next-Best-View Estimation based on Deep Reinforcement Learning for Active Object Classification
Christian Korbach
M. Solbach
Raphael Memmesheimer
Dietrich Paulus
John K. Tsotsos
EgoV
40
0
0
13 Oct 2021
A Review of the Deep Sea Treasure problem as a Multi-Objective
  Reinforcement Learning Benchmark
A Review of the Deep Sea Treasure problem as a Multi-Objective Reinforcement Learning Benchmark
Thomas Cassimon
Reinout Eyckerman
Siegfried Mercelis
Steven Latré
P. Hellinckx
59
0
0
13 Oct 2021
GridLearn: Multiagent Reinforcement Learning for Grid-Aware Building
  Energy Management
GridLearn: Multiagent Reinforcement Learning for Grid-Aware Building Energy Management
Aisling Pigott
Constance Crozier
K. Baker
Zoltán Nagy
AI4CE
177
41
0
12 Oct 2021
Learning to Coordinate in Multi-Agent Systems: A Coordinated
  Actor-Critic Algorithm and Finite-Time Guarantees
Learning to Coordinate in Multi-Agent Systems: A Coordinated Actor-Critic Algorithm and Finite-Time Guarantees
Siliang Zeng
Tianyi Chen
Alfredo García
Mingyi Hong
92
11
0
11 Oct 2021
Cooperative Assistance in Robotic Surgery through Multi-Agent
  Reinforcement Learning
Cooperative Assistance in Robotic Surgery through Multi-Agent Reinforcement Learning
Paul Maria Scheikl
B. Gyenes
Tornike Davitashvili
Rayan Younis
A. Schulze
Beat P. Müller-Stich
Gerhard Neumann
M. Wagner
F. Mathis-Ullrich
63
13
0
10 Oct 2021
DCT: Dynamic Compressive Transformer for Modeling Unbounded Sequence
DCT: Dynamic Compressive Transformer for Modeling Unbounded Sequence
Kai-Po Chang
Wei-Yun Ma
21
0
0
10 Oct 2021
Theoretically Principled Deep RL Acceleration via Nearest Neighbor
  Function Approximation
Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation
Junhong Shen
Lin F. Yang
OffRL
51
18
0
09 Oct 2021
Improving Kinodynamic Planners for Vehicular Navigation with Learned
  Goal-Reaching Controllers
Improving Kinodynamic Planners for Vehicular Navigation with Learned Goal-Reaching Controllers
Aravind Sivaramakrishnan
Edgar Granados
Seth Karten
T. McMahon
Kostas E. Bekris
51
7
0
08 Oct 2021
Explaining Reward Functions to Humans for Better Human-Robot
  Collaboration
Explaining Reward Functions to Humans for Better Human-Robot Collaboration
Lindsay M. Sanneman
J. Shah
45
5
0
08 Oct 2021
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement
  Learning
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
85
17
0
07 Oct 2021
Optimized Recommender Systems with Deep Reinforcement Learning
Optimized Recommender Systems with Deep Reinforcement Learning
Lucas Farris
OffRL
25
0
0
06 Oct 2021
Multi-Agent Constrained Policy Optimisation
Multi-Agent Constrained Policy Optimisation
Shangding Gu
J. Kuba
Munning Wen
Ruiqing Chen
Ziyan Wang
Zheng Tian
Jun Wang
Alois Knoll
Yaodong Yang
161
49
0
06 Oct 2021
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach
Alexander Khazatsky
Sergey Levine
Ruslan Salakhutdinov
OffRL
262
46
0
06 Oct 2021
Replay-Guided Adversarial Environment Design
Replay-Guided Adversarial Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
218
102
0
06 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for
  Sparse Reward Tasks
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
72
15
0
05 Oct 2021
OTTR: Off-Road Trajectory Tracking using Reinforcement Learning
OTTR: Off-Road Trajectory Tracking using Reinforcement Learning
Akhil Nagariya
D. Kalathil
Srikanth Saripalli
OffRL
47
1
0
05 Oct 2021
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Wonjoon Goo
S. Niekum
OffRL
88
8
0
05 Oct 2021
Influencing Towards Stable Multi-Agent Interactions
Influencing Towards Stable Multi-Agent Interactions
Woodrow Z. Wang
Andy Shih
Annie Xie
Dorsa Sadigh
127
35
0
05 Oct 2021
CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning
CARL: A Benchmark for Contextual and Adaptive Reinforcement Learning
C. Benjamins
Theresa Eimer
Frederik Schubert
André Biedenkapp
Bodo Rosenhahn
Frank Hutter
Marius Lindauer
OffRL
94
23
0
05 Oct 2021
Parallel Actors and Learners: A Framework for Generating Scalable RL
  Implementations
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
34
8
0
03 Oct 2021
Exploration of Artificial Intelligence-oriented Power System Dynamic
  Simulators
Exploration of Artificial Intelligence-oriented Power System Dynamic Simulators
Tannan Xiao
Ying-Cong Chen
Jianquan Wang
Shaowei Huang
Weilin Tong
Tirui He
66
15
0
03 Oct 2021
BRAC+: Improved Behavior Regularized Actor Critic for Offline
  Reinforcement Learning
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
99
17
0
02 Oct 2021
Cycle-Consistent World Models for Domain Independent Latent Imagination
Cycle-Consistent World Models for Domain Independent Latent Imagination
Sidney Bender
Tim Joseph
Marius Zoellner
57
0
0
02 Oct 2021
Stanford Pupper: A Low-Cost Agile Quadruped Robot for Benchmarking and
  Education
Stanford Pupper: A Low-Cost Agile Quadruped Robot for Benchmarking and Education
Nathan Kau
89
20
0
02 Oct 2021
Neural Network Verification in Control
Neural Network Verification in Control
M. Everett
AAML
70
17
0
30 Sep 2021
Untangling Braids with Multi-agent Q-Learning
Untangling Braids with Multi-agent Q-Learning
Abdullah Khan
A. Vernitski
A. Lisitsa
AI4CE
43
6
0
29 Sep 2021
On the Estimation Bias in Double Q-Learning
On the Estimation Bias in Double Q-Learning
Zhizhou Ren
Guangxiang Zhu
Haotian Hu
Beining Han
Jian-Hai Chen
Chongjie Zhang
82
17
0
29 Sep 2021
Improving Safety in Deep Reinforcement Learning using Unsupervised
  Action Planning
Improving Safety in Deep Reinforcement Learning using Unsupervised Action Planning
Hao-Lun Hsu
Qiuhua Huang
Sehoon Ha
OffRL
91
12
0
29 Sep 2021
A First-Occupancy Representation for Reinforcement Learning
A First-Occupancy Representation for Reinforcement Learning
Theodore H. Moskovitz
S. Wilson
M. Sahani
83
16
0
28 Sep 2021
Exploratory State Representation Learning
Exploratory State Representation Learning
Astrid Merckling
Nicolas Perrin-Gilbert
Alexandre Coninx
Stéphane Doncieux
OffRL
79
6
0
28 Sep 2021
Solving Challenging Control Problems Using Two-Staged Deep Reinforcement
  Learning
Solving Challenging Control Problems Using Two-Staged Deep Reinforcement Learning
Nitish Sontakke
Sehoon Ha
76
1
0
27 Sep 2021
Learning Multimodal Rewards from Rankings
Learning Multimodal Rewards from Rankings
Vivek Myers
Erdem Biyik
Nima Anari
Dorsa Sadigh
OffRL
88
51
0
27 Sep 2021
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning
  Algorithms
Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
Liyuan Zheng
Tanner Fiez
Zane Alumbaugh
Benjamin J. Chasnov
Lillian J. Ratliff
OffRL
99
42
0
25 Sep 2021
NICE: Robust Scheduling through Reinforcement Learning-Guided Integer
  Programming
NICE: Robust Scheduling through Reinforcement Learning-Guided Integer Programming
Luke Kenworthy
Siddharth Nayak
Christopher R. Chin
H. Balakrishnan
118
8
0
24 Sep 2021
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement
  Learning for Deterministic Policy Gradients
Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients
Baturay Saglam
Furkan B. Mutlu
Dogan C. Cicek
Suleyman S. Kozat
OffRL
53
3
0
24 Sep 2021
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with
  On-Policy Experience
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience
C. Banerjee
Zhiyong Chen
N. Noman
56
34
0
24 Sep 2021
Enhancing SUMO simulator for simulation based testing and validation of
  autonomous vehicles
Enhancing SUMO simulator for simulation based testing and validation of autonomous vehicles
Arpan Kusari
P. Li
Hanzhi Yang
Nikhil Punshi
Michelle Rasulis
S. Bogard
D. Leblanc
65
32
0
23 Sep 2021
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for
  Planning, Control, and Simulation
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation
A. Kamenev
Lirui Wang
Ollin Boer Bohan
Ishwar Kulkarni
Bilal Kartal
Artem Molchanov
Stan Birchfield
David Nistér
Nikolai Smolyanskiy
116
40
0
23 Sep 2021
ENERO: Efficient Real-Time WAN Routing Optimization with Deep
  Reinforcement Learning
ENERO: Efficient Real-Time WAN Routing Optimization with Deep Reinforcement Learning
Paul Almasan
Shihan Xiao
Xiangle Cheng
Xiang Shi
Pere Barlet-Ros
A. Cabellos-Aparicio
108
20
0
22 Sep 2021
Estimation Error Correction in Deep Reinforcement Learning for
  Deterministic Actor-Critic Methods
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods
Baturay Saglam
Enes Duran
Dogan C. Cicek
Furkan B. Mutlu
Suleyman S. Kozat
OffRL
73
12
0
22 Sep 2021
Locality Matters: A Scalable Value Decomposition Approach for
  Cooperative Multi-Agent Reinforcement Learning
Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Roy Zohar
Shie Mannor
Guy Tennenholtz
57
10
0
22 Sep 2021
Context-Specific Representation Abstraction for Deep Option Learning
Context-Specific Representation Abstraction for Deep Option Learning
Marwa Abdulhai
Dong-Ki Kim
Matthew D Riemer
Miao Liu
Gerald Tesauro
Jonathan P. How
OffRL
92
10
0
20 Sep 2021
Density-based Curriculum for Multi-goal Reinforcement Learning with
  Sparse Rewards
Density-based Curriculum for Multi-goal Reinforcement Learning with Sparse Rewards
Deyu Yang
Hanbo Zhang
Xuguang Lan
Jishiyu Ding
OffRL
77
2
0
18 Sep 2021
Decentralized Global Connectivity Maintenance for Multi-Robot
  Navigation: A Reinforcement Learning Approach
Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach
Minghao Li
Yingrui Jie
Yang Kong
Hui Cheng
60
9
0
17 Sep 2021
Soft Actor-Critic With Integer Actions
Soft Actor-Critic With Integer Actions
Ting-Han Fan
Yubo Wang
69
15
0
17 Sep 2021
Dropout's Dream Land: Generalization from Learned Simulators to Reality
Dropout's Dream Land: Generalization from Learned Simulators to Reality
Zac Wellmer
James T. Kwok
SyDa
69
9
0
17 Sep 2021
Automated Testing with Temporal Logic Specifications for Robotic
  Controllers using Adaptive Experiment Design
Automated Testing with Temporal Logic Specifications for Robotic Controllers using Adaptive Experiment Design
Craig Innes
S. Ramamoorthy
65
5
0
16 Sep 2021
ROS-X-Habitat: Bridging the ROS Ecosystem with Embodied AI
ROS-X-Habitat: Bridging the ROS Ecosystem with Embodied AI
Guanxiong Chen
Haoyu Yang
Ian M. Mitchell
LM&Ro
83
8
0
16 Sep 2021
Previous
123...232425...505152
Next