ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,705 papers shown
Title
Learning Physically Realizable Skills for Online Packing of General 3D
  Shapes
Learning Physically Realizable Skills for Online Packing of General 3D Shapes
Hang Zhao
Zherong Pan
Yang Yu
Kai Xu
OffRL
45
14
0
05 Dec 2022
Differentiated Federated Reinforcement Learning Based Traffic Offloading
  on Space-Air-Ground Integrated Networks
Differentiated Federated Reinforcement Learning Based Traffic Offloading on Space-Air-Ground Integrated Networks
Yeguang Qin
Yilin Yang
Fengxiao Tang
Xin Yao
Mingde Zhao
Nei Kato
37
6
0
05 Dec 2022
Automata Learning meets Shielding
Automata Learning meets Shielding
Martin Tappler
Stefan Pranger
Bettina Könighofer
Edi Muškardin
Roderick Bloem
Kim G. Larsen
38
4
0
04 Dec 2022
RLogist: Fast Observation Strategy on Whole-slide Images with Deep
  Reinforcement Learning
RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Boxuan Zhao
Jun Zhang
Deheng Ye
Jiancheng Cao
Xiao Han
Qiang Fu
Wei Yang
OffRL
31
9
0
04 Dec 2022
Selecting Mechanical Parameters of a Monopode Jumping System with
  Reinforcement Learning
Selecting Mechanical Parameters of a Monopode Jumping System with Reinforcement Learning
Andrew S. Albright
J. Vaughan
26
1
0
02 Dec 2022
Utilizing Prior Solutions for Reward Shaping and Composition in
  Entropy-Regularized Reinforcement Learning
Utilizing Prior Solutions for Reward Shaping and Composition in Entropy-Regularized Reinforcement Learning
Jacob Adamczyk
A. Arriojas
Stas Tiomkin
R. Kulkarni
45
8
0
02 Dec 2022
STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning
STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning
Nikhil Kumar Singh
Indranil Saha
23
6
0
02 Dec 2022
Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task
  Environments
Karolos: An Open-Source Reinforcement Learning Framework for Robot-Task Environments
Christian Bitter
Timo Thun
Tobias Meisen
36
1
0
01 Dec 2022
Symmetry Detection in Trajectory Data for More Meaningful Reinforcement
  Learning Representations
Symmetry Detection in Trajectory Data for More Meaningful Reinforcement Learning Representations
Marissa DÁlonzo
Rebecca L. Russell
25
0
0
29 Nov 2022
Learning and Understanding a Disentangled Feature Representation for
  Hidden Parameters in Reinforcement Learning
Learning and Understanding a Disentangled Feature Representation for Hidden Parameters in Reinforcement Learning
Christopher P. Reale
Rebecca L. Russell
25
1
0
29 Nov 2022
Closing the gap between SVRG and TD-SVRG with Gradient Splitting
Closing the gap between SVRG and TD-SVRG with Gradient Splitting
Arsenii Mustafin
Alexander Olshevsky
I. Paschalidis
24
1
0
29 Nov 2022
Learning from Good Trajectories in Offline Multi-Agent Reinforcement
  Learning
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning
Qiangxing Tian
Kun Kuang
Furui Liu
Baoxiang Wang
OffRL
34
9
0
28 Nov 2022
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement
  Learning
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning
Leo Ardon
Alberto Pozanco
Daniel Borrajo
Sumitra Ganesh
OffRL
23
0
0
28 Nov 2022
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse
  Reinforcement Learning
Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
Tuan-Duong Trinh
Haoyu Chen
Daniel S. Brown
OffRL
41
7
0
28 Nov 2022
Causal Deep Reinforcement Learning Using Observational Data
Causal Deep Reinforcement Learning Using Observational Data
Wenxuan Zhu
Chao Yu
Qiaosheng Zhang
CML
OffRL
26
5
0
28 Nov 2022
Quantile Constrained Reinforcement Learning: A Reinforcement Learning
  Framework Constraining Outage Probability
Quantile Constrained Reinforcement Learning: A Reinforcement Learning Framework Constraining Outage Probability
Whiyoung Jung
Myungsik Cho
Jongeui Park
Young-Jin Sung
43
4
0
28 Nov 2022
QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market
  Making Protocols
QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market Making Protocols
Dev Churiwala
Bhaskar Krishnamachari
6
4
0
28 Nov 2022
Domain Generalization for Robust Model-Based Offline Reinforcement
  Learning
Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Alan Clark
Shoaib Ahmed Siddiqui
Robert Kirk
Usman Anwar
Stephen Chung
David M. Krueger
OOD
OffRL
39
0
0
27 Nov 2022
Generalizing Gaussian Smoothing for Random Search
Generalizing Gaussian Smoothing for Random Search
Katelyn Gao
Ozan Sener
44
14
0
27 Nov 2022
How Crucial is Transformer in Decision Transformer?
How Crucial is Transformer in Decision Transformer?
Max Siebenborn
Boris Belousov
Junning Huang
Jan Peters
24
15
0
26 Nov 2022
Software Simulation and Visualization of Quantum Multi-Drone
  Reinforcement Learning
Software Simulation and Visualization of Quantum Multi-Drone Reinforcement Learning
C. Park
Jae Pyoung Kim
Won Joon Yun
Soohyun Park
Soyi Jung
Joongheon Kim
39
0
0
24 Nov 2022
Simultaneously Updating All Persistence Values in Reinforcement Learning
Simultaneously Updating All Persistence Values in Reinforcement Learning
Luca Sabbioni
Luca Al Daire
L. Bisi
Alberto Maria Metelli
Marcello Restelli
35
2
0
21 Nov 2022
Data-Driven Offline Decision-Making via Invariant Representation
  Learning
Data-Driven Offline Decision-Making via Invariant Representation Learning
Qi
Yi-Hsun Su
Aviral Kumar
Sergey Levine
OffRL
54
19
0
21 Nov 2022
A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement
  Learning
A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement Learning
Lang Qin
Rui Yan
Huajin Tang
OffRL
18
6
0
21 Nov 2022
Automating Rigid Origami Design
Automating Rigid Origami Design
Jeremia Geiger
Karolis Martinkus
Oliver Richter
Roger Wattenhofer
26
1
0
20 Nov 2022
LibSignal: An Open Library for Traffic Signal Control
LibSignal: An Open Library for Traffic Signal Control
Hao Mei
Xiaoliang Lei
Longchao Da
Bin Shi
Hua Wei
AI4TS
24
18
0
19 Nov 2022
A Neural Active Inference Model of Perceptual-Motor Learning
A Neural Active Inference Model of Perceptual-Motor Learning
Zhizhuo Yang
Gabriel J. Diaz
B. Fajen
Reynold J. Bailey
Alexander Ororbia
29
1
0
16 Nov 2022
Boosting Object Representation Learning via Motion and Object Continuity
Boosting Object Representation Learning via Motion and Object Continuity
Quentin Delfosse
Wolfgang Stammer
Thomas Rothenbacher
Dwarak Vittal
Kristian Kersting
OCL
54
20
0
16 Nov 2022
pyRDDLGym: From RDDL to Gym Environments
pyRDDLGym: From RDDL to Gym Environments
Ayal Taitler
Michael Gimelfarb
Jihwan Jeong
Sriram Gopalakrishnan
Martin Mladenov
Xiaotian Liu
Scott Sanner
25
8
0
11 Nov 2022
Leveraging Sequentiality in Reinforcement Learning from a Single
  Demonstration
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
33
4
0
09 Nov 2022
Detecting and Accommodating Novel Types and Concepts in an Embodied
  Simulation Environment
Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment
Sadaf Ghaffari
Nikhil Krishnaswamy
19
7
0
08 Nov 2022
ProtoX: Explaining a Reinforcement Learning Agent via Prototyping
ProtoX: Explaining a Reinforcement Learning Agent via Prototyping
Ronilo Ragodos
Tong Wang
Qihang Lin
Xun Zhou
31
7
0
06 Nov 2022
lilGym: Natural Language Visual Reasoning with Reinforcement Learning
lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Anne Wu
Kianté Brantley
Noriyuki Kojima
Yoav Artzi
ReLM
OffRL
LRM
49
3
0
03 Nov 2022
Leveraging Fully Observable Policies for Learning under Partial
  Observability
Leveraging Fully Observable Policies for Learning under Partial Observability
Hai V. Nguyen
Andrea Baisero
Dian Wang
Chris Amato
Robert Platt
OffRL
50
19
0
03 Nov 2022
Learning Control by Iterative Inversion
Learning Control by Iterative Inversion
Gal Leibovich
Guy Jacob
Or Avner
Gal Novik
Aviv Tamar
26
0
0
03 Nov 2022
Benefits of Monotonicity in Safe Exploration with Gaussian Processes
Benefits of Monotonicity in Safe Exploration with Gaussian Processes
Arpan Losalka
Jonathan Scarlett
31
1
0
03 Nov 2022
Event Tables for Efficient Experience Replay
Event Tables for Efficient Experience Replay
Varun Kompella
Thomas J. Walsh
Samuel Barrett
Peter R. Wurman
Peter Stone
OffRL
35
2
0
01 Nov 2022
Safe and Efficient Manoeuvring for Emergency Vehicles in Autonomous
  Traffic using Multi-Agent Proximal Policy Optimisation
Safe and Efficient Manoeuvring for Emergency Vehicles in Autonomous Traffic using Multi-Agent Proximal Policy Optimisation
L. Parada
Eduardo Candela
Luís Marques
Panagiotis Angeloudis
27
11
0
31 Oct 2022
A Unified Blockchain-Semantic Framework for Wireless Edge Intelligence
  Enabled Web 3.0
A Unified Blockchain-Semantic Framework for Wireless Edge Intelligence Enabled Web 3.0
Yi-Lan Lin
Zhipeng Gao
Hongyang Du
Dusit Niyato
Jiawen Kang
Ruilong Deng
X. Shen
23
42
0
27 Oct 2022
Quantum deep recurrent reinforcement learning
Quantum deep recurrent reinforcement learning
Samuel Yen-Chi Chen
69
19
0
26 Oct 2022
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online
  Reinforcement Learning
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
Yi Zhao
Rinu Boney
Alexander Ilin
Arno Solin
Joni Pajarinen
OffRL
OnRL
28
39
0
25 Oct 2022
Causal Explanation for Reinforcement Learning: Quantifying State and
  Temporal Importance
Causal Explanation for Reinforcement Learning: Quantifying State and Temporal Importance
Xiaoxiao Wang
Fanyu Meng
Xin Liu
Z. Kong
Xin Chen
XAI
CML
FAtt
66
4
0
24 Oct 2022
Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain
  Domains
Empirical analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains
Manon Flageat
Félix Chalumeau
Antoine Cully
36
26
0
24 Oct 2022
DaXBench: Benchmarking Deformable Object Manipulation with
  Differentiable Physics
DaXBench: Benchmarking Deformable Object Manipulation with Differentiable Physics
Siwei Chen
Yiqing Xu
Cunjun Yu
Linfeng Li
Xiao Ma
Zhongwen Xu
David Hsu
AI4CE
36
16
0
24 Oct 2022
Ares: A System-Oriented Wargame Framework for Adversarial ML
Ares: A System-Oriented Wargame Framework for Adversarial ML
Farhan Ahmed
Pratik Vaishnavi
Kevin Eykholt
Amir Rahmati
AAML
30
7
0
24 Oct 2022
Active Exploration for Robotic Manipulation
Active Exploration for Robotic Manipulation
Tim Schneider
Boris Belousov
Georgia Chalvatzaki
Diego Romeres
Devesh K. Jha
Jan Peters
65
10
0
23 Oct 2022
Augmentative Topology Agents For Open-Ended Learning
Augmentative Topology Agents For Open-Ended Learning
Muhammad Umair Nasir
Michael Beukman
Steven D. James
C. Cleghorn
42
3
0
20 Oct 2022
RCareWorld: A Human-centric Simulation World for Caregiving Robots
RCareWorld: A Human-centric Simulation World for Caregiving Robots
Ruolin Ye
Wenqiang Xu
Haoyuan Fu
Rajat Kumar Jenamani
V. Nguyen
Cewu Lu
Katherine Dimitropoulou
Tapomayukh Bhattacharjee
LM&Ro
32
39
0
19 Oct 2022
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning
Abdus Salam Azad
Izzeddin Gur
Jasper Emhoff
Nathaniel Alexis
Aleksandra Faust
Pieter Abbeel
Ion Stoica
SSL
34
12
0
19 Oct 2022
Planning for Sample Efficient Imitation Learning
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
44
21
0
18 Oct 2022
Previous
123...131415...333435
Next