Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
198
13
0
28 Aug 2023
Efficient Epistemic Uncertainty Estimation in Regression Ensemble Models Using Pairwise-Distance Estimators
Lucas Berry
David Meger
UD
84
2
0
25 Aug 2023
Conditional Kernel Imitation Learning for Continuous State Environments
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
A. Nayyar
68
0
0
24 Aug 2023
An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning Systems
Ahmed Haj Yahmed
Rached Bouchoucha
Houssem Ben Braiek
Foutse Khomh
CLL
AI4CE
64
0
0
23 Aug 2023
Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges
Ahmed Haj Yahmed
Altaf Allah Abbassi
Amin Nikanjam
Heng Li
Foutse Khomh
OffRL
72
5
0
23 Aug 2023
How Safe Am I Given What I See? Calibrated Prediction of Safety Chances for Image-Controlled Autonomy
Zhenjiang Mao
Carson Sobolewski
I. Ruchkin
129
9
0
23 Aug 2023
DFWLayer: Differentiable Frank-Wolfe Optimization Layer
Zixuan Liu
Liu Liu
Xueqian Wang
P. Zhao
AI4CE
101
0
0
21 Aug 2023
Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL
Ye Zhang
Jian Sun
G. Wang
Zhuoxian Li
Wei Chen
OffRL
39
0
0
20 Aug 2023
Safety Filter Design for Neural Network Systems via Convex Optimization
Shaoru Chen
K. Y. Chee
Nikolai Matni
M. A. Hsieh
George J. Pappas
84
3
0
16 Aug 2023
Formally-Sharp DAgger for MCTS: Lower-Latency Monte Carlo Tree Search using Data Aggregation with Formal Methods
Debraj Chakraborty
Damien Busatto-Gaston
Jean-François Raskin
G. Pérez
20
1
0
15 Aug 2023
Adaptive Tracking of a Single-Rigid-Body Character in Various Environments
Tae-Joung Kwon
Taehong Gu
Jaewon Ahn
Yoonsang Lee
67
3
0
14 Aug 2023
Learning Control Policies for Variable Objectives from Offline Data
Marc Weber
Phillip Swazinna
D. Hein
Steffen Udluft
V. Sterzing
OffRL
74
8
0
11 Aug 2023
Generalized Early Stopping in Evolutionary Direct Policy Search
Etor Arza
Léni K. Le Goff
E. Hart
OffRL
58
3
0
07 Aug 2023
Vehicles Control: Collision Avoidance using Federated Deep Reinforcement Learning
Badr Ben Elallid
Amine Abouaomar
N. Benamar
A. Kobbane
108
6
0
04 Aug 2023
UniSim: A Neural Closed-Loop Sensor Simulator
Ze Yang
Yun Chen
Jingkang Wang
S. Manivasagam
Wei-Chiu Ma
A. Yang
R. Urtasun
114
202
0
03 Aug 2023
qgym: A Gym for Training and Benchmarking RL-Based Quantum Compilation
S. V. D. Linde
Willem de Kok
T. Bontekoe
Sebastian Feld
78
13
0
01 Aug 2023
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization
Junyi Wang
Yuanyang Zhu
Zhi Wang
Yan Zheng
Jianye Hao
Chun-Han Chen
OffRL
65
0
0
01 Aug 2023
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Jiayang Song
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
64
20
0
31 Jul 2023
Discovering Adaptable Symbolic Algorithms from Scratch
Stephen Kelly
Daniel S. Park
Xingyou Song
Mitchell McIntire
Pranav Nashikkar
...
W. Banzhaf
Kalyanmoy Deb
Vishnu Boddeti
Jie Tan
Esteban Real
74
5
0
31 Jul 2023
End-to-End Reinforcement Learning for Torque Based Variable Height Hopping
Raghav Soni
Daniel Harnack
Hauke Isermann
Sotaro Fushimi
Shivesh Kumar
Frank Kirchner
77
9
0
31 Jul 2023
Variance Control for Distributional Reinforcement Learning
Qi Kuang
Zhoufan Zhu
Liwen Zhang
Fan Zhou
OffRL
143
3
0
30 Jul 2023
Initial State Interventions for Deconfounded Imitation Learning
Samuel Pfrommer
Yatong Bai
Hyunin Lee
Somayeh Sojoudi
CML
79
2
0
29 Jul 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
40
3
0
29 Jul 2023
trajdata: A Unified Interface to Multiple Human Trajectory Datasets
Boris Ivanovic
G. Song
Igor Gilitschenski
Marco Pavone
AI4TS
73
21
0
26 Jul 2023
WebArena: A Realistic Web Environment for Building Autonomous Agents
Shuyan Zhou
Frank F. Xu
Hao Zhu
Xuhui Zhou
Robert Lo
...
Tianyue Ou
Yonatan Bisk
Daniel Fried
Uri Alon
Graham Neubig
LLMAG
216
496
0
25 Jul 2023
MAEA: Multimodal Attribution for Embodied AI
Vidhi Jain
Jayant Sravan Tamarapalli
Sahiti Yerramilli
Yonatan Bisk
108
0
0
25 Jul 2023
Towards Sim2Real Transfer of Autonomy Algorithms using AutoDRIVE Ecosystem
Chinmay Vilas Samak
Tanmay Vilas Samak
Venkat Krovi
87
6
0
25 Jul 2023
Policy Gradient Optimal Correlation Search for Variance Reduction in Monte Carlo simulation and Maximum Optimal Transport
Pierre Bras
Gilles Pagès
56
1
0
24 Jul 2023
Pyrus Base: An Open Source Python Framework for the RoboCup 2D Soccer Simulation
Nader Zare
Aref Sayareh
Omid Amini
Mahtab Sarvmaili
Arad Firouzkouhi
Stan Matwin
Amilcar Soares
GP
23
1
0
22 Jul 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
68
1
0
21 Jul 2023
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Zhiao Huang
Litian Liang
Z. Ling
Xuanlin Li
Chuang Gan
H. Su
110
11
0
20 Jul 2023
Technical Challenges of Deploying Reinforcement Learning Agents for Game Testing in AAA Games
Jonas Gillberg
Joakim Bergdahl
Alessandro Sestini
Andy Eakins
Linus Gisslén
OffRL
158
7
0
19 Jul 2023
PyTAG: Challenges and Opportunities for Reinforcement Learning in Tabletop Games
Martin Balla
G. E. Long
Dominik Jeurissen
J. Goodman
Raluca D. Gaina
Diego Perez-Liebana
LMTD
OffRL
OnRL
82
1
0
19 Jul 2023
Reproducibility in Machine Learning-Driven Research
Harald Semmelrock
Simone Kopeinik
Dieter Theiler
Tony Ross-Hellauer
Dominik Kowald
AI4CE
87
18
0
19 Jul 2023
LuckyMera: a Modular AI Framework for Building Hybrid NetHack Agents
Luigi Quarantiello
Simone Marzeddu
Antonio Guzzi
Vincenzo Lomonaco
46
0
0
17 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
125
5
0
16 Jul 2023
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
Yanran Wang
Qiuchen Qian
David E. Boyle
60
4
0
13 Jul 2023
Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D Reconstruction
Sara Hatami Gazani
Matthew Tucsok
I. Mantegh
Homayoun Najjaran
53
4
0
11 Jul 2023
Boosting Feedback Efficiency of Interactive Reinforcement Learning by Adaptive Learning from Scores
Shukai Liu
Chenming Wu
Ying Li
Liang Zhang
77
0
0
11 Jul 2023
Pegasus Simulator: An Isaac Sim Framework for Multiple Aerial Vehicles Simulation
Marcelo Jacinto
Joao Pinto
Jay Patrikar
John Keller
R. Cunha
Sebastian Scherer
A. Pascoal
69
16
0
11 Jul 2023
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
Guy Azran
Mohamad H. Danesh
Stefano V. Albrecht
Sarah Keren
AI4CE
130
2
0
11 Jul 2023
Probabilistic Counterexample Guidance for Safer Reinforcement Learning (Extended Version)
Xiaotong Ji
Antonio Filieri
OffRL
96
1
0
10 Jul 2023
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
S. E. Ada
Erhan Öztop
Emre Ugur
OffRL
155
23
0
10 Jul 2023
Procedurally generating rules to adapt difficulty for narrative puzzle games
Thomas Vase Schultz Volden
Djordje Grbic
Paolo Burelli
33
1
0
07 Jul 2023
OmniBoost: Boosting Throughput of Heterogeneous Embedded Devices under Multi-DNN Workload
Andreas Karatzas
Iraklis Anagnostopoulos
62
22
0
06 Jul 2023
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
Abhijeet Pendyala
Justin Dettmer
Tobias Glasmachers
Asma Atamna
OffRL
46
6
0
06 Jul 2023
A Neuromorphic Architecture for Reinforcement Learning from Real-Valued Observations
Sergio Chevtchenko
Y. Bethi
Teresa B Ludermir
Saeed Afshar
OffRL
62
1
0
06 Jul 2023
Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement Learning
C. Bellinger
Mark Crowley
Isaac Tamblyn
22
3
0
05 Jul 2023
Hierarchical Planning and Policy Shaping Shared Autonomy for Articulated Robots
E. Yousefi
Mo Chen
I. Sharf
42
1
0
04 Jul 2023
Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning
Tyler Kastner
Murat A. Erdogdu
Amir-massoud Farahmand
OffRL
98
4
0
04 Jul 2023
Previous
1
2
3
...
8
9
10
...
50
51
52
Next