Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,654 papers shown
Title
gym-saturation: Gymnasium environments for saturation provers (System description)
Boris Shminke
44
1
0
16 Sep 2023
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
Xiao-Yin Liu
Xiao-Hu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
OOD
24
5
0
16 Sep 2023
Deep Multi-Agent Reinforcement Learning for Decentralized Active Hypothesis Testing
Hadar Szostak
Kobi Cohen
38
3
0
14 Sep 2023
Stable In-hand Manipulation with Finger Specific Multi-agent Shadow Reward
Lingfeng Tao
Jiucai Zhang
Xiaoli Zhang
26
0
0
13 Sep 2023
Investigating the Impact of Action Representations in Policy Gradient Algorithms
Jan Schneider-Barnes
Pierre Schumacher
Daniel Haeufle
Bernhard Scholkopf
Le Chen
OffRL
24
1
0
13 Sep 2023
Attention Loss Adjusted Prioritized Experience Replay
Zhuoying Chen
Huiping Li
Rizhong Wang
19
2
0
13 Sep 2023
Fitness Approximation through Machine Learning
Itai Tzruia
Tomer Halperin
Moshe Sipper
Achiya Elyasaf
28
2
0
06 Sep 2023
ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning
L. Du
Min Chen
Mingyang Sun
Shouling Ji
Peng Cheng
Jiming Chen
Zhikun Zhang
OffRL
53
8
0
06 Sep 2023
Representation Learning for Sequential Volumetric Design Tasks
Md Ferdous Alam
Yi Wang
Linh Tran
Chin-Yi Cheng
Jieliang Luo
3DV
34
2
0
05 Sep 2023
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Yifan Hu
Andreas Krause
Ilija Bogunovic
OOD
47
10
0
05 Sep 2023
Marginalized Importance Sampling for Off-Environment Policy Evaluation
Pulkit Katdare
Nan Jiang
Katherine Driggs-Campbell
OffRL
30
4
0
04 Sep 2023
Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning
Qisen Yang
Huanqian Wang
Mukun Tong
Wenjie Shi
Gao Huang
Shiji Song
40
5
0
04 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
32
8
0
04 Sep 2023
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization
Uri Gadot
E. Derman
Navdeep Kumar
Maxence Mohamed Elfatihi
Kfir Y. Levy
Shie Mannor
38
5
0
03 Sep 2023
Neurosymbolic Reinforcement Learning and Planning: A Survey
Kamal Acharya
Waleed Raza
Carlos Dourado
Alvaro Velasquez
Houbing Song
NAI
OffRL
37
16
0
02 Sep 2023
Suicidal Pedestrian: Generation of Safety-Critical Scenarios for Autonomous Vehicles
Yuhang Yang
Kalle Kujanpää
Amin Babadi
Joni Pajarinen
Alexander Ilin
27
3
0
01 Sep 2023
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement Learning
M. Rigaki
Sebastian Garcia
AAML
30
4
0
31 Aug 2023
DRL-Based Trajectory Tracking for Motion-Related Modules in Autonomous Driving
Yinda Xu
Lidong Yu
9
6
0
30 Aug 2023
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics
Zexin Li
Aritra Samanta
Yufei Li
Andrea Soltoggio
Hyoseung Kim
Cong Liu
39
6
0
29 Aug 2023
Target-independent XLA optimization using Reinforcement Learning
Milan Ganai
Haichen Li
Theodore Enns
Yida Wang
Randy Huang
44
0
0
28 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
53
10
0
28 Aug 2023
Distributionally Robust Statistical Verification with Imprecise Neural Networks
Souradeep Dutta
Michele Caprio
Vivian Lin
Matthew Cleaveland
Kuk Jin Jang
I. Ruchkin
O. Sokolsky
Insup Lee
OOD
AAML
54
7
0
28 Aug 2023
Efficient Epistemic Uncertainty Estimation in Regression Ensemble Models Using Pairwise-Distance Estimators
Lucas Berry
David Meger
UD
30
2
0
25 Aug 2023
Conditional Kernel Imitation Learning for Continuous State Environments
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
A. Nayyar
32
0
0
24 Aug 2023
An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning Systems
Ahmed Haj Yahmed
Rached Bouchoucha
Houssem Ben Braiek
Foutse Khomh
CLL
AI4CE
36
0
0
23 Aug 2023
Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges
Ahmed Haj Yahmed
Altaf Allah Abbassi
Amin Nikanjam
Heng Li
Foutse Khomh
OffRL
39
5
0
23 Aug 2023
How Safe Am I Given What I See? Calibrated Prediction of Safety Chances for Image-Controlled Autonomy
Zhenjiang Mao
Carson Sobolewski
I. Ruchkin
37
8
0
23 Aug 2023
DFWLayer: Differentiable Frank-Wolfe Optimization Layer
Zixuan Liu
Liu Liu
Xueqian Wang
P. Zhao
AI4CE
29
0
0
21 Aug 2023
Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL
Ye Zhang
Jian Sun
G. Wang
Zhuoxian Li
Wei Chen
OffRL
34
0
0
20 Aug 2023
Safety Filter Design for Neural Network Systems via Convex Optimization
Shaoru Chen
K. Y. Chee
Nikolai Matni
M. A. Hsieh
George J. Pappas
41
3
0
16 Aug 2023
Formally-Sharp DAgger for MCTS: Lower-Latency Monte Carlo Tree Search using Data Aggregation with Formal Methods
Debraj Chakraborty
Damien Busatto-Gaston
Jean-François Raskin
G. Pérez
13
1
0
15 Aug 2023
Adaptive Tracking of a Single-Rigid-Body Character in Various Environments
Tae-Joung Kwon
Taehong Gu
Jaewon Ahn
Yoonsang Lee
41
3
0
14 Aug 2023
Learning Control Policies for Variable Objectives from Offline Data
Marc Weber
Phillip Swazinna
D. Hein
Steffen Udluft
V. Sterzing
OffRL
29
8
0
11 Aug 2023
Generalized Early Stopping in Evolutionary Direct Policy Search
Etor Arza
Léni K. Le Goff
E. Hart
OffRL
36
3
0
07 Aug 2023
Vehicles Control: Collision Avoidance using Federated Deep Reinforcement Learning
Badr Ben Elallid
Amine Abouaomar
N. Benamar
A. Kobbane
11
5
0
04 Aug 2023
UniSim: A Neural Closed-Loop Sensor Simulator
Ze Yang
Yun Chen
Jingkang Wang
S. Manivasagam
Wei-Chiu Ma
A. Yang
R. Urtasun
59
188
0
03 Aug 2023
qgym: A Gym for Training and Benchmarking RL-Based Quantum Compilation
S. V. D. Linde
Willem de Kok
T. Bontekoe
Sebastian Feld
8
10
0
01 Aug 2023
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization
Junyi Wang
Yuanyang Zhu
Zhi Wang
Yan Zheng
Jianye Hao
Chun-Han Chen
OffRL
19
0
0
01 Aug 2023
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Jiayang Song
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
35
15
0
31 Jul 2023
Discovering Adaptable Symbolic Algorithms from Scratch
Stephen Kelly
Daniel S. Park
Xingyou Song
Mitchell McIntire
Pranav Nashikkar
...
W. Banzhaf
Kalyanmoy Deb
Vishnu Boddeti
Jie Tan
Esteban Real
33
3
0
31 Jul 2023
End-to-End Reinforcement Learning for Torque Based Variable Height Hopping
Raghav Soni
Daniel Harnack
Hauke Isermann
Sotaro Fushimi
Shivesh Kumar
Frank Kirchner
32
8
0
31 Jul 2023
Variance Control for Distributional Reinforcement Learning
Qi Kuang
Zhoufan Zhu
Liwen Zhang
Fan Zhou
OffRL
28
3
0
30 Jul 2023
Initial State Interventions for Deconfounded Imitation Learning
Samuel Pfrommer
Yatong Bai
Hyunin Lee
Somayeh Sojoudi
CML
35
2
0
29 Jul 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
29
3
0
29 Jul 2023
trajdata: A Unified Interface to Multiple Human Trajectory Datasets
Boris Ivanovic
G. Song
Igor Gilitschenski
Marco Pavone
AI4TS
37
17
0
26 Jul 2023
WebArena: A Realistic Web Environment for Building Autonomous Agents
Shuyan Zhou
Frank F. Xu
Hao Zhu
Xuhui Zhou
Robert Lo
...
Tianyue Ou
Yonatan Bisk
Daniel Fried
Uri Alon
Graham Neubig
LLMAG
41
392
0
25 Jul 2023
MAEA: Multimodal Attribution for Embodied AI
Vidhi Jain
Jayant Sravan Tamarapalli
Sahiti Yerramilli
Yonatan Bisk
50
0
0
25 Jul 2023
Towards Sim2Real Transfer of Autonomy Algorithms using AutoDRIVE Ecosystem
Chinmay Vilas Samak
Tanmay Vilas Samak
Venkat Krovi
41
6
0
25 Jul 2023
Policy Gradient Optimal Correlation Search for Variance Reduction in Monte Carlo simulation and Maximum Optimal Transport
Pierre Bras
Gilles Pagès
18
1
0
24 Jul 2023
Pyrus Base: An Open Source Python Framework for the RoboCup 2D Soccer Simulation
Nader Zare
Aref Sayareh
Omid Amini
Mahtab Sarvmaili
Arad Firouzkouhi
Stan Matwin
Amilcar Soares
GP
19
1
0
22 Jul 2023
Previous
1
2
3
...
7
8
9
...
32
33
34
Next