Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Shunyu Yao
Howard Chen
John Yang
Karthik Narasimhan
LLMAG
LM&Ro
176
522
0
04 Jul 2022
Renaissance Robot: Optimal Transport Policy Fusion for Learning Diverse Skills
Julia Tan
Ransalu Senanayake
Fabio Ramos
77
2
0
03 Jul 2022
A Survey on Active Simultaneous Localization and Mapping: State of the Art and New Frontiers
Julio A. Placed
Jared Strader
Henry Carrillo
Nikolay Atanasov
Vadim Indelman
Luca Carlone
J. A. Castellanos
132
191
0
01 Jul 2022
Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Siddhant Haldar
Vaibhav Mathur
Denis Yarats
Lerrel Pinto
116
67
0
30 Jun 2022
Lagrangian Density Space-Time Deep Neural Network Topology
B. Bishnoi
PINN
75
1
0
30 Jun 2022
Online vs. Offline Adaptive Domain Randomization Benchmark
Gabriele Tiboni
Karol Arndt
Giuseppe Averta
Ville Kyrki
Tatiana Tommasi
OffRL
51
5
0
29 Jun 2022
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision
Ryan Hoque
Lawrence Yunliang Chen
Satvik Sharma
K. Dharmarajan
Brijen Thananjeyan
Pieter Abbeel
Ken Goldberg
190
32
0
29 Jun 2022
Guided Exploration in Reinforcement Learning via Monte Carlo Critic Optimization
Igor Kuznetsov
89
2
0
25 Jun 2022
Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
J. MacGlashan
Evan Archer
A. Devlic
Takuma Seno
Craig Sherstan
Peter R. Wurman
AI PeterStoneSony
56
6
0
24 Jun 2022
Reinforcement Learning under Partial Observability Guided by Learned Environment Models
Edi Muškardin
Martin Tappler
B. Aichernig
Ingo Pill
OffRL
41
7
0
23 Jun 2022
Recursive Reinforcement Learning
E. M. Hahn
Mateo Perez
S. Schewe
Fabio Somenzi
Ashutosh Trivedi
D. Wojtczak
LRM
33
0
0
23 Jun 2022
Latent Policies for Adversarial Imitation Learning
Tianyu Wang
Nikhil Karnwal
Nikolay Atanasov
50
5
0
22 Jun 2022
POGEMA: Partially Observable Grid Environment for Multiple Agents
Alexey Skrynnik
Anton Andreychuk
Konstantin Yakovlev
Aleksandr I. Panov
24
6
0
22 Jun 2022
Generative Pretraining for Black-Box Optimization
S. Krishnamoorthy
Satvik Mashkaria
Aditya Grover
OffRL
AI4CE
146
31
0
22 Jun 2022
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine
Jiayi Weng
Min Lin
Shengyi Huang
Bo Liu
Denys Makoviichuk
...
Yufan Song
Ting Luo
Yukun Jiang
Zhongwen Xu
Shuicheng Yan
MoE
110
63
0
21 Jun 2022
Uncertainty Quantification for Competency Assessment of Autonomous Agents
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
50
3
0
21 Jun 2022
Model-Based Imitation Learning Using Entropy Regularization of Model and Policy
E. Uchibe
53
4
0
21 Jun 2022
Guided Safe Shooting: model based reinforcement learning with safety constraints
Giuseppe Paolo
Jonas Gonzalez-Billandon
Albert Thomas
Balázs Kégl
97
3
0
20 Jun 2022
Benchmarking Constraint Inference in Inverse Reinforcement Learning
Guiliang Liu
Yudong Luo
A. Gaurav
K. Rezaee
Pascal Poupart
135
24
0
20 Jun 2022
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration
Wenhui Huang
Cong Zhang
Jingda Wu
Xiangkun He
Jie Zhang
Chengqi Lv
54
9
0
20 Jun 2022
From Multi-agent to Multi-robot: A Scalable Training and Evaluation Platform for Multi-robot Reinforcement Learning
Zhiuxan Liang
Jiannong Cao
Shan Jiang
Divya Saxena
Jinlin Chen
Huafeng Xu
57
10
0
20 Jun 2022
Learning Multi-Task Transferable Rewards via Variational Inverse Reinforcement Learning
Se-Wook Yoo
Seung-Woo Seo
DRL
47
5
0
19 Jun 2022
From Universal Humanoid Control to Automatic Physically Valid Character Creation
Zhengyi Luo
Ye Yuan
Kris Kitani
AI4CE
52
5
0
18 Jun 2022
Interactive Visual Reasoning under Uncertainty
Manjie Xu
Guangyuan Jiang
Wei Liang
Song-Chun Zhu
Yixin Zhu
LRM
108
5
0
18 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
69
4
0
17 Jun 2022
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination
Jiafei Lyu
Xiu Li
Zongqing Lu
OffRL
88
26
0
16 Jun 2022
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning
Xiaoteng Ma
Shuai Ma
Li Xia
Qianchuan Zhao
92
3
0
15 Jun 2022
Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization
Maxim Kaledin
Alexander Golubev
Denis Belomestny
OffRL
85
4
0
14 Jun 2022
Open-Ended Learning Strategies for Learning Complex Locomotion Skills
Fangqin Zhou
Joaquin Vanschoren
101
2
0
14 Jun 2022
Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks
Josip Josifovski
M. Malmir
Noah Klarmann
B. L. Žagar
Nicolás Navarro-Guerrero
Alois C. Knoll
72
18
0
13 Jun 2022
Specifying and Testing
k
k
k
-Safety Properties for Machine-Learning Models
M. Christakis
Hasan Ferit Eniser
Jörg Hoffmann
Adish Singla
Valentin Wüstholz
56
6
0
13 Jun 2022
Intrinsically motivated option learning: a comparative study of recent methods
Đorđe Božić
Predrag Tadić
Mladen Nikolic
42
1
0
13 Jun 2022
Arena-Bench: A Benchmarking Suite for Obstacle Avoidance Approaches in Highly Dynamic Environments
Linh Kästner
Teham Bhuiyan
T. Le
Elias Treis
Johanne Cox
...
Reyk Carstens
Duc Pichel
Bassel Fatloun
Niloufar Khorsandi
Jens Lambrecht
101
37
0
12 Jun 2022
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Pratap Tokekar
Tianyi Zhou
81
8
0
12 Jun 2022
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Ruida Zhou
Tao-Wen Liu
D. Kalathil
P. R. Kumar
Chao Tian
70
15
0
10 Jun 2022
Overcoming the Spectral Bias of Neural Value Approximation
Ge Yang
Anurag Ajay
Pulkit Agrawal
103
26
0
09 Jun 2022
AAM-Gym: Artificial Intelligence Testbed for Advanced Air Mobility
Marc Brittain
Luis E. Alvarez
Kara Breeden
Ian Jessen
49
8
0
09 Jun 2022
Receding Horizon Inverse Reinforcement Learning
Yiqing Xu
Wei Gao
David Hsu
77
14
0
09 Jun 2022
Simplifying Polylogarithms with Machine Learning
Aurélien Dersy
M. Schwartz
Xiao-Yan Zhang
AI4CE
210
16
0
08 Jun 2022
A Study of Continual Learning Methods for Q-Learning
Benedikt Bagus
A. Gepperth
CLL
OffRL
103
4
0
08 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
86
21
0
08 Jun 2022
Introspective Experience Replay: Look Back When Surprised
Ramnath Kumar
Dheeraj M. Nagaraj
OffRL
65
2
0
07 Jun 2022
Neuro-Nav: A Library for Neurally-Plausible Reinforcement Learning
Arthur Juliani
Samuel A. Barnett
Brandon Davis
Margaret E. Sereno
Ida Momennejad
OffRL
65
10
0
06 Jun 2022
Achieving Goals using Reward Shaping and Curriculum Learning
M. Anca
Jonathan D. Thomas
Dabal Pedamonti
M. Studley
Mark Hansen
41
1
0
06 Jun 2022
ARC - Actor Residual Critic for Adversarial Imitation Learning
A. Deka
Changliu Liu
Katia Sycara
108
5
0
05 Jun 2022
Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration
Dustin Morrill
Esraá Saleh
Michael Bowling
Amy Greenwald
45
0
0
04 Jun 2022
FishGym: A High-Performance Physics-based Simulation Framework for Underwater Robot Learning
Wenji Liu
Kai-Yi Bai
Xuming He
Shuran Song
Changxi Zheng
Xiaopei Liu
AI4CE
85
14
0
03 Jun 2022
Disentangling Epistemic and Aleatoric Uncertainty in Reinforcement Learning
Bertrand Charpentier
Ransalu Senanayake
Mykel Kochenderfer
Stephan Günnemann
PER
UD
85
26
0
03 Jun 2022
Equivariant Reinforcement Learning for Quadrotor UAV
Beomyeol Yu
Taeyoung Lee
95
8
0
02 Jun 2022
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Brian M. Sadler
Furong Huang
Pratap Tokekar
Tianyi Zhou
79
9
0
02 Jun 2022
Previous
1
2
3
...
17
18
19
...
50
51
52
Next