Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,654 papers shown
Title
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Elita Lobo
Harvineet Singh
Marek Petrik
Cynthia Rudin
Himabindu Lakkaraju
36
3
0
06 Apr 2024
Embodied Neuromorphic Artificial Intelligence for Robotics: Perspectives, Challenges, and Research Development Stack
Rachmad Vidya Wicaksana Putra
Alberto Marchisio
F. Zayer
Jorge Dias
Mohamed Bennai
41
10
0
04 Apr 2024
Integrating Explanations in Learning LTL Specifications from Demonstrations
Ashutosh Gupta
John Komp
Abhay Singh Rajput
Shankaranarayanan Krishna
Ashutosh Trivedi
Namrita Varshney
21
0
0
03 Apr 2024
EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking
Stavros Orfanoudakis
C. Diaz-Londono
Yunus E. Yilmaz
Peter Palensky
Pedro P. Vergara
20
5
0
02 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
45
0
0
02 Apr 2024
Game-Theoretic Deep Reinforcement Learning to Minimize Carbon Emissions and Energy Costs for AI Inference Workloads in Geo-Distributed Data Centers
Ninad Hogade
S. Pasricha
AI4CE
14
2
0
01 Apr 2024
Zero-shot Safety Prediction for Autonomous Robots with Foundation World Models
Zhenjiang Mao
Siqi Dai
Yuang Geng
Ivan Ruchkin
48
3
0
30 Mar 2024
Efficient Automatic Tuning for Data-driven Model Predictive Control via Meta-Learning
Baoyu Li
William Edwards
Kris Hauser
34
0
0
30 Mar 2024
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning
Duzhen Zhang
Qingyu Wang
Tielin Zhang
Bo Xu
164
1
0
29 Mar 2024
CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening
Hei Yi Mak
Flint Xiaofeng Fan
Luca A. Lanzendörfer
Cheston Tan
Wei Tsang Ooi
Roger Wattenhofer
FedML
32
2
0
29 Mar 2024
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces
Toshihiro Ota
Mamba
46
16
0
29 Mar 2024
Application-Driven Innovation in Machine Learning
David Rolnick
Alán Aspuru-Guzik
Sara Beery
B. Dilkina
P. Donti
...
Hannah Kerner
C. Monteleoni
Esther Rolf
Milind Tambe
Adam White
41
8
0
26 Mar 2024
Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving
Axel Brunnbauer
Luigi Berducci
P. Priller
D. Ničković
Radu Grosu
55
1
0
26 Mar 2024
Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process
Kevin S. Miller
Adam J. Thorpe
Ufuk Topcu
29
0
0
25 Mar 2024
A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments
G. D’Amico
Mauro Marinoni
Giorgio Buttazzo
OffRL
40
1
0
25 Mar 2024
Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search
Can Bogoclu
Robert Vosshall
K. Cremanns
Dirk Roos
BDL
25
1
0
23 Mar 2024
ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning
Yiwen Chen
Yuyao Ye
Ziyi Chen
Chuheng Zhang
Marcelo H. Ang
52
0
0
23 Mar 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
43
0
0
21 Mar 2024
On Predictive planning and counterfactual learning in active inference
Aswin Paul
Takuya Isomura
Adeel Razi
AI4CE
41
2
0
19 Mar 2024
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
Antonio Lopez
David Fridovich-Keil
38
1
0
18 Mar 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
32
1
0
17 Mar 2024
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
Carmelo Sferrazza
Dun-Ming Huang
Xingyu Lin
Youngwoon Lee
Pieter Abbeel
60
37
0
15 Mar 2024
AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors
Yucen Wang
Shenghua Wan
Le Gan
Shuai Feng
De-Chuan Zhan
VGen
27
4
0
15 Mar 2024
Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Rui Liu
Erfaun Noorani
Pratap Tokekar
John S. Baras
39
1
0
13 Mar 2024
A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation
Pan He
Quanyi Li
Xiaoyong Yuan
Bolei Zhou
44
0
0
11 Mar 2024
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
Junseok Park
Yoonsung Kim
Hee Bin Yoo
Min Whoo Lee
Kibeom Kim
Won-Seok Choi
Minsu Lee
Byoung-Tak Zhang
OffRL
45
1
0
11 Mar 2024
LitSim: A Conflict-aware Policy for Long-term Interactive Traffic Simulation
haojie xin
Xiaodong Zhang
Renzhi Tang
Songyang Yan
Qianrui Zhao
Chunze Yang
Wen Cui
Zijiang Yang
58
2
0
07 Mar 2024
RACE-SM: Reinforcement Learning Based Autonomous Control for Social On-Ramp Merging
Jordan Poots
27
0
0
05 Mar 2024
Behavior Generation with Latent Actions
Seungjae Lee
Yibin Wang
Haritheja Etukuru
H. J. Kim
Mahi Shafiullah
Lerrel Pinto
VGen
OffRL
35
66
0
05 Mar 2024
Deep Reinforcement Learning for Dynamic Algorithm Selection: A Proof-of-Principle Study on Differential Evolution
Hongshu Guo
Yining Ma
Zeyuan Ma
Jiacheng Chen
Xinglin Zhang
Zhiguang Cao
Jun Zhang
Yue-jiao Gong
49
18
0
04 Mar 2024
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Michael T. Matthews
Michael Beukman
Benjamin Ellis
Mikayel Samvelyan
Matthew Jackson
Samuel Coward
Jakob Foerster
OffRL
42
26
0
26 Feb 2024
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
Anthony Liang
Guy Tennenholtz
Chih-Wei Hsu
Yinlam Chow
Erdem Biyik
Craig Boutilier
OffRL
45
1
0
25 Feb 2024
Leveraging Demonstrator-perceived Precision for Safe Interactive Imitation Learning of Clearance-limited Tasks
Hanbit Oh
Takamitsu Matsubara
66
3
0
21 Feb 2024
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization
Luca DÁmico-Wong
Hugh Zhang
Marc Lanctot
David C. Parkes
OffRL
16
0
0
19 Feb 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Quentin Gallouedec
E. Beeching
Clément Romac
Emmanuel Dellandrea
37
11
0
15 Feb 2024
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
34
2
0
14 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
26
2
0
14 Feb 2024
Decision Theory-Guided Deep Reinforcement Learning for Fast Learning
Zelin Wan
Jin-Hee Cho
Mu Zhu
Ahmed H. Anwar
Charles A. Kamhoua
Munindar P. Singh
AI4CE
23
0
0
08 Feb 2024
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee
Seung Joon Park
Yunhao Tang
Min-hwan Oh
24
2
0
08 Feb 2024
Exploration Without Maps via Zero-Shot Out-of-Distribution Deep Reinforcement Learning
Shathushan Sivashangaran
Apoorva Khairnar
A. Eskandarian
OffRL
45
0
0
07 Feb 2024
Voronoi Candidates for Bayesian Optimization
Nathan Wycoff
John W. Smith
Annie S. Booth
R. Gramacy
39
0
0
07 Feb 2024
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
Chen Wang
S. Erfani
T. Alpcan
Christopher Leckie
OffRL
38
2
0
07 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
31
4
0
07 Feb 2024
A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in Next-gen Networks
A. Abrol
Purnima Murali Mohan
Tram Truong-Huu
32
1
0
07 Feb 2024
An Architecture for Unattended Containerized (Deep) Reinforcement Learning with Webots
Tobias Haubold
Petra Linke
OffRL
13
0
0
06 Feb 2024
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restell
38
4
0
06 Feb 2024
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
Yufei Wang
Zhanyi Sun
Jesse Zhang
Zhou Xian
Erdem Biyik
David Held
Zackory M. Erickson
VLM
55
51
0
06 Feb 2024
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Nikhil Kumar Singh
Indranil Saha
OffRL
27
0
0
05 Feb 2024
Gazebo Plants: Simulating Plant-Robot Interaction with Cosserat Rods
Junchen Deng
Samhita Marri
Jonathan Klein
Wojciech Palubicki
Soren Pirk
Girish Chowdhary
D. L. Michels
22
2
0
04 Feb 2024
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li
Zicheng Zhang
Wang Luo
Congying Han
Yudong Hu
Tiande Guo
Shichen Liao
AAML
41
2
0
03 Feb 2024
Previous
1
2
3
4
5
6
...
32
33
34
Next