Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 2,578 papers shown
Title
Computational Thought Experiments for a More Rigorous Philosophy and Science of the Mind
Iris Oved
Nikhil Krishnaswamy
James Pustejovsky
Joshua Hartshorne
LM&Ro
60
0
0
14 May 2024
OpenBot-Fleet: A System for Collective Learning with Real Robots
Matthias M¨uller
Samarth Brahmbhatt
Ankur Deka
Quentin Leboutet
David Hafner
V. Koltun
82
0
0
13 May 2024
CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization
Wei-Ting Tang
J. Paulson
65
1
0
13 May 2024
Offline Model-Based Optimization via Policy-Guided Gradient Search
Yassine Chemingui
Aryan Deshwal
Trong Nghia Hoang
J. Doppa
OffRL
110
14
0
08 May 2024
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
88
0
0
07 May 2024
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems
Kailash Gogineni
Sai Santosh Dayapule
Juan Gómez Luna
Karthikeya Gogineni
Peng Wei
Tian-Shing Lan
Mohammad Sadrosadati
Onur Mutlu
Guru Venkataramani
99
11
0
07 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
80
0
0
04 May 2024
CityLearn v2: Energy-flexible, resilient, occupant-centric, and carbon-aware management of grid-interactive communities
Kingsley Nweye
Kathryn Kaspar
Giacomo Buscemi
Tiago Fonseca
G. Pinto
...
Luis Lino Ferreira
Tianzhen Hong
Mohamed Ouf
Alfonso Capozzoli
Zoltán Nagy
71
11
0
02 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
78
6
0
02 May 2024
Queue-based Eco-Driving at Roundabouts with Reinforcement Learning
Anna-Lena Schlamp
Werner Huber
Stefanie Schmidtner
34
0
0
01 May 2024
Employing Federated Learning for Training Autonomous HVAC Systems
Fredrik Hagström
Vikas Garg
Fabricio Oliveira
AI4CE
153
0
0
01 May 2024
SAFE-RL: Saliency-Aware Counterfactual Explainer for Deep Reinforcement Learning Policies
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
85
5
0
28 Apr 2024
Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review
Sergio A. Serrano
J. Martínez-Carranza
L. Sucar
84
1
0
26 Apr 2024
Timely Communications for Remote Inference
Md Kamran Chowdhury Shisher
Yin Sun
I-Hong Hou
59
13
0
25 Apr 2024
Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey
Lingfan Bao
Josephine N. Humphreys
Tianhu Peng
Chengxu Zhou
121
9
0
25 Apr 2024
Guided-SPSA: Simultaneous Perturbation Stochastic Approximation assisted by the Parameter Shift Rule
Maniraman Periyasamy
Axel Plinge
Christopher Mutschler
Daniel D. Scherer
Wolfgang Mauerer
95
13
0
24 Apr 2024
GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots
Simranjit Singh
Michael Fore
Dimitrios Stamoulis
LLMAG
77
12
0
23 Apr 2024
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents
Chen Gong
Kecen Li
Jin Yao
Tianhao Wang
OnRL
75
1
0
18 Apr 2024
MPC of Uncertain Nonlinear Systems with Meta-Learning for Fast Adaptation of Neural Predictive Models
Jiaqi Yan
Ankush Chakrabarty
Alisa Rupenyan
John Lygeros
87
2
0
18 Apr 2024
Compact Multi-Object Placement Using Adjacency-Aware Reinforcement Learning
Benedikt Kreis
Nils Dengler
Jorge de Heuvel
Rohit Menon
Hamsa Datta Perur
Maren Bennewitz
71
0
0
16 Apr 2024
Warm-Start Variational Quantum Policy Iteration
Nico Meyer
Jakob Murauer
Alexander Popov
Christian Ufrecht
Axel Plinge
Christopher Mutschler
Daniel D. Scherer
60
3
0
16 Apr 2024
Offline Trajectory Generalization for Offline Reinforcement Learning
Ziqi Zhao
Zhaochun Ren
Liu Yang
Fajie Yuan
Pengjie Ren
Zhumin Chen
Jun Ma
Xin Xin
OffRL
75
1
0
16 Apr 2024
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
Hui Bai
Ran Cheng
88
6
0
12 Apr 2024
Qiskit-Torch-Module: Fast Prototyping of Quantum Neural Networks
Nico Meyer
Christian Ufrecht
Maniraman Periyasamy
Axel Plinge
Christopher Mutschler
Daniel D. Scherer
Andreas R. Maier
79
6
0
09 Apr 2024
EVLearn: Extending the CityLearn Framework with Electric Vehicle Simulation
Tiago Fonseca
Luis Lino Ferreira
Bernardo Cabral
Ricardo Severino
Kingsley Nweye
Dipanjan Ghose
Zoltán Nagy
74
4
0
08 Apr 2024
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Elita Lobo
Harvineet Singh
Marek Petrik
Cynthia Rudin
Himabindu Lakkaraju
78
3
0
06 Apr 2024
Embodied Neuromorphic Artificial Intelligence for Robotics: Perspectives, Challenges, and Research Development Stack
Rachmad Vidya Wicaksana Putra
Alberto Marchisio
F. Zayer
Jorge Dias
Mohamed Bennai
73
11
0
04 Apr 2024
Integrating Explanations in Learning LTL Specifications from Demonstrations
Ashutosh Gupta
John Komp
Abhay Singh Rajput
Shankaranarayanan Krishna
Ashutosh Trivedi
Namrita Varshney
43
0
0
03 Apr 2024
EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking
Stavros Orfanoudakis
C. Diaz-Londono
Yunus E. Yilmaz
Peter Palensky
Pedro P. Vergara
59
7
0
02 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
123
0
0
02 Apr 2024
Game-Theoretic Deep Reinforcement Learning to Minimize Carbon Emissions and Energy Costs for AI Inference Workloads in Geo-Distributed Data Centers
Ninad Hogade
S. Pasricha
AI4CE
32
3
0
01 Apr 2024
Zero-shot Safety Prediction for Autonomous Robots with Foundation World Models
Zhenjiang Mao
Siqi Dai
Yuang Geng
Ivan Ruchkin
107
3
0
30 Mar 2024
Efficient Automatic Tuning for Data-driven Model Predictive Control via Meta-Learning
Baoyu Li
William Edwards
Kris Hauser
72
0
0
30 Mar 2024
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning
Duzhen Zhang
Qingyu Wang
Tielin Zhang
Bo Xu
324
2
0
29 Mar 2024
CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening
Hei Yi Mak
Flint Xiaofeng Fan
Luca A. Lanzendörfer
Cheston Tan
Wei Tsang Ooi
Roger Wattenhofer
FedML
73
2
0
29 Mar 2024
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces
Toshihiro Ota
Mamba
92
18
0
29 Mar 2024
Application-Driven Innovation in Machine Learning
David Rolnick
Alán Aspuru-Guzik
Sara Beery
B. Dilkina
P. Donti
...
Hannah Kerner
C. Monteleoni
Esther Rolf
Milind Tambe
Adam White
77
10
0
26 Mar 2024
Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving
Axel Brunnbauer
Luigi Berducci
P. Priller
D. Ničković
Radu Grosu
117
2
0
26 Mar 2024
Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process
Kevin S. Miller
Adam J. Thorpe
Ufuk Topcu
55
0
0
25 Mar 2024
A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments
G. D’Amico
Mauro Marinoni
Giorgio Buttazzo
OffRL
78
1
0
25 Mar 2024
Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search
Can Bogoclu
Robert Vosshall
K. Cremanns
Dirk Roos
BDL
47
1
0
23 Mar 2024
ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning
Yiwen Chen
Yuyao Ye
Ziyi Chen
Chuheng Zhang
Marcelo H. Ang
54
0
0
23 Mar 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
76
0
0
21 Mar 2024
On Predictive planning and counterfactual learning in active inference
Aswin Paul
Takuya Isomura
Adeel Razi
AI4CE
69
2
0
19 Mar 2024
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
Antonio Lopez
David Fridovich-Keil
87
1
0
18 Mar 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
61
1
0
17 Mar 2024
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
Carmelo Sferrazza
Dun-Ming Huang
Xingyu Lin
Youngwoon Lee
Pieter Abbeel
128
48
0
15 Mar 2024
AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors
Yucen Wang
Shenghua Wan
Le Gan
Shuai Feng
De-Chuan Zhan
VGen
72
6
0
15 Mar 2024
Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Rui Liu
Erfaun Noorani
Pratap Tokekar
John S. Baras
104
1
0
13 Mar 2024
A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation
Pan He
Quanyi Li
Xiaoyong Yuan
Bolei Zhou
55
0
0
11 Mar 2024
Previous
1
2
3
4
5
6
...
50
51
52
Next