Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,654 papers shown
Title
Quantum Architecture Search: A Survey
Darya Martyniuk
Johannes Jung
Adrian Paschke
AI4CE
41
8
0
10 Jun 2024
Sim-To-Real Transfer for Visual Reinforcement Learning of Deformable Object Manipulation for Robot-Assisted Surgery
Paul Maria Scheikl
E. Tagliabue
B. Gyenes
M. Wagner
Diego DallÁlba
Paolo Fiorini
Franziska Mathis-Ullrich
MedIm
46
47
0
10 Jun 2024
Cross Language Soccer Framework: An Open Source Framework for the RoboCup 2D Soccer Simulation
Nader Zare
Aref Sayareh
Alireza Sadraii
Arad Firouzkouhi
Amilcar Soares
34
0
0
09 Jun 2024
Algorithms for learning value-aligned policies considering admissibility relaxation
Andrés Holgado-Sánchez
Joaquín Arias
Holger Billhardt
Sascha Ossowski
29
0
0
07 Jun 2024
Reflective Policy Optimization
Yaozhong Gan
Renye Yan
Zhe Wu
Junliang Xing
40
1
0
06 Jun 2024
Adaptive Preference Scaling for Reinforcement Learning with Human Feedback
Ilgee Hong
Zichong Li
Alexander Bukharin
Yixiao Li
Haoming Jiang
Tianbao Yang
Tuo Zhao
40
4
0
04 Jun 2024
Power Mean Estimation in Stochastic Monte-Carlo Tree_Search
Tuan Dam
Odalric-Ambrym Maillard
Emilie Kaufmann
34
0
0
04 Jun 2024
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning
Wenzhe Li
Zihan Ding
Seth Karten
Chi Jin
40
1
0
04 Jun 2024
NeoRL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
OffRL
41
0
0
03 Jun 2024
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu
Yang Li
Yixing Lan
Hao Gao
Wei Pan
Xin Xu
OffRL
36
5
0
30 May 2024
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
64
3
0
29 May 2024
FDQN: A Flexible Deep Q-Network Framework for Game Automation
Prabhath Reddy Gujavarthy
OffRL
22
0
0
29 May 2024
Efficient Learning in Chinese Checkers: Comparing Parameter Sharing in Multi-Agent Reinforcement Learning
Noah Adhikari
Allen Gu
49
0
0
29 May 2024
DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime
Zhiyao Luo
Mingcheng Zhu
Fenglin Liu
Jiali Li
Yangchen Pan
Jiandong Zhou
Tingting Zhu
OffRL
43
3
0
28 May 2024
PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning
Martin Balla
G. E. Long
J. Goodman
Raluca D. Gaina
Diego Perez-Liebana
OffRL
GP
15
1
0
28 May 2024
Mollification Effects of Policy Gradient Methods
Tao Wang
Sylvia Herbert
Sicun Gao
54
0
0
28 May 2024
Matrix Low-Rank Approximation For Policy Gradient Methods
Sergio Rozada
A. Marques
42
2
0
27 May 2024
Matrix Low-Rank Trust Region Policy Optimization
Sergio Rozada
Antonio G. Marques
45
0
0
27 May 2024
Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images
Yiran Luo
Joshua Forster Feinglass
Tejas Gokhale
Kuan-Cheng Lee
Chitta Baral
Yezhou Yang
39
0
0
24 May 2024
Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments
Olivia Jullian Parra
J. G. Pardiñas
Lorenzo Del Pianta Pérez
Maximilian Janisch
S. Klaver
Thomas Lehéricy
N. Serra
OffRL
39
1
0
24 May 2024
Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency
Sheng Yue
Xingyuan Hua
Lili Chen
Ju Ren
28
1
0
24 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
45
0
23 May 2024
BenchNav: Simulation Platform for Benchmarking Off-road Navigation Algorithms with Probabilistic Traversability
Masafumi Endo
Kohei Honda
Genya Ishigami
37
2
0
22 May 2024
Investigating the Impact of Choice on Deep Reinforcement Learning for Space Controls
Nathaniel P. Hamilton
Kyle Dunlap
Kerianne L. Hobbs
42
0
0
20 May 2024
Stochastic Q-learning for Large Discrete Action Spaces
Fares Fourati
Vaneet Aggarwal
Mohamed-Slim Alouini
OffRL
44
2
0
16 May 2024
Computational Thought Experiments for a More Rigorous Philosophy and Science of the Mind
Iris Oved
Nikhil Krishnaswamy
James Pustejovsky
Joshua Hartshorne
LM&Ro
34
0
0
14 May 2024
CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization
Wei-Ting Tang
J. Paulson
22
1
0
13 May 2024
OpenBot-Fleet: A System for Collective Learning with Real Robots
Matthias M¨uller
Samarth Brahmbhatt
Ankur Deka
Quentin Leboutet
David Hafner
V. Koltun
53
0
0
13 May 2024
Offline Model-Based Optimization via Policy-Guided Gradient Search
Yassine Chemingui
Aryan Deshwal
Trong Nghia Hoang
J. Doppa
OffRL
53
9
0
08 May 2024
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
37
0
0
07 May 2024
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems
Kailash Gogineni
Sai Santosh Dayapule
Juan Gómez Luna
Karthikeya Gogineni
Peng Wei
Tian-Shing Lan
Mohammad Sadrosadati
Onur Mutlu
Guru Venkataramani
55
11
0
07 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
34
0
0
04 May 2024
CityLearn v2: Energy-flexible, resilient, occupant-centric, and carbon-aware management of grid-interactive communities
Kingsley Nweye
Kathryn Kaspar
Giacomo Buscemi
Tiago Fonseca
G. Pinto
...
Luis Lino Ferreira
Tianzhen Hong
Mohamed Ouf
Alfonso Capozzoli
Zoltán Nagy
27
7
0
02 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
48
4
0
02 May 2024
Queue-based Eco-Driving at Roundabouts with Reinforcement Learning
Anna-Lena Schlamp
Werner Huber
Stefanie Schmidtner
27
0
0
01 May 2024
Employing Federated Learning for Training Autonomous HVAC Systems
Fredrik Hagström
Vikas K. Garg
Fabricio Oliveira
AI4CE
76
0
0
01 May 2024
SAFE-RL: Saliency-Aware Counterfactual Explainer for Deep Reinforcement Learning Policies
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
48
4
0
28 Apr 2024
Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review
Sergio A. Serrano
J. Martínez-Carranza
L. Sucar
41
0
0
26 Apr 2024
Timely Communications for Remote Inference
Md Kamran Chowdhury Shisher
Yin Sun
I-Hong Hou
23
13
0
25 Apr 2024
Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey
Lingfan Bao
Josephine N. Humphreys
Tianhu Peng
Chengxu Zhou
73
5
0
25 Apr 2024
Guided-SPSA: Simultaneous Perturbation Stochastic Approximation assisted by the Parameter Shift Rule
Maniraman Periyasamy
Axel Plinge
Christopher Mutschler
Daniel D. Scherer
Wolfgang Mauerer
54
6
0
24 Apr 2024
GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots
Simranjit Singh
Michael Fore
Dimitrios Stamoulis
LLMAG
35
12
0
23 Apr 2024
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents
Chen Gong
Kecen Li
Jin Yao
Tianhao Wang
OnRL
41
0
0
18 Apr 2024
MPC of Uncertain Nonlinear Systems with Meta-Learning for Fast Adaptation of Neural Predictive Models
Jiaqi Yan
Ankush Chakrabarty
Alisa Rupenyan
John Lygeros
37
2
0
18 Apr 2024
Compact Multi-Object Placement Using Adjacency-Aware Reinforcement Learning
Benedikt Kreis
Nils Dengler
Jorge de Heuvel
Rohit Menon
Hamsa Datta Perur
Maren Bennewitz
24
0
0
16 Apr 2024
Warm-Start Variational Quantum Policy Iteration
Nico Meyer
Jakob Murauer
Alexander Popov
Christian Ufrecht
Axel Plinge
Christopher Mutschler
Daniel D. Scherer
41
2
0
16 Apr 2024
Offline Trajectory Generalization for Offline Reinforcement Learning
Ziqi Zhao
Zhaochun Ren
Liu Yang
Fajie Yuan
Pengjie Ren
Zhumin Chen
Jun Ma
Xin Xin
OffRL
29
1
0
16 Apr 2024
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
Hui Bai
Ran Cheng
58
4
0
12 Apr 2024
Qiskit-Torch-Module: Fast Prototyping of Quantum Neural Networks
Nico Meyer
Christian Ufrecht
Maniraman Periyasamy
Axel Plinge
Christopher Mutschler
Daniel D. Scherer
Andreas R. Maier
44
5
0
09 Apr 2024
EVLearn: Extending the CityLearn Framework with Electric Vehicle Simulation
Tiago Fonseca
Luis Lino Ferreira
Bernardo Cabral
Ricardo Severino
Kingsley Nweye
Dipanjan Ghose
Zoltán Nagy
27
3
0
08 Apr 2024
Previous
1
2
3
4
5
...
32
33
34
Next