ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRLODL
ArXiv (abs)PDFHTML

Papers citing "OpenAI Gym"

50 / 2,578 papers shown
Title
Computational Thought Experiments for a More Rigorous Philosophy and
  Science of the Mind
Computational Thought Experiments for a More Rigorous Philosophy and Science of the Mind
Iris Oved
Nikhil Krishnaswamy
James Pustejovsky
Joshua Hartshorne
LM&Ro
60
0
0
14 May 2024
OpenBot-Fleet: A System for Collective Learning with Real Robots
OpenBot-Fleet: A System for Collective Learning with Real Robots
Matthias M¨uller
Samarth Brahmbhatt
Ankur Deka
Quentin Leboutet
David Hafner
V. Koltun
82
0
0
13 May 2024
CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization
CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization
Wei-Ting Tang
J. Paulson
65
1
0
13 May 2024
Offline Model-Based Optimization via Policy-Guided Gradient Search
Offline Model-Based Optimization via Policy-Guided Gradient Search
Yassine Chemingui
Aryan Deshwal
Trong Nghia Hoang
J. Doppa
OffRL
110
14
0
08 May 2024
An Improved Finite-time Analysis of Temporal Difference Learning with
  Deep Neural Networks
An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks
Zhifa Ke
Zaiwen Wen
Junyu Zhang
88
0
0
07 May 2024
SwiftRL: Towards Efficient Reinforcement Learning on Real
  Processing-In-Memory Systems
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems
Kailash Gogineni
Sai Santosh Dayapule
Juan Gómez Luna
Karthikeya Gogineni
Peng Wei
Tian-Shing Lan
Mohammad Sadrosadati
Onur Mutlu
Guru Venkataramani
99
11
0
07 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent
  Baseline
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
80
0
0
04 May 2024
CityLearn v2: Energy-flexible, resilient, occupant-centric, and
  carbon-aware management of grid-interactive communities
CityLearn v2: Energy-flexible, resilient, occupant-centric, and carbon-aware management of grid-interactive communities
Kingsley Nweye
Kathryn Kaspar
Giacomo Buscemi
Tiago Fonseca
G. Pinto
...
Luis Lino Ferreira
Tianzhen Hong
Mohamed Ouf
Alfonso Capozzoli
Zoltán Nagy
71
11
0
02 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
78
6
0
02 May 2024
Queue-based Eco-Driving at Roundabouts with Reinforcement Learning
Queue-based Eco-Driving at Roundabouts with Reinforcement Learning
Anna-Lena Schlamp
Werner Huber
Stefanie Schmidtner
34
0
0
01 May 2024
Employing Federated Learning for Training Autonomous HVAC Systems
Employing Federated Learning for Training Autonomous HVAC Systems
Fredrik Hagström
Vikas Garg
Fabricio Oliveira
AI4CE
153
0
0
01 May 2024
SAFE-RL: Saliency-Aware Counterfactual Explainer for Deep Reinforcement
  Learning Policies
SAFE-RL: Saliency-Aware Counterfactual Explainer for Deep Reinforcement Learning Policies
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
85
5
0
28 Apr 2024
Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic
  Review
Knowledge Transfer for Cross-Domain Reinforcement Learning: A Systematic Review
Sergio A. Serrano
J. Martínez-Carranza
L. Sucar
84
1
0
26 Apr 2024
Timely Communications for Remote Inference
Timely Communications for Remote Inference
Md Kamran Chowdhury Shisher
Yin Sun
I-Hong Hou
59
13
0
25 Apr 2024
Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey
Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey
Lingfan Bao
Josephine N. Humphreys
Tianhu Peng
Chengxu Zhou
121
9
0
25 Apr 2024
Guided-SPSA: Simultaneous Perturbation Stochastic Approximation assisted by the Parameter Shift Rule
Guided-SPSA: Simultaneous Perturbation Stochastic Approximation assisted by the Parameter Shift Rule
Maniraman Periyasamy
Axel Plinge
Christopher Mutschler
Daniel D. Scherer
Wolfgang Mauerer
95
13
0
24 Apr 2024
GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots
GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots
Simranjit Singh
Michael Fore
Dimitrios Stamoulis
LLMAG
77
12
0
23 Apr 2024
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement
  Learning Agents
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents
Chen Gong
Kecen Li
Jin Yao
Tianhao Wang
OnRL
75
1
0
18 Apr 2024
MPC of Uncertain Nonlinear Systems with Meta-Learning for Fast
  Adaptation of Neural Predictive Models
MPC of Uncertain Nonlinear Systems with Meta-Learning for Fast Adaptation of Neural Predictive Models
Jiaqi Yan
Ankush Chakrabarty
Alisa Rupenyan
John Lygeros
87
2
0
18 Apr 2024
Compact Multi-Object Placement Using Adjacency-Aware Reinforcement
  Learning
Compact Multi-Object Placement Using Adjacency-Aware Reinforcement Learning
Benedikt Kreis
Nils Dengler
Jorge de Heuvel
Rohit Menon
Hamsa Datta Perur
Maren Bennewitz
71
0
0
16 Apr 2024
Warm-Start Variational Quantum Policy Iteration
Warm-Start Variational Quantum Policy Iteration
Nico Meyer
Jakob Murauer
Alexander Popov
Christian Ufrecht
Axel Plinge
Christopher Mutschler
Daniel D. Scherer
60
3
0
16 Apr 2024
Offline Trajectory Generalization for Offline Reinforcement Learning
Offline Trajectory Generalization for Offline Reinforcement Learning
Ziqi Zhao
Zhaochun Ren
Liu Yang
Fajie Yuan
Pengjie Ren
Zhumin Chen
Jun Ma
Xin Xin
OffRL
75
1
0
16 Apr 2024
Generalized Population-Based Training for Hyperparameter Optimization in
  Reinforcement Learning
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
Hui Bai
Ran Cheng
88
6
0
12 Apr 2024
Qiskit-Torch-Module: Fast Prototyping of Quantum Neural Networks
Qiskit-Torch-Module: Fast Prototyping of Quantum Neural Networks
Nico Meyer
Christian Ufrecht
Maniraman Periyasamy
Axel Plinge
Christopher Mutschler
Daniel D. Scherer
Andreas R. Maier
79
6
0
09 Apr 2024
EVLearn: Extending the CityLearn Framework with Electric Vehicle
  Simulation
EVLearn: Extending the CityLearn Framework with Electric Vehicle Simulation
Tiago Fonseca
Luis Lino Ferreira
Bernardo Cabral
Ricardo Severino
Kingsley Nweye
Dipanjan Ghose
Zoltán Nagy
74
4
0
08 Apr 2024
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Elita Lobo
Harvineet Singh
Marek Petrik
Cynthia Rudin
Himabindu Lakkaraju
78
3
0
06 Apr 2024
Embodied Neuromorphic Artificial Intelligence for Robotics:
  Perspectives, Challenges, and Research Development Stack
Embodied Neuromorphic Artificial Intelligence for Robotics: Perspectives, Challenges, and Research Development Stack
Rachmad Vidya Wicaksana Putra
Alberto Marchisio
F. Zayer
Jorge Dias
Mohamed Bennai
73
11
0
04 Apr 2024
Integrating Explanations in Learning LTL Specifications from
  Demonstrations
Integrating Explanations in Learning LTL Specifications from Demonstrations
Ashutosh Gupta
John Komp
Abhay Singh Rajput
Shankaranarayanan Krishna
Ashutosh Trivedi
Namrita Varshney
43
0
0
03 Apr 2024
EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and
  Benchmarking
EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking
Stavros Orfanoudakis
C. Diaz-Londono
Yunus E. Yilmaz
Peter Palensky
Pedro P. Vergara
59
7
0
02 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
123
0
0
02 Apr 2024
Game-Theoretic Deep Reinforcement Learning to Minimize Carbon Emissions
  and Energy Costs for AI Inference Workloads in Geo-Distributed Data Centers
Game-Theoretic Deep Reinforcement Learning to Minimize Carbon Emissions and Energy Costs for AI Inference Workloads in Geo-Distributed Data Centers
Ninad Hogade
S. Pasricha
AI4CE
32
3
0
01 Apr 2024
Zero-shot Safety Prediction for Autonomous Robots with Foundation World
  Models
Zero-shot Safety Prediction for Autonomous Robots with Foundation World Models
Zhenjiang Mao
Siqi Dai
Yuang Geng
Ivan Ruchkin
107
3
0
30 Mar 2024
Efficient Automatic Tuning for Data-driven Model Predictive Control via
  Meta-Learning
Efficient Automatic Tuning for Data-driven Model Predictive Control via Meta-Learning
Baoyu Li
William Edwards
Kris Hauser
72
0
0
30 Mar 2024
Biologically-Plausible Topology Improved Spiking Actor Network for
  Efficient Deep Reinforcement Learning
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning
Duzhen Zhang
Qingyu Wang
Tielin Zhang
Bo Xu
324
2
0
29 Mar 2024
CAESAR: Enhancing Federated RL in Heterogeneous MDPs through
  Convergence-Aware Sampling with Screening
CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening
Hei Yi Mak
Flint Xiaofeng Fan
Luca A. Lanzendörfer
Cheston Tan
Wei Tsang Ooi
Roger Wattenhofer
FedML
73
2
0
29 Mar 2024
Decision Mamba: Reinforcement Learning via Sequence Modeling with
  Selective State Spaces
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces
Toshihiro Ota
Mamba
92
18
0
29 Mar 2024
Application-Driven Innovation in Machine Learning
Application-Driven Innovation in Machine Learning
David Rolnick
Alán Aspuru-Guzik
Sara Beery
B. Dilkina
P. Donti
...
Hannah Kerner
C. Monteleoni
Esther Rolf
Milind Tambe
Adam White
77
10
0
26 Mar 2024
Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving
Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving
Axel Brunnbauer
Luigi Berducci
P. Priller
D. Ničković
Radu Grosu
117
2
0
26 Mar 2024
Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling
  Process
Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process
Kevin S. Miller
Adam J. Thorpe
Ufuk Topcu
55
0
0
25 Mar 2024
A Comparative Analysis of Visual Odometry in Virtual and Real-World
  Railways Environments
A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments
G. D’Amico
Mauro Marinoni
Giorgio Buttazzo
OffRL
78
1
0
25 Mar 2024
Deep Gaussian Covariance Network with Trajectory Sampling for
  Data-Efficient Policy Search
Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search
Can Bogoclu
Robert Vosshall
K. Cremanns
Dirk Roos
BDL
47
1
0
23 Mar 2024
ARO: Large Language Model Supervised Robotics Text2Skill Autonomous
  Learning
ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning
Yiwen Chen
Yuyao Ye
Ziyi Chen
Chuheng Zhang
Marcelo H. Ang
54
0
0
23 Mar 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation,
  Transferable Reward Recovery and Algebraic Equilibrium Proof
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
76
0
0
21 Mar 2024
On Predictive planning and counterfactual learning in active inference
On Predictive planning and counterfactual learning in active inference
Aswin Paul
Takuya Isomura
Adeel Razi
AI4CE
69
2
0
19 Mar 2024
Decomposing Control Lyapunov Functions for Efficient Reinforcement
  Learning
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
Antonio Lopez
David Fridovich-Keil
87
1
0
18 Mar 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
61
1
0
17 Mar 2024
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion
  and Manipulation
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
Carmelo Sferrazza
Dun-Ming Huang
Xingyu Lin
Youngwoon Lee
Pieter Abbeel
128
48
0
15 Mar 2024
AD3: Implicit Action is the Key for World Models to Distinguish the
  Diverse Visual Distractors
AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors
Yucen Wang
Shenghua Wan
Le Gan
Shuai Feng
De-Chuan Zhan
VGen
72
6
0
15 Mar 2024
Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Rui Liu
Erfaun Noorani
Pratap Tokekar
John S. Baras
104
1
0
13 Mar 2024
A Holistic Framework Towards Vision-based Traffic Signal Control with
  Microscopic Simulation
A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation
Pan He
Quanyi Li
Xiaoyong Yuan
Bolei Zhou
55
0
0
11 Mar 2024
Previous
123456...505152
Next