ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,654 papers shown
Title
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
Elita Lobo
Harvineet Singh
Marek Petrik
Cynthia Rudin
Himabindu Lakkaraju
36
3
0
06 Apr 2024
Embodied Neuromorphic Artificial Intelligence for Robotics:
  Perspectives, Challenges, and Research Development Stack
Embodied Neuromorphic Artificial Intelligence for Robotics: Perspectives, Challenges, and Research Development Stack
Rachmad Vidya Wicaksana Putra
Alberto Marchisio
F. Zayer
Jorge Dias
Mohamed Bennai
41
10
0
04 Apr 2024
Integrating Explanations in Learning LTL Specifications from
  Demonstrations
Integrating Explanations in Learning LTL Specifications from Demonstrations
Ashutosh Gupta
John Komp
Abhay Singh Rajput
Shankaranarayanan Krishna
Ashutosh Trivedi
Namrita Varshney
21
0
0
03 Apr 2024
EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and
  Benchmarking
EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking
Stavros Orfanoudakis
C. Diaz-Londono
Yunus E. Yilmaz
Peter Palensky
Pedro P. Vergara
20
5
0
02 Apr 2024
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Extremum-Seeking Action Selection for Accelerating Policy Optimization
Ya-Chien Chang
Sicun Gao
45
0
0
02 Apr 2024
Game-Theoretic Deep Reinforcement Learning to Minimize Carbon Emissions
  and Energy Costs for AI Inference Workloads in Geo-Distributed Data Centers
Game-Theoretic Deep Reinforcement Learning to Minimize Carbon Emissions and Energy Costs for AI Inference Workloads in Geo-Distributed Data Centers
Ninad Hogade
S. Pasricha
AI4CE
14
2
0
01 Apr 2024
Zero-shot Safety Prediction for Autonomous Robots with Foundation World
  Models
Zero-shot Safety Prediction for Autonomous Robots with Foundation World Models
Zhenjiang Mao
Siqi Dai
Yuang Geng
Ivan Ruchkin
48
3
0
30 Mar 2024
Efficient Automatic Tuning for Data-driven Model Predictive Control via
  Meta-Learning
Efficient Automatic Tuning for Data-driven Model Predictive Control via Meta-Learning
Baoyu Li
William Edwards
Kris Hauser
34
0
0
30 Mar 2024
Biologically-Plausible Topology Improved Spiking Actor Network for
  Efficient Deep Reinforcement Learning
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning
Duzhen Zhang
Qingyu Wang
Tielin Zhang
Bo Xu
164
1
0
29 Mar 2024
CAESAR: Enhancing Federated RL in Heterogeneous MDPs through
  Convergence-Aware Sampling with Screening
CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening
Hei Yi Mak
Flint Xiaofeng Fan
Luca A. Lanzendörfer
Cheston Tan
Wei Tsang Ooi
Roger Wattenhofer
FedML
32
2
0
29 Mar 2024
Decision Mamba: Reinforcement Learning via Sequence Modeling with
  Selective State Spaces
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces
Toshihiro Ota
Mamba
46
16
0
29 Mar 2024
Application-Driven Innovation in Machine Learning
Application-Driven Innovation in Machine Learning
David Rolnick
Alán Aspuru-Guzik
Sara Beery
B. Dilkina
P. Donti
...
Hannah Kerner
C. Monteleoni
Esther Rolf
Milind Tambe
Adam White
41
8
0
26 Mar 2024
Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving
Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving
Axel Brunnbauer
Luigi Berducci
P. Priller
D. Ničković
Radu Grosu
55
1
0
26 Mar 2024
Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling
  Process
Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process
Kevin S. Miller
Adam J. Thorpe
Ufuk Topcu
29
0
0
25 Mar 2024
A Comparative Analysis of Visual Odometry in Virtual and Real-World
  Railways Environments
A Comparative Analysis of Visual Odometry in Virtual and Real-World Railways Environments
G. D’Amico
Mauro Marinoni
Giorgio Buttazzo
OffRL
40
1
0
25 Mar 2024
Deep Gaussian Covariance Network with Trajectory Sampling for
  Data-Efficient Policy Search
Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search
Can Bogoclu
Robert Vosshall
K. Cremanns
Dirk Roos
BDL
25
1
0
23 Mar 2024
ARO: Large Language Model Supervised Robotics Text2Skill Autonomous
  Learning
ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning
Yiwen Chen
Yuyao Ye
Ziyi Chen
Chuheng Zhang
Marcelo H. Ang
52
0
0
23 Mar 2024
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation,
  Transferable Reward Recovery and Algebraic Equilibrium Proof
Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof
Yangchun Zhang
Qiang Liu
Weiming Li
Yirui Zhou
43
0
0
21 Mar 2024
On Predictive planning and counterfactual learning in active inference
On Predictive planning and counterfactual learning in active inference
Aswin Paul
Takuya Isomura
Adeel Razi
AI4CE
41
2
0
19 Mar 2024
Decomposing Control Lyapunov Functions for Efficient Reinforcement
  Learning
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
Antonio Lopez
David Fridovich-Keil
38
1
0
18 Mar 2024
Bridging the Gap between Discrete Agent Strategies in Game Theory and
  Continuous Motion Planning in Dynamic Environments
Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments
Hongrui Zheng
Zhijun Zhuang
Stephanie Wu
Shuo Yang
Rahul Mangharam
32
1
0
17 Mar 2024
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion
  and Manipulation
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
Carmelo Sferrazza
Dun-Ming Huang
Xingyu Lin
Youngwoon Lee
Pieter Abbeel
60
37
0
15 Mar 2024
AD3: Implicit Action is the Key for World Models to Distinguish the
  Diverse Visual Distractors
AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors
Yucen Wang
Shenghua Wan
Le Gan
Shuai Feng
De-Chuan Zhan
VGen
27
4
0
15 Mar 2024
Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Rui Liu
Erfaun Noorani
Pratap Tokekar
John S. Baras
39
1
0
13 Mar 2024
A Holistic Framework Towards Vision-based Traffic Signal Control with
  Microscopic Simulation
A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation
Pan He
Quanyi Li
Xiaoyong Yuan
Bolei Zhou
44
0
0
11 Mar 2024
Unveiling the Significance of Toddler-Inspired Reward Transition in
  Goal-Oriented Reinforcement Learning
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
Junseok Park
Yoonsung Kim
Hee Bin Yoo
Min Whoo Lee
Kibeom Kim
Won-Seok Choi
Minsu Lee
Byoung-Tak Zhang
OffRL
45
1
0
11 Mar 2024
LitSim: A Conflict-aware Policy for Long-term Interactive Traffic
  Simulation
LitSim: A Conflict-aware Policy for Long-term Interactive Traffic Simulation
haojie xin
Xiaodong Zhang
Renzhi Tang
Songyang Yan
Qianrui Zhao
Chunze Yang
Wen Cui
Zijiang Yang
58
2
0
07 Mar 2024
RACE-SM: Reinforcement Learning Based Autonomous Control for Social
  On-Ramp Merging
RACE-SM: Reinforcement Learning Based Autonomous Control for Social On-Ramp Merging
Jordan Poots
27
0
0
05 Mar 2024
Behavior Generation with Latent Actions
Behavior Generation with Latent Actions
Seungjae Lee
Yibin Wang
Haritheja Etukuru
H. J. Kim
Mahi Shafiullah
Lerrel Pinto
VGen
OffRL
35
66
0
05 Mar 2024
Deep Reinforcement Learning for Dynamic Algorithm Selection: A
  Proof-of-Principle Study on Differential Evolution
Deep Reinforcement Learning for Dynamic Algorithm Selection: A Proof-of-Principle Study on Differential Evolution
Hongshu Guo
Yining Ma
Zeyuan Ma
Jiacheng Chen
Xinglin Zhang
Zhiguang Cao
Jun Zhang
Yue-jiao Gong
49
18
0
04 Mar 2024
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement
  Learning
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Michael T. Matthews
Michael Beukman
Benjamin Ellis
Mikayel Samvelyan
Matthew Jackson
Samuel Coward
Jakob Foerster
OffRL
42
26
0
26 Feb 2024
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement
  Learning
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
Anthony Liang
Guy Tennenholtz
Chih-Wei Hsu
Yinlam Chow
Erdem Biyik
Craig Boutilier
OffRL
45
1
0
25 Feb 2024
Leveraging Demonstrator-perceived Precision for Safe Interactive
  Imitation Learning of Clearance-limited Tasks
Leveraging Demonstrator-perceived Precision for Safe Interactive Imitation Learning of Clearance-limited Tasks
Hanbit Oh
Takamitsu Matsubara
66
3
0
21 Feb 2024
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret
  Minimization
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization
Luca DÁmico-Wong
Hugh Zhang
Marc Lanctot
David C. Parkes
OffRL
16
0
0
19 Feb 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Quentin Gallouedec
E. Beeching
Clément Romac
Emmanuel Dellandrea
37
11
0
15 Feb 2024
Dataset Clustering for Improved Offline Policy Learning
Dataset Clustering for Improved Offline Policy Learning
Qiang Wang
Yixin Deng
Francisco Roldan Sanchez
Keru Wang
Kevin McGuinness
Noel E. O'Connor
Stephen J. Redmond
OffRL
34
2
0
14 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through
  Partially Supervised Reinforcement Learning
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
26
2
0
14 Feb 2024
Decision Theory-Guided Deep Reinforcement Learning for Fast Learning
Decision Theory-Guided Deep Reinforcement Learning for Fast Learning
Zelin Wan
Jin-Hee Cho
Mu Zhu
Ahmed H. Anwar
Charles A. Kamhoua
Munindar P. Singh
AI4CE
23
0
0
08 Feb 2024
Learning Uncertainty-Aware Temporally-Extended Actions
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee
Seung Joon Park
Yunhao Tang
Min-hwan Oh
24
2
0
08 Feb 2024
Exploration Without Maps via Zero-Shot Out-of-Distribution Deep
  Reinforcement Learning
Exploration Without Maps via Zero-Shot Out-of-Distribution Deep Reinforcement Learning
Shathushan Sivashangaran
Apoorva Khairnar
A. Eskandarian
OffRL
45
0
0
07 Feb 2024
Voronoi Candidates for Bayesian Optimization
Voronoi Candidates for Bayesian Optimization
Nathan Wycoff
John W. Smith
Annie S. Booth
R. Gramacy
39
0
0
07 Feb 2024
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
Chen Wang
S. Erfani
T. Alpcan
Christopher Leckie
OffRL
38
2
0
07 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
31
4
0
07 Feb 2024
A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in
  Next-gen Networks
A Deep Reinforcement Learning Approach for Adaptive Traffic Routing in Next-gen Networks
A. Abrol
Purnima Murali Mohan
Tram Truong-Huu
32
1
0
07 Feb 2024
An Architecture for Unattended Containerized (Deep) Reinforcement
  Learning with Webots
An Architecture for Unattended Containerized (Deep) Reinforcement Learning with Webots
Tobias Haubold
Petra Linke
OffRL
13
0
0
06 Feb 2024
No-Regret Reinforcement Learning in Smooth MDPs
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restell
38
4
0
06 Feb 2024
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model
  Feedback
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
Yufei Wang
Zhanyi Sun
Jesse Zhang
Zhou Xian
Erdem Biyik
David Held
Zackory M. Erickson
VLM
55
51
0
06 Feb 2024
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement
  Learning Using Unique Experiences
Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
Nikhil Kumar Singh
Indranil Saha
OffRL
27
0
0
05 Feb 2024
Gazebo Plants: Simulating Plant-Robot Interaction with Cosserat Rods
Gazebo Plants: Simulating Plant-Robot Interaction with Cosserat Rods
Junchen Deng
Samhita Marri
Jonathan Klein
Wojciech Palubicki
Soren Pirk
Girish Chowdhary
D. L. Michels
22
2
0
04 Feb 2024
Towards Optimal Adversarial Robust Q-learning with Bellman
  Infinity-error
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li
Zicheng Zhang
Wang Luo
Congying Han
Yudong Hu
Tiande Guo
Shichen Liao
AAML
41
2
0
03 Feb 2024
Previous
123456...323334
Next