Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,673 papers shown
Title
Learning to Backdoor Federated Learning
Henger Li
Chen Wu
Senchun Zhu
Zizhan Zheng
FedML
36
10
0
06 Mar 2023
Ensemble Reinforcement Learning: A Survey
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Y. Chen
Yutong Wu
OffRL
59
38
0
05 Mar 2023
Demonstration-guided Deep Reinforcement Learning for Coordinated Ramp Metering and Perimeter Control in Large Scale Networks
Zijian Hu
Wei-Ying Ma
27
5
0
04 Mar 2023
CoRL: Environment Creation and Management Focused on System Integration
J. D. Merrick
Benjamin K. Heiner
Cameron Long
Brian Stieber
Steve Fierro
Vardaan Gangal
Madison Blake
Joshua Blackburn
AI4CE
30
2
0
03 Mar 2023
Synthetic Data Generator for Adaptive Interventions in Global Health
Aditya Rastogi
J. F. Garamendi
Ana Fernández del Río
Anna Guitart
Moiz Hassan Khan
Dexian Tang
África Periánez
42
0
0
03 Mar 2023
Dynamic Competency Self-Assessment for Autonomous Agents
Nicholas Conlon
Nisar R. Ahmed
D. Szafir
34
3
0
03 Mar 2023
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting
Hongyao Tang
Hao Fei
Jianye Hao
28
1
0
02 Mar 2023
Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement
Petr Bobák
Ladislav Čmolík
Martin Cadík
OffRL
40
3
0
02 Mar 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula
Yuda Song
Aarti Singh
J. Andrew Bagnell
Sanjiban Choudhury
OffRL
46
13
0
01 Mar 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
27
0
0
28 Feb 2023
Taylor TD-learning
Michele Garibbo
Maxime Robeyns
Laurence Aitchison
OffRL
23
1
0
27 Feb 2023
CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems
Sagar Patel
Sangeetha Abdu Jyothi
Nina Narodytska
OffRL
27
0
0
27 Feb 2023
High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement
Weiming Qu
Tianlin Liu
D. Luo
26
2
0
26 Feb 2023
Q-Cogni: An Integrated Causal Reinforcement Learning Framework
C. Cunha
Wen Liu
T. French
Ajmal Mian
38
1
0
26 Feb 2023
DeepCPG Policies for Robot Locomotion
Aditya M. Deshpande
Eric Hurd
A. Minai
Manish Kumar
29
9
0
25 Feb 2023
Autonomous Exploration and Mapping for Mobile Robots via Cumulative Curriculum Reinforcement Learning
Zehan Li
Jinghao Xin
Ning Li
31
4
0
25 Feb 2023
EvoTorch: Scalable Evolutionary Computation in Python
N. E. Toklu
Timothy James Atkinson
Vojtvech Micka
Paweł Liskowski
R. Srivastava
22
12
0
24 Feb 2023
Why Target Networks Stabilise Temporal Difference Methods
Matt Fellows
Matthew Smith
Shimon Whiteson
OOD
AAML
21
7
0
24 Feb 2023
Improving Deep Policy Gradients with Value Function Search
Enrico Marchesini
Chris Amato
31
9
0
20 Feb 2023
Neural Optimal Control using Learned System Dynamics
Selim Engin
Volkan Isler
26
3
0
20 Feb 2023
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided Bounds on the Value Function
Jacob Adamczyk
Stas Tiomkin
R. Kulkarni
OffRL
22
0
0
19 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
35
2
0
17 Feb 2023
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long Horizon Trajectories
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
34
6
0
17 Feb 2023
Conservative State Value Estimation for Offline Reinforcement Learning
Liting Chen
Jie Yan
Zhengdao Shao
Lu Wang
Qingwei Lin
Saravan Rajmohan
Thomas Moscibroda
Dongmei Zhang
OffRL
26
6
0
14 Feb 2023
Universal Agent Mixtures and the Geometry of Intelligence
S. Alexander
David Quarel
Len Du
Marcus Hutter
23
1
0
13 Feb 2023
Graph Learning Based Decision Support for Multi-Aircraft Take-Off and Landing at Urban Air Mobility Vertiports
Prajit K. Kumar
Jhoel Witter
Steve Paul
Karthik Dantu
Souma Chowdhury
27
3
0
12 Feb 2023
UGAE: A Novel Approach to Non-exponential Discounting
Ariel Kwiatkowski
Vicky Kalogeiton
Julien Pettré
Marie-Paule Cani
OffRL
30
2
0
11 Feb 2023
Learning cooperative behaviours in adversarial multi-agent systems
Ni Wang
Gautham P. Das
Alan G. Millard
21
0
0
10 Feb 2023
A SWAT-based Reinforcement Learning Framework for Crop Management
Malvern Madondo
Muneeza Azmat
Kelsey L. DiPietro
R. Horesh
Michael Jacobs
Arun Bawa
Raghavan Srinivasan
Fearghal O'Donncha
11
8
0
10 Feb 2023
Scaling Goal-based Exploration via Pruning Proto-goals
Akhil Bagaria
Ray Jiang
Ramana Kumar
Tom Schaul
LRM
16
2
0
09 Feb 2023
A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning Conventions
G. Pihlgren
Konstantina Nikolaidou
Prakash Chandra Chhipa
Nosheen Abid
Rajkumar Saini
Fredrik Sandin
Marcus Liwicki
29
10
0
08 Feb 2023
Online Reinforcement Learning with Uncertain Episode Lengths
Debmalya Mandal
Goran Radanović
Jiarui Gan
Adish Singla
R. Majumdar
OffRL
38
5
0
07 Feb 2023
Two Losses Are Better Than One: Faster Optimization Using a Cheaper Proxy
Blake E. Woodworth
Konstantin Mishchenko
Francis R. Bach
47
6
0
07 Feb 2023
Object-Centric Scene Representations using Active Inference
Toon Van de Maele
Tim Verbelen
Pietro Mazzaglia
Stefano Ferraro
Bart Dhoedt
OCL
BDL
48
5
0
07 Feb 2023
Diversity Induced Environment Design via Self-Play
Dexun Li
Wenjun Li
Pradeep Varakantham
29
0
0
04 Feb 2023
Deep Reinforcement Learning for Cyber System Defense under Dynamic Adversarial Uncertainties
Ashutosh Dutta
Samrat Chatterjee
A. Bhattacharya
M. Halappanavar
32
8
0
03 Feb 2023
DiSProD: Differentiable Symbolic Propagation of Distributions for Planning
Palash Chatterjee
Ashutosh Chapagain
Weizhe (Wesley) Chen
Roni Khardon
21
1
0
03 Feb 2023
Accelerating Policy Gradient by Estimating Value Function from Prior Computation in Deep Reinforcement Learning
Hassam Sheikh
Mariano Phielipp
OffRL
24
6
0
02 Feb 2023
Normalizing Flow Ensembles for Rich Aleatoric and Epistemic Uncertainty Modeling
Lucas Berry
David Meger
26
7
0
02 Feb 2023
Task Placement and Resource Allocation for Edge Machine Learning: A GNN-based Multi-Agent Reinforcement Learning Paradigm
Yihong Li
Xiaoxi Zhang
Tian Zeng
Jingpu Duan
Chuanxi Wu
Di Wu
Xu Chen
34
16
0
01 Feb 2023
Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor
Julien Grand-Clément
Marko Petrik
35
14
0
31 Jan 2023
Learning Vision-based Robotic Manipulation Tasks Sequentially in Offline Reinforcement Learning Settings
Sudhir Pratap Yadav
R. Nagar
S. Shah
OffRL
29
3
0
31 Jan 2023
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments
Runlong Zhou
Zihan Zhang
S. Du
49
10
0
31 Jan 2023
Enabling surrogate-assisted evolutionary reinforcement learning via policy embedding
Lan Tang
Xiaxi Li
Jinyuan Zhang
Guiying Li
Peng Yang
Ke Tang
18
0
0
31 Jan 2023
PAC-Bayesian Soft Actor-Critic Learning
Bahareh Tasdighi
Abdullah Akgul
Manuel Haussmann
Kenny Kazimirzak Brink
M. Kandemir
41
3
0
30 Jan 2023
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary Computation
Beichen Huang
Ran Cheng
Zhuozhao Li
Yaochu Jin
Kay Chen Tan
18
26
0
29 Jan 2023
Improving Behavioural Cloning with Positive Unlabeled Learning
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
Nico Gürtler
Felix Widmaier
Francisco Roldan Sanchez
S. Redmond
OffRL
OnRL
36
8
0
27 Jan 2023
Single-Trajectory Distributionally Robust Reinforcement Learning
Zhipeng Liang
Xiaoteng Ma
Jose H. Blanchet
Jiheng Zhang
Zhengyuan Zhou
OOD
OffRL
39
11
0
27 Jan 2023
Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation
Chenxi Yang
Greg Anderson
Swarat Chaudhuri
37
1
0
26 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
29
0
0
26 Jan 2023
Previous
1
2
3
...
11
12
13
...
32
33
34
Next