Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.13264
Cited By
Deep Reinforcement Learning at the Edge of the Statistical Precipice
30 August 2021
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning at the Edge of the Statistical Precipice"
50 / 453 papers shown
Title
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel
Kaixin Wang
Uri Gadot
Navdeep Kumar
Kfir Y. Levy
Shie Mannor
34
2
0
09 Jun 2023
On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning
Hojoon Lee
Ko-tik Lee
Dongyoon Hwang
Hyunho Lee
ByungKun Lee
Jaegul Choo
SSL
OOD
26
5
0
09 Jun 2023
On the Importance of Exploration for Generalization in Reinforcement Learning
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCV
OffRL
30
20
0
08 Jun 2023
GEO-Bench: Toward Foundation Models for Earth Monitoring
Alexandre Lacoste
Nils Lehmann
Pau Rodríguez López
Evan D. Sherwin
Hannah Kerner
...
David Vazquez
Dava Newman
Yoshua Bengio
Stefano Ermon
Xiao Xiang Zhu
SSL
ALM
AI4CE
14
56
0
06 Jun 2023
RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Jonas Eschmann
Dario Albani
Giuseppe Loianno
OffRL
34
5
0
06 Jun 2023
Boosting Offline Reinforcement Learning with Action Preference Query
Qisen Yang
Shenzhi Wang
Matthieu Lin
S. Song
Gao Huang
OffRL
11
9
0
06 Jun 2023
A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
Mikael Henaff
Minqi Jiang
Roberta Raileanu
38
13
0
05 Jun 2023
Explore to Generalize in Zero-Shot RL
E. Zisselman
Itai Lavie
Daniel Soudry
Aviv Tamar
23
15
0
05 Jun 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRL
OnRL
37
14
0
05 Jun 2023
Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
Brahma S. Pavse
M. Zurek
Yudong Chen
Qiaomin Xie
Josiah P. Hanna
OffRL
33
1
0
02 Jun 2023
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages
Andrew Jesson
Chris Xiaoxuan Lu
Gunshi Gupta
Angelos Filos
Jakob N. Foerster
Y. Gal
OffRL
25
5
0
02 Jun 2023
Hyperparameters in Reinforcement Learning and How To Tune Them
Theresa Eimer
Marius Lindauer
Roberta Raileanu
OffRL
27
34
0
02 Jun 2023
Improving and Benchmarking Offline Reinforcement Learning Algorithms
Bingyi Kang
Xiao Ma
Yi-Ren Wang
Yang Yue
Shuicheng Yan
OffRL
8
9
0
01 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Aaron C. Courville
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
48
82
0
30 May 2023
VA-learning as a more efficient alternative to Q-learning
Yunhao Tang
Rémi Munos
Mark Rowland
Michal Valko
OffRL
13
6
0
29 May 2023
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
28
24
0
29 May 2023
Towards a Better Understanding of Representation Dynamics under TD-learning
Yunhao Tang
Rémi Munos
OffRL
23
1
0
29 May 2023
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
Guozheng Ma
Linrui Zhang
Haoyu Wang
Lu Li
Zilin Wang
Zhen Wang
Li Shen
Xueqian Wang
Dacheng Tao
42
10
0
25 May 2023
Deep Reinforcement Learning with Plasticity Injection
Evgenii Nikishin
Junhyuk Oh
Georg Ostrovski
Clare Lyle
Razvan Pascanu
Will Dabney
André Barreto
OffRL
20
49
0
24 May 2023
Replicable Reinforcement Learning
Eric Eaton
Marcel Hussing
Michael Kearns
Jessica Sorrell
OffRL
27
13
0
24 May 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
33
36
0
16 May 2023
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
D. Meger
Doina Precup
25
2
0
09 May 2023
Behavior Contrastive Learning for Unsupervised Skill Discovery
Rushuai Yang
Chenjia Bai
Hongyi Guo
Siyuan Li
Bin Zhao
Zhen Wang
Peng Liu
Xuelong Li
SSL
29
16
0
08 May 2023
Simple Noisy Environment Augmentation for Reinforcement Learning
Raad Khraishi
Ramin Okhrati
OffRL
11
1
0
04 May 2023
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Jesse Farebrother
Joshua Greaves
Rishabh Agarwal
Charline Le Lan
Ross Goroshin
Pablo Samuel Castro
Marc G. Bellemare
51
25
0
25 Apr 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
24
31
0
20 Apr 2023
Learning Representative Trajectories of Dynamical Systems via Domain-Adaptive Imitation
Edgardo Solano-Carrillo
Jannis Stoppe
13
0
0
19 Apr 2023
AutoRL Hyperparameter Landscapes
Aditya Mohan
C. Benjamins
Konrad Wienecke
A. Dockhorn
Marius Lindauer
29
7
0
05 Apr 2023
Empirical Design in Reinforcement Learning
Andrew Patterson
Samuel Neumann
Martha White
Adam White
9
21
0
03 Apr 2023
PyFlyt -- UAV Simulation Environments for Reinforcement Learning Research
Jun Jet Tai
J. Wong
M. Innocente
N. Horri
J. Brusey
S. K. Phang
14
10
0
03 Apr 2023
Swarm Reinforcement Learning For Adaptive Mesh Refinement
Niklas Freymuth
Philipp Dahlinger
Tobias Würth
Simon Reisch
Luise Kärger
Gerhard Neumann
29
13
0
03 Apr 2023
Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning
Claude Formanek
C. Tilbury
Jonathan P. Shock
Kale-ab Tessera
Arnu Pretorius
26
3
0
31 Mar 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
36
71
0
28 Mar 2023
Inverse Reinforcement Learning without Reinforcement Learning
Gokul Swamy
Sanjiban Choudhury
J. Andrew Bagnell
Zhiwei Steven Wu
21
34
0
26 Mar 2023
Interpretable Reinforcement Learning via Neural Additive Models for Inventory Management
Julien N. Siems
Maximilian Schambach
Sebastian Schulze
Johannes Otterbach
14
2
0
18 Mar 2023
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Nicolai Dorka
Tim Welschehold
Wolfram Burgard
16
3
0
17 Mar 2023
Sample-efficient Adversarial Imitation Learning
Dahuin Jung
Hyungyu Lee
Sung-Hoon Yoon
SSL
18
2
0
14 Mar 2023
Transformer-based World Models Are Happy With 100k Interactions
Jan Robine
Marc Höftmann
Tobias Uelwer
Stefan Harmeling
OffRL
16
68
0
13 Mar 2023
Synthetic Experience Replay
Cong Lu
Philip J. Ball
Yee Whye Teh
Jack Parker-Holder
OffRL
94
67
0
12 Mar 2023
Learning Exploration Strategies to Solve Real-World Marble Runs
Alisa Allaire
C. Atkeson
29
0
0
08 Mar 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
51
88
0
24 Feb 2023
Towards Computationally Efficient Responsibility Attribution in Decentralized Partially Observable MDPs
Stelios Triantafyllou
Goran Radanović
11
5
0
24 Feb 2023
Self-supervised network distillation: an effective approach to exploration in sparse reward environments
Matej Pecháč
M. Chovanec
Igor Farkaš
29
3
0
22 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
35
23
0
20 Feb 2023
Guiding Pretraining in Reinforcement Learning with Large Language Models
Yuqing Du
Olivia Watkins
Zihan Wang
Cédric Colas
Trevor Darrell
Pieter Abbeel
Abhishek Gupta
Jacob Andreas
LM&Ro
21
174
0
13 Feb 2023
Combining Reconstruction and Contrastive Methods for Multimodal Representations in RL
P. Becker
Sebastian Mossburger
Fabian Otto
Gerhard Neumann
SSL
34
2
0
10 Feb 2023
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Lukas Schafer
Oliver Slumbers
Stephen Marcus McAleer
Yali Du
Stefano V. Albrecht
D. Mguni
74
7
0
07 Feb 2023
A Strong Baseline for Batch Imitation Learning
Matthew Smith
Lucas Maystre
Zhenwen Dai
K. Ciosek
OffRL
17
4
0
06 Feb 2023
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
21
18
0
06 Feb 2023
Off-the-Grid MARL: Datasets with Baselines for Offline Multi-Agent Reinforcement Learning
Claude Formanek
Asad Jeewa
Jonathan P. Shock
Arnu Pretorius
OffRL
34
1
0
01 Feb 2023
Previous
1
2
3
...
10
5
6
7
8
9
Next