ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.13264
  4. Cited By
Deep Reinforcement Learning at the Edge of the Statistical Precipice

Deep Reinforcement Learning at the Edge of the Statistical Precipice

30 August 2021
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning at the Edge of the Statistical Precipice"

50 / 453 papers shown
Title
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic
  Forgetting in Curiosity
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity
Jaedong Hwang
Zhang-Wei Hong
Eric Chen
Akhilan Boopathy
Pulkit Agrawal
Ila Fiete
CLL
35
5
0
26 Oct 2023
Understanding and Addressing the Pitfalls of Bisimulation-based
  Representations in Offline Reinforcement Learning
Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Hongyu Zang
Xin-hui Li
Leiji Zhang
Yang Liu
Baigui Sun
Riashat Islam
Rémi Tachet des Combes
Romain Laroche
OffRL
29
5
0
26 Oct 2023
Towards Control-Centric Representations in Reinforcement Learning from
  Images
Towards Control-Centric Representations in Reinforcement Learning from Images
Chen Liu
Hongyu Zang
Xin Li
Yong Heng
Yifei Wang
Zhen Fang
Yisen Wang
Mingzhong Wang
20
0
0
25 Oct 2023
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL
Zhongjian Qiao
Jiafei Lyu
Xiu Li
16
3
0
23 Oct 2023
Eureka: Human-Level Reward Design via Coding Large Language Models
Eureka: Human-Level Reward Design via Coding Large Language Models
Yecheng Jason Ma
William Liang
Guanzhi Wang
De-An Huang
Osbert Bastani
Dinesh Jayaraman
Yuke Zhu
Linxi Fan
A. Anandkumar
19
291
0
19 Oct 2023
Uncertainty-aware transfer across tasks using hybrid model-based
  successor feature reinforcement learning
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
41
1
0
16 Oct 2023
Bootstrap Your Own Skills: Learning to Solve New Tasks with Large
  Language Model Guidance
Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance
Jesse Zhang
Jiahui Zhang
Karl Pertsch
Ziyi Liu
Xiang Ren
Minsuk Chang
Shao-Hua Sun
Joseph J. Lim
LLMAG
LM&Ro
97
60
0
16 Oct 2023
An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks
An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks
Antonin Raffin
Olivier Sigaud
Jens Kober
Alin Albu-Schäffer
João Silvério
F. Stulp
32
2
0
09 Oct 2023
ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot
  Coordination
ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination
Xihuai Wang
Shao Zhang
Wenhao Zhang
Wentao Dong
Jingxiao Chen
Ying Wen
Weinan Zhang
28
8
0
08 Oct 2023
Hieros: Hierarchical Imagination on Structured State Space Sequence
  World Models
Hieros: Hierarchical Imagination on Structured State Space Sequence World Models
Paul Mattes
Rainer Schlosser
R. Herbrich
21
4
0
08 Oct 2023
Improving Offline-to-Online Reinforcement Learning with Q Conditioned
  State Entropy Exploration
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
Ziqi Zhang
Xiao Xiong
Zifeng Zhuang
Jinxin Liu
Donglin Wang
OffRL
OnRL
40
0
0
07 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced
  Datasets
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
38
19
0
06 Oct 2023
A Kernel Perspective on Behavioural Metrics for Markov Decision
  Processes
A Kernel Perspective on Behavioural Metrics for Markov Decision Processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
33
4
0
05 Oct 2023
Small batch deep reinforcement learning
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
34
14
0
05 Oct 2023
How the level sampling process impacts zero-shot generalisation in deep
  reinforcement learning
How the level sampling process impacts zero-shot generalisation in deep reinforcement learning
Samuel Garcin
James Doran
Shangmin Guo
Christopher G. Lucas
Stefano V. Albrecht
46
0
0
05 Oct 2023
Discovering General Reinforcement Learning Algorithms with Adversarial
  Environment Design
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Matthew Jackson
Minqi Jiang
Jack Parker-Holder
Risto Vuorio
Chris Xiaoxuan Lu
Gregory Farquhar
Shimon Whiteson
Jakob N. Foerster
OOD
11
9
0
04 Oct 2023
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning
  Platform
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Shengyi Huang
Jiayi Weng
Rujikorn Charakorn
Min-Bin Lin
Zhongwen Xu
Santiago Ontañón
12
3
0
29 Sep 2023
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of
  Agents
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents
Marco Pleines
Matthias Pallasch
Frank Zimmer
Mike Preuss
OffRL
29
0
0
29 Sep 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
RLLTE: Long-Term Evolution Project of Reinforcement Learning
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
29
1
0
28 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
36
0
0
26 Sep 2023
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in
  Continuous Control
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Nate Rahn
P. DÓro
Harley Wiltzer
Pierre-Luc Bacon
Marc G. Bellemare
17
3
0
26 Sep 2023
Diagnosing and exploiting the computational demands of videos games for
  deep reinforcement learning
Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
L. Govindarajan
Rex G Liu
Drew Linsley
A. Ashok
Max Reuter
M. Frank
Thomas Serre
OffRL
13
0
0
22 Sep 2023
Adversarial Style Transfer for Robust Policy Optimization in Deep
  Reinforcement Learning
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
29
4
0
29 Aug 2023
Stabilizing Unsupervised Environment Design with a Learned Adversary
Stabilizing Unsupervised Environment Design with a Learned Adversary
Ishita Mediratta
Minqi Jiang
Jack Parker-Holder
Michael Dennis
Eugene Vinitsky
Tim Rocktaschel
34
14
0
21 Aug 2023
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In
  the Game of Hanabi
Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi
Hadi Nekoei
Xutong Zhao
Janarthanan Rajendran
Miao Liu
Sarath Chandar
15
4
0
20 Aug 2023
A Survey on Fairness in Large Language Models
A Survey on Fairness in Large Language Models
Yingji Li
Mengnan Du
Rui Song
Xin Wang
Ying Wang
ALM
49
59
0
20 Aug 2023
Value-Distributional Model-Based Reinforcement Learning
Value-Distributional Model-Based Reinforcement Learning
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
OffRL
24
4
0
12 Aug 2023
BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
Omer Veysel Cagatan
Barış Akgün
BDL
OffRL
27
3
0
08 Aug 2023
Deep Reinforcement Learning for Autonomous Spacecraft Inspection using
  Illumination
Deep Reinforcement Learning for Autonomous Spacecraft Inspection using Illumination
David van Wijk
Kyle Dunlap
M. Majji
Kerianne L. Hobbs
11
11
0
04 Aug 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Seohong Park
Dibya Ghosh
Benjamin Eysenbach
Sergey Levine
OffRL
30
44
0
22 Jul 2023
SafeDreamer: Safe Reinforcement Learning with World Models
SafeDreamer: Safe Reinforcement Learning with World Models
Weidong Huang
Jiaming Ji
Borong Zhang
Chunhe Xia
Yao-Chun Yang
OffRL
30
19
0
14 Jul 2023
Contextual Pre-planning on Reward Machine Abstractions for Enhanced
  Transfer in Deep Reinforcement Learning
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
Guy Azran
Mohamad H. Danesh
Stefano V. Albrecht
Sarah Keren
AI4CE
31
1
0
11 Jul 2023
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Zilai Zeng
Ce Zhang
Shijie Wang
Chen Sun
OffRL
29
5
0
07 Jul 2023
Offline Reinforcement Learning with Imbalanced Datasets
Offline Reinforcement Learning with Imbalanced Datasets
Li Jiang
Sijie Cheng
Jielin Qiu
Haoran Xu
Wai Kin Victor Chan
Zhao Ding
OffRL
34
3
0
06 Jul 2023
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand
  Cores
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Zhiyu Mei
Wei Fu
Jiaxuan Gao
Guang Wang
Huanchen Zhang
Yi Wu
OffRL
LRM
24
5
0
29 Jun 2023
Learning to Modulate pre-trained Models in RL
Learning to Modulate pre-trained Models in RL
Thomas Schmied
M. Hofmarcher
Fabian Paischer
Razvan Pascanu
Sepp Hochreiter
CLL
OffRL
24
14
0
26 Jun 2023
CEIL: Generalized Contextual Imitation Learning
CEIL: Generalized Contextual Imitation Learning
Jinxin Liu
Li He
Yachen Kang
Zifeng Zhuang
Donglin Wang
Huazhe Xu
31
18
0
26 Jun 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
42
8
0
26 Jun 2023
Waypoint Transformer: Reinforcement Learning via Supervised Learning
  with Intermediate Targets
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Anirudhan Badrinath
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
25
16
0
24 Jun 2023
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory
  Weighting
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting
Zhang-Wei Hong
Pulkit Agrawal
Rémi Tachet des Combes
Romain Laroche
OffRL
29
17
0
22 Jun 2023
Can Differentiable Decision Trees Enable Interpretable Reward Learning
  from Human Feedback?
Can Differentiable Decision Trees Enable Interpretable Reward Learning from Human Feedback?
Akansha Kalra
Daniel S. Brown
16
0
0
22 Jun 2023
MP3: Movement Primitive-Based (Re-)Planning Policy
MP3: Movement Primitive-Based (Re-)Planning Policy
Fabian Otto
Hongyi Zhou
Onur Celik
Ge Li
Rudolf Lioutikov
Gerhard Neumann
21
5
0
22 Jun 2023
Efficient Dynamics Modeling in Interactive Environments with Koopman
  Theory
Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
Arnab Kumar Mondal
Siba Smarak Panigrahi
Sai Rajeswar
K. Siddiqi
Siamak Ravanbakhsh
26
3
0
20 Jun 2023
AdaStop: adaptive statistical testing for sound comparisons of Deep RL
  agents
AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents
Timothée Mathieu
R. D. Vecchia
Alena Shilova
M. Centa
Hector Kohler
Odalric-Ambrym Maillard
Philippe Preux
22
0
0
19 Jun 2023
PLASTIC: Improving Input and Label Plasticity for Sample Efficient
  Reinforcement Learning
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning
Hojoon Lee
Hanseul Cho
Hyunseung Kim
Daehoon Gwak
Joonkee Kim
Jaegul Choo
Se-Young Yun
Chulhee Yun
OffRL
82
25
0
19 Jun 2023
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Fabian Paischer
Thomas Adler
M. Hofmarcher
Sepp Hochreiter
21
9
0
15 Jun 2023
Reward-Free Curricula for Training Robust World Models
Reward-Free Curricula for Training Robust World Models
Marc Rigter
Minqi Jiang
Ingmar Posner
VLM
OffRL
31
6
0
15 Jun 2023
Katakomba: Tools and Benchmarks for Data-Driven NetHack
Katakomba: Tools and Benchmarks for Data-Driven NetHack
Vladislav Kurenkov
Alexander Nikulin
Denis Tarasov
Sergey Kolesnikov
OffRL
30
5
0
14 Jun 2023
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments
Quentin Delfosse
Jannis Blüml
Bjarne Gregori
Sebastian Sztwiertnia
Kristian Kersting
40
17
0
14 Jun 2023
VIBR: Learning View-Invariant Value Functions for Robust Visual Control
VIBR: Learning View-Invariant Value Functions for Robust Visual Control
Tom Dupuis
Jaonary Rabarisoa
Q. C. Pham
David Filliat
36
0
0
14 Jun 2023
Previous
123456...8910
Next