ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.13264
  4. Cited By
Deep Reinforcement Learning at the Edge of the Statistical Precipice

Deep Reinforcement Learning at the Edge of the Statistical Precipice

30 August 2021
Rishabh Agarwal
Max Schwarzer
P. S. Castro
Aaron Courville
Marc G. Bellemare
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning at the Edge of the Statistical Precipice"

50 / 453 papers shown
Title
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
34
0
0
22 Oct 2024
FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Woosung Koh
Wonbeen Oh
S. Kim
Suhin Shin
Hyeongjin Kim
Jaein Jang
Junghyun Lee
Se-Young Yun
30
0
0
21 Oct 2024
Non-invasive Neural Decoding in Source Reconstructed Brain Space
Non-invasive Neural Decoding in Source Reconstructed Brain Space
Yonatan Gideoni
Ryan Charles Timms
Oiwi Parker Jones
40
1
0
20 Oct 2024
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
54
0
0
19 Oct 2024
Mitigating Suboptimality of Deterministic Policy Gradients in Complex
  Q-functions
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain
Norio Kosaka
Xinhu Li
Kyung-Min Kim
Erdem Bıyık
Joseph J. Lim
OffRL
16
0
0
15 Oct 2024
Large Language Model Evaluation via Matrix Nuclear-Norm
Large Language Model Evaluation via Matrix Nuclear-Norm
Y. Li
Tingyu Xia
Yi-Ju Chang
Yuan Wu
27
1
0
14 Oct 2024
Improving Generalization on the ProcGen Benchmark with Simple
  Architectural Changes and Scale
Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale
Andrew Jesson
Yiding Jiang
OffRL
29
1
0
13 Oct 2024
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
Ge Li
Dong Tian
Hongyi Zhou
Xinkai Jiang
Rudolf Lioutikov
Gerhard Neumann
OffRL
169
3
0
12 Oct 2024
Can we hop in general? A discussion of benchmark selection and design
  using the Hopper environment
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
C. Voelcker
Marcel Hussing
Eric Eaton
OffRL
23
3
0
11 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
39
1
0
11 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
90
0
0
10 Oct 2024
Retrieval-Augmented Decision Transformer: External Memory for In-context
  RL
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
Thomas Schmied
Fabian Paischer
Vihang Patil
M. Hofmarcher
Razvan Pascanu
Sepp Hochreiter
OffRL
42
6
0
09 Oct 2024
Robust Offline Imitation Learning from Diverse Auxiliary Data
Robust Offline Imitation Learning from Diverse Auxiliary Data
Udita Ghosh
Dripta S. Raychaudhuri
Jiachen Li
Konstantinos Karydis
A. Roy-Chowdhury
OffRL
24
1
0
04 Oct 2024
Evaluating Robustness of Reward Models for Mathematical Reasoning
Evaluating Robustness of Reward Models for Mathematical Reasoning
Sunghwan Kim
Dongjin Kang
Taeyoon Kwon
Hyungjoo Chae
Jungsoo Won
Dongha Lee
Jinyoung Yeo
30
4
0
02 Oct 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Aaron C. Courville
Hugo Larochelle
Pablo Samuel Castro
MoE
127
2
0
02 Oct 2024
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Jie Cheng
Ruixi Qiao
Gang Xiong
Binhua Li
Yingwei Ma
Binhua Li
Yongbin Li
Yisheng Lv
OffRL
OnRL
LM&Ro
50
3
0
01 Oct 2024
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World
Taisuke Kobayashi
71
2
0
29 Sep 2024
The Price of Pessimism for Automated Defense
The Price of Pessimism for Automated Defense
Erick Galinkin
Emmanouil Pountourakis
Spiros Mancoridis
AAML
21
1
0
28 Sep 2024
Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning
Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning
Claude Formanek
Louise Beyers
C. Tilbury
Jonathan P. Shock
Arnu Pretorius
OffRL
34
0
0
18 Sep 2024
SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation
SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation
Benjamin Stoler
Ingrid Navarro
Jonathan M Francis
Jean Oh
AAML
60
4
0
16 Sep 2024
Robot Learning as an Empirical Science: Best Practices for Policy
  Evaluation
Robot Learning as an Empirical Science: Best Practices for Policy Evaluation
H. Kress-Gazit
Kunimatsu Hashimoto
Naveen Kuppuswamy
Paarth Shah
Phoebe Horgan
Gordon Richardson
Siyuan Feng
Benjamin Burchfiel
29
5
0
14 Sep 2024
The Role of Deep Learning Regularizations on Actors in Offline RL
The Role of Deep Learning Regularizations on Actors in Offline RL
Denis Tarasov
Anja Surina
Çağlar Gülçehre
OffRL
AI4CE
50
1
0
11 Sep 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
40
1
0
07 Sep 2024
Enhancing Reinforcement Learning Through Guided Search
Enhancing Reinforcement Learning Through Guided Search
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
91
0
0
19 Aug 2024
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
Rafael Rafailov
Kyle Hatch
Anikait Singh
Laura Smith
Aviral Kumar
...
Victor Kolev
Philip J. Ball
Jiajun Wu
Chelsea Finn
Sergey Levine
OffRL
34
3
0
15 Aug 2024
Explaining an Agent's Future Beliefs through Temporally Decomposing
  Future Reward Estimators
Explaining an Agent's Future Beliefs through Temporally Decomposing Future Reward Estimators
Mark Towers
Yali Du
Christopher T. Freeman
Timothy J. Norman
29
0
0
15 Aug 2024
Mitigating Metropolitan Carbon Emissions with Dynamic Eco-driving at
  Scale
Mitigating Metropolitan Carbon Emissions with Dynamic Eco-driving at Scale
Vindula Jayawardana
Baptiste Freydt
Ao Qu
Cameron Hickert
E. Sanchez
Catherine Tang
Mark Taylor
Blaine Leonard
Cathy Wu
34
1
0
10 Aug 2024
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement
  Learning
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
Andrew Patterson
Samuel Neumann
Raksha Kumaraswamy
Martha White
Adam White
18
2
0
26 Jul 2024
Instance Selection for Dynamic Algorithm Configuration with
  Reinforcement Learning: Improving Generalization
Instance Selection for Dynamic Algorithm Configuration with Reinforcement Learning: Improving Generalization
C. Benjamins
Gjorgjina Cenikj
Ana Nikolikj
Aditya Mohan
T. Eftimov
Marius Lindauer
AI4CE
15
0
0
18 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Can Learned Optimization Make Reinforcement Learning Less Difficult?
Alexander David Goldie
Chris Xiaoxuan Lu
Matthew Jackson
Shimon Whiteson
Jakob N. Foerster
42
3
0
09 Jul 2024
CANDID DAC: Leveraging Coupled Action Dimensions with Importance
  Differences in DAC
CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC
Philipp Bordne
M. A. Hasan
Eddie Bergman
Noor H. Awad
André Biedenkapp
31
1
0
08 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
26
0
0
08 Jul 2024
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style
  Reinforcement Learning
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Zakariae El Asri
Olivier Sigaud
Nicolas Thome
37
0
0
02 Jul 2024
Normalization and effective learning rates in reinforcement learning
Normalization and effective learning rates in reinforcement learning
Clare Lyle
Zeyu Zheng
Khimya Khetarpal
James Martens
H. V. Hasselt
Razvan Pascanu
Will Dabney
19
7
0
01 Jul 2024
Coordination Failure in Cooperative Offline MARL
Coordination Failure in Cooperative Offline MARL
C. Tilbury
Claude Formanek
Louise Beyers
Jonathan P. Shock
Arnu Pretorius
OffRL
33
1
0
01 Jul 2024
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
Benjamin Estermann
Luca A. Lanzendörfer
Yannick Niedermayr
Roger Wattenhofer
50
2
0
29 Jun 2024
External Model Motivated Agents: Reinforcement Learning for Enhanced
  Environment Sampling
External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling
Rishav Bhagat
Jonathan C. Balloch
Zhiyu Lin
Julia Kim
Mark O. Riedl
41
0
0
28 Jun 2024
Efficient World Models with Context-Aware Tokenization
Efficient World Models with Context-Aware Tokenization
Vincent Micheli
Eloi Alonso
François Fleuret
OffRL
VLM
34
5
0
27 Jun 2024
Mixture of Experts in a Mixture of RL settings
Mixture of Experts in a Mixture of RL settings
Timon Willi
J. Obando-Ceron
Jakob Foerster
Karolina Dziugaite
Pablo Samuel Castro
MoE
41
7
0
26 Jun 2024
Bidirectional-Reachable Hierarchical Reinforcement Learning with
  Mutually Responsive Policies
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
Yu-Juan Luo
Fuchun Sun
Tianying Ji
Xianyuan Zhan
27
0
0
26 Jun 2024
On the consistency of hyper-parameter selection in value-based deep
  reinforcement learning
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
J. Obando-Ceron
J. G. Araújo
Aaron C. Courville
Pablo Samuel Castro
40
6
0
25 Jun 2024
Position: Benchmarking is Limited in Reinforcement Learning Research
Position: Benchmarking is Limited in Reinforcement Learning Research
Scott M. Jordan
Adam White
Bruno Castro da Silva
Martha White
Philip S. Thomas
OffRL
23
5
0
23 Jun 2024
KalMamba: Towards Efficient Probabilistic State Space Models for RL
  under Uncertainty
KalMamba: Towards Efficient Probabilistic State Space Models for RL under Uncertainty
P. Becker
Niklas Freymuth
Gerhard Neumann
Mamba
26
2
0
21 Jun 2024
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement
  Learning
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning
Matthias Weissenbacher
Rishabh Agarwal
Yoshinobu Kawahara
OffRL
34
1
0
21 Jun 2024
CoDreamer: Communication-Based Decentralised World Models
CoDreamer: Communication-Based Decentralised World Models
Edan Toledo
Amanda Prorok
43
0
0
19 Jun 2024
Discovering Minimal Reinforcement Learning Environments
Discovering Minimal Reinforcement Learning Environments
Jarek Liesen
Chris Xiaoxuan Lu
Andrei Lupu
Jakob N. Foerster
Henning Sprekeler
R. T. Lange
OffRL
46
3
0
18 Jun 2024
Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary
  Model
Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary Model
Siemen Herremans
Ali Anwar
Siegfried Mercelis
47
2
0
14 Jun 2024
Dispelling the Mirage of Progress in Offline MARL through Standardised
  Baselines and Evaluation
Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation
Claude Formanek
C. Tilbury
Louise Beyers
Jonathan P. Shock
Arnu Pretorius
OffRL
34
1
0
13 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
39
2
0
11 Jun 2024
Previous
12345...8910
Next