Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.07857
Cited By
RUDDER: Return Decomposition for Delayed Rewards
20 June 2018
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RUDDER: Return Decomposition for Delayed Rewards"
40 / 40 papers shown
Title
Economic Battery Storage Dispatch with Deep Reinforcement Learning from Rule-Based Demonstrations
Manuel Sage
Martin Staniszewski
Yaoyao Fiona Zhao
26
2
0
06 Apr 2025
Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm
H. Kim
Kanghoon Lee
J. Park
Jiachen Li
Jinkyoo Park
62
1
0
05 Mar 2025
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
86
1
0
22 Jan 2025
Instance Temperature Knowledge Distillation
Zhengbo Zhang
Yuxi Zhou
Jia Gong
Jun Liu
Zhigang Tu
34
2
0
27 Jun 2024
Informativeness of Reward Functions in Reinforcement Learning
R. Devidze
Parameswaran Kamalaruban
Adish Singla
29
2
0
10 Feb 2024
A User Study on Explainable Online Reinforcement Learning for Adaptive Systems
Andreas Metzger
Jan Laufer
Felix Feit
Klaus Pohl
OffRL
OnRL
24
1
0
09 Jul 2023
Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation
Wenyan Yang
A. Angleraud
R. Pieters
Joni Pajarinen
Joni-Kristian Kämäräinen
32
6
0
05 Mar 2023
Preference Transformer: Modeling Human Preferences using Transformers for RL
Changyeon Kim
Jongjin Park
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
OffRL
35
61
0
02 Mar 2023
Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)
Amartya Mukherjee
Jun Liu
20
11
0
01 Feb 2023
Feature construction using explanations of individual predictions
Boštjan Vouk
Matej Guid
Marko Robnik-Šikonja
FAtt
27
10
0
23 Jan 2023
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
D. Meger
OffRL
17
14
0
28 Nov 2022
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Clément Bonnet
Laurence Midgley
Alexandre Laterre
24
1
0
19 Nov 2022
Agent-Time Attention for Sparse Rewards Multi-Agent Reinforcement Learning
Jennifer She
Jayesh K. Gupta
Mykel J. Kochenderfer
28
4
0
31 Oct 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
24
11
0
12 Jul 2022
Off-Beat Multi-Agent Reinforcement Learning
Wei Qiu
Weixun Wang
R. Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
29
2
0
27 May 2022
A Globally Convergent Evolutionary Strategy for Stochastic Constrained Optimization with Applications to Reinforcement Learning
Youssef Diouane
Aurelien Lucchi
Vihang Patil
24
3
0
21 Feb 2022
Selective Credit Assignment
Veronica Chelu
Diana Borsa
Doina Precup
Hado van Hasselt
24
2
0
20 Feb 2022
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
32
53
0
17 Feb 2022
Bayesian sense of time in biological and artificial brains
Z. Fountas
Alexey Zakharov
32
0
0
14 Jan 2022
Mirror Learning: A Unifying Framework of Policy Optimisation
J. Kuba
Christian Schroeder de Witt
Jakob N. Foerster
23
24
0
07 Jan 2022
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Hung Le
Thommen George Karimpanal
Majid Abdolshah
T. Tran
Svetha Venkatesh
17
19
0
03 Nov 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
27
11
0
08 Jun 2021
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
22
17
0
10 Mar 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Erik Cambria
OffRL
44
73
0
01 Jan 2021
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
13
509
0
30 Mar 2020
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
31
120
0
24 Mar 2020
Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications
Wojciech Samek
G. Montavon
Sebastian Lapuschkin
Christopher J. Anders
K. Müller
XAI
44
82
0
17 Mar 2020
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning
Yaodong Yang
Jianye Hao
Guangyong Chen
Hongyao Tang
Yingfeng Chen
Yujing Hu
Changjie Fan
Zhongyu Wei
23
52
0
10 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
37
188
0
23 Dec 2019
Automatic Design of CNNs via Differentiable Neural Architecture Search for PolSAR Image Classification
Hongwei Dong
Siyu Zhang
B. Zou
Lamei Zhang
16
47
0
16 Nov 2019
Towards Explainable Artificial Intelligence
Wojciech Samek
K. Müller
XAI
32
436
0
26 Sep 2019
Explaining and Interpreting LSTMs
L. Arras
Jose A. Arjona-Medina
Michael Widrich
G. Montavon
Michael Gillhofer
K. Müller
Sepp Hochreiter
Wojciech Samek
FAtt
AI4TS
18
79
0
25 Sep 2019
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Tom Schaul
Diana Borsa
Joseph Modayil
Razvan Pascanu
11
63
0
25 Apr 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
18
17
0
11 Mar 2019
Optimizing Agent Behavior over Long Time Scales by Transporting Value
Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
M. Berk Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
21
121
0
15 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
32
550
0
12 Oct 2018
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Martin Schmid
Neil Burch
Marc Lanctot
Matej Moravcík
Rudolf Kadlec
Michael Bowling
21
64
0
09 Sep 2018
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
Su Young Lee
Sung-Ik Choi
Sae-Young Chung
BDL
13
73
0
31 May 2018
Methods for Interpreting and Understanding Deep Neural Networks
G. Montavon
Wojciech Samek
K. Müller
FaML
234
2,238
0
24 Jun 2017
1