ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.07857
  4. Cited By
RUDDER: Return Decomposition for Delayed Rewards

RUDDER: Return Decomposition for Delayed Rewards

20 June 2018
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
ArXivPDFHTML

Papers citing "RUDDER: Return Decomposition for Delayed Rewards"

40 / 40 papers shown
Title
Economic Battery Storage Dispatch with Deep Reinforcement Learning from Rule-Based Demonstrations
Economic Battery Storage Dispatch with Deep Reinforcement Learning from Rule-Based Demonstrations
Manuel Sage
Martin Staniszewski
Yaoyao Fiona Zhao
26
2
0
06 Apr 2025
Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm
H. Kim
Kanghoon Lee
J. Park
Jiachen Li
Jinkyoo Park
62
1
0
05 Mar 2025
Evolution and The Knightian Blindspot of Machine Learning
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
86
1
0
22 Jan 2025
Instance Temperature Knowledge Distillation
Instance Temperature Knowledge Distillation
Zhengbo Zhang
Yuxi Zhou
Jia Gong
Jun Liu
Zhigang Tu
34
2
0
27 Jun 2024
Informativeness of Reward Functions in Reinforcement Learning
Informativeness of Reward Functions in Reinforcement Learning
R. Devidze
Parameswaran Kamalaruban
Adish Singla
29
2
0
10 Feb 2024
A User Study on Explainable Online Reinforcement Learning for Adaptive
  Systems
A User Study on Explainable Online Reinforcement Learning for Adaptive Systems
Andreas Metzger
Jan Laufer
Felix Feit
Klaus Pohl
OffRL
OnRL
24
1
0
09 Jul 2023
Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation
Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation
Wenyan Yang
A. Angleraud
R. Pieters
Joni Pajarinen
Joni-Kristian Kämäräinen
32
6
0
05 Mar 2023
Preference Transformer: Modeling Human Preferences using Transformers
  for RL
Preference Transformer: Modeling Human Preferences using Transformers for RL
Changyeon Kim
Jongjin Park
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
OffRL
38
61
0
02 Mar 2023
Bridging Physics-Informed Neural Networks with Reinforcement Learning:
  Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)
Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)
Amartya Mukherjee
Jun Liu
20
11
0
01 Feb 2023
Feature construction using explanations of individual predictions
Feature construction using explanations of individual predictions
Boštjan Vouk
Matej Guid
Marko Robnik-Šikonja
FAtt
27
10
0
23 Jan 2023
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
D. Meger
OffRL
17
14
0
28 Nov 2022
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer
  Value Function
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
Clément Bonnet
Laurence Midgley
Alexandre Laterre
24
1
0
19 Nov 2022
Agent-Time Attention for Sparse Rewards Multi-Agent Reinforcement
  Learning
Agent-Time Attention for Sparse Rewards Multi-Agent Reinforcement Learning
Jennifer She
Jayesh K. Gupta
Mykel J. Kochenderfer
31
4
0
31 Oct 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
24
11
0
12 Jul 2022
Off-Beat Multi-Agent Reinforcement Learning
Off-Beat Multi-Agent Reinforcement Learning
Wei Qiu
Weixun Wang
R. Wang
Bo An
Yujing Hu
S. Obraztsova
Zinovi Rabinovich
Jianye Hao
Yingfeng Chen
Changjie Fan
OffRL
29
2
0
27 May 2022
A Globally Convergent Evolutionary Strategy for Stochastic Constrained
  Optimization with Applications to Reinforcement Learning
A Globally Convergent Evolutionary Strategy for Stochastic Constrained Optimization with Applications to Reinforcement Learning
Youssef Diouane
Aurelien Lucchi
Vihang Patil
24
3
0
21 Feb 2022
Selective Credit Assignment
Selective Credit Assignment
Veronica Chelu
Diana Borsa
Doina Precup
Hado van Hasselt
26
2
0
20 Feb 2022
Retrieval-Augmented Reinforcement Learning
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
32
53
0
17 Feb 2022
Bayesian sense of time in biological and artificial brains
Bayesian sense of time in biological and artificial brains
Z. Fountas
Alexey Zakharov
32
0
0
14 Jan 2022
Mirror Learning: A Unifying Framework of Policy Optimisation
Mirror Learning: A Unifying Framework of Policy Optimisation
J. Kuba
Christian Schroeder de Witt
Jakob N. Foerster
23
24
0
07 Jan 2022
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Hung Le
Thommen George Karimpanal
Majid Abdolshah
T. Tran
Svetha Venkatesh
19
19
0
03 Nov 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Towards Practical Credit Assignment for Deep Reinforcement Learning
Towards Practical Credit Assignment for Deep Reinforcement Learning
Vyacheslav Alipov
Riley Simmons-Edler
N.Yu. Putintsev
Pavel Kalinin
Dmitry Vetrov
OffRL
29
11
0
08 Jun 2021
An Information-Theoretic Perspective on Credit Assignment in
  Reinforcement Learning
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning
Dilip Arumugam
Peter Henderson
Pierre-Luc Bacon
24
17
0
10 Mar 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Erik Cambria
OffRL
47
73
0
01 Jan 2021
Agent57: Outperforming the Atari Human Benchmark
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
15
509
0
30 Mar 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
120
0
24 Mar 2020
Explaining Deep Neural Networks and Beyond: A Review of Methods and
  Applications
Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications
Wojciech Samek
G. Montavon
Sebastian Lapuschkin
Christopher J. Anders
K. Müller
XAI
44
82
0
17 Mar 2020
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning
Yaodong Yang
Jianye Hao
Guangyong Chen
Hongyao Tang
Yingfeng Chen
Yujing Hu
Changjie Fan
Zhongyu Wei
23
52
0
10 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
37
188
0
23 Dec 2019
Automatic Design of CNNs via Differentiable Neural Architecture Search
  for PolSAR Image Classification
Automatic Design of CNNs via Differentiable Neural Architecture Search for PolSAR Image Classification
Hongwei Dong
Siyu Zhang
B. Zou
Lamei Zhang
16
47
0
16 Nov 2019
Towards Explainable Artificial Intelligence
Towards Explainable Artificial Intelligence
Wojciech Samek
K. Müller
XAI
32
436
0
26 Sep 2019
Explaining and Interpreting LSTMs
Explaining and Interpreting LSTMs
L. Arras
Jose A. Arjona-Medina
Michael Widrich
G. Montavon
Michael Gillhofer
K. Müller
Sepp Hochreiter
Wojciech Samek
FAtt
AI4TS
21
79
0
25 Sep 2019
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning
Tom Schaul
Diana Borsa
Joseph Modayil
Razvan Pascanu
13
63
0
25 Apr 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy
  Critics
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
18
17
0
11 Mar 2019
Optimizing Agent Behavior over Long Time Scales by Transporting Value
Optimizing Agent Behavior over Long Time Scales by Transporting Value
Chia-Chun Hung
Timothy Lillicrap
Josh Abramson
Yan Wu
M. Berk Mirza
Federico Carnevale
Arun Ahuja
Greg Wayne
23
121
0
15 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
32
550
0
12 Oct 2018
Variance Reduction in Monte Carlo Counterfactual Regret Minimization
  (VR-MCCFR) for Extensive Form Games using Baselines
Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Martin Schmid
Neil Burch
Marc Lanctot
Matej Moravcík
Rudolf Kadlec
Michael Bowling
23
64
0
09 Sep 2018
Sample-Efficient Deep Reinforcement Learning via Episodic Backward
  Update
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update
Su Young Lee
Sung-Ik Choi
Sae-Young Chung
BDL
15
73
0
31 May 2018
Methods for Interpreting and Understanding Deep Neural Networks
Methods for Interpreting and Understanding Deep Neural Networks
G. Montavon
Wojciech Samek
K. Müller
FaML
234
2,238
0
24 Jun 2017
1