ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.05500
  4. Cited By
What Can Learned Intrinsic Rewards Capture?
v1v2v3 (latest)

What Can Learned Intrinsic Rewards Capture?

11 December 2019
Zeyu Zheng
Junhyuk Oh
Matteo Hessel
Zhongwen Xu
M. Kroiss
H. V. Hasselt
David Silver
Satinder Singh
ArXiv (abs)PDFHTML

Papers citing "What Can Learned Intrinsic Rewards Capture?"

37 / 37 papers shown
Title
Black box meta-learning intrinsic rewards for sparse-reward environments
Black box meta-learning intrinsic rewards for sparse-reward environments
Octavio Pappalardo
Rodrigo Ramele
Juan Miguel Santos
OffRL
85
0
0
31 Jul 2024
Evolution of Rewards for Food and Motor Action by Simulating Birth and
  Death
Evolution of Rewards for Food and Motor Action by Simulating Birth and Death
Yuji Kanagawa
Kenji Doya
26
0
0
21 Jun 2024
Improving Dialogue Agents by Decomposing One Global Explicit Annotation
  with Local Implicit Multimodal Feedback
Improving Dialogue Agents by Decomposing One Global Explicit Annotation with Local Implicit Multimodal Feedback
Dong Won Lee
Hae Won Park
Yoon Kim
C. Breazeal
Louis-Philippe Morency
104
0
0
17 Mar 2024
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit
  Tasks in Public Health
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health
Nikhil Behari
Edwin Zhang
Yunfan Zhao
Aparna Taneja
Dheeraj M. Nagaraj
Milind Tambe
139
16
0
22 Feb 2024
General Policies, Subgoal Structure, and Planning Width
General Policies, Subgoal Structure, and Planning Width
Blai Bonet
Hector Geffner
36
2
0
09 Nov 2023
Behavior Alignment via Reward Function Optimization
Behavior Alignment via Reward Function Optimization
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
106
11
0
29 Oct 2023
Diversity is Strength: Mastering Football Full Game with Interactive
  Reinforcement Learning of Multiple AIs
Diversity is Strength: Mastering Football Full Game with Interactive Reinforcement Learning of Multiple AIs
Chenglu Sun
Shuo Shen
Sijia Xu
Weidong Zhang
52
1
0
28 Jun 2023
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
Jinxin Liu
Lipeng Zu
Li He
Donglin Wang
OffRL
112
9
0
23 Jun 2023
Mastering Asymmetrical Multiplayer Game with Multi-Agent
  Asymmetric-Evolution Reinforcement Learning
Mastering Asymmetrical Multiplayer Game with Multi-Agent Asymmetric-Evolution Reinforcement Learning
Chenglu Sun
Yi-cui Zhang
Yu Zhang
Ziling Lu
Jingbin Liu
Si-Qi Xu
Weidong Zhang
35
0
0
20 Apr 2023
A Domain-Agnostic Approach for Characterization of Lifelong Learning
  Systems
A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Megan M. Baker
Alexander New
Mario Aguilar-Simon
Ziad Al-Halah
Sébastien M. R. Arnold
...
Zifan Xu
A. Yanguas-Gil
Harel Yedidsion
Shangqun Yu
Gautam K. Vallabha
81
18
0
18 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&RoOffRLAI4CELRM
137
119
0
18 Jan 2023
Reusable Options through Gradient-based Meta Learning
Reusable Options through Gradient-based Meta Learning
David Kuric
H. V. Hoof
88
0
0
22 Dec 2022
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
David Meger
OffRL
86
15
0
28 Nov 2022
Redeeming Intrinsic Rewards via Constrained Optimization
Redeeming Intrinsic Rewards via Constrained Optimization
Eric Chen
Zhang-Wei Hong
Joni Pajarinen
Pulkit Agrawal
OnRL
104
27
0
14 Nov 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
DaeJin Jo
Sungwoong Kim
D. W. Nam
Taehwan Kwon
Seungeun Rho
Jongmin Kim
Donghoon Lee
OffRL
70
10
0
11 Oct 2022
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Risto Vuorio
Jacob Beck
Shimon Whiteson
Jakob N. Foerster
Gregory Farquhar
79
7
0
22 Sep 2022
Language-Based Causal Representation Learning
Language-Based Causal Representation Learning
Blai Bonet
Hector Geffner
79
0
0
12 Jul 2022
Learning Sketches for Decomposing Planning Problems into Subproblems of
  Bounded Width: Extended Version
Learning Sketches for Decomposing Planning Problems into Subproblems of Bounded Width: Extended Version
Dominik Drexler
Jendrik Seipp
Hector Geffner
17
19
0
28 Mar 2022
Learning Synthetic Environments and Reward Networks for Reinforcement
  Learning
Learning Synthetic Environments and Reward Networks for Reinforcement Learning
Fabio Ferreira
Thomas Nierhoff
Andreas Saelinger
Frank Hutter
38
4
0
06 Feb 2022
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement
  Learning
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
Xidong Feng
Bo Liu
Jie Ren
Luo Mai
Rui Zhu
Haifeng Zhang
Jun Wang
Yaodong Yang
94
12
0
31 Dec 2021
Learning Long-Term Reward Redistribution via Randomized Return
  Decomposition
Learning Long-Term Reward Redistribution via Randomized Return Decomposition
Zhizhou Ren
Ruihan Guo
Yuanshuo Zhou
Jian-wei Peng
125
38
0
26 Nov 2021
On the Expressivity of Markov Reward
On the Expressivity of Markov Reward
David Abel
Will Dabney
Anna Harutyunyan
Mark K. Ho
Michael L. Littman
Doina Precup
Satinder Singh
86
85
0
01 Nov 2021
Wasserstein Distance Maximizing Intrinsic Control
Wasserstein Distance Maximizing Intrinsic Control
Ishan Durugkar
Steven Hansen
Stephen Spencer
Volodymyr Mnih
97
6
0
28 Oct 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
107
6
0
18 Sep 2021
Target Languages (vs. Inductive Biases) for Learning to Act and Plan
Target Languages (vs. Inductive Biases) for Learning to Act and Plan
Hector Geffner
73
6
0
15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
86
102
0
14 Sep 2021
Neural Auto-Curricula
Neural Auto-Curricula
Xidong Feng
Oliver Slumbers
Bo Liu
Bo Liu
Stephen Marcus McAleer
Ying Wen
Jun Wang
Yaodong Yang
96
2
0
04 Jun 2021
Adversarial Intrinsic Motivation for Reinforcement Learning
Adversarial Intrinsic Motivation for Reinforcement Learning
Ishan Durugkar
Mauricio Tec
S. Niekum
Peter Stone
OOD
127
41
0
27 May 2021
Expressing and Exploiting the Common Subgoal Structure of Classical
  Planning Domains Using Sketches: Extended Version
Expressing and Exploiting the Common Subgoal Structure of Classical Planning Domains Using Sketches: Extended Version
Dominik Drexler
Jendrik Seipp
Hector Geffner
43
16
0
10 May 2021
Delayed Rewards Calibration via Reward Empirical Sufficiency
Delayed Rewards Calibration via Reward Empirical Sufficiency
Yixuan Liu
Hu Wang
Xiaowei Wang
Xiaoyue Sun
Liuyue Jiang
Minhui Xue
40
0
0
21 Feb 2021
Adaptive Pairwise Weights for Temporal Credit Assignment
Adaptive Pairwise Weights for Temporal Credit Assignment
Zeyu Zheng
Risto Vuorio
Richard L. Lewis
Satinder Singh
47
5
0
09 Feb 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and Perspectives
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLLOffRL
142
324
0
25 Dec 2020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
OffRL
99
179
0
05 Nov 2020
Variational Dynamic for Self-Supervised Exploration in Deep
  Reinforcement Learning
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
Chenjia Bai
Peng Liu
Kaiyu Liu
Zhaoran Wang
Yingnan Zhao
Lingxiao Wang
SSL
53
18
0
17 Oct 2020
Discovering Reinforcement Learning Algorithms
Discovering Reinforcement Learning Algorithms
Junhyuk Oh
Matteo Hessel
Wojciech M. Czarnecki
Zhongwen Xu
H. V. Hasselt
Satinder Singh
David Silver
81
129
0
17 Jul 2020
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
82
78
0
16 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State
  Entropy Estimate
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
73
19
0
09 Jul 2020
1