ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.12392
  4. Cited By
A Unified Bellman Optimality Principle Combining Reward Maximization and
  Empowerment

A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment

26 July 2019
Felix Leibfried
Sergio Pascual-Diaz
Jordi Grau-Moya
ArXivPDFHTML

Papers citing "A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment"

32 / 32 papers shown
Title
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
346
10,591
0
17 Feb 2020
Mutual-Information Regularization in Markov Decision Processes and
  Actor-Critic Learning
Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
Felix Leibfried
Jordi Grau-Moya
52
22
0
11 Sep 2019
Soft Actor-Critic Algorithms and Applications
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
133
2,418
0
13 Dec 2018
Empowerment-driven Exploration using Mutual Information Estimation
Empowerment-driven Exploration using Mutual Information Estimation
Navneet Kumar
25
8
0
11 Oct 2018
Adversarial Imitation via Variational Inverse Reinforcement Learning
Adversarial Imitation via Variational Inverse Reinforcement Learning
A. H. Qureshi
Byron Boots
Michael C. Yip
52
61
0
17 Sep 2018
A unified strategy for implementing curiosity and empowerment driven
  reinforcement learning
A unified strategy for implementing curiosity and empowerment driven reinforcement learning
Ildefons Magrans de Abril
Ryota Kanai
60
20
0
18 Jun 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
71
477
0
14 Jun 2018
Deep Reinforcement Learning in a Handful of Trials using Probabilistic
  Dynamics Models
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
Kurtland Chua
Roberto Calandra
R. McAllister
Sergey Levine
BDL
210
1,272
0
30 May 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial
  and Review
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CE
BDL
73
671
0
02 May 2018
Addressing Function Approximation Error in Actor-Critic Methods
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
169
5,168
0
26 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
284
8,313
0
04 Jan 2018
An Information-Theoretic Optimality Principle for Deep Reinforcement
  Learning
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Felix Leibfried
Jordi Grau-Moya
Haitham Bou-Ammar
79
24
0
06 Aug 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
446
18,931
0
20 Jul 2017
A unified view of entropy-regularized Markov decision processes
A unified view of entropy-regularized Markov decision processes
Gergely Neu
Anders Jonsson
Vicencc Gómez
93
262
0
22 May 2017
Equivalence Between Policy Gradients and Soft Q-Learning
Equivalence Between Policy Gradients and Soft Q-Learning
John Schulman
Xi Chen
Pieter Abbeel
OffRL
80
345
0
21 Apr 2017
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Bridging the Gap Between Value and Policy Based Reinforcement Learning
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
152
470
0
28 Feb 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
95
1,339
0
27 Feb 2017
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
204
5,073
0
05 Jun 2016
Information Theoretically Aided Reinforcement Learning for Embodied
  Agents
Information Theoretically Aided Reinforcement Learning for Embodied Agents
Guido Montúfar
K. Zahedi
Nihat Ay
32
10
0
31 May 2016
Planning with Information-Processing Constraints and Model Uncertainty
  in Markov Decision Processes
Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes
Jordi Grau-Moya
Felix Leibfried
Tim Genewein
Daniel A. Braun
107
28
0
07 Apr 2016
Bounded Rational Decision-Making in Feedforward Neural Networks
Bounded Rational Decision-Making in Feedforward Neural Networks
Felix Leibfried
Daniel A. Braun
48
9
0
26 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
189
8,833
0
04 Feb 2016
Taming the Noise in Reinforcement Learning via Soft Updates
Taming the Noise in Reinforcement Learning via Soft Updates
Roy Fox
Ari Pakman
Naftali Tishby
67
338
0
28 Dec 2015
Variational Information Maximisation for Intrinsically Motivated
  Reinforcement Learning
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning
S. Mohamed
Danilo Jimenez Rezende
DRL
SSL
86
401
0
29 Sep 2015
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
156
7,623
0
22 Sep 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
310
13,214
0
09 Sep 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
274
6,755
0
19 Feb 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.6K
149,842
0
22 Dec 2014
Empowerment -- an Introduction
Empowerment -- an Introduction
Christoph Salge
C. Glackin
Daniel Polani
79
181
0
07 Oct 2013
Thermodynamics as a theory of decision-making with information
  processing costs
Thermodynamics as a theory of decision-making with information processing costs
Pedro A. Ortega
Daniel A. Braun
85
262
0
29 Apr 2012
Empowerment for Continuous Agent-Environment Systems
Empowerment for Continuous Agent-Environment Systems
T. Jung
Daniel Polani
Peter Stone
61
99
0
31 Jan 2012
Higher coordination with less control - A result of information
  maximization in the sensorimotor loop
Higher coordination with less control - A result of information maximization in the sensorimotor loop
K. Zahedi
Nihat Ay
R. Der
90
71
0
11 Oct 2009
1