ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 0810.3605
  4. Cited By
A Minimum Relative Entropy Principle for Learning and Acting
v1v2v3 (latest)

A Minimum Relative Entropy Principle for Learning and Acting

20 October 2008
Pedro A. Ortega
Daniel A. Braun
ArXiv (abs)PDFHTML

Papers citing "A Minimum Relative Entropy Principle for Learning and Acting"

26 / 26 papers shown
Title
Partition Tree Weighting for Non-Stationary Stochastic Bandits
Partition Tree Weighting for Non-Stationary Stochastic Bandits
Joel Veness
Marcus Hutter
Andras Gyorgy
Jordi Grau-Moya
45
0
0
26 Feb 2025
A Unifying Framework for Causal Imitation Learning with Hidden Confounders
A Unifying Framework for Causal Imitation Learning with Hidden Confounders
Daqian Shao
Thomas Kleine Buening
Marta Z. Kwiatkowska
CML
135
1
0
11 Feb 2025
Memory Sequence Length of Data Sampling Impacts the Adaptation of
  Meta-Reinforcement Learning Agents
Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents
Menglong Zhang
Fuyuan Qian
Quanying Liu
94
1
0
18 Jun 2024
Bayesian Learning of Optimal Policies in Markov Decision Processes with
  Countably Infinite State-Space
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space
Saghar Adler
V. Subramanian
47
2
0
05 Jun 2023
Shaking the foundations: delusions in sequence models for interaction
  and control
Shaking the foundations: delusions in sequence models for interaction and control
Pedro A. Ortega
M. Kunesch
Grégoire Delétang
Tim Genewein
Jordi Grau-Moya
...
Yutian Chen
Scott E. Reed
Marcus Hutter
Nando de Freitas
Shane Legg
91
64
0
20 Oct 2021
Algorithms for Causal Reasoning in Probability Trees
Algorithms for Causal Reasoning in Probability Trees
Tim Genewein
Tom McGrath
Grégoire Delétang
Vladimir Mikulik
Miljan Martic
Shane Legg
Pedro A. Ortega
TPMCML
55
16
0
23 Oct 2020
Action and Perception as Divergence Minimization
Action and Perception as Divergence Minimization
Danijar Hafner
Pedro A. Ortega
Jimmy Ba
Thomas Parr
Karl J. Friston
N. Heess
91
53
0
03 Sep 2020
Sophisticated Inference
Sophisticated Inference
Karl J. Friston
Lancelot Da Costa
Danijar Hafner
C. Hesp
Thomas Parr
89
101
0
07 Jun 2020
Efficient exploration of zero-sum stochastic games
Efficient exploration of zero-sum stochastic games
Carlos Martin
Tuomas Sandholm
41
5
0
24 Feb 2020
Exploration by Optimisation in Partial Monitoring
Exploration by Optimisation in Partial Monitoring
Tor Lattimore
Csaba Szepesvári
74
38
0
12 Jul 2019
Meta-learning of Sequential Strategies
Meta-learning of Sequential Strategies
Pedro A. Ortega
Jane X. Wang
Mark Rowland
Tim Genewein
Z. Kurth-Nelson
...
Yee Whye Teh
H. V. Hasselt
Nando de Freitas
M. Botvinick
Shane Legg
OffRL
123
101
0
08 May 2019
Bounded rational decision-making from elementary computations that
  reduce uncertainty
Bounded rational decision-making from elementary computations that reduce uncertainty
Sebastian Gottwald
Daniel A. Braun
81
33
0
08 Apr 2019
Expanding the Active Inference Landscape: More Intrinsic Motivations in
  the Perception-Action Loop
Expanding the Active Inference Landscape: More Intrinsic Motivations in the Perception-Action Loop
Martin Biehl
Christian Guckelsberger
Christoph Salge
Simón C. Smith
Daniel Polani
LRMAI4CE
150
26
0
21 Jun 2018
Information-gain computation
Information-gain computation
Anthony Di Franco
20
1
0
05 Jul 2017
Nonparametric General Reinforcement Learning
Nonparametric General Reinforcement Learning
Jan Leike
OffRL
113
26
0
28 Nov 2016
Planning with Information-Processing Constraints and Model Uncertainty
  in Markov Decision Processes
Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes
Jordi Grau-Moya
Felix Leibfried
Tim Genewein
Daniel A. Braun
143
28
0
07 Apr 2016
Thompson Sampling is Asymptotically Optimal in General Environments
Thompson Sampling is Asymptotically Optimal in General Environments
Jan Leike
Tor Lattimore
Laurent Orseau
Marcus Hutter
137
39
0
25 Feb 2016
Belief Flows of Robust Online Learning
Belief Flows of Robust Online Learning
Pedro A. Ortega
K. Crammer
Daniel D. Lee
35
0
0
26 May 2015
Thompson Sampling for Learning Parameterized Markov Decision Processes
Thompson Sampling for Learning Parameterized Markov Decision Processes
Aditya Gopalan
Shie Mannor
86
0
0
29 Jun 2014
Thompson Sampling for Complex Bandit Problems
Thompson Sampling for Complex Bandit Problems
Aditya Gopalan
Shie Mannor
Yishay Mansour
165
204
0
03 Nov 2013
Generalized Thompson Sampling for Sequential Decision-Making and Causal
  Inference
Generalized Thompson Sampling for Sequential Decision-Making and Causal Inference
Pedro A. Ortega
Daniel A. Braun
CML
131
52
0
18 Mar 2013
A Nonparametric Conjugate Prior Distribution for the Maximizing Argument
  of a Noisy Function
A Nonparametric Conjugate Prior Distribution for the Maximizing Argument of a Noisy Function
Pedro A. Ortega
Jordi Grau-Moya
Tim Genewein
David Balduzzi
Daniel A. Braun
113
2
0
09 Jun 2012
Thermodynamics as a theory of decision-making with information
  processing costs
Thermodynamics as a theory of decision-making with information processing costs
Pedro A. Ortega
Daniel A. Braun
141
263
0
29 Apr 2012
Information, Utility & Bounded Rationality
Information, Utility & Bounded Rationality
Pedro A. Ortega
Daniel A. Braun
97
2
0
28 Jul 2011
An axiomatic formalization of bounded rationality based on a
  utility-information equivalence
An axiomatic formalization of bounded rationality based on a utility-information equivalence
Pedro A. Ortega
Daniel A. Braun
100
2
0
06 Jul 2010
A conversion between utility and information
A conversion between utility and information
Pedro A. Ortega
Daniel A. Braun
125
21
0
26 Nov 2009
1