Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
0810.3605
Cited By
v1
v2
v3 (latest)
A Minimum Relative Entropy Principle for Learning and Acting
20 October 2008
Pedro A. Ortega
Daniel A. Braun
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Minimum Relative Entropy Principle for Learning and Acting"
26 / 26 papers shown
Title
Partition Tree Weighting for Non-Stationary Stochastic Bandits
Joel Veness
Marcus Hutter
Andras Gyorgy
Jordi Grau-Moya
45
0
0
26 Feb 2025
A Unifying Framework for Causal Imitation Learning with Hidden Confounders
Daqian Shao
Thomas Kleine Buening
Marta Z. Kwiatkowska
CML
135
1
0
11 Feb 2025
Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents
Menglong Zhang
Fuyuan Qian
Quanying Liu
94
1
0
18 Jun 2024
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space
Saghar Adler
V. Subramanian
47
2
0
05 Jun 2023
Shaking the foundations: delusions in sequence models for interaction and control
Pedro A. Ortega
M. Kunesch
Grégoire Delétang
Tim Genewein
Jordi Grau-Moya
...
Yutian Chen
Scott E. Reed
Marcus Hutter
Nando de Freitas
Shane Legg
91
64
0
20 Oct 2021
Algorithms for Causal Reasoning in Probability Trees
Tim Genewein
Tom McGrath
Grégoire Delétang
Vladimir Mikulik
Miljan Martic
Shane Legg
Pedro A. Ortega
TPM
CML
55
16
0
23 Oct 2020
Action and Perception as Divergence Minimization
Danijar Hafner
Pedro A. Ortega
Jimmy Ba
Thomas Parr
Karl J. Friston
N. Heess
91
53
0
03 Sep 2020
Sophisticated Inference
Karl J. Friston
Lancelot Da Costa
Danijar Hafner
C. Hesp
Thomas Parr
89
101
0
07 Jun 2020
Efficient exploration of zero-sum stochastic games
Carlos Martin
Tuomas Sandholm
39
5
0
24 Feb 2020
Exploration by Optimisation in Partial Monitoring
Tor Lattimore
Csaba Szepesvári
74
38
0
12 Jul 2019
Meta-learning of Sequential Strategies
Pedro A. Ortega
Jane X. Wang
Mark Rowland
Tim Genewein
Z. Kurth-Nelson
...
Yee Whye Teh
H. V. Hasselt
Nando de Freitas
M. Botvinick
Shane Legg
OffRL
123
101
0
08 May 2019
Bounded rational decision-making from elementary computations that reduce uncertainty
Sebastian Gottwald
Daniel A. Braun
81
33
0
08 Apr 2019
Expanding the Active Inference Landscape: More Intrinsic Motivations in the Perception-Action Loop
Martin Biehl
Christian Guckelsberger
Christoph Salge
Simón C. Smith
Daniel Polani
LRM
AI4CE
150
26
0
21 Jun 2018
Information-gain computation
Anthony Di Franco
18
1
0
05 Jul 2017
Nonparametric General Reinforcement Learning
Jan Leike
OffRL
113
26
0
28 Nov 2016
Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes
Jordi Grau-Moya
Felix Leibfried
Tim Genewein
Daniel A. Braun
143
28
0
07 Apr 2016
Thompson Sampling is Asymptotically Optimal in General Environments
Jan Leike
Tor Lattimore
Laurent Orseau
Marcus Hutter
137
39
0
25 Feb 2016
Belief Flows of Robust Online Learning
Pedro A. Ortega
K. Crammer
Daniel D. Lee
33
0
0
26 May 2015
Thompson Sampling for Learning Parameterized Markov Decision Processes
Aditya Gopalan
Shie Mannor
86
0
0
29 Jun 2014
Thompson Sampling for Complex Bandit Problems
Aditya Gopalan
Shie Mannor
Yishay Mansour
165
204
0
03 Nov 2013
Generalized Thompson Sampling for Sequential Decision-Making and Causal Inference
Pedro A. Ortega
Daniel A. Braun
CML
131
52
0
18 Mar 2013
A Nonparametric Conjugate Prior Distribution for the Maximizing Argument of a Noisy Function
Pedro A. Ortega
Jordi Grau-Moya
Tim Genewein
David Balduzzi
Daniel A. Braun
113
2
0
09 Jun 2012
Thermodynamics as a theory of decision-making with information processing costs
Pedro A. Ortega
Daniel A. Braun
141
263
0
29 Apr 2012
Information, Utility & Bounded Rationality
Pedro A. Ortega
Daniel A. Braun
95
2
0
28 Jul 2011
An axiomatic formalization of bounded rationality based on a utility-information equivalence
Pedro A. Ortega
Daniel A. Braun
98
2
0
06 Jul 2010
A conversion between utility and information
Pedro A. Ortega
Daniel A. Braun
125
21
0
26 Nov 2009
1