v1v2v3 (latest)

A Minimum Relative Entropy Principle for Learning and Acting

20 October 2008

Papers citing "A Minimum Relative Entropy Principle for Learning and Acting"

26 / 26 papers shown

Title
Partition Tree Weighting for Non-Stationary Stochastic Bandits Joel Veness Marcus Hutter Andras Gyorgy Jordi Grau-Moya 45 0 0 26 Feb 2025
A Unifying Framework for Causal Imitation Learning with Hidden Confounders Daqian Shao Thomas Kleine Buening Marta Z. Kwiatkowska CML 135 1 0 11 Feb 2025
Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents Menglong Zhang Fuyuan Qian Quanying Liu 94 1 0 18 Jun 2024
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space Saghar Adler V. Subramanian 47 2 0 05 Jun 2023
Shaking the foundations: delusions in sequence models for interaction and control Pedro A. Ortega M. Kunesch Grégoire Delétang Tim Genewein Jordi Grau-Moya ... Yutian Chen Scott E. Reed Marcus Hutter Nando de Freitas Shane Legg 91 64 0 20 Oct 2021
Algorithms for Causal Reasoning in Probability Trees Tim Genewein Tom McGrath Grégoire Delétang Vladimir Mikulik Miljan Martic Shane Legg Pedro A. Ortega TPM CML 55 16 0 23 Oct 2020
Action and Perception as Divergence Minimization Danijar Hafner Pedro A. Ortega Jimmy Ba Thomas Parr Karl J. Friston N. Heess 91 53 0 03 Sep 2020
Sophisticated Inference Karl J. Friston Lancelot Da Costa Danijar Hafner C. Hesp Thomas Parr 89 101 0 07 Jun 2020
Efficient exploration of zero-sum stochastic games Carlos Martin Tuomas Sandholm 39 5 0 24 Feb 2020
Exploration by Optimisation in Partial Monitoring Tor Lattimore Csaba Szepesvári 74 38 0 12 Jul 2019
Meta-learning of Sequential Strategies Pedro A. Ortega Jane X. Wang Mark Rowland Tim Genewein Z. Kurth-Nelson ... Yee Whye Teh H. V. Hasselt Nando de Freitas M. Botvinick Shane Legg OffRL 123 101 0 08 May 2019
Bounded rational decision-making from elementary computations that reduce uncertainty Sebastian Gottwald Daniel A. Braun 81 33 0 08 Apr 2019
Expanding the Active Inference Landscape: More Intrinsic Motivations in the Perception-Action Loop Martin Biehl Christian Guckelsberger Christoph Salge Simón C. Smith Daniel Polani LRM AI4CE 150 26 0 21 Jun 2018
Information-gain computation Anthony Di Franco 18 1 0 05 Jul 2017
Nonparametric General Reinforcement Learning Jan Leike OffRL 113 26 0 28 Nov 2016
Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes Jordi Grau-Moya Felix Leibfried Tim Genewein Daniel A. Braun 143 28 0 07 Apr 2016
Thompson Sampling is Asymptotically Optimal in General Environments Jan Leike Tor Lattimore Laurent Orseau Marcus Hutter 137 39 0 25 Feb 2016
Belief Flows of Robust Online Learning Pedro A. Ortega K. Crammer Daniel D. Lee 33 0 0 26 May 2015
Thompson Sampling for Learning Parameterized Markov Decision Processes Aditya Gopalan Shie Mannor 86 0 0 29 Jun 2014
Thompson Sampling for Complex Bandit Problems Aditya Gopalan Shie Mannor Yishay Mansour 165 204 0 03 Nov 2013
Generalized Thompson Sampling for Sequential Decision-Making and Causal Inference Pedro A. Ortega Daniel A. Braun CML 131 52 0 18 Mar 2013
A Nonparametric Conjugate Prior Distribution for the Maximizing Argument of a Noisy Function Pedro A. Ortega Jordi Grau-Moya Tim Genewein David Balduzzi Daniel A. Braun 113 2 0 09 Jun 2012
Thermodynamics as a theory of decision-making with information processing costs Pedro A. Ortega Daniel A. Braun 141 263 0 29 Apr 2012
Information, Utility & Bounded Rationality Pedro A. Ortega Daniel A. Braun 95 2 0 28 Jul 2011
An axiomatic formalization of bounded rationality based on a utility-information equivalence Pedro A. Ortega Daniel A. Braun 98 2 0 06 Jul 2010
A conversion between utility and information Pedro A. Ortega Daniel A. Braun 125 21 0 26 Nov 2009