ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.5326
  4. Cited By
Compress and Control

Compress and Control

19 November 2014
J. Veness
Marc G. Bellemare
Marcus Hutter
Alvin Chua
Guillaume Desjardins
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Compress and Control"

10 / 10 papers shown
Title
In-context learning and Occam's razor
In-context learning and Occam's razor
Eric Elmoznino
Tom Marty
Tejas Kasetty
Léo Gagnon
Sarthak Mittal
Mahan Fathi
Dhanya Sridhar
Guillaume Lajoie
119
1
0
17 Oct 2024
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
127
12,261
0
19 Dec 2013
Sparse Adaptive Dirichlet-Multinomial-like Processes
Sparse Adaptive Dirichlet-Multinomial-like Processes
Marcus Hutter
64
15
0
16 May 2013
Cover Tree Bayesian Reinforcement Learning
Cover Tree Bayesian Reinforcement Learning
Nikolaos Tziortziotis
Christos Dimitrakakis
K. Blekas
OffRLBDL
76
21
0
08 May 2013
Partition Tree Weighting
Partition Tree Weighting
J. Veness
Martha White
Michael Bowling
András Gyorgy
113
22
0
03 Nov 2012
The Arcade Learning Environment: An Evaluation Platform for General
  Agents
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
120
3,020
0
19 Jul 2012
Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based
  Search
Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search
A. Guez
David Silver
Peter Dayan
102
174
0
14 May 2012
Learning is planning: near Bayes-optimal reinforcement learning via
  Monte-Carlo tree search
Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search
J. Asmuth
Michael L. Littman
122
42
0
14 Feb 2012
Efficient Tracking of Large Classes of Experts
Efficient Tracking of Large Classes of Experts
András Gyorgy
Tamás Linder
Gábor Lugosi
115
75
0
12 Oct 2011
Reinforcement Learning via AIXI Approximation
Reinforcement Learning via AIXI Approximation
J. Veness
K. S. Ng
Marcus Hutter
David Silver
98
29
0
13 Jul 2010
1