ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.10928
  4. Cited By
Single-Agent Policy Tree Search With Guarantees

Single-Agent Policy Tree Search With Guarantees

27 November 2018
Laurent Orseau
Levi H. S. Lelis
Tor Lattimore
T. Weber
ArXivPDFHTML

Papers citing "Single-Agent Policy Tree Search With Guarantees"

7 / 7 papers shown
Title
Soft-Bayes: Prod for Mixtures of Experts with Log-Loss
Soft-Bayes: Prod for Mixtures of Experts with Log-Loss
Laurent Orseau
Tor Lattimore
Shane Legg
18
22
0
08 Jan 2019
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
99
1,755
0
05 Dec 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
65
552
0
19 Jul 2017
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
161
8,805
0
04 Feb 2016
The LAMA Planner: Guiding Cost-Based Anytime Planning with Landmarks
The LAMA Planner: Guiding Cost-Based Anytime Planning with Landmarks
Silvia Richter
Matthias Westphal
48
735
0
16 Jan 2014
The Fast Downward Planning System
The Fast Downward Planning System
M. Helmert
44
1,895
0
27 Sep 2011
The FF Planning System: Fast Plan Generation Through Heuristic Search
The FF Planning System: Fast Plan Generation Through Heuristic Search
Jörg Hoffmann
Bernhard Nebel
74
2,352
0
03 Jun 2011
1