ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.02807
  4. Cited By
Combining Q-Learning and Search with Amortized Value Estimates

Combining Q-Learning and Search with Amortized Value Estimates

5 December 2019
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
Tobias Pfaff
T. Weber
Lars Buesing
Peter W. Battaglia
    OffRL
ArXivPDFHTML

Papers citing "Combining Q-Learning and Search with Amortized Value Estimates"

9 / 9 papers shown
Title
Decision Making in Non-Stationary Environments with Policy-Augmented
  Search
Decision Making in Non-Stationary Environments with Policy-Augmented Search
Ava Pettet
Yunuo Zhang
Baiting Luo
Kyle Wray
Hendrik Baier
Aron Laszka
Abhishek Dubey
Ayan Mukhopadhyay
12
3
0
06 Jan 2024
Optimal Robotic Assembly Sequence Planning: A Sequential Decision-Making Approach
Optimal Robotic Assembly Sequence Planning: A Sequential Decision-Making Approach
Kartik Nagpal
Negar Mehr
25
0
0
26 Oct 2023
Cognitive Architectures for Language Agents
Cognitive Architectures for Language Agents
T. Sumers
Shunyu Yao
Karthik Narasimhan
Thomas L. Griffiths
LLMAG
LM&Ro
54
153
0
05 Sep 2023
Graph-based Reinforcement Learning meets Mixed Integer Programs: An
  application to 3D robot assembly discovery
Graph-based Reinforcement Learning meets Mixed Integer Programs: An application to 3D robot assembly discovery
Niklas Funk
Svenja Menzenbach
Georgia Chalvatzaki
Jan Peters
31
13
0
08 Mar 2022
Evaluating model-based planning and planner amortization for continuous
  control
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
55
17
0
07 Oct 2021
Learning Off-Policy with Online Planning
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
29
45
0
23 Aug 2020
Monte-Carlo Tree Search as Regularized Policy Optimization
Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill
Florent Altché
Yunhao Tang
Thomas Hubert
Michal Valko
Ioannis Antonoglou
Rémi Munos
19
73
0
24 Jul 2020
A Unifying Framework for Reinforcement Learning and Planning
A Unifying Framework for Reinforcement Learning and Planning
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
25
9
0
26 Jun 2020
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang
Amy Zhang
Ari S. Morcos
Joelle Pineau
Pieter Abbeel
Roberto Calandra
SSL
OffRL
28
27
0
07 May 2020
1