ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.13508
  4. Cited By
Guided Cooperation in Hierarchical Reinforcement Learning via
  Model-based Rollout

Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout

24 September 2023
Haoran Wang
Zeshen Tang
Leya Yang
Yaoru Sun
Fang Wang
Siyu Zhang
Ye-Ting Chen
ArXivPDFHTML

Papers citing "Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout"

18 / 18 papers shown
Title
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
Haoran Wang
Yaoru Sun
Zeshen Tang
Haibo Shi
Chenyuan Jiao
64
0
0
12 Oct 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
108
7
0
03 Jun 2024
Imitating Graph-Based Planning with Goal-Conditioned Policies
Imitating Graph-Based Planning with Goal-Conditioned Policies
Junsup Kim
Younggyo Seo
SungSoo Ahn
Kyunghwan Son
Jinwoo Shin
48
10
0
20 Mar 2023
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement
  Learning
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Junsup Kim
Younggyo Seo
Jinwoo Shin
60
59
0
26 Oct 2021
Model-based Policy Optimization with Unsupervised Model Adaptation
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
57
28
0
19 Oct 2020
LaND: Learning to Navigate from Disengagements
LaND: Learning to Navigate from Disengagements
G. Kahn
Pieter Abbeel
Sergey Levine
30
52
0
09 Oct 2020
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang
Amy Zhang
Ari S. Morcos
Joelle Pineau
Pieter Abbeel
Roberto Calandra
SSL
OffRL
42
27
0
07 May 2020
Sparse Graphical Memory for Robust Planning
Sparse Graphical Memory for Robust Planning
Scott Emmons
Ajay Jain
Michael Laskin
Thanard Kurutach
Pieter Abbeel
Deepak Pathak
53
50
0
13 Mar 2020
Hallucinative Topological Memory for Zero-Shot Visual Planning
Hallucinative Topological Memory for Zero-Shot Visual Planning
Kara Liu
Thanard Kurutach
Christine Tung
Pieter Abbeel
Aviv Tamar
55
48
0
27 Feb 2020
When to Trust Your Model: Model-Based Policy Optimization
When to Trust Your Model: Model-Based Policy Optimization
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
OffRL
55
939
0
19 Jun 2019
Search on the Replay Buffer: Bridging Planning and Reinforcement
  Learning
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
51
289
0
12 Jun 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
76
1,044
0
03 Jun 2019
Near-Optimal Representation Learning for Hierarchical Reinforcement
  Learning
Near-Optimal Representation Learning for Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
53
208
0
02 Oct 2018
Semi-parametric Topological Memory for Navigation
Semi-parametric Topological Memory for Navigation
Nikolay Savinov
Alexey Dosovitskiy
V. Koltun
44
379
0
01 Mar 2018
Model-Based Value Estimation for Efficient Model-Free Reinforcement
  Learning
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
OffRL
50
317
0
28 Feb 2018
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Carlos Florensa
Yan Duan
Pieter Abbeel
BDL
67
360
0
10 Apr 2017
Improved Training of Wasserstein GANs
Improved Training of Wasserstein GANs
Ishaan Gulrajani
Faruk Ahmed
Martín Arjovsky
Vincent Dumoulin
Aaron Courville
GAN
130
9,509
0
31 Mar 2017
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
183
13,174
0
09 Sep 2015
1