ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1604.06057
  4. Cited By
Hierarchical Deep Reinforcement Learning: Integrating Temporal
  Abstraction and Intrinsic Motivation
v1v2 (latest)

Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation

20 April 2016
Tejas D. Kulkarni
Karthik Narasimhan
A. Saeedi
J. Tenenbaum
ArXiv (abs)PDFHTML

Papers citing "Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation"

50 / 493 papers shown
Title
Goal-conditioned Hierarchical Reinforcement Learning for Sample-efficient and Safe Autonomous Driving at Intersections
Goal-conditioned Hierarchical Reinforcement Learning for Sample-efficient and Safe Autonomous Driving at Intersections
Yiou Huang
21
0
0
19 Jun 2025
Decoupled Hierarchical Reinforcement Learning with State Abstraction for Discrete Grids
Decoupled Hierarchical Reinforcement Learning with State Abstraction for Discrete Grids
Qingyu Xiao
Yuanlin Chang
Youtian Du
25
0
0
01 Jun 2025
Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals
Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals
V. Wang
Tinghuai Wang
Joni Pajarinen
BDL
32
0
0
27 May 2025
Dual-Agent Reinforcement Learning for Automated Feature Generation
Dual-Agent Reinforcement Learning for Automated Feature Generation
Wanfu Gao
Zengyao Man
Hanlin Pan
Kunpeng Liu
84
0
0
19 May 2025
Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning
Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning
Hongjoon Ahn
Heewoong Choi
Jisu Han
Taesup Moon
OffRL
106
0
0
19 May 2025
Decentralized Traffic Flow Optimization Through Intrinsic Motivation
Decentralized Traffic Flow Optimization Through Intrinsic Motivation
Himaja Papala
Daniel Polani
Stas Tiomkin
91
0
0
08 May 2025
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Baida Zhang
Yakai Chen
Huichun Li
Zhenghu Zu
61
0
0
07 May 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
75
0
0
23 Mar 2025
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Vivek Myers
Bill Chunyuan Zheng
Anca Dragan
Kuan Fang
Sergey Levine
200
1
0
08 Feb 2025
MuST: Multi-Head Skill Transformer for Long-Horizon Dexterous Manipulation with Skill Progress
MuST: Multi-Head Skill Transformer for Long-Horizon Dexterous Manipulation with Skill Progress
Kai Gao
Fan Wang
Erica Aduh
Dylan Randle
Jane Shi
152
0
0
04 Feb 2025
Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning
Zhihao Zhang
Ekim Yurtsever
Keith A. Redmill
86
0
0
28 Jan 2025
Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning
Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning
Gavin B. Rens
84
0
0
03 Jan 2025
RFPPO: Motion Dynamic RRT based Fluid Field - PPO for Dynamic TF/TA Routing Planning
RFPPO: Motion Dynamic RRT based Fluid Field - PPO for Dynamic TF/TA Routing Planning
Rongkun Xue
Jing Yang
Yuyang Jiang
Yiming Feng
Zi Yang
101
0
0
31 Dec 2024
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for
  Long-Horizon Manipulation
SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation
Zihan Zhou
Animesh Garg
Dieter Fox
Caelan Reed Garrett
Ajay Mandlekar
95
4
0
23 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSLOffRLOnRL
189
0
0
23 Oct 2024
Unveiling Options with Neural Decomposition
Unveiling Options with Neural Decomposition
Mahdi Alikhasi
Levi H. S. Lelis
89
0
0
15 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
131
5
0
03 Oct 2024
Simplified priors for Object-Centric Learning
Simplified priors for Object-Centric Learning
Vihang Patil
Andreas Radler
Daniel Klotz
Sepp Hochreiter
OCL
86
0
0
01 Oct 2024
Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise
  Recommendation
Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation
Luo Ji
Gao Liu
Mingyang Yin
Hongxia Yang
Jingren Zhou
53
0
0
11 Sep 2024
Coordinating Planning and Tracking in Layered Control Policies via
  Actor-Critic Learning
Coordinating Planning and Tracking in Layered Control Policies via Actor-Critic Learning
Fengjun Yang
Nikolai Matni
OffRL
72
0
0
03 Aug 2024
How to Choose a Reinforcement-Learning Algorithm
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
71
1
0
30 Jul 2024
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement
  Learning in POMDP Environments
SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments
Shu Ishida
João F. Henriques
100
0
0
26 Jul 2024
MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept
  Discovery
MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Pei Zhou
Yanchao Yang
77
1
0
21 Jul 2024
Value Internalization: Learning and Generalizing from Social Reward
Value Internalization: Learning and Generalizing from Social Reward
Frieda Rong
Max Kleiman-Weiner
68
1
0
19 Jul 2024
Bidirectional-Reachable Hierarchical Reinforcement Learning with
  Mutually Responsive Policies
Bidirectional-Reachable Hierarchical Reinforcement Learning with Mutually Responsive Policies
Yu-Juan Luo
Fuchun Sun
Tianying Ji
Xianyuan Zhan
59
0
0
26 Jun 2024
Probabilistic Subgoal Representations for Hierarchical Reinforcement
  learning
Probabilistic Subgoal Representations for Hierarchical Reinforcement learning
V. Wang
Tinghuai Wang
Wenyan Yang
Joni-Kristian Kämäräinen
Joni Pajarinen
BDL
57
4
0
24 Jun 2024
Act Better by Timing: A timing-Aware Reinforcement Learning for
  Autonomous Driving
Act Better by Timing: A timing-Aware Reinforcement Learning for Autonomous Driving
Guanzhou Li
Jianping Wu
Yujing He
61
0
0
19 Jun 2024
EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing
  with Deep Reinforcement Learning
EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing with Deep Reinforcement Learning
Yijun Hao
Shusen Yang
Fang Li
Yifan Zhang
Shibo Wang
Xuebin Ren
44
2
0
11 Jun 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
67
1
0
30 May 2024
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep
  Reinforcement Learning
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
Shuai Zhang
Heshan Devaka Fernando
Miao Liu
K. Murugesan
Songtao Lu
Pin-Yu Chen
Tianyi Chen
Meng Wang
75
2
0
24 May 2024
Spatio-temporal Value Semantics-based Abstraction for Dense Deep
  Reinforcement Learning
Spatio-temporal Value Semantics-based Abstraction for Dense Deep Reinforcement Learning
Jihui Nie
Dehui Du
Jiangnan Zhao
AI4CE
51
0
0
24 May 2024
Reinforcement learning
Reinforcement learning
Florentin Wörgötter
94
2,526
0
16 May 2024
On Policy Reuse: An Expressive Language for Representing and Executing
  General Policies that Call Other Policies
On Policy Reuse: An Expressive Language for Representing and Executing General Policies that Call Other Policies
Blai Bonet
Dominik Drexler
Hector Geffner
36
2
0
25 Mar 2024
M-HOF-Opt: Multi-Objective Hierarchical Output Feedback Optimization via Multiplier Induced Loss Landscape Scheduling
M-HOF-Opt: Multi-Objective Hierarchical Output Feedback Optimization via Multiplier Induced Loss Landscape Scheduling
Xudong Sun
Nutan Chen
Alexej Gossmann
Yu Xing
Carla Feistner
...
Felix Drost
Daniele Scarcella
Lisa Beer
Carsten Marr
Carsten Marr
104
1
0
20 Mar 2024
Reinforcement Learning with Options and State Representation
Reinforcement Learning with Options and State Representation
Ayoub Ghriss
Masashi Sugiyama
A. Lazaric
21
0
0
16 Mar 2024
ALaRM: Align Language Models via Hierarchical Rewards Modeling
ALaRM: Align Language Models via Hierarchical Rewards Modeling
Yuhang Lai
Siyuan Wang
Shujun Liu
Xuanjing Huang
Zhongyu Wei
89
5
0
11 Mar 2024
Reinforcement Learning Based Oscillation Dampening: Scaling up
  Single-Agent RL algorithms to a 100 AV highway field operational test
Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test
Kathy Jang
Nathan Lichtlé
Eugene Vinitsky
Adit Shah
Matt Bunting
...
Dan Work
M. D. Monache
Jonathan Sprinkle
Jonathan W. Lee
Alexandre M. Bayen
65
8
0
26 Feb 2024
Empowering Large Language Model Agents through Action Learning
Empowering Large Language Model Agents through Action Learning
Haiteng Zhao
Chang Ma
Guoyin Wang
Jing Su
Lingpeng Kong
Jingjing Xu
Zhi-Hong Deng
Hongxia Yang
LM&RoLLMAG
92
12
0
24 Feb 2024
Hierarchical State Space Models for Continuous Sequence-to-Sequence
  Modeling
Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling
Raunaq M. Bhirangi
Chenyu Wang
Venkatesh Pattabiraman
Carmel Majidi
Abhinav Gupta
Tess Hellebrekers
Lerrel Pinto
88
13
0
15 Feb 2024
Informativeness of Reward Functions in Reinforcement Learning
Informativeness of Reward Functions in Reinforcement Learning
R. Devidze
Parameswaran Kamalaruban
Adish Singla
72
2
0
10 Feb 2024
TopoNav: Topological Navigation for Efficient Exploration in Sparse
  Reward Environments
TopoNav: Topological Navigation for Efficient Exploration in Sparse Reward Environments
Jumman Hossain
A. Faridee
Nirmalya Roy
Jade Freeman
Timothy Gregory
Theron T. Trout
65
3
0
06 Feb 2024
Reinforcement Learning from Bagged Reward
Reinforcement Learning from Bagged Reward
Yuting Tang
Xin-Qiang Cai
Yao-Xiang Ding
Qiyu Wu
Guoqing Liu
Masashi Sugiyama
OffRL
79
0
0
06 Feb 2024
Sample Efficient Reinforcement Learning by Automatically Learning to
  Compose Subtasks
Sample Efficient Reinforcement Learning by Automatically Learning to Compose Subtasks
Shuai Han
Mehdi Dastani
Shihan Wang
OffRL
108
1
0
25 Jan 2024
Reconciling Spatial and Temporal Abstractions for Goal Representation
Reconciling Spatial and Temporal Abstractions for Goal Representation
Mehdi Zadem
Sergio Mover
Sao Mai Nguyen
41
5
0
18 Jan 2024
NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents
  Designed for Open Worlds
NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds
Shivam Goel
Yichen Wei
Panagiotis Lymperopoulos
matthias. scheutz
Jivko Sinapov
GNN
70
3
0
07 Jan 2024
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Filippos Christianos
Georgios Papoudakis
Matthieu Zimmer
Thomas Coste
Zhihao Wu
...
Yicheng Luo
Jianye Hao
Kun Shao
Haitham Bou-Ammar
Jun Wang
81
20
0
22 Dec 2023
Counting Reward Automata: Sample Efficient Reinforcement Learning
  Through the Exploitation of Reward Function Structure
Counting Reward Automata: Sample Efficient Reinforcement Learning Through the Exploitation of Reward Function Structure
Tristan Bester
Benjamin Rosman
Steven D. James
Geraud Nangue Tasse
61
1
0
18 Dec 2023
Auto MC-Reward: Automated Dense Reward Design with Large Language Models
  for Minecraft
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Hao Li
Xue Yang
Zhaokai Wang
Xizhou Zhu
Jie Zhou
Yu Qiao
Xiaogang Wang
Hongsheng Li
Lewei Lu
Jifeng Dai
97
36
0
14 Dec 2023
A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional
  Reinforcement Learning
A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning
Cyrus Neary
Christian Ellis
Aryaman Singh Samyal
Craig T. Lennon
Ufuk Topcu
OffRL
437
0
0
02 Dec 2023
Hierarchical Reinforcement Learning for Power Network Topology Control
Hierarchical Reinforcement Learning for Power Network Topology Control
Blazej Manczak
Jan Viebahn
H. V. Hoof
58
7
0
03 Nov 2023
1234...8910
Next