ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.04571
  4. Cited By
When Waiting is not an Option : Learning Options with a Deliberation
  Cost

When Waiting is not an Option : Learning Options with a Deliberation Cost

14 September 2017
J. Harb
Pierre-Luc Bacon
Martin Klissarov
Doina Precup
ArXivPDFHTML

Papers citing "When Waiting is not an Option : Learning Options with a Deliberation Cost"

36 / 36 papers shown
Title
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Rishav Rishav
Somjit Nath
Vincent Michalski
Samira Ebrahimi Kahou
FAtt
OffRL
73
0
0
19 Mar 2025
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
OffRL
53
0
0
03 Jan 2025
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations
Amirhossein Mesbah
Reshad Hosseini
Seyed Pooya Shariatpanahi
M. N. Ahmadabadi
77
0
0
21 Dec 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
47
1
0
09 Jun 2024
Adaptive trajectory-constrained exploration strategy for deep
  reinforcement learning
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
41
3
0
27 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles
  Control: Recent Advancements and Future Prospects
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
28
10
0
18 Dec 2023
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
Utsav Singh
Vinay P. Namboodiri
36
3
0
07 Apr 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov
Marlos C. Machado
OffRL
26
19
0
26 Jan 2023
Dynamic Decision Frequency with Continuous Options
Dynamic Decision Frequency with Continuous Options
Amir-Hossein Karimi
Jun Jin
Jun Luo
A. R. Mahmood
Martin Jägersand
Samuele Tosatto
15
9
0
06 Dec 2022
Leveraging Sequentiality in Reinforcement Learning from a Single
  Demonstration
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration
Alexandre Chenu
Olivier Serris
Olivier Sigaud
Nicolas Perrin-Gilbert
28
4
0
09 Nov 2022
Humans decompose tasks by trading off utility and computational cost
Humans decompose tasks by trading off utility and computational cost
Carlos G. Correa
Mark K. Ho
Frederick Callaway
Nathaniel D. Daw
Thomas Griffiths
29
33
0
07 Nov 2022
Attention Option-Critic
Attention Option-Critic
Raviteja Chunduru
Doina Precup
22
8
0
07 Jan 2022
Flexible Option Learning
Flexible Option Learning
Martin Klissarov
Doina Precup
OffRL
41
26
0
06 Dec 2021
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with
  Dual Coordination Mechanism
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism
Zhiwei Xu
Yunpeng Bai
Bin Zhang
Dapeng Li
Guoliang Fan
22
23
0
14 Oct 2021
Temporally Abstract Partial Models
Temporally Abstract Partial Models
Khimya Khetarpal
Zafarali Ahmed
Gheorghe Comanici
Doina Precup
26
14
0
06 Aug 2021
TempoRL: Learning When to Act
TempoRL: Learning When to Act
André Biedenkapp
Raghunandan Rajan
Frank Hutter
Marius Lindauer
OffRL
21
27
0
09 Jun 2021
Discovery of Options via Meta-Learned Subgoals
Discovery of Options via Meta-Learned Subgoals
Vivek Veeriah
Tom Zahavy
Matteo Hessel
Zhongwen Xu
Junhyuk Oh
Iurii Kemaev
H. V. Hasselt
David Silver
Satinder Singh
29
33
0
12 Feb 2021
Learning Skills to Navigate without a Master: A Sequential Multi-Policy
  Reinforcement Learning Algorithm
Learning Skills to Navigate without a Master: A Sequential Multi-Policy Reinforcement Learning Algorithm
Ambedkar Dukkipati
Rajarshi Banerjee
Ranga Shaarad Ayyagari
Dhaval Parmar Udaybhai
21
6
0
30 Jan 2021
Relative Variational Intrinsic Control
Relative Variational Intrinsic Control
Kate Baumli
David Warde-Farley
Steven Hansen
Volodymyr Mnih
26
42
0
14 Dec 2020
Behavior Priors for Efficient Reinforcement Learning
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
37
39
0
27 Oct 2020
Early Anomaly Detection in Time Series: A Hierarchical Approach for
  Predicting Critical Health Episodes
Early Anomaly Detection in Time Series: A Hierarchical Approach for Predicting Critical Health Episodes
Vítor Cerqueira
Luís Torgo
Carlos Soares
AI4TS
11
8
0
22 Oct 2020
Data-efficient Hindsight Off-policy Option Learning
Data-efficient Hindsight Off-policy Option Learning
Markus Wulfmeier
Dushyant Rao
Roland Hafner
Thomas Lampe
A. Abdolmaleki
...
Michael Neunert
Dhruva Tirumala
Noah Y. Siegel
N. Heess
Martin Riedmiller
OffRL
31
47
0
30 Jul 2020
Resource-rational Task Decomposition to Minimize Planning Costs
Resource-rational Task Decomposition to Minimize Planning Costs
Carlos G. Correa
Mark K. Ho
Frederick Callaway
Thomas Griffiths
17
19
0
27 Jul 2020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Yujing Hu
...
Changjie Fan
Weixun Wang
Wulong Liu
Zhaodong Wang
J. Peng
OffRL
22
12
0
19 Feb 2020
An Efficient Transfer Learning Framework for Multiagent Reinforcement
  Learning
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
Tianpei Yang
Weixun Wang
Hongyao Tang
Jianye Hao
Zhaopeng Meng
...
Wulong Liu
Chen Zhang
Yujing Hu
Yingfeng Chen
Changjie Fan
26
22
0
19 Feb 2020
On the Role of Weight Sharing During Deep Option Learning
On the Role of Weight Sharing During Deep Option Learning
Matthew D Riemer
Ignacio Cases
Clemens Rosenbaum
Miao Liu
Gerald Tesauro
OffRL
11
18
0
31 Dec 2019
Compositional Transfer in Hierarchical Reinforcement Learning
Compositional Transfer in Hierarchical Reinforcement Learning
Markus Wulfmeier
A. Abdolmaleki
Roland Hafner
Jost Tobias Springenberg
Michael Neunert
Tim Hertweck
Thomas Lampe
Noah Y. Siegel
N. Heess
Martin Riedmiller
30
27
0
26 Jun 2019
DynoPlan: Combining Motion Planning and Deep Neural Network based
  Controllers for Safe HRL
DynoPlan: Combining Motion Planning and Deep Neural Network based Controllers for Safe HRL
Daniel Angelov
Yordan V. Hristov
S. Ramamoorthy
19
1
0
24 Jun 2019
Sub-policy Adaptation for Hierarchical Reinforcement Learning
Sub-policy Adaptation for Hierarchical Reinforcement Learning
Alexander C. Li
Carlos Florensa
I. Clavera
Pieter Abbeel
29
71
0
13 Jun 2019
Composing Task-Agnostic Policies with Deep Reinforcement Learning
Composing Task-Agnostic Policies with Deep Reinforcement Learning
A. H. Qureshi
Jacob J. Johnson
Yuzhe Qin
Taylor Henderson
Byron Boots
Michael C. Yip
OffRL
22
30
0
25 May 2019
DAC: The Double Actor-Critic Architecture for Learning Options
DAC: The Double Actor-Critic Architecture for Learning Options
Shangtong Zhang
Shimon Whiteson
30
72
0
29 Apr 2019
Discovering Options for Exploration by Minimizing Cover Time
Discovering Options for Exploration by Minimizing Cover Time
Yuu Jinnai
Jee Won Park
David Abel
George Konidaris
27
52
0
02 Mar 2019
The Termination Critic
The Termination Critic
Anna Harutyunyan
Will Dabney
Diana Borsa
N. Heess
Rémi Munos
Doina Precup
OffRL
24
48
0
26 Feb 2019
Finding Options that Minimize Planning Time
Finding Options that Minimize Planning Time
Yuu Jinnai
David Abel
D Ellis Hershkowitz
Michael Littman
George Konidaris
11
41
0
16 Oct 2018
Context-Aware Policy Reuse
Context-Aware Policy Reuse
Siyuan Li
Fangda Gu
Guangxiang Zhu
Chongjie Zhang
OffRL
30
36
0
11 Jun 2018
Data-Efficient Hierarchical Reinforcement Learning
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
OffRL
68
797
0
21 May 2018
1