Strategic Attentive Writer for Learning Macro-Actions

Strategic Attentive Writer for Learning Macro-Actions

15 June 2016

Koray Kavukcuoglu

ArXiv (abs)PDF HTML

Papers citing "Strategic Attentive Writer for Learning Macro-Actions"

15 / 15 papers shown

Title
OptionZero: Planning with Learned Options Po-Wei Huang Pei-Chiun Peng Hung Guei Ti-Rong Wu 102 1 0 23 Feb 2025
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration Max Wilcoxson Qiyang Li Kevin Frans Sergey Levine SSL OffRL OnRL 172 0 0 23 Oct 2024
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions Yekun Chai Haoran Sun Huang Fang Shuohuan Wang Yu Sun Hua Wu 459 4 0 03 Oct 2024
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning Sahil Sharma A. Suresh Rahul Ramesh Balaraman Ravindran OffRL 49 36 0 20 May 2017
The Option-Critic Architecture Pierre-Luc Bacon J. Harb Doina Precup OffRL 71 1,089 0 16 Sep 2016
A Deep Hierarchical Approach to Lifelong Learning in Minecraft Chen Tessler Shahar Givony Tom Zahavy D. Mankowitz Shie Mannor CLL 137 381 0 25 Apr 2016
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation Tejas D. Kulkarni Karthik Narasimhan A. Saeedi J. Tenenbaum 74 1,137 0 20 Apr 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 207 8,881 0 04 Feb 2016
End-to-End Training of Deep Visuomotor Policies Sergey Levine Chelsea Finn Trevor Darrell Pieter Abbeel BDL 315 3,444 0 02 Apr 2015
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 279 6,801 0 19 Feb 2015
DRAW: A Recurrent Neural Network For Image Generation Karol Gregor Ivo Danihelka Alex Graves Danilo Jimenez Rezende Daan Wierstra GAN DRL 178 1,962 0 16 Feb 2015
Auto-Encoding Variational Bayes Diederik P. Kingma Max Welling BDL 455 16,923 0 20 Dec 2013
Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation Yoshua Bengio Nicholas Léonard Aaron Courville 396 3,157 0 15 Aug 2013
Generating Sequences With Recurrent Neural Networks Alex Graves GAN 167 4,039 0 04 Aug 2013
The Arcade Learning Environment: An Evaluation Platform for General Agents Marc G. Bellemare Yavar Naddaf J. Veness Michael Bowling 120 3,021 0 19 Jul 2012