Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.04695
Cited By
Strategic Attentive Writer for Learning Macro-Actions
15 June 2016
Alexander
A. Vezhnevets
Volodymyr Mnih
J. Agapiou
Simon Osindero
Alex Graves
Oriol Vinyals
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Strategic Attentive Writer for Learning Macro-Actions"
15 / 15 papers shown
Title
OptionZero: Planning with Learned Options
Po-Wei Huang
Pei-Chiun Peng
Hung Guei
Ti-Rong Wu
102
1
0
23 Feb 2025
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
172
0
0
23 Oct 2024
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Yekun Chai
Haoran Sun
Huang Fang
Shuohuan Wang
Yu Sun
Hua Wu
459
4
0
03 Oct 2024
Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning
Sahil Sharma
A. Suresh
Rahul Ramesh
Balaraman Ravindran
OffRL
49
36
0
20 May 2017
The Option-Critic Architecture
Pierre-Luc Bacon
J. Harb
Doina Precup
OffRL
71
1,089
0
16 Sep 2016
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Chen Tessler
Shahar Givony
Tom Zahavy
D. Mankowitz
Shie Mannor
CLL
137
381
0
25 Apr 2016
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
Tejas D. Kulkarni
Karthik Narasimhan
A. Saeedi
J. Tenenbaum
74
1,137
0
20 Apr 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
207
8,881
0
04 Feb 2016
End-to-End Training of Deep Visuomotor Policies
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
BDL
315
3,444
0
02 Apr 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
279
6,801
0
19 Feb 2015
DRAW: A Recurrent Neural Network For Image Generation
Karol Gregor
Ivo Danihelka
Alex Graves
Danilo Jimenez Rezende
Daan Wierstra
GAN
DRL
178
1,962
0
16 Feb 2015
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
455
16,923
0
20 Dec 2013
Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation
Yoshua Bengio
Nicholas Léonard
Aaron Courville
396
3,157
0
15 Aug 2013
Generating Sequences With Recurrent Neural Networks
Alex Graves
GAN
167
4,039
0
04 Aug 2013
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
120
3,021
0
19 Jul 2012
1