Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.04875
Cited By
v1
v2 (latest)
Importance Weighted Policy Learning and Adaptation
10 September 2020
Alexandre Galashov
Jakub Sygnowski
Guillaume Desjardins
Jan Humplik
Leonard Hasenclever
Rae Jeong
Yee Whye Teh
N. Heess
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Importance Weighted Policy Learning and Adaptation"
18 / 18 papers shown
Title
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
55
60
0
22 Jan 2020
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
L. Zintgraf
K. Shiarlis
Maximilian Igl
Sebastian Schulze
Y. Gal
Katja Hofmann
Shimon Whiteson
OffRL
63
278
0
18 Oct 2019
Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML
Aniruddh Raghu
M. Raghu
Samy Bengio
Oriol Vinyals
303
646
0
19 Sep 2019
Meta reinforcement learning as task inference
Jan Humplik
Alexandre Galashov
Leonard Hasenclever
Pedro A. Ortega
Yee Whye Teh
N. Heess
OffRL
112
128
0
15 May 2019
Information asymmetry in KL-regularized RL
Alexandre Galashov
Siddhant M. Jayakumar
Leonard Hasenclever
Dhruva Tirumala
Jonathan Richard Schwarz
Guillaume Desjardins
Wojciech M. Czarnecki
Yee Whye Teh
Razvan Pascanu
N. Heess
OffRL
59
103
0
03 May 2019
Challenges of Real-World Reinforcement Learning
Gabriel Dulac-Arnold
D. Mankowitz
Todd Hester
OffRL
82
549
0
29 Apr 2019
Exploiting Hierarchy for Learning and Transfer in KL-regularized RL
Dhruva Tirumala
Hyeonwoo Noh
Alexandre Galashov
Leonard Hasenclever
Arun Ahuja
Greg Wayne
Razvan Pascanu
Yee Whye Teh
N. Heess
OffRL
55
44
0
18 Mar 2019
Probabilistic Model-Agnostic Meta-Learning
Chelsea Finn
Kelvin Xu
Sergey Levine
BDL
271
672
0
07 Jun 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
87
448
0
28 Feb 2018
Meta-Reinforcement Learning of Structured Exploration Strategies
Abhishek Gupta
Russell Mendonca
YuXuan Liu
Pieter Abbeel
Sergey Levine
OffRL
110
347
0
20 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
222
1,605
0
05 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
314
8,396
0
04 Jan 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
825
11,937
0
09 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
110
1,342
0
27 Feb 2017
Learning to reinforcement learn
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
97
982
0
17 Nov 2016
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
99
1,027
0
09 Nov 2016
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
138
617
0
08 Jun 2016
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
127
12,261
0
19 Dec 2013
1