Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.02779
Cited By
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
9 November 2016
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning"
46 / 46 papers shown
Title
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments
Yun Qu
Wenjie Wang
Yixiu Mao
Yiqin Lv
Xiangyang Ji
TTA
136
0
0
27 Apr 2025
Embodied World Models Emerge from Navigational Task in Open-Ended Environments
Li Jin
Liu Jia
48
0
0
15 Apr 2025
MoE-Loco: Mixture of Experts for Multitask Locomotion
Runhan Huang
Shaoting Zhu
Yilun Du
Hang Zhao
MoE
85
1
0
11 Mar 2025
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Luise Ge
Michael Lanier
Anindya Sarkar
Bengisu Guresti
Yevgeniy Vorobeychik
Chongjie Zhang
178
0
0
26 Feb 2025
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Songming Liu
Dong Yan
Jun Zhu
205
3
0
17 Feb 2025
Coreset-Based Task Selection for Sample-Efficient Meta-Reinforcement Learning
Donglin Zhan
Leonardo F. Toso
James Anderson
172
3
0
04 Feb 2025
Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning
Kaixi Bao
Chenhao Li
Yarden As
Andreas Krause
Marco Hutter
OffRL
CLL
189
1
0
03 Feb 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
173
2
0
28 Jan 2025
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
Jianye Xu
Pan Hu
Bassam Alrifaee
81
5
0
14 Aug 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
201
2
0
07 Jun 2024
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand
Amy Zhang
Ufuk Topcu
OffRL
70
3
0
30 Jan 2024
DittoGym: Learning to Control Soft Shape-Shifting Robots
Suning Huang
Boyuan Chen
Huazhe Xu
Vincent Sitzmann
67
3
0
24 Jan 2024
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
63
0
0
31 Aug 2023
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDL
OffRL
84
17
0
11 Aug 2020
Agent Modelling under Partial Observability for Deep Reinforcement Learning
Georgios Papoudakis
Filippos Christianos
Stefano V. Albrecht
58
62
0
16 Jun 2020
Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling
Russell Mendonca
Xinyang Geng
Chelsea Finn
Sergey Levine
OOD
OffRL
78
40
0
12 Jun 2020
Learning to reinforcement learn for Neural Architecture Search
J. Gomez
Joaquin Vanschoren
57
8
0
09 Nov 2019
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Andrei A. Rusu
Matej Vecerík
Thomas Rothörl
N. Heess
Razvan Pascanu
R. Hadsell
74
533
0
13 Oct 2016
Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer
Coline Devin
Abhishek Gupta
Trevor Darrell
Pieter Abbeel
Sergey Levine
OffRL
82
397
0
22 Sep 2016
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Ian Osband
Benjamin Van Roy
BDL
76
260
0
01 Jul 2016
Progressive Neural Networks
Andrei A. Rusu
Neil C. Rabinowitz
Guillaume Desjardins
Hubert Soyer
J. Kirkpatrick
Koray Kavukcuoglu
Razvan Pascanu
R. Hadsell
CLL
AI4CE
77
2,446
0
15 Jun 2016
Learning to learn by gradient descent by gradient descent
Marcin Andrychowicz
Misha Denil
Sergio Gomez Colmenarejo
Matthew W. Hoffman
David Pfau
Tom Schaul
Brendan Shillingford
Nando de Freitas
108
2,006
0
14 Jun 2016
Matching Networks for One Shot Learning
Oriol Vinyals
Charles Blundell
Timothy Lillicrap
Koray Kavukcuoglu
Daan Wierstra
VLM
365
7,321
0
13 Jun 2016
Learning to Optimize
Ke Li
Jitendra Malik
58
256
0
06 Jun 2016
Predicting Personal Traits from Facial Images using Convolutional Neural Networks Augmented with Facial Landmark Information
Yoad Lewenberg
Valliappa Chockalingam
Satinder Singh
Honglak Lee
CVBM
57
304
0
29 May 2016
One-shot Learning with Memory-Augmented Neural Networks
Adam Santoro
Sergey Bartunov
M. Botvinick
Daan Wierstra
Timothy Lillicrap
77
525
0
19 May 2016
ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning
Michal Kempka
Marek Wydmuch
Grzegorz Runc
Jakub Toczek
Wojciech Ja'skowski
60
697
0
06 May 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
79
1,693
0
22 Apr 2016
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Martín Abadi
Ashish Agarwal
P. Barham
E. Brevdo
Zhiwen Chen
...
Pete Warden
Martin Wattenberg
Martin Wicke
Yuan Yu
Xiaoqiang Zheng
269
11,152
0
14 Mar 2016
Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks
Tim Salimans
Diederik P. Kingma
ODL
192
1,941
0
25 Feb 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
197
8,851
0
04 Feb 2016
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
OffRL
86
599
0
19 Nov 2015
Policy Distillation
Andrei A. Rusu
Sergio Gomez Colmenarejo
Çağlar Gülçehre
Guillaume Desjardins
J. Kirkpatrick
Razvan Pascanu
Volodymyr Mnih
Koray Kavukcuoglu
R. Hadsell
81
692
0
19 Nov 2015
Learning Continuous Control Policies by Stochastic Value Gradients
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
97
560
0
30 Oct 2015
One-Shot Learning of Manipulation Skills with Online Dynamics Adaptation and Neural Network Priors
Justin Fu
Sergey Levine
Pieter Abbeel
OffRL
59
160
0
23 Sep 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
318
13,237
0
09 Sep 2015
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
Manuel Watter
Jost Tobias Springenberg
Joschka Boedecker
Martin Riedmiller
BDL
68
845
0
24 Jun 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
90
3,406
0
08 Jun 2015
End-to-End Training of Deep Visuomotor Policies
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
BDL
311
3,434
0
02 Apr 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
277
6,767
0
19 Feb 2015
From Pixels to Torques: Policy Learning with Deep Dynamical Models
Niklas Wahlström
Thomas B. Schon
M. Deisenroth
72
189
0
08 Feb 2015
Example Selection For Dictionary Learning
Tomoki Tsuchida
G. Cottrell
66
47
0
18 Dec 2014
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
581
12,704
0
11 Dec 2014
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CE
AIMat
244
6,776
0
03 Sep 2014
Exact solutions to the nonlinear dynamics of learning in deep linear neural networks
Andrew M. Saxe
James L. McClelland
Surya Ganguli
ODL
165
1,844
0
20 Dec 2013
(More) Efficient Reinforcement Learning via Posterior Sampling
Ian Osband
Daniel Russo
Benjamin Van Roy
116
533
0
04 Jun 2013
1