Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.02790
Cited By
Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices
6 August 2020
Emmy Liu
Aditi Raghunathan
Percy Liang
Chelsea Finn
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices"
40 / 40 papers shown
Title
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control
Jielong Yang
Daoyuan Huang
57
0
0
21 Feb 2025
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
101
7
0
29 Jan 2025
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
124
2
0
07 Jun 2024
Offline Meta Learning of Exploration
Ron Dorfman
Idan Shenfeld
Aviv Tamar
OffRL
13
20
0
06 Aug 2020
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
Emmy Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
32
5
0
12 Jul 2020
An Imitation Learning Approach for Cache Replacement
Emmy Liu
Milad Hashemi
Kevin Swersky
Parthasarathy Ranganathan
Junwhan Ahn
11
87
0
29 Jun 2020
Meta-Model-Based Meta-Policy Optimization
Takuya Hiraoka
Takahisa Imagawa
Voot Tangkaratt
Takayuki Osa
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
22
8
0
04 Jun 2020
Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Pierre-Alexandre Kamienny
Matteo Pirotta
A. Lazaric
Thibault Lavril
Nicolas Usunier
Ludovic Denoyer
60
18
0
06 May 2020
MAME : Model-Agnostic Meta-Exploration
Swaminathan Gurumurthy
Sumit Kumar
Katia Sycara
30
15
0
11 Nov 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
174
1,145
0
24 Oct 2019
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
L. Zintgraf
K. Shiarlis
Maximilian Igl
Sebastian Schulze
Y. Gal
Katja Hofmann
Shimon Whiteson
OffRL
41
273
0
18 Oct 2019
Meta-Q-Learning
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
51
144
0
30 Sep 2019
Environment Probing Interaction Policies
Wenxuan Zhou
Lerrel Pinto
Abhinav Gupta
38
67
0
26 Jul 2019
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment
Felix Leibfried
Sergio Pascual-Diaz
Jordi Grau-Moya
76
28
0
26 Jul 2019
Watch, Try, Learn: Meta-Learning from Demonstrations and Reward
Allan Zhou
Eric Jang
Daniel Kappler
Alexander Herzog
Mohi Khansari
Paul Wohlhart
Yunfei Bai
Mrinal Kalakrishnan
Sergey Levine
Chelsea Finn
52
49
0
07 Jun 2019
Meta reinforcement learning as task inference
Jan Humplik
Alexandre Galashov
Leonard Hasenclever
Pedro A. Ortega
Yee Whye Teh
N. Heess
OffRL
75
126
0
15 May 2019
Guided Meta-Policy Search
Russell Mendonca
Abhishek Gupta
Rosen Kralev
Pieter Abbeel
Sergey Levine
Chelsea Finn
23
57
0
01 Apr 2019
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables
Kate Rakelly
Aurick Zhou
Deirdre Quillen
Chelsea Finn
Sergey Levine
OffRL
64
652
0
19 Mar 2019
NoRML: No-Reward Meta Learning
Yuxiang Yang
Ken Caluwaerts
Atil Iscen
Jie Tan
Chelsea Finn
26
26
0
04 Mar 2019
Learning to Generalize from Sparse and Underspecified Rewards
Rishabh Agarwal
Chen Liang
Dale Schuurmans
Mohammad Norouzi
OffRL
65
96
0
19 Feb 2019
Unsupervised Control Through Non-Parametric Discriminative Rewards
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
Steven Hansen
Volodymyr Mnih
DRL
OffRL
SSL
58
176
0
28 Nov 2018
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
78
1,310
0
30 Oct 2018
ProMP: Proximal Meta-Policy Search
Jonas Rothfuss
Dennis Lee
I. Clavera
Tamim Asfour
Pieter Abbeel
51
208
0
16 Oct 2018
Learning to Adapt in Dynamic, Real-World Environments Through Meta-Reinforcement Learning
Anusha Nagabandi
I. Clavera
Simin Liu
R. Fearing
Pieter Abbeel
Sergey Levine
Chelsea Finn
87
540
0
30 Mar 2018
Meta Reinforcement Learning with Latent Variable Gaussian Processes
Steindór Sæmundsson
Katja Hofmann
M. Deisenroth
BDL
OffRL
AI4CE
69
141
0
20 Mar 2018
Meta-Reinforcement Learning of Structured Exploration Strategies
Abhishek Gupta
Russell Mendonca
YuXuan Liu
Pieter Abbeel
Sergey Levine
OffRL
62
342
0
20 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
58
1,075
0
16 Feb 2018
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
71
226
0
13 Feb 2018
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
89
2,416
0
15 May 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
737
11,793
0
09 Mar 2017
Deep Variational Information Bottleneck
Alexander A. Alemi
Ian S. Fischer
Joshua V. Dillon
Kevin Patrick Murphy
71
1,697
0
01 Dec 2016
Variational Intrinsic Control
Karol Gregor
Danilo Jimenez Rezende
Daan Wierstra
DRL
OffRL
44
426
0
22 Nov 2016
Learning to reinforcement learn
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
56
974
0
17 Nov 2016
RL
2
^2
2
: Fast Reinforcement Learning via Slow Reinforcement Learning
Yan Duan
John Schulman
Xi Chen
Peter L. Bartlett
Ilya Sutskever
Pieter Abbeel
OffRL
57
1,011
0
09 Nov 2016
Learning to learn by gradient descent by gradient descent
Marcin Andrychowicz
Misha Denil
Sergio Gomez Colmenarejo
Matthew W. Hoffman
David Pfau
Tom Schaul
Brendan Shillingford
Nando de Freitas
65
2,000
0
14 Jun 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
154
1,465
0
06 Jun 2016
One-shot Learning with Memory-Augmented Neural Networks
Adam Santoro
Sergey Bartunov
M. Botvinick
Daan Wierstra
Timothy Lillicrap
42
525
0
19 May 2016
Dueling Network Architectures for Deep Reinforcement Learning
Ziyun Wang
Tom Schaul
Matteo Hessel
H. V. Hasselt
Marc Lanctot
Nando de Freitas
OffRL
56
3,742
0
20 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
100
7,590
0
22 Sep 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
519
149,474
0
22 Dec 2014
1