Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.06842
Cited By
Recomposing the Reinforcement Learning Building Blocks with Hypernetworks
12 June 2021
Shai Keynan
Elad Sarafian
Sarit Kraus
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recomposing the Reinforcement Learning Building Blocks with Hypernetworks"
43 / 43 papers shown
Title
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi-An Ma
DiffM
120
29
0
17 Feb 2025
Principled Weight Initialization for Hypernetworks
Oscar Chang
Lampros Flokas
Hod Lipson
62
75
0
13 Dec 2023
D2RL: Deep Dense Architectures in Reinforcement Learning
Samarth Sinha
Homanga Bharadhwaj
A. Srinivas
Animesh Garg
OffRL
AI4CE
64
56
0
19 Oct 2020
Continual Model-Based Reinforcement Learning with Hypernetworks
Yizhou Huang
Kevin Xie
Homanga Bharadhwaj
Florian Shkurti
CLL
49
48
0
25 Sep 2020
Explicit Gradient Learning
Mor Sinay
Elad Sarafian
Y. Louzoun
Noam Agmon
Sarit Kraus
OffRL
40
8
0
09 Jun 2020
On the Modularity of Hypernetworks
Tomer Galanti
Lior Wolf
36
5
0
23 Feb 2020
Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies
Sungryull Sohn
Hyunjae Woo
Jongwook Choi
Honglak Lee
OffRL
62
36
0
01 Jan 2020
On approximating
∇
f
\nabla f
∇
f
with neural networks
Saeed Saremi
FedML
41
22
0
28 Oct 2019
Meta-Q-Learning
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
70
144
0
30 Sep 2019
Deep Meta Functionals for Shape Representation
G. Littwin
Lior Wolf
3DPC
45
81
0
17 Aug 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
104
1,044
0
03 Jun 2019
Continual learning with hypernetworks
J. Oswald
Christian Henning
Benjamin Grewe
João Sacramento
CLL
59
354
0
03 Jun 2019
A Generative Model for Sampling High-Performance and Diverse Weights for Neural Networks
Lior Deutsch
Erik Nijkamp
Yu Yang
35
16
0
07 May 2019
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables
Kate Rakelly
Aurick Zhou
Deirdre Quillen
Chelsea Finn
Sergey Levine
OffRL
78
652
0
19 Mar 2019
Model-Based Reinforcement Learning for Atari
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
...
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
109
851
0
01 Mar 2019
HyperGAN: A Generative Model for Diverse, Performant Neural Networks
Neale Ratzlaff
Fuxin Li
54
64
0
30 Jan 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
187
1,586
0
07 Dec 2018
Meta-Learning with Latent Embedding Optimization
Andrei A. Rusu
Dushyant Rao
Jakub Sygnowski
Oriol Vinyals
Razvan Pascanu
Simon Osindero
R. Hadsell
130
1,366
0
16 Jul 2018
HyperNets and their application to learning spatial transformations
A. Potapov
O. Shcherbakov
I. Zhdanov
S. Rodionov
Nikolai Skorobogatko
17
3
0
12 Jul 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
128
1,662
0
30 Mar 2018
Addressing Function Approximation Error in Actor-Critic Methods
Scott Fujimoto
H. V. Hoof
David Meger
OffRL
164
5,121
0
26 Feb 2018
Spectral Normalization for Generative Adversarial Networks
Takeru Miyato
Toshiki Kataoka
Masanori Koyama
Yuichi Yoshida
ODL
153
4,421
0
16 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
243
8,236
0
04 Jan 2018
A Deeper Look at Experience Replay
Shangtong Zhang
R. Sutton
OffRL
VLM
66
271
0
04 Dec 2017
HyperNetworks with statistical filtering for defending adversarial examples
Zhun Sun
Mete Ozay
Takayuki Okatani
AAML
38
16
0
06 Nov 2017
SMASH: One-Shot Model Architecture Search through HyperNetworks
Andrew Brock
Theodore Lim
J. Ritchie
Nick Weston
117
762
0
17 Aug 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
288
18,685
0
20 Jul 2017
Learning to Learn: Meta-Critic Networks for Sample Efficient Learning
Flood Sung
Li Zhang
Tao Xiang
Timothy M. Hospedales
Yongxin Yang
OffRL
52
127
0
29 Jun 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
781
11,793
0
09 Mar 2017
HyperNetworks
David R Ha
Andrew M. Dai
Quoc V. Le
115
1,603
0
27 Sep 2016
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
Laurens van der Maaten
Kilian Q. Weinberger
PINN
3DV
691
36,599
0
25 Aug 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
188
5,056
0
05 Jun 2016
A Systematic Evaluation and Benchmark for Person Re-Identification: Features, Metrics, and Datasets
Srikrishna Karanam
Mengran Gou
Ziyan Wu
Angels Rates-Borras
Mario Sznaier
Richard J. Radke
78
1,008
0
31 May 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
307
573
0
04 Apr 2016
Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks
Tim Salimans
Diederik P. Kingma
ODL
158
1,933
0
25 Feb 2016
Deep Reinforcement Learning with a Natural Language Action Space
Ji He
Jianshu Chen
Xiaodong He
Jianfeng Gao
Lihong Li
Li Deng
Mari Ostendorf
71
245
0
14 Nov 2015
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
146
7,590
0
22 Sep 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
267
13,174
0
09 Sep 2015
Training Very Deep Networks
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
116
1,675
0
22 Jul 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
63
3,368
0
08 Jun 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
254
6,722
0
19 Feb 2015
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
398
43,154
0
11 Feb 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
246
18,534
0
06 Feb 2015
1